June 19, 2023

Clinical utility, not 'prettiness,' best metric for evaluating AI improvements to medical imaging, say engineers

by Shawn Ballard, Washington University in St. Louis

Medical imaging plays an essential role in diagnosis and treatment for an array of conditions. From X-rays to see a broken bone or a tooth cavity to SPECT scans for spotting heart defects, doctors use medical imaging to look inside the body, find disease and treat it appropriately. But what happens when those images aren't clear?

Recent advances in artificial intelligence have opened the door to using AI-based methods for denoising, or cleaning up, medical images. However, before these tools can be used in clinical settings for real patient care, they need to be rigorously evaluated, said Abhinav Jha, assistant professor of biomedical engineering in the McKelvey School of Engineering and of radiology at Mallinckrodt Institute of Radiology (MIR) in the School of Medicine, both at Washington University in St. Louis.

In a study published in Medical Physics, Jha and collaborators at MIR evaluated a commonly used AI-based approach to denoise cardiac SPECT images. The team assessed the performance of the approach in two ways: How visually similar were denoised images to normal images and how well did the denoised image perform in the clinically relevant task of detecting heart defects?

"Rather alarmingly, while the visual-similarity-based metrics suggested that the AI-based denoising technique improved performance, it was actually having no significant impact, and in some cases, it was even degrading performance on clinical tasks," Jha said. "This emphasizes the important need for performing evaluation of AI algorithms on clinical tasks and not just relying on visual similarity as a measure of performance."

In the study, first author Zitong Yu, a doctoral student in Jha's lab, found that the AI denoising technique tended to smooth out cardiac SPECT images, which reduced noise as intended, but also reduced the contrast of the heart defect that doctors need to make accurate diagnoses. "This is precisely what we want to prevent from happening in actual medical practice," Yu said.

The study advocates for task-based evaluation of AI-based denoising methods to assess the usefulness of AI-processed images. "Ensuring AI-based denoising works well for real clinical tasks—not just aesthetically—would mean big benefits for patients by producing high-quality images in less time or with reduced radiation doses," said collaborator Robert J. Gropler, professor of radiology and senior vice chair and division director of radiological sciences at MIR.

Jha and his team have been developing a new denoising technique along this direction, and their presentation on this topic received an honorable mention at the SPIE Medical Imaging meeting. Jha also led a multi-institutional, multi-agency team tasked with developing a framework for evaluating AI-based medical imaging methods. Their guidelines, Recommendations for Evaluation of AI for Nuclear Medicine (RELAINCE), were released in 2022 and informed this latest research.

More information: Zitong Yu et al, Need for objective task‐based evaluation of deep learning‐based denoising methods: A study in the context of myocardial perfusion SPECT, Medical Physics (2023). DOI: 10.1002/mp.16407

Provided by Washington University in St. Louis

Citation: Clinical utility, not 'prettiness,' best metric for evaluating AI improvements to medical imaging, say engineers (2023, June 19) retrieved 15 August 2024 from https://medicalxpress.com/news/2023-06-clinical-prettiness-metric-ai-medical.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Evaluation of AI for medical imaging: A key requirement for clinical translation

50 shares

Feedback to editors

3D body scanner with AI predicts metabolic syndrome risk

7 hours ago

Sick days: Assessing the economic costs of long COVID

7 hours ago

New way to extend 'shelf life' of blood stem cells can improve gene therapy

8 hours ago

Novel test helps identify patients at high risk of esophageal cancers

8 hours ago

Mouse study finds probiotics during pregnancy help moms and babies

8 hours ago

New study uncovers how brain cells form precise circuits before experience is able to shape wiring

8 hours ago

The brain creates parallel copies for a single memory, new study reveals

8 hours ago

New research discovers differences in oxygen physiology in people with Down syndrome

9 hours ago

Nasal spray flu vaccine candidate shows promise when administered alongside high dose annual shot

9 hours ago

Researchers confirm genetic link between Alzheimer's and heart disease

9 hours ago

Load comments (0)

Clinical utility, not 'prettiness,' best metric for evaluating AI improvements to medical imaging, say engineers

3D body scanner with AI predicts metabolic syndrome risk

Sick days: Assessing the economic costs of long COVID

New way to extend 'shelf life' of blood stem cells can improve gene therapy

Novel test helps identify patients at high risk of esophageal cancers

Mouse study finds probiotics during pregnancy help moms and babies

New study uncovers how brain cells form precise circuits before experience is able to shape wiring

The brain creates parallel copies for a single memory, new study reveals

New research discovers differences in oxygen physiology in people with Down syndrome

Nasal spray flu vaccine candidate shows promise when administered alongside high dose annual shot

Researchers confirm genetic link between Alzheimer's and heart disease

Evaluation of AI for medical imaging: A key requirement for clinical translation

Distinguishing real from fake in the age of synthetic images

Researchers develop a new method for denoising images

Photon-counting CT offers superior imaging in babies with heart defects

Biomarkers for Parkinson's disease sought through imaging

Decreasing noise in acoustic underwater transmissions using deep neural networks

AI sperm checker enhances IVF success

Leading AI models struggle to identify genetic conditions from patient-written descriptions, researchers find

Swipe up! Health apps deliver real results en masse

Algorithm achieves 98% accuracy in disease prediction via tongue color

Quantitative ultrasound parameters offer new tool for diagnosing lung disease

AI accurately diagnoses genetic condition from facial photographs

Phys.org

Tech Xplore

Science X

Clinical utility, not 'prettiness,' best metric for evaluating AI improvements to medical imaging, say engineers

3D body scanner with AI predicts metabolic syndrome risk

Sick days: Assessing the economic costs of long COVID

New way to extend 'shelf life' of blood stem cells can improve gene therapy

Novel test helps identify patients at high risk of esophageal cancers

Mouse study finds probiotics during pregnancy help moms and babies

New study uncovers how brain cells form precise circuits before experience is able to shape wiring

The brain creates parallel copies for a single memory, new study reveals

New research discovers differences in oxygen physiology in people with Down syndrome

Nasal spray flu vaccine candidate shows promise when administered alongside high dose annual shot

Researchers confirm genetic link between Alzheimer's and heart disease

Related Stories

Evaluation of AI for medical imaging: A key requirement for clinical translation

Distinguishing real from fake in the age of synthetic images

Researchers develop a new method for denoising images

Photon-counting CT offers superior imaging in babies with heart defects

Biomarkers for Parkinson's disease sought through imaging

Decreasing noise in acoustic underwater transmissions using deep neural networks

Recommended for you

AI sperm checker enhances IVF success

Leading AI models struggle to identify genetic conditions from patient-written descriptions, researchers find

Swipe up! Health apps deliver real results en masse

Algorithm achieves 98% accuracy in disease prediction via tongue color

Quantitative ultrasound parameters offer new tool for diagnosing lung disease

AI accurately diagnoses genetic condition from facial photographs

Newsletter sign up

Donate and enjoy an ad-free experience