September 15, 2022

No labels? No problem! New tool overcomes major hurdle in clinical AI design

chest xray — Credit: Pixabay/CC0 Public Domain

Harvard Medical School scientists and colleagues at Stanford University have developed an artificial intelligence diagnostic tool that can detect diseases on chest X-rays directly from natural-language descriptions contained in accompanying clinical reports.

The step is deemed a major advance in clinical AI design because most current AI models require laborious human annotation of vast reams of data before the labeled data are fed into the model to train it.

A report on the work, published Sept. 15 in Nature Biomedical Engineering, shows that the model, called CheXzero, performed on par with human radiologists in its ability to detect pathologies on chest X-rays.

The team has made the code for the model publicly available for other researchers.

Most AI models require labeled datasets during their "training" so they can learn to correctly identify pathologies. This process is especially burdensome for medical image-interpretation tasks since it involves large-scale annotation by human clinicians, which is often expensive and time-consuming. For instance, to label a chest X-ray dataset, expert radiologists would have to look at hundreds of thousands of X-ray images one by one and explicitly annotate each one with the conditions detected. While more recent AI models have tried to address this labeling bottleneck by learning from unlabeled data in a "pre-training" stage, they eventually require fine-tuning on labeled data to achieve high performance.

By contrast, the new model is self-supervised, in the sense that it learns more independently, without the need for hand-labeled data before or after training. The model relies solely on chest X-rays and the English-language notes found in accompanying X-ray reports.

"We're living the early days of the next-generation medical AI models that are able to perform flexible tasks by directly learning from text," said study lead investigator Pranav Rajpurkar, assistant professor of biomedical informatics in the Blavatnik Institute at HMS. "Up until now, most AI models have relied on manual annotation of huge amounts of data—to the tune of 100,000 images—to achieve a high performance. Our method needs no such disease-specific annotations.

"With CheXzero, one can simply feed the model a chest X-ray and corresponding radiology report, and it will learn that the image and the text in the report should be considered as similar—in other words, it learns to match chest X-rays with their accompanying report," Rajpurkar added. "The model is able to eventually learn how concepts in the unstructured text correspond to visual patterns in the image."

The model was "trained" on a publicly available dataset containing more than 377,000 chest X-rays and more than 227,000 corresponding clinical notes. Its performance was then tested on two separate datasets of chest X-rays and corresponding notes collected from two different institutions, one of which was in a different country. This diversity of datasets was meant to ensure that the model performed equally well when exposed to clinical notes that may use different terminology to describe the same finding.

Upon testing, CheXzero successfully identified pathologies that were not explicitly annotated by human clinicians. It outperformed other self-supervised AI tools and performed with accuracy similar to that of human radiologists.

The approach, the researchers said, could eventually be applied to imaging modalities well beyond X-rays, including CT scans, MRIs, and echocardiograms.

"CheXzero shows that accuracy of complex medical image interpretation no longer needs to remain at the mercy of large labeled datasets," said study co-first author Ekin Tiu, an undergraduate student at Stanford and a visiting researcher at HMS. "We use chest X-rays as a driving example, but in reality CheXzero's capability is generalizable to a vast array of medical settings where unstructured data is the norm, and precisely embodies the promise of bypassing the large-scale labeling bottleneck that has plagued the field of medical machine learning."

More information: Pranav Rajpurkar, Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning, Nature Biomedical Engineering (2022). DOI: 10.1038/s41551-022-00936-9. www.nature.com/articles/s41551-022-00936-9

Journal information: Nature Biomedical Engineering

Provided by Harvard Medical School

Citation: No labels? No problem! New tool overcomes major hurdle in clinical AI design (2022, September 15) retrieved 11 July 2024 from https://medicalxpress.com/news/2022-09-problem-tool-major-hurdle-clinical.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

AI-based system shows promise in tuberculosis detection

149 shares

Feedback to editors

Coordinated activity of mossy cells contributes to encoding of spatial and contextual memories, study finds

2 hours ago

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

3 hours ago

Major trial looks at most effective speech therapy for people with Parkinson's disease

14 hours ago

Models show promise in predicting cognitive decline in early Alzheimer's

16 hours ago

New material derived from graphene improves the performance of neuroprostheses

18 hours ago

Discovery could help with early detection of vision loss in age-related macular degeneration

18 hours ago

New Co-STAR T cells show promise for treating cancers in laboratory study

18 hours ago

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

18 hours ago

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

18 hours ago

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

19 hours ago

Load comments (0)

No labels? No problem! New tool overcomes major hurdle in clinical AI design

Coordinated activity of mossy cells contributes to encoding of spatial and contextual memories, study finds

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Major trial looks at most effective speech therapy for people with Parkinson's disease

Models show promise in predicting cognitive decline in early Alzheimer's

New material derived from graphene improves the performance of neuroprostheses

Discovery could help with early detection of vision loss in age-related macular degeneration

New Co-STAR T cells show promise for treating cancers in laboratory study

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

AI-based system shows promise in tuberculosis detection

Radiologists use deep learning to find signs of COVID-19 in chest X-rays

Researchers determine that AI-based tools have not yet reached full diagnostic potential in COVID-19

City digital twins help train deep learning models to separate building facades

New foundation model improves accuracy for remote sensing image interpretation

Automated MRI image labelling processes 100,000 brain exams in under 30 minutes

Prime editing efficiently corrects cystic fibrosis mutation in human lung cells, study shows

New material derived from graphene improves the performance of neuroprostheses

Implantable LED device uses light to treat deep-seated cancers

A possible treatment for sickle cell disease

Inulin-gel-based oral immunotherapy may offer long-awaited treatment for food allergy sufferers

Pulsed field ablation procedures found safe and effective for atrial fibrillation patients

Phys.org

Tech Xplore

Science X

No labels? No problem! New tool overcomes major hurdle in clinical AI design

Coordinated activity of mossy cells contributes to encoding of spatial and contextual memories, study finds

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Major trial looks at most effective speech therapy for people with Parkinson's disease

Models show promise in predicting cognitive decline in early Alzheimer's

New material derived from graphene improves the performance of neuroprostheses

Discovery could help with early detection of vision loss in age-related macular degeneration

New Co-STAR T cells show promise for treating cancers in laboratory study

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

Related Stories

AI-based system shows promise in tuberculosis detection

Radiologists use deep learning to find signs of COVID-19 in chest X-rays

Researchers determine that AI-based tools have not yet reached full diagnostic potential in COVID-19

City digital twins help train deep learning models to separate building facades

New foundation model improves accuracy for remote sensing image interpretation

Automated MRI image labelling processes 100,000 brain exams in under 30 minutes

Recommended for you

Prime editing efficiently corrects cystic fibrosis mutation in human lung cells, study shows

New material derived from graphene improves the performance of neuroprostheses

Implantable LED device uses light to treat deep-seated cancers

A possible treatment for sickle cell disease

Inulin-gel-based oral immunotherapy may offer long-awaited treatment for food allergy sufferers

Pulsed field ablation procedures found safe and effective for atrial fibrillation patients

Newsletter sign up

Donate and enjoy an ad-free experience