June 15, 2023

Researchers test AI-powered chatbot's medical diagnostic ability

by Jacqueline Mitchell, Beth Israel Deaconess Medical Center

In a recent experiment published in JAMA, physician-researchers at Beth Israel Deaconess Medical Center (BIDMC) tested one well-known publicly available chatbot's ability to make accurate diagnoses in challenging medical cases. The team found that the generative AI, Chat-GPT 4, selected the correct diagnosis as its top diagnosis nearly 40 percent of the time and provided the correct diagnosis in its list of potential diagnoses in two-thirds of challenging cases.

Generative AI refers to a type of artificial intelligence that uses patterns and information it has been trained on to create new content, rather than simply processing and analyzing existing data. Some of the most well-known examples of generative AI are so-called chatbots, which use a branch of artificial intelligence called natural language processing (NLP) that allows computers to understand, interpret and generate human-like language.

Generative AI chatbots are powerful tools poised to revolutionize creative industries, education, customer service and more. However, little is known about their potential performance in the clinical setting, such as complex diagnostic reasoning.

"Recent advances in artificial intelligence have led to generative AI models that are capable of detailed text-based responses that score highly in standardized medical examinations," said Adam Rodman, MD, MPH, co-director of the Innovations in Media and Education Delivery (iMED) Initiative at BIDMC and an instructor in medicine at Harvard Medical School.

"We wanted to know if such a generative model could 'think' like a doctor, so we asked one to solve standardized complex diagnostic cases used for educational purposes. It did really, really well."

To assess the chatbot's diagnostic skills, Rodman and colleagues used clinicopathological case conferences (CPCs), a series of complex and challenging patient cases including relevant clinical and laboratory data, imaging studies, and histopathological findings published in the New England Journal of Medicine for educational purposes.

Evaluating 70 CPC cases, the artificial intelligence exactly matched the final CPC diagnosis in 27 (39 percent) of cases. In 64 percent of the cases, the final CPC diagnosis was included in the AI's differential—a list of possible conditions that could account for a patient's symptoms, medical history, clinical findings and laboratory or imaging results.

"While Chatbots cannot replace the expertise and knowledge of a trained medical professional, generative AI is a promising potential adjunct to human cognition in diagnosis," said first author Zahir Kanjee, MD, MPH, a hospitalist at BIDMC and assistant professor of medicine at Harvard Medical School.

"It has the potential to help physicians make sense of complex medical data and broaden or refine our diagnostic thinking. We need more research on the optimal uses, benefits and limits of this technology, and a lot of privacy issues need sorting out, but these are exciting findings for the future of diagnosis and patient care."

"Our study adds to a growing body of literature demonstrating the promising capabilities of AI technology," said co-author Byron Crowe, MD, an internal medicine physician at BIDMC and an instructor in medicine at Harvard Medical School.

"Further investigation will help us better understand how these new AI models might transform health care delivery."

More information: Zahir Kanjee et al, Accuracy of a Generative Artificial Intelligence Model in a Complex Diagnostic Challenge, JAMA (2023). DOI: 10.1001/jama.2023.8288

Journal information: Journal of the American Medical Association , New England Journal of Medicine

Provided by Beth Israel Deaconess Medical Center

Citation: Researchers test AI-powered chatbot's medical diagnostic ability (2023, June 15) retrieved 11 August 2024 from https://medicalxpress.com/news/2023-06-ai-powered-chatbot-medical-diagnostic-ability.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Meta guru says ChatGPT-style AI is out-of-date

22 shares

Feedback to editors

US health regulator rejects MDMA treatment for PTSD, for now

Aug 10, 2024

Study finds baked potatoes can improve heart health for diabetics

Aug 9, 2024

National study shows how internal medicine chief residency has changed over 20 years

Aug 9, 2024

Vegan diet better than Mediterranean, finds new research

Aug 9, 2024

Memory problems in old age linked to a key enzyme, study in mice finds

Aug 9, 2024

Key factor found in drug-context links, relapse

Aug 9, 2024

Researchers outline promises, challenges of understanding AI for biological discovery

Aug 9, 2024

The dengue vaccine is effective and safe: Confirmation from the first global meta-analysis

Aug 9, 2024

'PTNM' system provides new classification for Peyronie's disease and penile curvature

Aug 9, 2024

Researchers crack a key celiac mystery: Where the gluten reaction begins

Aug 9, 2024

Load comments (0)

Researchers test AI-powered chatbot's medical diagnostic ability

US health regulator rejects MDMA treatment for PTSD, for now

Study finds baked potatoes can improve heart health for diabetics

National study shows how internal medicine chief residency has changed over 20 years

Vegan diet better than Mediterranean, finds new research

Memory problems in old age linked to a key enzyme, study in mice finds

Key factor found in drug-context links, relapse

Researchers outline promises, challenges of understanding AI for biological discovery

The dengue vaccine is effective and safe: Confirmation from the first global meta-analysis

'PTNM' system provides new classification for Peyronie's disease and penile curvature

Researchers crack a key celiac mystery: Where the gluten reaction begins

Meta guru says ChatGPT-style AI is out-of-date

New research suggests AI image generation using DALL-E 2 has promising future in radiology

Amazon joins generative AI race, targets tech at cloud customers

Is generative AI bad for the environment? A computer scientist explains the carbon footprint of ChatGPT and its cousins

Researchers use new deep learning approach to enable analysis of electrocardiograms as language

ChatGPT does not pass American College of Gastroenterology tests

Researchers outline promises, challenges of understanding AI for biological discovery

A new way to measure bipolar disorder: Focus on the 'spikes'

Researchers report potent antibiotic that overcomes resistance

New findings on CARD14 protein's role in eczema and psoriasis

Computer simulations clarify how breast cancer spreads

MAPLEX exosome-based delivery system carries therapeutic proteins into cells

Phys.org

Tech Xplore

Science X

Researchers test AI-powered chatbot's medical diagnostic ability

US health regulator rejects MDMA treatment for PTSD, for now

Study finds baked potatoes can improve heart health for diabetics

National study shows how internal medicine chief residency has changed over 20 years

Vegan diet better than Mediterranean, finds new research

Memory problems in old age linked to a key enzyme, study in mice finds

Key factor found in drug-context links, relapse

Researchers outline promises, challenges of understanding AI for biological discovery

The dengue vaccine is effective and safe: Confirmation from the first global meta-analysis

'PTNM' system provides new classification for Peyronie's disease and penile curvature

Researchers crack a key celiac mystery: Where the gluten reaction begins

Related Stories

Meta guru says ChatGPT-style AI is out-of-date

New research suggests AI image generation using DALL-E 2 has promising future in radiology

Amazon joins generative AI race, targets tech at cloud customers

Is generative AI bad for the environment? A computer scientist explains the carbon footprint of ChatGPT and its cousins

Researchers use new deep learning approach to enable analysis of electrocardiograms as language

ChatGPT does not pass American College of Gastroenterology tests

Recommended for you

Researchers outline promises, challenges of understanding AI for biological discovery

A new way to measure bipolar disorder: Focus on the 'spikes'

Researchers report potent antibiotic that overcomes resistance

New findings on CARD14 protein's role in eczema and psoriasis

Computer simulations clarify how breast cancer spreads

MAPLEX exosome-based delivery system carries therapeutic proteins into cells

Newsletter sign up

Donate and enjoy an ad-free experience