February 9, 2023

ChatGPT can (almost) pass the US Medical Licensing Exam

ChatGPT can score at or around the approximately 60% passing threshold for the United States Medical Licensing Exam (USMLE), with responses that make coherent, internal sense and contain frequent insights, according to a study published February 9, 2023, in the open-access journal PLOS Digital Health by Tiffany Kung, Victor Tseng, and colleagues at AnsibleHealth.

ChatGPT is a new artificial intelligence (AI) system, known as a large language model (LLM), designed to generate human-like writing by predicting upcoming word sequences. Unlike most chatbots, ChatGPT cannot search the internet. Instead, it generates text using word relationships predicted by its internal processes.

Kung and colleagues tested ChatGPT's performance on the USMLE, a highly standardized and regulated series of three exams (Steps 1, 2CK, and 3) required for medical licensure in the United States. Taken by medical students and physicians-in-training, the USMLE assesses knowledge spanning most medical disciplines, ranging from biochemistry, to diagnostic reasoning, to bioethics.

After screening to remove image-based questions, the authors tested the software on 350 of the 376 public questions available from the June 2022 USMLE release.

After indeterminate responses were removed, ChatGPT scored between 52.4% and 75.0% across the three USMLE exams. The passing threshold each year is approximately 60%. ChatGPT also demonstrated 94.6% concordance across all its responses and produced at least one significant insight (something that was new, non-obvious, and clinically valid) for 88.9% of its responses. Notably, ChatGPT exceeded the performance of PubMedGPT, a counterpart model trained exclusively on biomedical domain literature, which scored 50.8% on an older dataset of USMLE-style questions.

While the relatively small input size restricted the depth and range of analyses, the authors note their findings provide a glimpse of ChatGPT's potential to enhance medical education, and eventually, clinical practice. For example, they add, clinicians at AnsibleHealth already use ChatGPT to rewrite jargon-heavy reports for easier patient comprehension.

"Reaching the passing score for this notoriously difficult expert exam, and doing so without any human reinforcement, marks a notable milestone in clinical AI maturation," say the authors.

Author Dr. Tiffany Kung added that ChatGPT's role in this research went beyond being the study subject: "ChatGPT contributed substantially to the writing of [our] manuscript... We interacted with ChatGPT much like a colleague, asking it to synthesize, simplify and offer counterpoints to drafts in progress... All of the co-authors valued ChatGPT's input."

More information: Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models, PLOS Digital Health (2023). DOI: 10.1371/journal.pdig.0000198

Journal information: PLOS Digital Health

Provided by Public Library of Science

Citation: ChatGPT can (almost) pass the US Medical Licensing Exam (2023, February 9) retrieved 25 April 2024 from https://medicalxpress.com/news/2023-02-chatgpt-medical-exam.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ChatGPT found to be capable of passing exams for MBA and Medical Licensing Exam

223 shares

Feedback to editors

Link between depression and cardiovascular disease explained: They partly develop from same gene module

1 hour ago

Study highlights increased risk of second cancers among breast cancer survivors

6 hours ago

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

7 hours ago

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

7 hours ago

A closed-loop drug-delivery system could improve chemotherapy

8 hours ago

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

8 hours ago

Solving the riddle of the sphingolipids in coronary artery disease

8 hours ago

New AI technology estimates brain age using low-cost EEG device

8 hours ago

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

10 hours ago

Trials reveal that internet-based conversations help sustain brain function in older adults

10 hours ago

Load comments (1)

ChatGPT can (almost) pass the US Medical Licensing Exam

Link between depression and cardiovascular disease explained: They partly develop from same gene module

Study highlights increased risk of second cancers among breast cancer survivors

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

A closed-loop drug-delivery system could improve chemotherapy

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

Solving the riddle of the sphingolipids in coronary artery disease

New AI technology estimates brain age using low-cost EEG device

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

Trials reveal that internet-based conversations help sustain brain function in older adults

ChatGPT found to be capable of passing exams for MBA and Medical Licensing Exam

ChatGPT bot passes US law school exam

Top French university bans students from using ChatGPT

Colombian judge uses ChatGPT in ruling

ChatGPT maker fields tool for spotting AI-written text

ChatGPT bot 'for professional use' on the way

New AI technology estimates brain age using low-cost EEG device

New device improves stem cell generation and chance for accessible Alzheimer's cell therapy

New algorithm could provide early warning for asthma attacks

A flexible microdisplay that can monitor brain activity in real-time during brain surgery

Study finds AI can develop treatments to prevent 'superbugs'

Experimental strategy is the first to tackle fibrosis and scarring at the cellular level

Phys.org

Tech Xplore

Science X

ChatGPT can (almost) pass the US Medical Licensing Exam

Link between depression and cardiovascular disease explained: They partly develop from same gene module

Study highlights increased risk of second cancers among breast cancer survivors

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

A closed-loop drug-delivery system could improve chemotherapy

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

Solving the riddle of the sphingolipids in coronary artery disease

New AI technology estimates brain age using low-cost EEG device

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

Trials reveal that internet-based conversations help sustain brain function in older adults

Related Stories

ChatGPT found to be capable of passing exams for MBA and Medical Licensing Exam

ChatGPT bot passes US law school exam

Top French university bans students from using ChatGPT

Colombian judge uses ChatGPT in ruling

ChatGPT maker fields tool for spotting AI-written text

ChatGPT bot 'for professional use' on the way

Recommended for you

New AI technology estimates brain age using low-cost EEG device

New device improves stem cell generation and chance for accessible Alzheimer's cell therapy

New algorithm could provide early warning for asthma attacks

A flexible microdisplay that can monitor brain activity in real-time during brain surgery

Study finds AI can develop treatments to prevent 'superbugs'

Experimental strategy is the first to tackle fibrosis and scarring at the cellular level

Newsletter sign up

Donate and enjoy an ad-free experience