September 8, 2024

GPT-4 with vision has poor accuracy for image-based radiology questions

by Elana Gotkine

The large language model GPT-4 with vision (GPT-4V) has high accuracy for text-only radiology questions, but much lower accuracy for image-based questions, according to a study published online Sept. 3 in Radiology.

Nolan Hayden, M.D., from Henry Ford Health in Detroit, and colleagues examined the performance of GPT-4V on radiology in-training examination questions to gauge the model's baseline knowledge in radiology. The September 2023 release of GPT-4V was assessed using 386 retired questions (189 image-based and 197 text-based) from the American College of Radiology Diagnostic Radiology In-Training Examinations; 377 questions were unique.

The researchers found that GPT-4V answered 65.3 percent of the unique questions correctly, with significantly higher accuracy observed on the text-only versus the image-based questions (81.5 versus 47.8 percent). For text-based questions, differences were seen between prompts, with chain-of-thought prompting outperforming long instruction, basic prompting, and the original prompting style by 6.1, 6.8, and 8.9 percent, respectively. For image-based questions, there were no differences seen between prompts.

"We found that while GPT-4V shows relatively good performance on text-based questions, it shows deficits in accurately interpreting key radiologic images. This highlights the model's limitations in visual radiology analysis," the authors write. "We also noted an alarming tendency for GPT-4V to provide correct diagnoses based on incorrect image interpretations, which could have significant clinical implications."

More information: Nolan Hayden et al, Performance of GPT-4 with Vision on Text- and Image-based ACR Diagnostic Radiology In-Training Examination Questions, Radiology (2024). DOI: 10.1148/radiol.240153

Francis Deng, Multimodal Models Are Still a Novice at Radiology Vision, Radiology (2024). DOI: 10.1148/radiol.242286

Journal information: Radiology

Citation: GPT-4 with vision has poor accuracy for image-based radiology questions (2024, September 8) retrieved 8 September 2024 from https://medicalxpress.com/news/2024-09-gpt-vision-poor-accuracy-image.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Vision-based ChatGPT shows deficits interpreting radiologic images

Feedback to editors

Low-impact yoga and exercise found to help older women manage urinary incontinence

Sep 7, 2024

Missouri patient tests positive for bird flu despite no known exposure to animals

Sep 7, 2024

Falling for financial scams? It may signal early Alzheimer's disease

Sep 6, 2024

Cognitive behavioral therapy enhances brain circuits to relieve depression

Sep 6, 2024

New molecular sensor enables fluorescence imaging for assessing sarcoma severity

Sep 6, 2024

Noninvasive focused ultrasound show potential for combating chronic pain

Sep 6, 2024

Study finds TGF-beta and RAS signaling are both required for lung cancer metastasis

Sep 6, 2024

Research team successfully maps the brain-spinal cord connection in humans

Sep 6, 2024

Alzheimer's study reveals critical differences in memory loss progression based on the presence of specific proteins

Sep 6, 2024

Chemical screen identifies PRMT5 as therapeutic target for paclitaxel-resistant triple-negative breast cancer

Sep 6, 2024

Load comments (0)

GPT-4 with vision has poor accuracy for image-based radiology questions

Low-impact yoga and exercise found to help older women manage urinary incontinence

Missouri patient tests positive for bird flu despite no known exposure to animals

Falling for financial scams? It may signal early Alzheimer's disease

Cognitive behavioral therapy enhances brain circuits to relieve depression

New molecular sensor enables fluorescence imaging for assessing sarcoma severity

Noninvasive focused ultrasound show potential for combating chronic pain

Study finds TGF-beta and RAS signaling are both required for lung cancer metastasis

Research team successfully maps the brain-spinal cord connection in humans

Alzheimer's study reveals critical differences in memory loss progression based on the presence of specific proteins

Chemical screen identifies PRMT5 as therapeutic target for paclitaxel-resistant triple-negative breast cancer

Vision-based ChatGPT shows deficits interpreting radiologic images

ChatGPT's diagnostic capabilities evaluated in comparison to radiologists: Could AI boost results?

ChatGPT passes radiology board exam

New research suggests AI image generation using DALL-E 2 has promising future in radiology

Study shows ChatGPT performs well in answering genetic testing questions

Radiology researchers test large language model that preserves patient privacy

New molecular sensor enables fluorescence imaging for assessing sarcoma severity

Research team successfully maps the brain-spinal cord connection in humans

Researchers develop mechanism that predicts severity of aggressive form of breast cancer

AI-driven tool could improve brain pressure monitoring in intensive care patients

New AI hair analysis method holds promise for improved health research

Conversational AI technology improves sexual and reproductive health education, study finds

Phys.org

Tech Xplore

Science X

GPT-4 with vision has poor accuracy for image-based radiology questions

Low-impact yoga and exercise found to help older women manage urinary incontinence

Missouri patient tests positive for bird flu despite no known exposure to animals

Falling for financial scams? It may signal early Alzheimer's disease

Cognitive behavioral therapy enhances brain circuits to relieve depression

New molecular sensor enables fluorescence imaging for assessing sarcoma severity

Noninvasive focused ultrasound show potential for combating chronic pain

Study finds TGF-beta and RAS signaling are both required for lung cancer metastasis

Research team successfully maps the brain-spinal cord connection in humans

Alzheimer's study reveals critical differences in memory loss progression based on the presence of specific proteins

Chemical screen identifies PRMT5 as therapeutic target for paclitaxel-resistant triple-negative breast cancer

Related Stories

Vision-based ChatGPT shows deficits interpreting radiologic images

ChatGPT's diagnostic capabilities evaluated in comparison to radiologists: Could AI boost results?

ChatGPT passes radiology board exam

New research suggests AI image generation using DALL-E 2 has promising future in radiology

Study shows ChatGPT performs well in answering genetic testing questions

Radiology researchers test large language model that preserves patient privacy

Recommended for you

New molecular sensor enables fluorescence imaging for assessing sarcoma severity

Research team successfully maps the brain-spinal cord connection in humans

Researchers develop mechanism that predicts severity of aggressive form of breast cancer

AI-driven tool could improve brain pressure monitoring in intensive care patients

New AI hair analysis method holds promise for improved health research

Conversational AI technology improves sexual and reproductive health education, study finds

Newsletter sign up

Donate and enjoy an ad-free experience