April 2, 2024

Trust your doctor: Study shows human medical professionals are more reliable than artificial intelligence tools

When looking for medical information, people can use web search engines or large language models (LLMs) like ChatGPT-4 or Google Bard. However, these artificial intelligence (AI) tools have their limitations and can sometimes generate incorrect advice or instructions. A new study in the American Journal of Preventive Medicine assesses the accuracy and reliability of AI-generated advice against established medical standards and finds that LLMs are not trustworthy enough to replace human medical professionals just yet.

Andrei Brateanu, MD, Department of Internal Medicine, Cleveland Clinic Foundation, says, "Web search engines can provide access to reputable sources of information, offering accurate details on a variety of topics such as preventive measures and general medical questions.

"Similarly, LLMs can offer medical information that may look very accurate and convincing when, in fact, it may be occasionally inaccurate. Therefore, we thought it would be important to compare the answers from LLMs with data obtained from recognized medical organizations. This comparison helps validate the reliability of the medical information by cross-referencing it with trusted health care data."

In the study, 56 questions were posed to ChatGPT-4 and Bard, and their responses were evaluated by two physicians for accuracy, with a third resolving any disagreements. Final assessments found 28.6% of ChatGPT-4's answers accurate, 28.6% inaccurate, and 42.8% partially accurate but incomplete. Bard performed better, with 53.6% of answers accurate, 17.8% inaccurate, and 28.6% partially accurate.

Dr. Brateanu explains, "All LLMs, including ChatGPT-4 and Bard, operate using complex mathematical algorithms. The fact that both models produced responses with inaccuracies or omitted crucial information highlights the ongoing challenge of developing AI tools that can provide dependable medical advice. This might come as a surprise, considering the advanced technology behind these models and their anticipated role in health care environments."

This research underscores the importance of being cautious and critical of medical information obtained from AI sources, reinforcing the need to consult health care professionals for accurate medical advice. For health care professionals, it points to the potential and limitations of using AI as a supplementary tool in providing patient care and emphasizes the ongoing need for oversight and verification of AI-generated information.

Dr. Brateanu concludes, "AI tools should not be seen as substitutes for medical professionals. Instead, they can be considered as additional resources that, when combined with human expertise, can enhance the overall quality of information provided. As we incorporate AI technology into health care, it's crucial to ensure that the essence of health care continues to be fundamentally human."

More information: Joseph Kassab et al, Accuracy of Online Artificial Intelligence Models in Primary Care Settings, American Journal of Preventive Medicine (2024). DOI: 10.1016/j.amepre.2024.02.006

Provided by Elsevier

Citation: Trust your doctor: Study shows human medical professionals are more reliable than artificial intelligence tools (2024, April 2) retrieved 30 April 2024 from https://medicalxpress.com/news/2024-04-doctor-human-medical-professionals-reliable.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ChatGPT shows poor performance in answering drug-related questions

0 shares

Feedback to editors

X-chromosome inactivation may reduce autism risk, study in mice suggests

21 minutes ago

Microarray patches safe and effective for vaccinating children, trial shows

10 hours ago

Healthy lifestyle may offset effects of life-shortening genes by more than 60%

10 hours ago

Fentanyl inhalation may cause potentially irreversible brain damage, warn doctors

10 hours ago

Frequent teen vaping might boost risk of toxic lead and uranium exposure

10 hours ago

Study in Haiti suggests early-onset heart failure is prevalent form of heart disease in low-income countries

12 hours ago

AI algorithms can determine how well newborns nurse, study shows

13 hours ago

Kaposi sarcoma discovery and mouse model could facilitate drug development

13 hours ago

Immune cell interaction study unlocks novel treatment targets for chikungunya virus

13 hours ago

New method rapidly reveals how protein modifications power T cells

13 hours ago

Load comments (0)

Trust your doctor: Study shows human medical professionals are more reliable than artificial intelligence tools

X-chromosome inactivation may reduce autism risk, study in mice suggests

Microarray patches safe and effective for vaccinating children, trial shows

Healthy lifestyle may offset effects of life-shortening genes by more than 60%

Fentanyl inhalation may cause potentially irreversible brain damage, warn doctors

Frequent teen vaping might boost risk of toxic lead and uranium exposure

Study in Haiti suggests early-onset heart failure is prevalent form of heart disease in low-income countries

AI algorithms can determine how well newborns nurse, study shows

Kaposi sarcoma discovery and mouse model could facilitate drug development

Immune cell interaction study unlocks novel treatment targets for chikungunya virus

New method rapidly reveals how protein modifications power T cells

ChatGPT shows poor performance in answering drug-related questions

Study finds AI empowers patients before and after seeing physicians for radiation oncology treatment

DeepMind develops SAFE, an AI-based app that can fact-check LLMs

Review reveals potential uses and pitfalls for generative AI in the medical setting

Study shows ChatGPT performs well in answering genetic testing questions

AI model can respond appropriately to ophthalmology questions

Study suggests that stevia is the most brain-compatible sugar substitute

Healthy lifestyle may offset effects of life-shortening genes by more than 60%

Study in Haiti suggests early-onset heart failure is prevalent form of heart disease in low-income countries

Pandemic fatigue and vaccine hesitancy continue to affect global public health, new 23-country study finds

Nature's nudge: Study shows green views lead to healthier food choices

AI algorithms can determine how well newborns nurse, study shows

Phys.org

Tech Xplore

Science X

Trust your doctor: Study shows human medical professionals are more reliable than artificial intelligence tools

X-chromosome inactivation may reduce autism risk, study in mice suggests

Microarray patches safe and effective for vaccinating children, trial shows

Healthy lifestyle may offset effects of life-shortening genes by more than 60%

Fentanyl inhalation may cause potentially irreversible brain damage, warn doctors

Frequent teen vaping might boost risk of toxic lead and uranium exposure

Study in Haiti suggests early-onset heart failure is prevalent form of heart disease in low-income countries

AI algorithms can determine how well newborns nurse, study shows

Kaposi sarcoma discovery and mouse model could facilitate drug development

Immune cell interaction study unlocks novel treatment targets for chikungunya virus

New method rapidly reveals how protein modifications power T cells

Related Stories

ChatGPT shows poor performance in answering drug-related questions

Study finds AI empowers patients before and after seeing physicians for radiation oncology treatment

DeepMind develops SAFE, an AI-based app that can fact-check LLMs

Review reveals potential uses and pitfalls for generative AI in the medical setting

Study shows ChatGPT performs well in answering genetic testing questions

AI model can respond appropriately to ophthalmology questions

Recommended for you

Study suggests that stevia is the most brain-compatible sugar substitute

Healthy lifestyle may offset effects of life-shortening genes by more than 60%

Study in Haiti suggests early-onset heart failure is prevalent form of heart disease in low-income countries

Pandemic fatigue and vaccine hesitancy continue to affect global public health, new 23-country study finds

Nature's nudge: Study shows green views lead to healthier food choices

AI algorithms can determine how well newborns nurse, study shows

Newsletter sign up

Donate and enjoy an ad-free experience