February 8, 2024

Evaluating the performance of AI-based large language models in radiation oncology

In a new study published in the journal AI in Precision Oncology, Nikhil Thaker, from Capital Health and Bayta Systems, and co-authors, evaluated the performance of various LLMs, including OpenAI's GPT-3.5-turbo, GPT-4, GPT-4-turbo, Meta's Llama-2 models, and Google's PaLM-2-text-bison. The LLMs were given an exam including 300 questions, and the answers were compared to Radiation Oncology trainee performance.

The results showed that OpenAI's GPT-4-turbo had the best performance, with 74.2% correct answers, and all three Llama-2 models under-performed. The LLMs tended to excel in the area of statistics, but to underperform in clinical areas, with the exception of GPT-turbo, which performed comparably to upper-level radiation oncology trainees and superiorly to lower-level trainees.

"Future research will need to evaluate the performance of models that are fine-tune trained in clinical oncology," concluded the investigators. "This study also underscores the need for rigorous validation of LLM-generated information against established medical literature and expert consensus, necessitating expert oversight in their application in medical education and practice."

"The study highlights the potential of generative AI to revolutionize radiation oncology education and practice. OpenAI's GPT-4-turbo demonstrates that AI can complement medical training, suggesting a future where AI aids in improving patient outcomes. It's essential, though, to validate these technologies rigorously and involve experts to ensure their reliable and effective use in health care," says Douglas Flora, MD, Editor-in-Chief of AI in Precision Oncology.

More information: Nikhil G. Thaker et al, Large Language Models Encode Radiation Oncology Domain Knowledge: Performance on the American College of Radiology Standardized Examination, AI in Precision Oncology (2024). DOI: 10.1089/aipo.2023.0007

Journal information: npj Precision Oncology

Provided by Mary Ann Liebert, Inc

Citation: Evaluating the performance of AI-based large language models in radiation oncology (2024, February 8) retrieved 28 April 2024 from https://medicalxpress.com/news/2024-02-ai-based-large-language-oncology.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Exploring artificial general intelligence for radiation oncology

0 shares

Feedback to editors

Research shows 'profound' link between dietary choices and brain health

18 hours ago

Component of keto diet plus immunotherapy may reduce prostate cancer

22 hours ago

Study finds big jump in addiction treatment at community health clinics

22 hours ago

Positive childhood experiences can boost mental health and reduce depression and anxiety in teens

22 hours ago

Gene linked to epilepsy and autism decoded in new study

Apr 26, 2024

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Apr 26, 2024

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Apr 26, 2024

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Apr 26, 2024

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Apr 26, 2024

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Apr 26, 2024

Load comments (0)

Evaluating the performance of AI-based large language models in radiation oncology

Research shows 'profound' link between dietary choices and brain health

Component of keto diet plus immunotherapy may reduce prostate cancer

Study finds big jump in addiction treatment at community health clinics

Positive childhood experiences can boost mental health and reduce depression and anxiety in teens

Gene linked to epilepsy and autism decoded in new study

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Exploring artificial general intelligence for radiation oncology

Evaluating AI-based nodal contouring in head and neck cancer

Generative artificial intelligence models effectively highlight social determinants of health in doctors' notes

Review reveals potential uses and pitfalls for generative AI in the medical setting

ChatGPT shows limited ability to recommend guidelines-based cancer treatments

Study assesses GPT-4's potential to perpetuate racial, gender biases in clinical decision making

Component of keto diet plus immunotherapy may reduce prostate cancer

Study identifies driver of liver cancer that could be target for treatment

Analysis identifies 50 new genomic regions associated with kidney cancer risk

Biomarkers identified for successful treatment of bone marrow tumors

Study finds vitamin D alters mouse gut bacteria to give better cancer immunity

Targeting specific protein regions offers a new treatment approach in medulloblastoma

Phys.org

Tech Xplore

Science X

Evaluating the performance of AI-based large language models in radiation oncology

Research shows 'profound' link between dietary choices and brain health

Component of keto diet plus immunotherapy may reduce prostate cancer

Study finds big jump in addiction treatment at community health clinics

Positive childhood experiences can boost mental health and reduce depression and anxiety in teens

Gene linked to epilepsy and autism decoded in new study

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Related Stories

Exploring artificial general intelligence for radiation oncology

Evaluating AI-based nodal contouring in head and neck cancer

Generative artificial intelligence models effectively highlight social determinants of health in doctors' notes

Review reveals potential uses and pitfalls for generative AI in the medical setting

ChatGPT shows limited ability to recommend guidelines-based cancer treatments

Study assesses GPT-4's potential to perpetuate racial, gender biases in clinical decision making

Recommended for you

Component of keto diet plus immunotherapy may reduce prostate cancer

Study identifies driver of liver cancer that could be target for treatment

Analysis identifies 50 new genomic regions associated with kidney cancer risk

Biomarkers identified for successful treatment of bone marrow tumors

Study finds vitamin D alters mouse gut bacteria to give better cancer immunity

Targeting specific protein regions offers a new treatment approach in medulloblastoma

Newsletter sign up

Donate and enjoy an ad-free experience