January 10, 2024

Creating medical exam questions with ChatGPT

GPT — Credit: Unsplash/CC0 Public Domain

For a recent study, UKB researchers created two sets of 25 multiple-choice questions (MCQs), each with five possible answers, one of which was correct. The first set of questions was written by an experienced medical lecturer; the second set was created by ChatGPT. 161 students answered all questions in random order. For each question, students also indicated whether they thought it was created by a human or by ChatGPT.

Matthias Laupichler, one of the study authors and research associate at the Institute for Medical Didactics at the UKB, explains, "We were surprised that the difficulty of human-generated and ChatGPT-generated questions was virtually identical. Even more surprising for us is that the students could not correctly identify the question's origin in almost half of the cases."

"Although the results obviously need to be replicated in further studies, the automated generation of exam questions using ChatGPT and co. appears to be a promising tool for medical studies."

His colleague and co-author of the study, Johanna Rother, adds, "Lecturers can use ChatGPT to generate ideas for exam questions, which are then checked and, if necessary, revised by the lecturers. In our opinion, however, students in particular benefit from the automated generation of medical practice questions, as it has long been known that self-testing one's own knowledge is very beneficial for learning."

Tobias Raupach, Director of the Institute of Medical Didactics, continues, "We knew from previous studies that language models such as ChatGPT can answer the questions in medical state examinations. We have now shown for the first time that the software can also be used to write new questions that hardly differ from those of experienced teachers."

Tizian Kaiser, who is studying human medicine in his seventh semester, comments, "When working on the mock exam, I was quite surprised at how difficult it was for me to tell the questions apart. My approach was to differentiate between the questions based on their length, the complexity of their sentence structure, and the difficulty of their content."

"But to be honest, in some situations, I simply had to guess, and the evaluation showed that I could barely differentiate between them. This leads me to the conviction that a meaningful knowledge query, as in this exam, is also possible exclusively through questions posed by the AI."

He is convinced that ChatGPT has great potential for student learning. It allows students to repeat what they have learned in different ways and in different ways again and again.

"There is the option of being quizzed by the AI on predefined topics, having mock exams designed, or simulating oral exams in writing. The repetition of the material is thus tailored to the exam concept, and the training possibilities are endless," says the study participant, while also qualifying, "However, I would only use Chat-GPT for this purpose and not beforehand in the learning process, in which the study topics have to be worked through and summarized."

"Because Chat-GPT is excellent for repetition, I fear errors can occur when preparing learning content. I wouldn't notice these errors without a prior overview of the topic."

It is known from other studies that regular testing—even and especially without grading—helps students to remember learning content more sustainably. Such tests can now be created with little effort. However, the current study should first be transferred to other contexts (e.g., other subjects, semesters, and countries), and it should be investigated whether ChatGPT can also write questions other than the multiple choice questions commonly used in medicine.

The research is published in Academic Medicine.

More information: Matthias Carl Laupichler et al, Large Language Models in Medical Education: Comparing ChatGPT- to Human-Generated Exam Questions, Academic Medicine (2024). DOI: 10.1097/ACM.0000000000005626

Journal information: Academic Medicine

Provided by University Hospital Bonn

Citation: Creating medical exam questions with ChatGPT (2024, January 10) retrieved 17 July 2024 from https://medicalxpress.com/news/2024-01-medical-exam-chatgpt.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Study finds AI language model failed to produce appropriate questions, answers for medical school exam

0 shares

Feedback to editors

Researchers identify the 'broken gate' causing unstoppable brain signals in severe childhood epilepsy

4 minutes ago

When the brain speaks, the heart feels it—the link between the brain's reward system and acute myocardial infarction

18 minutes ago

Llama nanobodies: New therapy can neutralize a wide variety of HIV-1 strains

19 minutes ago

New study finds cell donor's socioeconomic status shapes cancer treatment outcomes

41 minutes ago

Distinct signaling pathway identified as key driver for epithelial cancer development

55 minutes ago

New gene therapy for muscular dystrophy offers hope

1 hour ago

Immune cells monitor blood platelet maturation in bone marrow, researchers discover

1 hour ago

Scientists define new type of memory loss in older adults

1 hour ago

Boost in infant genetics research could change lives, say researchers

1 hour ago

Scientists develop first bone marrow model that supports human stem cells

1 hour ago

Load comments (0)

Creating medical exam questions with ChatGPT

Researchers identify the 'broken gate' causing unstoppable brain signals in severe childhood epilepsy

When the brain speaks, the heart feels it—the link between the brain's reward system and acute myocardial infarction

Llama nanobodies: New therapy can neutralize a wide variety of HIV-1 strains

New study finds cell donor's socioeconomic status shapes cancer treatment outcomes

Distinct signaling pathway identified as key driver for epithelial cancer development

New gene therapy for muscular dystrophy offers hope

Immune cells monitor blood platelet maturation in bone marrow, researchers discover

Scientists define new type of memory loss in older adults

Boost in infant genetics research could change lives, say researchers

Scientists develop first bone marrow model that supports human stem cells

Study finds AI language model failed to produce appropriate questions, answers for medical school exam

ChatGPT shows poor performance in answering drug-related questions

ChatGPT bot passes US law school exam

ChatGPT scores nearly 50% on board certification practice test for ophthalmology, study shows

ChatGPT is still no match for humans when it comes to accounting

ChatGPT outscores med students on complex clinical exam questions

New study finds cell donor's socioeconomic status shapes cancer treatment outcomes

Scientists discover switching off inflammatory protein leads to longer, healthier lifespans in mice

Machine learning helps define new subtypes of Parkinson's disease

World-first international guidelines weeds-out potentially critical scientific fraud

A treatment for metastasis? Using ferroptosis to attack migrating cancer cells

Study shows AI tool successfully responds to patient questions in electronic health record

Phys.org

Tech Xplore

Science X

Creating medical exam questions with ChatGPT

Researchers identify the 'broken gate' causing unstoppable brain signals in severe childhood epilepsy

When the brain speaks, the heart feels it—the link between the brain's reward system and acute myocardial infarction

Llama nanobodies: New therapy can neutralize a wide variety of HIV-1 strains

New study finds cell donor's socioeconomic status shapes cancer treatment outcomes

Distinct signaling pathway identified as key driver for epithelial cancer development

New gene therapy for muscular dystrophy offers hope

Immune cells monitor blood platelet maturation in bone marrow, researchers discover

Scientists define new type of memory loss in older adults

Boost in infant genetics research could change lives, say researchers

Scientists develop first bone marrow model that supports human stem cells

Related Stories

Study finds AI language model failed to produce appropriate questions, answers for medical school exam

ChatGPT shows poor performance in answering drug-related questions

ChatGPT bot passes US law school exam

ChatGPT scores nearly 50% on board certification practice test for ophthalmology, study shows

ChatGPT is still no match for humans when it comes to accounting

ChatGPT outscores med students on complex clinical exam questions

Recommended for you

New study finds cell donor's socioeconomic status shapes cancer treatment outcomes

Scientists discover switching off inflammatory protein leads to longer, healthier lifespans in mice

Machine learning helps define new subtypes of Parkinson's disease

World-first international guidelines weeds-out potentially critical scientific fraud

A treatment for metastasis? Using ferroptosis to attack migrating cancer cells

Study shows AI tool successfully responds to patient questions in electronic health record

Newsletter sign up

Donate and enjoy an ad-free experience