March 14, 2023 report

AI language models open a potential Pandora's box of medical research fraud

by Justin Jackson , Medical Xpress

chatbot — Credit: Unsplash/CC0 Public Domain

Medical student and researcher Faisal Elali of the State University of New York Downstate Health Sciences University and medical scribe and researcher Leena Rachid from the New York-Presbyterian/Weill Cornell Medical Center wanted to see if artificial intelligence could write a fabricated research paper and then investigate how best to detect it.

Artificial intelligence is an increasingly valuable and vital part of scientific research. It is used as a tool to analyze complicated data sets, but it is never used to generate the actual paper for publication. AI-generated research papers, on the other hand, can look convincing even when based on an entirely fabricated study. But exactly how convincing?

In a paper published in the open-access journal Patterns, the research duo demonstrated the feasibility of fabricating a research paper using ChatGPT, an AI-based language model. Simply by asking, they were able to have ChatGPT produce a number of well-written, entirely made-up abstracts. A hypothetical fraudster could then submit these fake abstracts to multiple journals seeking publication. If accepted, the same process could be used to write an entire study with false data, nonexistent participants and meaningless results. However, it could appear legitimate, especially if the subject is particularly abstract or not screened by an expert in the specific field.

In a previous experiment cited in the current paper, humans were given both human-created and AI-generated abstracts to consider. In that experiment, humans incorrectly identified 32% of the AI-generated research abstracts as real and 14% of the human-written abstracts as fake.

The current research team decided to test their ChatGPT fabricated study against three online AI detectors. The texts were overwhelmingly identified as AI-generated, suggesting the adoption of AI detection tools by journals could be a successful diverter of fraudulent applications. However, when they took the same text and ran it through a free, online, AI-powered rephrasing tool first—the consensus unanimously flipped to "likely human," suggesting we need better AI detection tools.

Actual science is hard work, and communicating the details of that work is a crucial aspect of science requiring substantial effort. But any mostly hairless ape can string sensible sounding words together given enough time and coffee—as the writer of this article can firmly attest. Creating a fake study with enough detail to seem credible would take tremendous effort, requiring hours of researching how best to sound believable, and might be too tedious a task for someone interested in malicious mischief. With AI completing the task in minutes, that mischief could become an entirely achievable objective. As the researchers point out in their paper, that mischief could have terrible consequences.

They give an example of a legitimate study that supports the use of drug A over drug B for treating a medical condition. Now, suppose a fabricated study makes the opposite claim and is not detected (as a side note, even if it is detected, clawing back citations and reprints of retracted studies is notoriously difficult). It could impact subsequent meta-analyses and systematic reviews of these studies—studies that guide health care policies, standards of care and clinical recommendations.

Beyond the simple mischief motive, the authors of the paper point to the pressure on medical professionals to quickly produce a high volume of publications to gain research funding or entry into higher career positions. In part, they point out that the United States Medical Licensing Examination recently switched from a graded exam to a pass/fail model, meaning ambitious students rely more heavily on published research to distinguish them from the pack. This raises the stakes for a trustworthy AI detection system to remove potentially fraudulent medical research that could pollute the publishing environment—or worse still, practitioners who submit fraudulent papers from practicing on patients.

The goal of AI language models has long been to produce texts that are indistinguishable from human text. That we need AI that can detect when a human is using AI to produce fraudulent work indistinguishable from reality should not come as a surprise. What might be surprising is just that we may need it so soon.

More information: Faisal R. Elali et al, AI-generated research paper fabrication and plagiarism in the scientific community, Patterns (2023). DOI: 10.1016/j.patter.2023.100706

Journal information: Patterns

Citation: AI language models open a potential Pandora's box of medical research fraud (2023, March 14) retrieved 24 April 2024 from https://medicalxpress.com/news/2023-03-ai-language-potential-pandora-medical.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ChatGPT writes convincing fake scientific abstracts that fool reviewers in study

193 shares

Feedback to editors

Study highlights increased risk of second cancers among breast cancer survivors

36 minutes ago

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

1 hour ago

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

1 hour ago

A closed-loop drug-delivery system could improve chemotherapy

3 hours ago

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

3 hours ago

Solving the riddle of the sphingolipids in coronary artery disease

3 hours ago

New AI technology estimates brain age using low-cost EEG device

3 hours ago

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

4 hours ago

Trials reveal that internet-based conversations help sustain brain function in older adults

4 hours ago

Study finds that a dash of exercise can help students focus and enjoy university lectures

4 hours ago

Load comments (0)

AI language models open a potential Pandora's box of medical research fraud

Study highlights increased risk of second cancers among breast cancer survivors

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

A closed-loop drug-delivery system could improve chemotherapy

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

Solving the riddle of the sphingolipids in coronary artery disease

New AI technology estimates brain age using low-cost EEG device

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

Trials reveal that internet-based conversations help sustain brain function in older adults

Study finds that a dash of exercise can help students focus and enjoy university lectures

ChatGPT writes convincing fake scientific abstracts that fool reviewers in study

ChatGPT maker fields tool for spotting AI-written text

Real or fake text? We can learn to spot the difference

Exploring how to add hidden electronic watermarks to works written by AI systems

We pitted ChatGPT against tools for detecting AI-written text, and the results are troubling

Human writer or AI? Scholars build a detection tool

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

New algorithm could provide early warning for asthma attacks

New potential avenues for cancer therapies through RNA-binding proteins

Study finds AI can develop treatments to prevent 'superbugs'

New study uses AI to predict malaria outbreaks in South Asia

Researchers develop deep-learning model capable of predicting cardiac arrhythmia 30 minutes before it happens

Phys.org

Tech Xplore

Science X

AI language models open a potential Pandora's box of medical research fraud

Study highlights increased risk of second cancers among breast cancer survivors

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

A closed-loop drug-delivery system could improve chemotherapy

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

Solving the riddle of the sphingolipids in coronary artery disease

New AI technology estimates brain age using low-cost EEG device

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

Trials reveal that internet-based conversations help sustain brain function in older adults

Study finds that a dash of exercise can help students focus and enjoy university lectures

Related Stories

ChatGPT writes convincing fake scientific abstracts that fool reviewers in study

ChatGPT maker fields tool for spotting AI-written text

Real or fake text? We can learn to spot the difference

Exploring how to add hidden electronic watermarks to works written by AI systems

We pitted ChatGPT against tools for detecting AI-written text, and the results are troubling

Human writer or AI? Scholars build a detection tool

Recommended for you

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

New algorithm could provide early warning for asthma attacks

New potential avenues for cancer therapies through RNA-binding proteins

Study finds AI can develop treatments to prevent 'superbugs'

New study uses AI to predict malaria outbreaks in South Asia

Researchers develop deep-learning model capable of predicting cardiac arrhythmia 30 minutes before it happens

Newsletter sign up

Donate and enjoy an ad-free experience