April 3, 2024

Reliable emergency room translations might be job for humans, AI together

translate — Credit: Pixabay/CC0 Public Domain

While the garbled translation of a newspaper article in a foreign language may be nothing more than an annoyance, uses of machine translation technology extend to higher-stakes settings as well: In a hospital emergency room, incorrectly translated discharge instructions or medication protocols could have life-threatening consequences.

Researchers from the University of Maryland's Computational Linguistics and Information Processing (CLIP) Lab looked into this problem, studying data collected from English-to-Chinese machine translation systems used in emergency rooms at the University of California, San Francisco.

The paper is published in the journal Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

They found that neither an artificial intelligence tool to monitor translation quality nor more manual approaches could fully overcome errors—but that combining human and computerized abilities held promise for improving such systems.

For this study, the CLIP team reviewed data from 65 English-speaking physicians to evaluate two distinct methods for assessing the quality of machine-generated translations used for Chinese-speaking patients.

One group of physicians used a quality estimation tool—AI-driven software that can automatically predict the accuracy of a machine translation output. According to the researchers, this tool helped doctors rely on machine translation more appropriately by deciding to show "good" translations to patients overall. But the tool was not perfect; it failed to flag some critical errors that could harm the health of the patient.

A second set of doctors used a technique known as backtranslation, where the user retranslates the Chinese output using Google Translate to assess its English output. The researchers observed complementary trends for these doctors: backtranslation does not improve their ability to assess translation quality on average, but does help identify clinically critical errors that quality estimation tools fail to flag.

The CLIP team believes its study paves the way for future work in designing methods that combine the strengths of both methods tested, resulting in a human-centered evaluation design that can be used to further improve machine translation tools used in clinical settings.

"Our study confirms that lay users often trust AI systems even when they should not, and that the strategies that people develop on their own to decide whether to trust an output—such as backtranslation—can be misleading," said Marine Carpuat, an associate professor of computer science who co-authored the study.

"However, we show that AI techniques can also be used to provide feedback that helps people calibrate their trust in systems. We view this as a first step toward developing trustworthy AI."

Sweta Agrawal Ph.D. '23, a co-author on the study who is now a postdoctoral fellow at the Instituto de Telecomunicações in Portugal, said that the project has important implications for medical care and society at large.

"This work provides support for the usefulness of providing actionable feedback to users in high-risk scenarios," she said. "Moreover, the findings contribute to the ongoing research efforts to design reliable metrics, especially for critical domains like health care."

Other UMD co-authors included Ge Gao, an assistant professor of information studies and Yimin Xiao, a third-year information studies doctoral student; researchers from the University of California (UC) Berkeley, and UC San Francisco also numbered among the co-authors.

Carpuat and Gao both have appointments in the University of Maryland Institute for Advanced Computer Studies, which provides technical and administrative support for their work in the CLIP Lab.

Based on their findings, the researchers will develop new techniques to assist people in using these imperfect systems more effectively.

More information: Nikita Mehandru et al, Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (2023). DOI: 10.18653/v1/2023.emnlp-main.712

Provided by University of Maryland

Citation: Reliable emergency room translations might be job for humans, AI together (2024, April 3) retrieved 30 April 2024 from https://medicalxpress.com/news/2024-04-reliable-emergency-room-job-humans.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Strengthening the partnership between humans and AI: The case of translators

1 shares

Feedback to editors

Exercise programs benefit a wide range of long-term health conditions, finds health data analysis

19 minutes ago

Treatment-related pain may be 'socially contagious'

55 minutes ago

Gene expression analyses identify potential drivers of chronic allergic inflammation

1 hour ago

Researchers reveal a new approach for treating degenerative diseases

1 hour ago

Early gestational diabetes treatment shown to reduce birth complications, health costs for those at higher risk

2 hours ago

Omega-6 fatty acids could cut risk of bipolar disorder, study suggests

2 hours ago

Study finds Bcl6 protein is an important transcription factor for formation of certain dendritic cells

2 hours ago

Development in cancer treatment focuses on re-educating cells to combat resistance

2 hours ago

All women need mammograms beginning at age 40, expert panel says

2 hours ago

Staying fit boosts kids' mental health

2 hours ago

Load comments (0)

Reliable emergency room translations might be job for humans, AI together

Exercise programs benefit a wide range of long-term health conditions, finds health data analysis

Treatment-related pain may be 'socially contagious'

Gene expression analyses identify potential drivers of chronic allergic inflammation

Researchers reveal a new approach for treating degenerative diseases

Early gestational diabetes treatment shown to reduce birth complications, health costs for those at higher risk

Omega-6 fatty acids could cut risk of bipolar disorder, study suggests

Study finds Bcl6 protein is an important transcription factor for formation of certain dendritic cells

Development in cancer treatment focuses on re-educating cells to combat resistance

All women need mammograms beginning at age 40, expert panel says

Staying fit boosts kids' mental health

Strengthening the partnership between humans and AI: The case of translators

Facebook unveils machine learning translator for 100 languages

Faulty machine translations litter the web

Google translates doctor's orders into Spanish and Chinese with few significant errors

Study assesses the quality of AI literary translations by comparing them with human translations

Machine translation for cuneiform tablets

GPT-4, Google Gemini fall short in breast imaging classification, study finds

AI algorithms can determine how well newborns nurse, study shows

Using AI to improve diagnosis of rare genetic disorders

Researchers create an AI-powered digital imaging system to speed up cancer biopsy results

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

New algorithm could provide early warning for asthma attacks

Phys.org

Tech Xplore

Science X

Reliable emergency room translations might be job for humans, AI together

Exercise programs benefit a wide range of long-term health conditions, finds health data analysis

Treatment-related pain may be 'socially contagious'

Gene expression analyses identify potential drivers of chronic allergic inflammation

Researchers reveal a new approach for treating degenerative diseases

Early gestational diabetes treatment shown to reduce birth complications, health costs for those at higher risk

Omega-6 fatty acids could cut risk of bipolar disorder, study suggests

Study finds Bcl6 protein is an important transcription factor for formation of certain dendritic cells

Development in cancer treatment focuses on re-educating cells to combat resistance

All women need mammograms beginning at age 40, expert panel says

Staying fit boosts kids' mental health

Related Stories

Strengthening the partnership between humans and AI: The case of translators

Facebook unveils machine learning translator for 100 languages

Faulty machine translations litter the web

Google translates doctor's orders into Spanish and Chinese with few significant errors

Study assesses the quality of AI literary translations by comparing them with human translations

Machine translation for cuneiform tablets

Recommended for you

GPT-4, Google Gemini fall short in breast imaging classification, study finds

AI algorithms can determine how well newborns nurse, study shows

Using AI to improve diagnosis of rare genetic disorders

Researchers create an AI-powered digital imaging system to speed up cancer biopsy results

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

New algorithm could provide early warning for asthma attacks

Newsletter sign up

Donate and enjoy an ad-free experience