Automatic adverse drug reaction extraction from electronic health records

Patients' electronic health records convey crucial information. The application of natural language processing techniques to these records may be an effective means of extracting information that may improve clinical decision making, clinical documentation and billing, disease prediction and the detection of adverse drug reactions. Adverse drug reactions are a major health problem, resulting in hospital re-admissions and even the death of thousands of patients. An automatic detection system can highlight said reactions in a document, summarize them and automatically report them.

In this context, the Basurto University Hospital and the Galdakao Hospital "were interested in creating a system that would use natural language processing techniques to analyze patient health records in order to automatically identify any adverse effects," explains the engineer Sara Santiso, who also holds a Ph.D. in Computer Science. After the hospitals contacted the IXA group at the UPV/EHU, several researchers started working to build a robust model with which to extract adverse drug reactions from electronic health records written in Spanish, based on clinical text mining.

To this end, "not only have we used techniques based on traditional machine learning algorithms, we have also explored deep learning techniques, reaching the conclusion that these are better able to detect adverse reactions," explains Santiso, one of the authors of the study. Machine learning and deep learning imitate the way the human brain learns, although they use different types of algorithms to do so.

Difficulties finding a corpus in Spanish

Santiso underscores the difficulties the team encountered when trying to find a large enough corpus with which to work: "At first, we started with only a few health records, because they are difficult to obtain due to privacy issues; you have to sign confidentiality agreements in order to work with them," she explains. The research team has found that "having a larger corpus helps the system learn the examples contained in it more effectively, thereby giving rise to better results."

Through this study, which was carried out with health records written in Spanish, "we are contributing to closing the gap between clinical text mining in English and that carried out in other languages, which accounts for less than 5% of all papers published in the field. Indeed, the extraction of clinical information is not yet fully developed due (among other things) to the potential for extracting information from other hospitals and in other languages," claims the researcher.

Although natural language processing has been of inestimable help in the computer-aided detection of adverse drug reactions, there is still room for improvement: "To date, systems have tended to focus on detecting drug-disease pairs located in the same sentence. However, health records contain implicit information that might reveal underlying relations (for example, information about antecedents might be relevant for determining the causes of an adverse event). In other words, future research should strive to detect both explicitly and implicitly-stated inter-sentence relationships." Moreover, another issue that should be the subject of future research is the lack of electronic health records written in Spanish.

More information: Sara Santiso et al, Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants, Computer Methods and Programs in Biomedicine (2020). DOI: 10.1016/j.cmpb.2020.105891

Provided by University of the Basque Country

Automatic adverse drug reaction extraction from electronic health records

Difficulties finding a corpus in Spanish

'Trending' doctors' notes could help hospitals predict COVID-19 surges

Researchers explore causal machine learning, a new advancement for AI in health care

Using AI to trace the origins of metastatic cancer cells

Geneticists develop world's first bioinformatic tool to identify amyloids consisting of multiple copies of same protein

Large genomic study finds tri-ancestral origins for Japanese population

New heart disease calculator could save lives by identifying high-risk patients missed by current tools

Researchers reduce bias in pathology AI algorithms and enhance accuracy using foundation models

AI tool predicts responses to cancer therapy using information from each cell of the tumor

Researchers develop a new way to safely boost immune cells to fight cancer

New compound from blessed thistle may promote functional nerve regeneration

New research defines specific genomic changes associated with the transmissibility of the mpox virus

New study confirms community pharmacies can help people quit smoking

Researchers discover glial hyper-drive for triggering epileptic seizures

Deeper dive into the gut microbiome shows changes linked to body weight

A new therapeutic target for traumatic brain injury

Dozens of COVID virus mutations arose in man with longest known case, research finds

Analyzing the progression in retinal thickness could predict cognitive progression in Parkinson's patients

Researchers discover dynamic DNA structures that regulate the formation of memory

Potential modifier gene identified as cause of ciliary pathology in retinitis pigmentosa patient

Donate and enjoy an ad-free experience

Automatic adverse drug reaction extraction from electronic health records

Difficulties finding a corpus in Spanish

Let us know if there is a problem with our content

Thank you for taking time to provide your feedback to the editors

Donate and enjoy an ad-free experience

Share article

E-MAIL THE STORY