Identifying correlations in electronic patient records

August 25, 2011

A new study demonstrates how text mining of electronic health records can be used to create medical term profiles of patients, which can be used both to identify co-occurrence of diseases and to cluster patients into groups with highly similar clinical features. The study, carried out in Denmark by a multi-disciplinary group of bioinformaticians, systems biologists and clinicians, will be published in the open-access journal PLoS Computational Biology on 25th August 2011.

Health records contain detailed phenotypic information on the clinical profile of each individual patient; however, a large part of the clinical features are described in free text produced by staff often covering many years of hospitalization.

"Using our text mining approach on the free text in the records, we identified roughly ten times as many medical terms characterizing each patient as were manually included by the . Worldwide, the manually inserted medical terms in medical records are heavily biased by local practice and billing purposes. Using our method we obtained a much more fine-grained clinical characterization of each patient, which ultimately also may be very valuable for choosing personalized treatment regimes", says Professor Søren Brunak from the Technical University of Denmark and the University of Copenhagen who led the team behind the research project.

The team used the "International Classification of Disease" terminology, maintained by the WHO as a controlled vocabulary, as the basis for the analysis. "The fact that terminologies like ICD have been translated word by word between languages makes it possible in principle to use the same term profiles across language barriers and combine cohorts across countries" says author Professor Lars Juhl Jensen from the University of Copenhagen.

The research group identified a large number of diseases and symptoms which co-occur much more than expected when compared to the individual frequencies of the diseases. The group subsequently mapped these correlations to the genetic level by investigating gene overlaps in protein interaction networks already linked to the individual diseases. "The aim here is to discover a possible genetic cause behind the disease correlations observed, thus interfacing the electronic patient record data directly to the DNA sequencing of human individuals", says Brunak.

Explore further: Electronic medical records speed genetic health studies

More information: Roque FS, Jensen PB, Schmock H, Dalgaard M, Andreatta M, et al. (2011) Using Electronic Patient Records to Discover Disease Correlations and Stratify Patient Cohorts. PLoS Comput Biol 7(8): e1002141. doi:10.1371/journal.pcbi.1002141

Related Stories

Electronic medical records speed genetic health studies

April 20, 2011
Recruiting thousands of patients to collect health data for genetic clues to disease is expensive and time consuming. But that arduous process of collecting data for genetic studies could be faster and cheaper by instead ...

Electronic medical record text search tool shows promise for identifying postoperative complications

August 23, 2011
Use of natural language processing, such as in the form of free-text searches of electronic medical records (EMRs) of clinical and progress notes of patients performed better at identifying postoperative surgical complications ...

Recommended for you

Exploring the potential of human echolocation

June 25, 2017
People who are visually impaired will often use a cane to feel out their surroundings. With training and practice, people can learn to use the pitch, loudness and timbre of echoes from the cane or other sounds to navigate ...

Team eradicates hepatitis C in 10 patients following lifesaving transplants from infected donors

April 30, 2017
Ten patients at Penn Medicine have been cured of the Hepatitis C virus (HCV) following lifesaving kidney transplants from deceased donors who were infected with the disease. The findings point to new strategies for increasing ...

'bench to bedside to bench': Scientists call for closer basic-clinical collaborations

March 24, 2017
In the era of genome sequencing, it's time to update the old "bench-to-bedside" shorthand for how basic research discoveries inform clinical practice, researchers from The Jackson Laboratory (JAX), National Human Genome Research ...

The ethics of tracking athletes' biometric data

January 18, 2017
(Medical Xpress)—Whether it is a FitBit or a heart rate monitor, biometric technologies have become household devices. Professional sports leagues use some of the most technologically advanced biodata tracking systems to ...

Financial ties between researchers and drug industry linked to positive trial results

January 18, 2017
Financial ties between researchers and companies that make the drugs they are studying are independently associated with positive trial results, suggesting bias in the evidence base, concludes a study published by The BMJ ...

Best of Last Year – The top Medical Xpress articles of 2016

December 23, 2016
(Medical Xpress)—It was a big year for research involving overall health issues, starting with a team led by researchers at the UNC School of Medicine and the National Institutes of Health who unearthed more evidence that ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.