Identifying correlations in electronic patient records

A new study demonstrates how text mining of electronic health records can be used to create medical term profiles of patients, which can be used both to identify co-occurrence of diseases and to cluster patients into groups with highly similar clinical features. The study, carried out in Denmark by a multi-disciplinary group of bioinformaticians, systems biologists and clinicians, will be published in the open-access journal PLoS Computational Biology on 25th August 2011.

Health records contain detailed phenotypic information on the clinical profile of each individual patient; however, a large part of the clinical features are described in free text produced by staff often covering many years of hospitalization.

"Using our text mining approach on the free text in the records, we identified roughly ten times as many medical terms characterizing each patient as were manually included by the . Worldwide, the manually inserted medical terms in medical records are heavily biased by local practice and billing purposes. Using our method we obtained a much more fine-grained clinical characterization of each patient, which ultimately also may be very valuable for choosing personalized treatment regimes", says Professor Søren Brunak from the Technical University of Denmark and the University of Copenhagen who led the team behind the research project.

The team used the "International Classification of Disease" terminology, maintained by the WHO as a controlled vocabulary, as the basis for the analysis. "The fact that terminologies like ICD have been translated word by word between languages makes it possible in principle to use the same term profiles across language barriers and combine cohorts across countries" says author Professor Lars Juhl Jensen from the University of Copenhagen.

The research group identified a large number of diseases and symptoms which co-occur much more than expected when compared to the individual frequencies of the diseases. The group subsequently mapped these correlations to the genetic level by investigating gene overlaps in protein interaction networks already linked to the individual diseases. "The aim here is to discover a possible genetic cause behind the disease correlations observed, thus interfacing the electronic patient record data directly to the DNA sequencing of human individuals", says Brunak.

More information: Roque FS, Jensen PB, Schmock H, Dalgaard M, Andreatta M, et al. (2011) Using Electronic Patient Records to Discover Disease Correlations and Stratify Patient Cohorts. PLoS Comput Biol 7(8): e1002141. doi:10.1371/journal.pcbi.1002141

add to favorites email to friend print save as pdf

Related Stories

Electronic medical records speed genetic health studies

Apr 20, 2011

Recruiting thousands of patients to collect health data for genetic clues to disease is expensive and time consuming. But that arduous process of collecting data for genetic studies could be faster and cheaper by instead ...

Recommended for you

Hormonal therapy for transsexualism safe and effective

8 hours ago

Hormonal therapy for transsexual patients is safe and effective, a multicenter European study indicates. The results will be presented Saturday at The Endocrine Society's 95th Annual Meeting in San Francisco.

Royalty Pharma lets Elan takeover bid expire

12 hours ago

Royalty Pharma has let its latest takeover bid for Irish drugmaker Elan lapse as it decided against pressing ahead with a court challenge of a requirement that it withdraw the offer.

FDA approves new silicone breast implants

Jun 17, 2013

(HealthDay)—MemoryShape breast implants have been approved by the U.S. Food and Drug Administration for breast augmentation in women 22 and older, and for breast reconstruction, the FDA said Friday.

User comments

More news stories

Study suggests new approach to fight lung cancer

Recent research has shown that cancer cells have a much different – and more complex – metabolism than normal cells. Now, scientists at The University of Texas at Dallas have found that exploiting these differences might ...

Getting enough sleep could help prevent type 2 diabetes

Men who lose sleep during the work week may be able to lower their risk of developing Type 2 diabetes by getting more hours of sleep, according to Los Angeles Biomedical Research Institute (LA BioMed) research findings presented ...