This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

peer-reviewed publication

trusted source

proofread

Ehrapy: A new open-source tool for analyzing complex health data

Ehrapy: A new open-source tool for analyzing complex health data
Logo ehrapy. Credit: Meshal Ansari

Led by Helmholtz Munich, scientists have developed an accessible software solution specifically designed for the analysis of complex medical health data. The open-source software called "ehrapy" enables researchers to structure and systematically examine large, heterogeneous datasets. The software is available to the global scientific community to use and further develop.

Ehrapy is intended to fill a critical gap in the analysis of health data, says Lukas Heumos, one of the main developers and a scientist at the Institute of Computational Biology at Helmholtz Munich and the Technical University of Munich (TUM). "Until now, there have been no standardized tools for systematically and efficiently analyzing diverse and complex medical data. We've changed that with ehrapy," said Heumos.

The team behind ehrapy comes from biomedical research and has extensive experience in analyzing complex scientific datasets. "The health care sector faces similar challenges in data analysis as those working in laboratories," noted Heumos at the start of the ehrapy project.

The study was published in Nature Medicine.

Exploratory approach—hypothesis-free analysis

Together with many other contributors, Heumos has used his expertise in scientific software development to create a solution for analyzing patient data. Heumos said, "Ehrapy can uncover new patterns and generate insights without needing to analyze the data based on a specific assumption or hypothesis." This exploratory approach, says Heumos, is a unique feature of ehrapy.

Ehrapy allows researchers to sort, group, and analyze large, heterogeneous, and complex datasets without any pre-existing hypotheses. This opens up new insights that can then be explored further.

Heumos explained, "The exploratory approach brings fresh perspectives to health data analysis. Due to their complexity and heterogeneity, these data are often not analyzed as effectively as they could be." Ehrapy thus opens new avenues for making health data more useful for and practice.

The long-term goal: Routine use in clinical practice

Ehrapy was designed as from the beginning. "It was important to us to make the software available to the from day one," emphasized Heumos.

The software is available as a Python package on GitHub, an for software development, and can be used and further developed by researchers worldwide.

Currently, ehrapy focuses on efficiently and quickly analyzing research datasets, such as those stored in large health research centers. "Routine use in is a long-term goal, but for now, we are concentrating on providing the with a powerful tool," said Heumos.

In the future, the team plans to provide standardized databases for electronic health records (EHRs). These databases will enable better integration and analysis of large volumes of medical data. Additionally, this will facilitate the development of EHR atlases that can serve as reference datasets for contextualizing and annotating new datasets.

A long journey

"Ehrapy enables comprehensive data analysis across systems, which can be a key step for future AI systems in medicine. I therefore hope for a relatively quick adoption at various sites," says Prof. Fabian Theis, Director of the Institute of Computational Biology at Helmholtz Munich and TUM Professor. "Establishing such technologies in medicine is a lengthy process that can take decades. Our goal is to bridge the gap between and practical application in medicine."

Theis further explains that the development team is focusing on exploratory data analysis methods in a holistic form to more easily reveal hidden connections and added, "We are also trying to support academic and commercial players in the health care sector."

More information: Exploratory electronic health record analysis with ehrapy, Nature Medicine (2024). DOI: 10.1038/s41591-024-03214-0

Ehrapy on GitHub: https://github.com/theislab/ehrapy

Journal information: Nature Medicine
Citation: Ehrapy: A new open-source tool for analyzing complex health data (2024, September 12) retrieved 12 September 2024 from https://medicalxpress.com/news/2024-09-ehrapy-source-tool-complex-health.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Interpreting large-scale medical datasets: Generative model enables multi-scale representations of cells and samples

2 shares

Feedback to editors