July 11, 2018

New informatics tool makes the most of genomic data

by University of Illinois at Urbana-Champaign

The rise of genomics, the shift from considering genes singly to collectively, is adding a new dimension to medical care; biomedical researchers hope to use the information contained in human genomes to make better predictions about individual health, including responses to therapeutic drugs. A new computational tool developed through a collaboration between the University of Illinois and the Mayo Clinic combines multiple types of genomic information to make stronger predictions about what genomic features are associated with specific drug responses.

The tool, described in Genome Research, was developed by members of KnowEnG, a Center of Excellence established by an NIH Big Data to Knowledge (BD2K) Initiative award to the University of Illinois in partnership with the Mayo Clinic. KnowEnG stands for Knowledge Engine for Genomics, representing the center's mission to develop analytical resources for biomedical work with genomic data. The Center is housed within the Carl R. Woese Institute for Genomic Biology at the University of Illinois.

"We all know treatment outcomes for complex diseases like cancers vary dramatically among individuals, from lacking of efficacy resulting in disease recurring to severe toxicity resulting in noncompliance in patients who cannot tolerate these life-saving drugs," said Leiwei Wang, a professor of pharmacology at the Mayo Clinic. "Therefore, it is extremely important for us to understand better of how and why patients respond differently, so that we can truly individualize their therapies by choosing the right drug at the right dose."

The researchers' first step toward this goal was a large-scale data collection effort. They assembled a panel of lab-reared tumor cells derived from a diverse set of individuals, and exposed samples of those cells to one of a set of common anticancer drugs. This allowed them to quantify the drug responses of different genetic backgrounds in a directly comparable way.

Using these data, Mayo Clinic researchers wanted to ask what characteristics of cells from each individual helped determine its unique set of responses to the drugs tested. They collected data on the "expression" of every gene in the genome—how often each gene was being read by the cell and used to create the corresponding protein that gene carries the instructions for.

The team also wanted to look at where those differences in gene expression might come from. DNA sequence surrounding genes in the genome influence when genes are expressed. So do the actions of special proteins called transcription factors, which bind to DNA and make it easier or harder for genes to be read by cellular machinery. Finally, how different regions of the long DNA strands of the genome are coiled up, the "epigenetic state" of genomic DNA also helps determine how likely a gene is to be expressed.

The team decided to collect data on all of these characteristics of their lines of cells. They had built a comprehensive dataset, but lacked something vital—an analytical tool that could use it to full advantage.

"There was no tool that would exploit all of these together," said Professor of Computer Science and Willett Faculty Scholar Saurabh Sinha, who co-directs the BD2K Center. "From the question came the data . . . then came our part, what do you do with it?"

Sinha and graduate student Casey Hanson developed an algorithm that takes in data on gene expression, genomic factors that help control gene expression, and resulting traits (such as drug response) and uses these to predict which genes are most important in determining the latter. They based their work on a tool they had previously developed named "Gene Expression in the Middle," or GENMi. Their new model, because of its ability to appropriately weight and integrate multiple sources of data, is named "probabilistic GENMi" or pGENMi.

"It's a more rigorous tool; it should automatically handle how to weight different aspects of the data when it's trying to look at many different types of data to reach a common conclusion," Sinha said. "Methodologically, that was the most challenging part, the development of the probabilistic model."

Because this tool is the first of its kind, team had to get creative to assess how well it was working—they had no prior standard of performance for comparison, and the results generated by pGENMi are the basis for further experimental work, not an endpoint.

"Our end result was testable predictions . . . a ranking of what experiments to do and verify that this transcription factor indeed has a role in regulating the response to that drug," Sinha said.

"In a lot of computer science and bioinformatics papers, there is a gold standard database to validate predictions against—but we didn't have the luxury of that," Hanson said. "We had to search a vast literature to try to find, among the myriad ways of doing so and stating that one has done so, experiments that [could] confirm our hypothesis." The team's mix of computer science and biological knowledge was what made this task possible.

Hanson and his coauthors examined whether the predictions generated by the algorithm included associations that were already confirmed by the studies he identified. The literature searches revealed examples in which transcription factors highlighted by pGENMi had been experimentally manipulated, resulting in changes in drug responsiveness. Many of the predictions generated by pGENMi were supported by previous work, making it likely that those not supported by prior work are novel but real associations.

"For example . . . we found a paper in which rapamycin [an anticancer drug] decreased GATA1 [a transcription factor's] binding with DNA. Another paper, we found that . . . rapamycin increased expression of a gene, ERCC1," Hanson said. The same paper linked the transcription factor, GATA1, to ERCC1's expression. Hanson noted that "our own experiments showed that knocking down GATA1 changed the sensitivity of cells to rapamycin," in agreement with the previous work.

To test pGENMi's results even further, the group selected transcription factors predicted to impact drug responsiveness, as well as several predicted to have little impact, and reduced their function in lab-grown cancer cells. For the majority of the TFs examined, these experimental results were consistent with pGENMi's predictions.

Although in this initial project pGENMi was used to explore the factors that influence the response of cancer cells to therapeutic drugs, its flexibility would allow for a wide range of applications.

"We have generated tools that can be used broadly by the research community. These tools will be open to anyone who might have the right data sets to both help generate hypothesis and also to help refine the algorithms," Wang said. "This is a perfect example of how expertise in complementary research areas, in this case, computational science and pharmacoproteomics, come together to make a difference."

More information: Casey Hanson et al, Principled multi-omic analysis reveals gene regulatory mechanisms of phenotype variation, Genome Research (2018). DOI: 10.1101/gr.227066.117

Journal information: Genome Research

Provided by University of Illinois at Urbana-Champaign

Citation: New informatics tool makes the most of genomic data (2018, July 11) retrieved 17 July 2024 from https://medicalxpress.com/news/2018-07-informatics-tool-genomic.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

How a thieving transcription factor dominates the genome

Feedback to editors

Study finds no increased risk of birth defects after COVID-19 infection or vaccination in early pregnancy

4 hours ago

Improving HIV treatment in children and adolescents, the right way

4 hours ago

Why the most prescribed chemotherapy drug can cause serious heart damage

5 hours ago

Vaccination credited for 70% reduction in long COVID risk over course of pandemic

6 hours ago

Study shows elevated cancer risk for women with endometriosis

6 hours ago

Repurposing FDA-approved drugs may decrease fibrosis in butterfly disease

6 hours ago

Gut protein may protect brain cells in Parkinson's disease

6 hours ago

Living in greener neighborhoods during midlife can slow cognitive decline

7 hours ago

Study reveals disparities between male and female surgical residents in the experiences of pregnancy and parenthood

7 hours ago

New technique could help treat aggressive brain tumors

8 hours ago

Load comments (0)

New informatics tool makes the most of genomic data

Study finds no increased risk of birth defects after COVID-19 infection or vaccination in early pregnancy

Improving HIV treatment in children and adolescents, the right way

Why the most prescribed chemotherapy drug can cause serious heart damage

Vaccination credited for 70% reduction in long COVID risk over course of pandemic

Study shows elevated cancer risk for women with endometriosis

Repurposing FDA-approved drugs may decrease fibrosis in butterfly disease

Gut protein may protect brain cells in Parkinson's disease

Living in greener neighborhoods during midlife can slow cognitive decline

Study reveals disparities between male and female surgical residents in the experiences of pregnancy and parenthood

New technique could help treat aggressive brain tumors

How a thieving transcription factor dominates the genome

New tools used to identify childhood cancer genes

Biologists find mechanisms that control where transcription factors bind

Many experiments for the price of one: A breakthrough in the study of gene regulation

The surprising role of gene architecture in cell fate decisions

RefEx, a web tool for a comfortable search of reference data for gene expression analysis

New gene therapy for muscular dystrophy offers hope

Distinct signaling pathway identified as key driver for epithelial cancer development

Study addresses a long-standing diversity bias in human genetics

Boost in infant genetics research could change lives, say researchers

Team explores strategies for correcting mutations that cause stroke

Study reveals secrets of energy metabolism, promising better blood transfusions

Phys.org

Tech Xplore

Science X

New informatics tool makes the most of genomic data

Study finds no increased risk of birth defects after COVID-19 infection or vaccination in early pregnancy

Improving HIV treatment in children and adolescents, the right way

Why the most prescribed chemotherapy drug can cause serious heart damage

Vaccination credited for 70% reduction in long COVID risk over course of pandemic

Study shows elevated cancer risk for women with endometriosis

Repurposing FDA-approved drugs may decrease fibrosis in butterfly disease

Gut protein may protect brain cells in Parkinson's disease

Living in greener neighborhoods during midlife can slow cognitive decline

Study reveals disparities between male and female surgical residents in the experiences of pregnancy and parenthood

New technique could help treat aggressive brain tumors

Related Stories

How a thieving transcription factor dominates the genome

New tools used to identify childhood cancer genes

Biologists find mechanisms that control where transcription factors bind

Many experiments for the price of one: A breakthrough in the study of gene regulation

The surprising role of gene architecture in cell fate decisions

RefEx, a web tool for a comfortable search of reference data for gene expression analysis

Recommended for you

New gene therapy for muscular dystrophy offers hope

Distinct signaling pathway identified as key driver for epithelial cancer development

Study addresses a long-standing diversity bias in human genetics

Boost in infant genetics research could change lives, say researchers

Team explores strategies for correcting mutations that cause stroke

Study reveals secrets of energy metabolism, promising better blood transfusions

Newsletter sign up

Donate and enjoy an ad-free experience