Researchers create powerful new method to analyze genetic data

June 12, 2012

University of Texas Medical Branch at Galveston researchers have developed a powerful visual analytical approach to explore genetic data, enabling scientists to identify novel patterns of information that could be crucial to human health.

The method, which combines three different "bipartite visual representations" of , is described in an article to appear in the . The work won a distinguished paper award when it was presented at the AMIA Summit on Translational Bioinformatics in March 2012.

In the paper, the authors use their technique to analyze data on in humans known as , or SNPs. Among other things, the frequencies of particular SNPs are associated with an individual's ancestral origins; for the study, the researchers chose to examine SNP data from 60 individuals from Nigeria and 60 individuals from Utah.

"We selected SNPs that we already knew differentiated between the two groups, and then showed that our method can reveal more about the data than traditional methods," said UTMB associate professor Suresh Bhavnani, lead author on the JAMIA paper and a member of UTMB's Institute for Translational Sciences. "This is a fresh way of looking at , a methodological contribution that we believe can help biologists and clinicians make better sense of a variety of ."

Like many kinds of biomedical data, Bhavnani said, datasets describing individuals and their SNPs are particularly suited to visual representations that are bipartite: that is, they simultaneously present two different classes of data. In the case of the Utah-Nigeria SNP data, Bhavnani and his colleagues started with what is known as a bipartite network visualization — an intricate computer-generated arrangement of colored dots and black, gray and white lines.

"In the bipartite network you see both the individuals and their genetic profiles simultaneously, and cognitively that's really important," Bhavnani said. "You can look at the individuals and know immediately which SNPs make them different from others, and conversely you can look at the SNPs to see how they are co-occurring, and with which individuals they are co-occurring. This rich representation enables you to quickly comprehend the complex bipartite relationships in the data"

The bipartite network visualization of the Utah-Nigeria individual-SNP data has distinct clusters on its left and right sides that correspond to the Utah and Nigerian subjects and SNPs. It also accurately portrays a genetic phenomenon called admixture, in which an individual possesses SNPs that are characteristic of individuals from Utah as well as from Nigeria. Admixed individuals are placed on the edges of their clusters, relatively close to the center of the visualization. The identification of admixed individuals, and the implicated SNPs could help in the design of case-control studies where there is a need for the selection of homogenous sets of individual from different ancestral origins.

To produce an even more detailed picture of the individual-SNP information, the researchers applied two other bipartite visualization techniques to the data: the bipartite heat map, and the bipartite Circos ideogram. In the heat map, rectangular cells laid out in a spreadsheet-like arrangement and colored white, gray, or black helped precisely define the boundaries of the clusters by clarifying individual-SNP relationships. In the Circos ideogram, and SNPs placed around the perimeter of a circle and linked with curved lines, enabling the researchers to more closely examine the admixed individuals' ties to in the clusters associated with both Utah and Nigeria.

"The network representation is very powerful because it gives you the overall structure of the data, but to really understand the complex relationships, you need these additional bipartite representations," Bhavnani said.

The JAMIA paper, according to Bhavnani, represents a proof of concept for the researchers' novel combination of methods, which can be applied to a wide range of biomedical questions. "You can think of anything – for example you could examine cases and controls in Alzheimer's disease, or you could compare children who are prone to ear infections and those aren't prone," Bhavnani said. "Whatever your disease or trait of interest is, our approach can handle it."

Explore further: 18 novel subtype-dependent genetic variants for autism spectrum disorders revealed

Related Stories

18 novel subtype-dependent genetic variants for autism spectrum disorders revealed

April 27, 2011
By dividing individuals with autism spectrum disorders (ASD) into four subtypes according to similarity of symptoms and reanalyzing existing genome-wide genetic data on these individuals vs. controls, researchers at the George ...

Recommended for you

The 16 genetic markers that can cut a life story short

July 27, 2017
The answer to how long each of us will live is partly encoded in our genome. Researchers have identified 16 genetic markers associated with a decreased lifespan, including 14 new to science. This is the largest set of markers ...

A rogue gene is causing seizures in babies—here's how scientists wants to stop it

July 26, 2017
Two rare diseases caused by a malfunctioning gene that triggers seizures or involuntary movements in children as early as a few days old have left scientists searching for answers and better treatment options.

Scientists provide insight into genetic basis of neuropsychiatric disorders

July 21, 2017
A study by scientists at the Children's Medical Center Research Institute at UT Southwestern (CRI) is providing insight into the genetic basis of neuropsychiatric disorders. In this research, the first mouse model of a mutation ...

Scientists identify new way cells turn off genes

July 19, 2017
Cells have more than one trick up their sleeve for controlling certain genes that regulate fetal growth and development.

South Asian genomes could be boon for disease research, scientists say

July 18, 2017
The Indian subcontinent's massive population is nearing 1.5 billion according to recent accounts. But that population is far from monolithic; it's made up of nearly 5,000 well-defined sub-groups, making the region one of ...

Mutant yeast reveals details of the aberrant genomic machinery of children's high-grade gliomas

July 18, 2017
St. Jude Children's Research Hospital biologists have used engineered yeast cells to discover how a mutation that is frequently found in pediatric brain tumor high-grade glioma triggers a cascade of genomic malfunctions.

1 comment

Adjust slider to filter visible comments by rank

Display comments: newest first

slayerwulfe
not rated yet Jun 14, 2012
i hope this will encourage research into human, animal, plant as symbiosis, reciprocity, and mutually beneficial to all three.

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.