CMU will tap advanced computer methods to help doctors make sense of their patients' DNA

January 10, 2012, Carnegie Mellon University

Scientists at Carnegie Mellon University say advanced computational tools will be the key to a new research project that, if successful, could enable doctors to routinely use information extracted from a patient's DNA to diagnose and guide treatment of diseases.

Ion Torrent, a unit of Life Technologies Corporation (: LIFE), is sponsoring the project during its first year, and more funding is expected to come through federal grants and other sources. Robert F. Murphy, director of the Lane Center for in Carnegie Mellon's School of , will lead a multidisciplinary team of researchers that will collaborate with scientists at the Baylor College of Medicine and Yale University.

The ultimate dream, Murphy said, is to develop what Ion Torrent Founder and CEO Jonathan M. Rothberg dubbed "doctor in a box" software. Doctor-in-a-box would take a patient's DNA sequence and use it to diagnose disease, identify a patient's susceptibility to disease, and predict which therapies might be most effective or cause the fewest side effects. The size and complexity of the , which was first sequenced in its entirety in 2003, has stymied efforts to date to create such software.

"There's just way too much information for doctors to make sense of it all," Murphy said. But new machine — statistically driven software that can detect associations within mountains of data — may soon be able to translate the genetic and other hereditary information encoded in the human in a way that is clinically relevant to doctors and patients, he added. His team isn't the first to use to analyze whole genomes, however it will employ some unique software developed at Carnegie Mellon.

The Lane Center includes a number of faculty who are leaders in aspects of the problem, including Eric Xing, Ziv Bar-Joseph, Kathryn Roeder, Russell Schwartz and Seyoung Kim.

The team's software will be trained specifically to analyze the type of whole-genome sequence data produced by Ion Torrent's unique sequencing technology, which is ideal for clinical applications because it is designed to sequence the entire human genome in a day for just $1,000. Up to now, routine clinical use of whole genome sequencing has been impractical because it's taken weeks to complete at a cost of about $10,000. Now that Ion Torrent can reduce the time and expense, the next step is creating a tool to enable doctors to easily integrate whole genome sequencing into medical practice, Rothberg said.

"The promise of 'doctor-in-a-box' is that by using artificial intelligence, like we've seen with IBM's 'Watson' computer, we will be able to associate the variations in the human genome with the vast amount of information we have about human health," said Rothberg (E'85). "The work the Carnegie Mellon team is undertaking opens up the possibility that practicing physicians will be able to diagnose disease, identify disease susceptibility and guide therapy selection as easily as they can now use Apple's Siri on the iPhone."

"It's an enormous undertaking," Murphy agreed, "but we are creating a framework that will allow us to tackle this problem one piece at a time and to do so at a scale that makes sense when all of those pieces are put together."

The sheer size of the problem necessitates collaboration with other groups trying to understand the genome, so Murphy said the team intends to make its software available as open source.

During the first year, researchers will focus on identifying the genomic features associated with a single disease or patient population, which has yet to be selected. Researchers at Baylor's Human Genome Sequencing Center and Yale's Center for Genome Analysis will perform the whole of patients and provide longitudinal medical records, such as disease treatments and outcomes and results of clinical tests.

This information, scrubbed of patient identity information, will be analyzed by the Carnegie Mellon researchers, who include biologists, statisticians and computational biologists, as well as other computer scientists. Machine learning programs will tease out the relationships between the genomic data and the clinical outcomes for each of the anonymous patients, while incorporating information from biomedical literature regarding gene and protein expression and disease pathways.

This analysis will yield models based on personal genome sequences that can be used to predict disease and treatment responsiveness, as well as choose preventive therapies.

To provide impetus to the research program, Rothberg will sponsor an "Analyzing the $1,000 Genome" Conference to be held at Carnegie Mellon sometime in the summer or fall of 2012. The scientific conference will highlight outstanding work on computational analysis of genome sequences and foster discussion of new directions and strategies for extending this research.

In addition to Murphy, the research program leadership includes Jaime Carbonell, director of CMU's Language Technologies Institute; Tom Mitchell, director of CMU's Machine Learning Department; Richard Gibbs, director of Baylor's sequencing center; and Shrikant Mane, director of Yale's genome center.

Rothberg also established the Rothberg Research Awards in Human Brain Imaging at Carnegie Mellon to support the university's faculty and students in creatively pushing research boundaries in how the brain thinks, learns and ages.

Explore further: A $1000 genome could be reached by 2013

Related Stories

A $1000 genome could be reached by 2013

July 21, 2011
( -- A new report published in the journal Nature describes the new machine created by Jonathan Rothberg of Ion Torrent Systems which uses semiconductors to decode DNA and takes them one step closer to being able ...

Institute presses for greater use of gene sequencing in medicine

December 7, 2011
Almost a year after researchers in Wisconsin published a groundbreaking paper describing their use of genetic sequencing to diagnose and treat a 4-year-old boy, a national health agency is shifting its focus to put $416 ...

Recommended for you

Discovery of the 'pioneer' that opens the genome

January 23, 2018
Our genome contains all the information necessary to form a complete human being. This information, encoded in the genome's DNA, stretches over one to two metres long but still manages to squeeze into a cell about 100 times ...

Researchers identify gene responsible for mesenchymal stem cells' stem-ness'

January 22, 2018
Many doctors, researchers and patients are eager to take advantage of the promise of stem cell therapies to heal damaged tissues and replace dysfunctional cells. Hundreds of ongoing clinical trials are currently delivering ...

Genes contribute to biological motion perception and its covariation with autistic traits

January 22, 2018
Humans can readily perceive and recognize the movements of a living creature, based solely on a few point-lights tracking the motion of the major joints. Such exquisite sensitivity to biological motion (BM) signals is essential ...

Peers' genes may help friends stay in school, new study finds

January 18, 2018
While there's scientific evidence to suggest that your genes have something to do with how far you'll go in school, new research by a team from Stanford and elsewhere says the DNA of your classmates also plays a role.

Two new breast cancer genes emerge from Lynch syndrome gene study

January 18, 2018
Researchers at Columbia University Irving Medical Center and NewYork-Presbyterian have identified two new breast cancer genes. Having one of the genes—MSH6 and PMS2—approximately doubles a woman's risk of developing breast ...

A centuries-old math equation used to solve a modern-day genetics challenge

January 18, 2018
Researchers developed a new mathematical tool to validate and improve methods used by medical professionals to interpret results from clinical genetic tests. The work was published this month in Genetics in Medicine.


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.