Team develops tool to better visualize, analyze human genomic data
Scientists at the University of Maryland have developed a new, web-based tool that enables researchers to quickly and easily visualize and compare large amounts of genomic information resulting from high-throughput sequencing experiments. The free tool, called Epiviz, was described in a paper published online on August 3, 2014 in the journal Nature Methods.
Next-generation sequencing has revolutionized functional genomics. These techniques are key to understanding the molecular mechanisms underlying cell function in healthy and diseased individuals and the development of diseases like cancer. Data from multiple experiments need to be integrated, but the growing number of data sets makes a thorough comparison and analysis of results challenging.
To visualize and browse entire genomes, graphical interfaces that display information from a database of genomic data—called "genome browsers"—were created. Epiviz offers a major advantage over browsers currently available: Epiviz seamlessly integrates with the open-source Bioconductor analysis software widely used by genomic scientists, through its Epivizr Bioconductor package.
"Prior tools limited visualization to presentation and dissemination, rather than a hybrid tool integrating interactive visualization with algorithmic analysis," says Héctor Corrada Bravo, assistant professor in computer science at UMD. He also has an appointment in the Center for Bioinformatics and Computational Biology of the university's Institute for Advanced Computer Studies.
Because Epiviz is based on the Bioconductor infrastructure, the tool supports many popular next-generation sequencing techniques, such as ChIP-seq, which is used to analyze protein interactions with DNA; RNA-seq, which reveals a comprehensive snapshot of the abundance of RNAs in cells; and DNA methylation analyses.
Epiviz implements multiple visualization methods for location-based data (such as genomic regions of interest) and feature-based data (such as gene expression), using interactive data visualization techniques not available in web-based genome browsers. For example, because display objects are mapped directly to data elements, Epiviz links data across different visualizations giving users visual insights of the spatial relationships of multiple data sets. The tool is designed to allow biomedical scientists to easily incorporate their own visualizations.
In the Nature Methods paper, Corrada Bravo, UMD computer science doctoral student Florin Chelaru, and undergraduate research assistants from Williams College in Mass. and Washington University in St. Louis used Epiviz to visualize and analyze DNA methylation and gene expression data in colon cancer. Changes in DNA methylation patterns compared with normal tissue have been associated with a large number of human malignancies.
Using Epiviz and Bioconductor, the research team found consistent regions of DNA methylation changes in colon cancer samples generated by the public Cancer Genome Atlas project and similar gene expression in these regions of DNA methylation changes in other cancer types. The results were in agreement with previous experiments, which were conducted by researchers at Johns Hopkins University in collaboration with Corrada Bravo, showing DNA methylation changes across large regions in the colon cancer genome.
"Epiviz helps biomedical scientists meet the challenge of visualizing large genomic data sets while supporting creative data analysis in a collaborative environment," says Corrada Bravo.
Héctor Corrada Bravo website: http://www.cbcb.umd.edu/~hcorrada/