UC Santa Cruz builds national data center for cancer genome research
The emerging field of "personalized" or "precision" medicine holds great promise in the fight against cancer. If scientists can identify the genetic changes that drive each patient's cancer cells, they can use that information to develop targeted treatments. But achieving this goal will require massive amounts of genomic and clinical data and a sophisticated infrastructure to manage and analyze the data.
The University of California, Santa Cruz, has now completed a first step in building this infrastructure, said UC Santa Cruz bioinformatics expert David Haussler. Haussler's team has established the Cancer Genomics Hub (CGHub), a large-scale data repository and user portal for the National Cancer Institute's cancer genome research programs. CGHub's initial "beta" release is providing cancer researchers with efficient access to a large and rapidly growing store of valuable biomedical data. The project is funded by the National Cancer Institute (NCI) through a $10.3 million subcontract with SAIC-Frederick Inc., the prime contractor for the Frederick National Laboratory for Cancer Research.
"By providing researchers with comprehensive catalogs of the key genomic changes in many major types and subtypes of cancer, these efforts will support the development of more effective ways to diagnose and treat cancer," said Haussler, a distinguished professor of biomolecular engineering in the Baskin School of Engineering at UCSC and a Howard Hughes Medical Institute investigator.
In personalized care, doctors design treatments to target specific genetic changes found in a patient's cancer cells. Researchers are trying to catalog all the genetic abnormalities found in different types of cancers and find connections between specific genetic changes and how patients respond to different treatments. The scale and complexity of the information being gathered creates a critical challenge in the area of data management.
Although recent studies using genetically targeted treatments have shown promising results, much more research is needed to enable their widespread use, Haussler said. "There won't be one magic bullet, because cancer is not one disease, or even 100 diseases. Every instance of cancer is different. We have to improve our understanding of the molecular biology of cancer and develop computer algorithms so that we can analyze the genetic changes in each individual patient. It will take time. But with cancer genomics, we will eventually know our enemy completely."
Haussler's team assembled the first draft of the human genome sequence in 2000 and created and maintains the UCSC Genome Browser, a web-based tool that is used extensively in biomedical research and serves as the platform for several large-scale genomics projects. His group's contributions to cancer genomics research include creation of a Cancer Genomics Browser for analyzing data from large-scale cancer studies.
Haussler's group built CGHub to support all three major NCI cancer genome sequencing programs: The Cancer Genome Atlas (TCGA), Therapeutically Applicable Research to Generate Effective Treatments (TARGET), and the Cancer Genome Characterization Initiative (CGCI). TCGA is a collaborative effort led by NCI and the National Human Genome Research Institute to map the genomic changes that occur in at least 20 major types and subtypes of adult cancer. The TARGET program is a related effort focusing on the five most common childhood cancers, and the CGCI makes available genomic data from HIV-associated cancers and certain lymphoid and childhood cancers.
These programs are laying the foundation for personalized cancer care by creating a database that scientists around the world can use to connect specific genomic changes with clinical outcomes. Haussler's group has been closely involved in data analysis for TCGA.
"TCGA is allowing us for the first time to look at cancer in full molecular detail," Haussler said. "Cancer is a disease caused by disruption of DNA molecules within the cell. When life starts, every cell in the body has the same DNA. In the course of a person's lifetime, however, some cells may accumulate changes in their DNA that cause them to go rogue and multiply without control, creating the disease we call cancer. For the first time now, we are able to look into an individual patient's cancer cells and see all the genetic disruptions, among which are the molecular drivers of that person's cancer."
There are currently only a few situations in which doctors can prescribe a treatment plan based on the specific genetic mutations in a patient's cancer cells. That is expected to change as projects like TCGA, TARGET, and CGCI yield a comprehensive catalog that researchers can use to find new targets for medicines and discover clues to improve patient outcomes. But there is an urgent need for an efficient and user-friendly portal to give researchers access to the data. The NCI genome projects are producing staggering amounts of data.
"The scale of this is far beyond anything faced in medical research before," Haussler said. "Each genome file, the DNA record from a tumor or normal tissue, is 300 billion bytes. And for every case there are two of these files, the cancer genome and the normal genome. Add to this RNA sequence data, and the prospect of deeper sequencing in the future, and we must plan for up to a terabyte (1,000 billion bytes) for each case."
TCGA currently generates about 10 terabytes of data each month. For comparison, the Hubble Space Telescope amassed about 45 terabytes of data in its first 20 years of operation. TCGA's output will increase tenfold or more over the next two years. Over the next four years, if the project produces a terabyte of DNA and RNA data from each of more than 10,000 patients, it will have produced 10 petabytes of data (a petabyte is 1,000 terabytes). And TCGA is just the beginning of the data deluge, Haussler said, noting that 10,000 cases is a small fraction of the 1.5 million new cancer cases diagnosed every year in the United States alone.
New data compression schemes are expected to reduce the total storage space needed, so the CGHub repository is designed initially to hold 5 petabytes and to allow further growth as needed. That is still a massive amount of data, and CGHub will need to accommodate transfers of extremely large data files.
Managed by the UCSC team, the CGHub computer system is located at the San Diego Supercomputer Center. It is connected by high-performance national research networks to major centers nationwide that are participating in these projects, including UCSC. Haussler's team designed and oversees the storage and computing infrastructure for the repository, which has an automated query and download interface for large-scale, high-speed use. It will eventually also include an interactive web-based interface to allow researchers to browse and query the system and download custom datasets.
It may take years for cancer genomics research to bring about major changes in cancer care. The first step, and the focus of the NCI cancer genomics programs, is to determine which genomic changes are involved in each type of cancer and to understand the molecular and clinical effects of those changes. Then biomedical researchers must identify or develop treatments to block those effects.
"Right now, cancer research needs something on a very large scale, like the Large Hadron Collider in physics," Haussler said. "Instead of bringing subatomic particles together in high-energy collisions and computing their behavior, we're bringing cancer genomes together in a common database and computing the disease drivers."
CGHub program director is Robert Zimmerman, and project team members include technical director Mark Diekhans; operations manager Linda Rosewood; hardware systems lead Erich Weiler; engineering lead Chris Wilks; engineering consultant Brian Craft; and networking consultants Brad Smith and Jim Warner. The core code, including GT software for downloading data, was licensed from Annai Systems. The cancer genomics group at UCSC also includes co-principal investigator Joshua Stuart, an associate professor of biomolecular engineering at UCSC; assistant research scientist Jing Zhu; engineers Kyle Ellrott, Teresa Swatloski, and Singer Ma; user testing engineer Mary Goldman; postdoctoral scholars Adam Ewing, Benedict Paten, and Daniel Zerbino; research associate Charlie Vaske; and graduate students Tracy Ballinger, Steve Benz, Daniel Carlin, James Durbin, Ted Goldstein, Mia Grifford, Sam Ng, Amie Radenbaugh, Zack Sanborn, and Chris Szeto.
Provided by University of California - Santa Cruz
- Cancer Genomics Browser gives cancer researchers a powerful new tool Mar 30, 2009 | not rated yet | 0
- Future of personalized cancer care is promising and near Apr 19, 2011 | not rated yet | 0
- The Cancer Genome Atlas identifies distinct subtypes of deadly brain cancer Jan 19, 2010 | not rated yet | 0
- Genome 10K Project announces first 101 species for genome sequencing Nov 30, 2010 | not rated yet | 0
- Researchers discover new molecular subtype of brain cancer Apr 15, 2010 | not rated yet | 0
- Motion perception revisited: High Phi effect challenges established motion perception assumptions Apr 23, 2013 | 3 / 5 (2) | 2
- Anything you can do I can do better: Neuromolecular foundations of the superiority illusion (Update) Apr 02, 2013 | 4.5 / 5 (11) | 5
- The visual system as economist: Neural resource allocation in visual adaptation Mar 30, 2013 | 5 / 5 (2) | 9
- Separate lives: Neuronal and organismal lifespans decoupled Mar 27, 2013 | 4.9 / 5 (8) | 0
- Sizing things up: The evolutionary neurobiology of scale invariance Feb 28, 2013 | 4.8 / 5 (10) | 14
How can there be villous adenoma in colon, if there are no villi there
17 hours ago As title suggest. Thanks :smile:
How can there be a term called "intestinal metaplasia" of stomach
May 21, 2013 Hello everyone, Ok Stomach's normal epithelium is simple columnar, now in intestinal type of adenocarcinoma of stomach it undergoes "intestinal...
Pressure-volume curve: Elastic Recoil Pressure don't make sense
May 18, 2013 From pressure-volume curve of the lung and chest wall (attached photo), I don't understand why would the elastic recoil pressure of the lung is...
If you became brain-dead, would you want them to pull the plug?
May 17, 2013 I'd want the rest of me to stay alive. Sure it's a lousy way to live but it beats being all-the-way dead. Maybe if I make it 20 years they'll...
MRI bill question
May 15, 2013 Dear PFers, The hospital gave us a $12k bill for one MRI (head with contrast). The people I talked to at the hospital tell me that they do not...
Ratio of Hydrogen of Oxygen in Dessicated Animal Protein
May 13, 2013 As an experiment, for the past few months I've been consuming at least one portion of Jell-O or unflavored Knox gelatin per day. I'm 64, in very...
- More from Physics Forums - Medical Sciences
More news stories
(HealthDay)—The American Cancer Society, which is celebrating on Wednesday a century of fighting a disease once viewed as a death sentence, is making a pledge to put itself out of business.
Cancer 7 hours ago | not rated yet | 0
National Lung Screening Trial (NLST) investigators also conclude that the 20 percent reduction in lung cancer mortality with low-dose computed tomography (LDCT) versus chest X-ray (CXR) screening previously reported in the ...
Cancer 7 hours ago | not rated yet | 0
Researchers have developed a new drug delivery system that allows inhalation of chemotherapeutic drugs to help treat lung cancer, and in laboratory and animal tests it appears to reduce the systemic damage ...
Cancer 11 hours ago | not rated yet | 0 |
When turned on, the gene p53 turns off cancer. However, when existing drugs boost p53, only a few tumors die – the rest resist the challenge. A study published in the journal Cell Reports shows how: tumors that live even i ...
Cancer 11 hours ago | not rated yet | 0 |
Study leader, Professor John Mathews from the University of Melbourne said this small increase in cancer risk must be weighed against the undoubted benefits from CT scans in diagnosing and monitoring disease.
Cancer 15 hours ago | not rated yet | 0
Existing research shows that bicyclists who wear helmets have an 88 percent lower risk of brain injury, but researchers at Boston Children's Hospital found that simply having bicycle helmet laws in place showed a 20 percent ...
50 minutes ago | not rated yet | 0
Swiss scientists reveal the mechanism responsible for aging hidden deep within mitochondria—and dramatically slow it down in worms by administering antibiotics to the young.
11 hours ago | 4.9 / 5 (7) | 0 |
Researchers from Queen Mary, University of London have led the largest sequencing study of human disease to date, investigating the genetic basis of six autoimmune diseases.
11 hours ago | 4 / 5 (1) | 0 |
Until now, little was scientifically known about the human potential to cultivate compassion—the emotional state of caring for people who are suffering in a way that motivates altruistic behavior.
8 hours ago | 5 / 5 (2) | 2 |
(HealthDay)—Migraines and depression can each cause a great deal of suffering, but new research indicates the combination of the two may be linked to something else entirely—a smaller brain.
8 hours ago | 4 / 5 (2) | 0 |
A new approach for immunizing against influenza elicited a more potent immune response and broader protection than the currently licensed seasonal influenza vaccines when tested in mice and ferrets. The vaccine ...
9 hours ago | not rated yet | 0 |