ENCODE project: In massive genome analysis new data suggests 'gene' redefinition

September 5, 2012 in Genetics

Most people understand genes to be specific segments of DNA that determine traits or diseases that are inherited. Textbooks suggest that genes are copied ("transcribed") into RNA molecules, which are then used as templates for making protein – the highly diverse set of molecules that act as building blocks and engines of our cells. The truth, it now appears, is not so simple.

As part of a huge collaborative effort called ENCODE (Encyclopedia of DNA Elements), a research team led by Cold Spring Harbor Laboratory (CSHL) Professor Thomas Gingeras, Ph.D., today publishes a -wide analysis of , called transcripts, produced within .

Their analysis – one component of a massive release of research results by ENCODE teams from 32 institutes in 5 countries, with 30 papers appearing today in 3 different high-level — shows that three-quarters of the genome is capable of being transcribed. This figure is important because it indicates that nearly all of our genome is dynamic and active. It stands in marked contrast to consensus views prior to ENCODE's comprehensive research efforts, which suggested that only the small protein-encoding fraction of the genome was transcribed, and therefore important.

The vast amount of data generated with advanced technologies by Gingeras' group and others in the ENCODE project is likely to radically change the prevailing understanding of what defines a gene, the unit we routinely use, for instance, to speak of inheritable traits like eye color or to explain the causes of and susceptibility to most diseases, running the gamut from cancer to schizophrenia to heart disease.

In 2003 the ENCODE project consortium was set up by the U.S. government's National Institute (NHGRI) to examine the newly minted sequence of the in greater depth. At the time, the genome was thought of as a linear molecule of DNA with "genes" being contained within isolated sections that make up just 1%-2% of its total length. The long stretches of DNA between these gene islands were once thought to be mostly functionless spacers, padding, or even "junk DNA."

Through the work of Gingeras and others in this latest phase of the ENCODE project consortium, we now know that most of the DNA around protein-encoding genes is also capable of being transcribed into RNA – another way of saying that it has the potential of performing useful functions in cells.

In preliminary ENCODE results published in 2007, the researchers closely examined about 1% of the human genome. The initial results showed that much more of our DNA could be transcribed than previously thought. Far from being padding, many of these RNA messages appeared to be functional.

The Gingeras lab discovered potentially new classes of functional RNAs in this preliminary work. The additional knowledge that parts of one gene or functional RNA can reside within another were surprising, and suggested a picture of the architecture of our genome that was much more complex than previously thought.

What the new ENCODE data reveals

Two of the 30 papers published by Gingeras and other ENCODE colleagues, including CSHL Professor and HHMI Investigator Gregory Hannon, Ph.D., who is also a co-author in this study, today mark the culmination of project's second phase. What distinguishes the data analyzed in this phase is comprehensiveness. The initial observations of 2007 are now extended to cover the entire human genome – a tour-de-force effort in which the transcribed RNA from different sub-cellular compartments of 15 human cell lines was analyzed

Although the results vary between cell lines, a consensus picture is emerging. In addition to showing that up to three-quarters of our DNA may be transcribed into RNA, the data strongly suggests, according to Gingeras, that a large percent of non-protein-coding RNAs are localized within cells in a manner consistent with their having functional roles.

The current outstanding question concerns the nature and range of those functions. It is thought that these "non-coding" RNA transcripts act something like components of a giant, complex switchboard, controlling a network of many events in the cell by regulating the processes of replication, transcription and translation – that is, the making of proteins based on information carried by messenger RNAs.

With the understanding that so much of our DNA can be transcribed into RNA comes the realization that there is much less space between what we previously thought of as genes, Gingeras points out.

"We see the boundaries of what were assumed to be the regions between genes shrinking in length," he says, "and genic regions making many overlapping RNAs." It appears, he continues, that the boundaries of conventionally described genes are melding together, challenging the notion that a gene is a discrete, localized region of a genome separated by inert DNA. "New definitions of a gene are needed," Gingeras says.

What are the practical implications? According to Gingeras, they include being able to identify possible causes for natural traits such as height or hair loss and disease states such as cancer. Many genetic variations associated with a trait often map to what were formally believed to be "spacer" regions.

"With our increasingly deeper understanding that such regions are related to the neighboring or "distal" protein coding regions – via the creation of non-coding RNAs – we will now seek underlying explanations of the association of the genetic variation and traits of interest." This topic is explored in a second paper published today that summarizes the finding of all the consortium groups participating in the current phase of the ENCODE project.

"Exploration of the genome is akin to our efforts at exploring our physical universe," Gingeras says. "We expect to be amazed and excited by our future efforts to map and explore our personal genetic universes."

More information:

"Landscape of transcription in human cells" is published online in Nature on September 5, 2012. The authors are: Sarah Djebali, Carrie A. Davis and 83 others. The paper can be obtained online at doi:10.1038/nature11233 .

The ENCODE Project Consortium. 2012. An integrated encyclopedia of DNA elements in the human genome. Naturedoi:10.1038/nature11247 .

Other new ENCODE results can be found in the following journals: Nature (6 papers); Genome Research (18 papers); and Genome Biology (6 papers).

Journal reference: Nature search and more info website Genome Research search and more info website Genome Biology search and more info website

Provided by Cold Spring Harbor Laboratory search and more info website

5 /5 (8 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

nanotech_republika_pl
Sep 05, 2012

Rank: not rated yet
"The analysis shows that three-quarters of the genome is capable of being transcribed." Lots of changes in science books. These paper release are like tsunami in terms of changes to the knowledge about DNA, not? The recent history of human genomics is pre-genome map, circa 2000, pre-ENCODE project publication in 2012, and post ENCODE project publication. The history in making.
baudrunner
Sep 05, 2012

Rank: not rated yet
Studying genetic differences between protein synthesizing characteristics of different cells based on what we know of those cells' functions; of the differentiation of the chromosomes during cell division from the time of conception giving rise to stem cell production (but do those stem cells really carry unique instructions and can we say that we have the level of knowledge to merit altering their nature to make them generic?); how uniqueness is expressed in individuals; top-down approach - bottom-up approach, etc.
Torbjorn_Larsson_OM
Sep 05, 2012

Rank: not rated yet
nanotech, much is anticipated epigenetics (histones et cetera), much is non-functional pseudogenes, much is neutral low rate transcripts. The genetic regulation is larger than expected, but still on the order of early ENCODE finds.

For me, it has an interesting astrobiology potential, since it finds a) transcripts are core functionality, b) RNA is much more relaxed vs selective pressures, variation allowed despite folding functionality.

This opens up a large window to the earlier RNA world, since bacteria have clusters of RNA like their operon clusters of genes, and thousands of small RNAs scattered. These can still have functionality and lineage heritage.
Rank 5 /5 (8 votes)
Relevant PhysicsForums posts

More news stories

Researchers identify new circadian clock component

Northwestern University scientists have shown a gene involved in neurodegenerative disease also plays a critical role in the proper function of the circadian clock.

Genetics created May 16, 2013 | popularity 3 / 5 (1) | comments 1 | with audio podcast

Returning genetic incidental findings without patient consent violates basic rights, experts say

Informed consent is the backbone of patient care. Genetic testing has long required patient consent and patients have had a "right not to know" the results. However, as 21st century medicine now begins to use the tools of ...

Genetics created May 16, 2013 | popularity 5 / 5 (1) | comments 3 | with audio podcast

Ethicists provide framework supporting new recommendations on reporting incidental findings in gene sequencing

In a paper published in Science Express, a group of experts led by bioethicists in the Center for Medical Ethics and Health Policy at Baylor College of Medicine provide a framework for the new American College of Medical Geneti ...

Genetics created May 16, 2013 | popularity not rated yet | comments 0

Experts urge caution over use of new genetic sequencing techniques

The use of genome-wide analysis (GWA), where the entirety of an individual's DNA is examined to look for the genomic mutations or variants which can cause health problems is a massively useful technology for diagnosing disease. ...

Genetics created May 16, 2013 | popularity not rated yet | comments 0

Collecting DNA for human rights: How to help while safeguarding privacy

DNA databases might help identify victims of crime and human trafficking, but how do we safeguard the personal privacy of innocent victims and family members? A new report online May 15 in the Cell Press journal Trends in ...

Genetics created May 15, 2013 | popularity not rated yet | comments 0


Music therapy reduces anxiety, use of sedatives for patients receiving ventilator support

New research suggests that for some hospitalized ICU patients on mechanical ventilators, using headphones to listen to their favorite types of music could lower anxiety and reduce their need for sedative medications.

Tiny, implantable coil promises hope for emphysema patients

A small, easily implantable device called the Lung Volume Reduction Coil (LVRC) may play a key role in the treatment of two types of emphysema, according to a study conducted in Europe. Results of the study indicate the beneficial ...

Early IV nutrition for certain patients does improve survival or reduce ICU length of stay

The early (within 24 hours of intensive care unit [ICU] admission) provision of intravenous nutrition among critically ill patients with contraindications (a condition that makes a particular procedure potentially inadvisable) ...

Having a nighttime critical care physician in the ICU doesn't improve patient outcomes, research finds

With little evidence to guide them, many hospital intensive care units (ICUs) have been employing critical care physicians at night with the notion it would improve patients' outcomes. However, new results from a one-year ...

CT radiation risk less than risk of examination indicator

(HealthDay)—For young adults needing either a chest or abdominopelvic computed tomography (CT), the short-term risk of death from underlying morbidity is greater than the long-term risk of radiation-induced ...

Extra vitamin D may ease Crohn's symptoms, study finds

(HealthDay)—Vitamin D supplements may help those with Crohn's disease overcome the fatigue and decreased muscle strength associated with the inflammatory bowel disease, according to new research.