Staking out unknown genomic territory

January 4, 2013
Through nearly a decade of effort, the ENCODE Consortium has started to construct a functional framework for the human genome. Credit: 2012

Scientists have long known that the human genome is incredibly complex. However, after almost 10 years of hard work, a team of more than 400 scientists at 32 research institutions worldwide has finally made serious headway in beginning to understand the structure, function and internal logic of the approximately 3.2 billion bases found within every cell of our body.

The Encyclopedia of DNA Elements (ENCODE) Consortium is coordinated by the US National Institute and draws upon intellectual firepower from the world's leading geneticists—including Piero Carninci and colleagues at the RIKEN Omics Science Center (OSC) in Yokohama, Japan. In early September 2012, ENCODE finally shared the initial fruits of its labors with the world.

The results revealed some surprises. For example, ENCODE's most inclusive model suggests that up to 80% of the genome serves some in at least one of the studied. ENCODE scientists also found that considerably more of the genome is dedicated to regulating gene function than to genes themselves. They have mapped many previously identified disease-associated genomic variants to such .

The RIKEN team was well-versed in the complexities of through their experiences with FANTOM, a major genomics consortium headquartered at OSC, but Carninci says the guidance of lead analysis coordinator Ewan Birney was essential to the success of such an ambitious effort as ENCODE. Standardization was also a challenge, as different cells can have highly divergent patterns of gene expression. ENCODE selected 147 human cell lines and prioritized them so that all groups focused their efforts on common sets of targets.

Every group had its own specialization, and Carninci and colleagues used techniques devised at RIKEN to map genome-wide sites where DNA gets transcribed into RNA2. Their team confirmed striking differences between cell lines, with no one cell type expressing more than 56.7% of the pool of RNA molecules identified in the total sample set. They also identified many cell-specific 'enhancers' of and characterized fundamental differences in expression behavior between genes that encode proteins and those that do not.

The ENCODE effort will continue but Carninci sees great value in the data already uncovered. "I believe this information will be generally used to broadly classify functional parts of the genome in many unrelated biomedical studies," he says. "We have better programs to identify regulatory elements and rules to define those elements, and can now expand this to examine, for instance, biological samples related to diseases."

Explore further: ENCODE project: In massive genome analysis new data suggests 'gene' redefinition

More information: The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012). … ull/nature11247.html

Djebali, S., Davis, C.A., Merkel, A., Dobin, A., Lassmann, T., Mortazavi, A., Tanzer, A., Lagarde, J., Lin, W., Schlesinger, F. et al. Landscape of transcription in human cells. Nature 489, 101–108 (2012). … ull/nature11233.html

Related Stories

ENCODE project: In massive genome analysis new data suggests 'gene' redefinition

September 5, 2012
Most people understand genes to be specific segments of DNA that determine traits or diseases that are inherited. Textbooks suggest that genes are copied ("transcribed") into RNA molecules, which are then used as templates ...

ENCODE project: Researchers catalogue functional elements of the genome

September 5, 2012
Most of the DNA alterations that are tied to disease do not alter protein-coding genes, but rather the "switches" that control them. Characterizing these switches is one of many goals of the ENCODE project – a sweeping, ...

Gene mapping reveals architecture that controls expression of genes responsible for our sense of smell

May 18, 2012
Within the nasal cavity, millions of sensory neurons in a postage-stamp-sized patch of tissue called the olfactory epithelium control our sense of smell. Thanks to the exquisitely controlled expression of some 300 different ...

Non-coding antisense RNA can be used to stimulate protein production

October 16, 2012
While studying Parkinson's disease, an international research group made a discovery which can improve industrial protein synthesis for therapeutic use. They managed to understand a novel function of non-protein coding RNA: ...

Recommended for you

Scientists provide insight into genetic basis of neuropsychiatric disorders

July 21, 2017
A study by scientists at the Children's Medical Center Research Institute at UT Southwestern (CRI) is providing insight into the genetic basis of neuropsychiatric disorders. In this research, the first mouse model of a mutation ...

Scientists identify new way cells turn off genes

July 19, 2017
Cells have more than one trick up their sleeve for controlling certain genes that regulate fetal growth and development.

South Asian genomes could be boon for disease research, scientists say

July 18, 2017
The Indian subcontinent's massive population is nearing 1.5 billion according to recent accounts. But that population is far from monolithic; it's made up of nearly 5,000 well-defined sub-groups, making the region one of ...

Mutant yeast reveals details of the aberrant genomic machinery of children's high-grade gliomas

July 18, 2017
St. Jude Children's Research Hospital biologists have used engineered yeast cells to discover how a mutation that is frequently found in pediatric brain tumor high-grade glioma triggers a cascade of genomic malfunctions.

Late-breaking mutations may play an important role in autism

July 17, 2017
A study of nearly 6,000 families, combining three genetic sequencing technologies, finds that mutations that occur after conception play an important role in autism. A team led by investigators at Boston Children's Hospital ...

Newly discovered gene variants link innate immunity and Alzheimer's disease

July 17, 2017
Three new gene variants, found in a genome wide association study of Alzheimer's disease (AD), point to the brain's immune cells in the onset of the disorder. These genes encode three proteins that are found in microglia, ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.