Solving pieces of the genetic puzzle

May 10, 2018 by Lori Dajose, California Institute of Technology
The circular E. coli genome. Sets of genes, or operons, which lack regulatory understanding are shown in red. Though E. coli is the most well-studied organism on the planet, scientists do not understand the regulatory mechanisms of 67 percent of E. coli's genes. Credit: Courtesy of the Phillips laboratory

Every living thing on the planet contains DNA, the molecular sequence that encodes the genetic blueprint of an organism. Genome sequencing can reveal your likelihood of getting certain diseases like Alzheimer's, and it can tell you whether you have straight or curly hair or are likely to sneeze when sunlight hits your eyes. But for all this information, scientists only understand the functions of a small portion of the genome. The bacterium Escherichia coli (E. coli) is widely considered the most well-studied organism on Earth, but still, scientists have no idea how more than half of E. coli's genes are regulated.

Now, recent research from Caltech illustrates a new technique to help crack the code of certain mysterious regions of DNA called noncoding DNA sequences. Many mutations in these poorly understood regions have been implicated in disease in organisms such as humans, so understanding the function of noncoding DNA is critical.

The work was done in the laboratory of Rob Phillips, Fred and Nancy Morris Professor of Biophysics, Biology, and Physics. A paper describing the research appears online on May 4, ahead of print in the journal Proceedings of the National Academy of Sciences.

"Humans have such a wide variety of cells—muscle cells, neurons, photoreceptors, blood cells, to name a few," says Phillips. "They all have the same DNA, so how do they each turn out so differently? The answer lies in the fact that genes can be regulated—turned on or off, dialed up and dialed down—differently in different tissues. Until now, there have been no general principles to help us understand how this regulation was encoded."

The most well-studied parts of the genome are the so-called coding regions—the genes that encode for the production of the proteins that allow a cell to function.

However, more than 50 percent of the genes in E. coli have noncoding regions whose functions remain completely mysterious. These regions of the DNA contain sites where proteins called transcription factors bind and are able to dial up or down expression of other genes—in other words, noncoding regions contain information about how the genome regulates itself. 

In the new work, postdoctoral scholar Nathan Belliveau (PhD '18) applied a method called Sort-Seq to mutate small pieces of noncoding regions in E. coli and determine which regions contain binding sites. Binding sites are the locations where specialized proteins that are involved in transcription—the first step in the process of gene expression—attach to DNA.

First, the researchers cut out potentially interesting sections of noncoding DNA that they wanted to learn about. To these, they attached DNA encoding for the production of a glowing green fluorescent protein (GFP). Then, each little engineered section of DNA was placed inside an individual E. coli bacterium, causing it to produce these green proteins.

When Belliveau randomly mutated parts of the unknown regions, he noted observable changes in the amount of GFP produced in some of the bacteria, indicating that the mutated DNA is altering the level of gene expression. Through DNA sequencing, the researchers were then able to pinpoint the exact location of these important mutations and use this information to identify new binding sites.

Phillips gives a literary analogy: "This is as if I went through a book, randomly took 10 percent of the letters in words, and changed them. If the first letter of 'walk' gets changed to a T, making the word 'talk,' then you change the meaning of the word completely—your comprehension changes. We wanted to know: Which parts of the genome affect cellular comprehension the most?"

After examining many noncoding regions to determine binding sites, the team aimed to match the regions with the corresponding proteins that bind there.

"This was literally like finding a needle in a haystack," says Phillips. "There are roughly 3 million proteins in E. coli, and maybe 10 copies of a particular protein that will correspond to a given binding site. That's finding one protein in 300,000 proteins."

Belliveau developed a method to find the proverbial needle: He took a piece of noncoding DNA that contained a binding site, poured the contents of an E. coli cell over that DNA, and then identified the that had stuck to the site. 

"This work is a demonstration that we can use our approach to go from nothing—complete ignorance—to actually understanding mechanisms of regulation," says Belliveau. "The next step is to try to scale this up to allow us to go after the entire genome."

"We live in a genomic era," says Phillips. "We have to be able to figure out how, where, and when are turned off and on."

The paper is titled "A systematic approach for dissecting the molecular mechanisms of transcriptional regulation in bacteria."

Explore further: Researchers pinpoint key regulatory role of noncoding genes in prostate cancer development

More information: Systematic approach for dissecting the molecular mechanisms of transcriptional regulation in bacteria, Proceedings of the National Academy of Sciences (2018). DOI: 10.1073/pnas.1722055115

Related Stories

Researchers pinpoint key regulatory role of noncoding genes in prostate cancer development

August 15, 2016
Prostate cancer researchers studying genetic variations have pinpointed 45 genes associated with disease development and progression.

Scientists characterize regulatory DNA sequences responsible for human diseases

August 24, 2017
Scientists from the Children's Medical Center Research Institute at UT Southwestern (CRI) have developed an innovative system to identify and characterize the molecular components that control the activities of regulatory ...

Mapping the genome jungle: Unique animal traits could offer insight into human disease

March 6, 2018
From a bat's wings to an elephant's cancer resistance, an interdisciplinary team of scientists at University of Utah Health are using animals' unique traits to pinpoint regions of the human genome that might affect health. ...

'Mysterious' non-protein-coding RNAs play important roles in gene expression

January 12, 2017
In cells, DNA is transcribed into RNAs that provide the molecular recipe for cells to make proteins. Most of the genome is transcribed into RNA, but only a small proportion of RNAs are actually from the protein-coding regions ...

Scientists reveal how epigenetic changes in DNA are interpreted

May 4, 2017
A new study in Science from Karolinska Institutet maps out how different DNA-binding proteins in human cells react to certain biochemical modifications of the DNA molecule. The scientists report that some 'master' regulatory ...

Discovery of genetic mutation may boost cancer therapies

February 24, 2017
A newly discovered type of genetic mutation that occurs frequently in cancer cells may provide clues about the disease's origins and offer new therapeutic targets, according to research from Weill Cornell Medicine and the ...

Recommended for you

Researchers find potential new gene therapy for blinding disease

August 20, 2018
The last year has seen milestones in the gene therapy field, with FDA approvals to treat cancer and an inherited blinding disorder. New findings from a team led by University of Pennsylvania vision scientists, who have in ...

New study explains why genetic mutations cause disease in some people but not in others

August 20, 2018
Researchers at the New York Genome Center (NYGC) and Columbia University have uncovered a molecular mechanism behind one of biology's long-standing mysteries: why individuals carrying identical gene mutations for a disease ...

Critical role of DHA on foetal brain development revealed

August 17, 2018
Duke-NUS researchers have found evidence that a natural form of Docosahexaenoic Acid (DHA) made by the liver called Lyso-Phosphatidyl-Choline (LPC-DHA), is critical for normal foetal and infant brain development, and that ...

New algorithm could improve diagnosis of rare diseases

August 17, 2018
Today, diagnosing rare genetic diseases requires a slow process of educated guesswork. Gill Bejerano, Ph.D., associate professor of developmental biology and of computer science at Stanford, is working to speed it up.

Gene silencing critical for normal breast development

August 17, 2018
Researchers have discovered that normal breast development relies on a genetic 'brake', a protein complex that keeps swathes of genes silenced.

Officials remove special rules for gene therapy experiments

August 16, 2018
U.S. health officials are eliminating special regulations for gene therapy experiments, saying that what was once exotic science is quickly becoming an established form of medical care with no extraordinary risks.


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.