Solving pieces of the genetic puzzle

May 10, 2018 by Lori Dajose, California Institute of Technology
The circular E. coli genome. Sets of genes, or operons, which lack regulatory understanding are shown in red. Though E. coli is the most well-studied organism on the planet, scientists do not understand the regulatory mechanisms of 67 percent of E. coli's genes. Credit: Courtesy of the Phillips laboratory

Every living thing on the planet contains DNA, the molecular sequence that encodes the genetic blueprint of an organism. Genome sequencing can reveal your likelihood of getting certain diseases like Alzheimer's, and it can tell you whether you have straight or curly hair or are likely to sneeze when sunlight hits your eyes. But for all this information, scientists only understand the functions of a small portion of the genome. The bacterium Escherichia coli (E. coli) is widely considered the most well-studied organism on Earth, but still, scientists have no idea how more than half of E. coli's genes are regulated.

Now, recent research from Caltech illustrates a new technique to help crack the code of certain mysterious regions of DNA called noncoding DNA sequences. Many mutations in these poorly understood regions have been implicated in disease in organisms such as humans, so understanding the function of noncoding DNA is critical.

The work was done in the laboratory of Rob Phillips, Fred and Nancy Morris Professor of Biophysics, Biology, and Physics. A paper describing the research appears online on May 4, ahead of print in the journal Proceedings of the National Academy of Sciences.

"Humans have such a wide variety of cells—muscle cells, neurons, photoreceptors, blood cells, to name a few," says Phillips. "They all have the same DNA, so how do they each turn out so differently? The answer lies in the fact that genes can be regulated—turned on or off, dialed up and dialed down—differently in different tissues. Until now, there have been no general principles to help us understand how this regulation was encoded."

The most well-studied parts of the genome are the so-called coding regions—the genes that encode for the production of the proteins that allow a cell to function.

However, more than 50 percent of the genes in E. coli have noncoding regions whose functions remain completely mysterious. These regions of the DNA contain sites where proteins called transcription factors bind and are able to dial up or down expression of other genes—in other words, noncoding regions contain information about how the genome regulates itself. 

In the new work, postdoctoral scholar Nathan Belliveau (PhD '18) applied a method called Sort-Seq to mutate small pieces of noncoding regions in E. coli and determine which regions contain binding sites. Binding sites are the locations where specialized proteins that are involved in transcription—the first step in the process of gene expression—attach to DNA.

First, the researchers cut out potentially interesting sections of noncoding DNA that they wanted to learn about. To these, they attached DNA encoding for the production of a glowing green fluorescent protein (GFP). Then, each little engineered section of DNA was placed inside an individual E. coli bacterium, causing it to produce these green proteins.

When Belliveau randomly mutated parts of the unknown regions, he noted observable changes in the amount of GFP produced in some of the bacteria, indicating that the mutated DNA is altering the level of gene expression. Through DNA sequencing, the researchers were then able to pinpoint the exact location of these important mutations and use this information to identify new binding sites.

Phillips gives a literary analogy: "This is as if I went through a book, randomly took 10 percent of the letters in words, and changed them. If the first letter of 'walk' gets changed to a T, making the word 'talk,' then you change the meaning of the word completely—your comprehension changes. We wanted to know: Which parts of the genome affect cellular comprehension the most?"

After examining many noncoding regions to determine binding sites, the team aimed to match the regions with the corresponding proteins that bind there.

"This was literally like finding a needle in a haystack," says Phillips. "There are roughly 3 million proteins in E. coli, and maybe 10 copies of a particular protein that will correspond to a given binding site. That's finding one protein in 300,000 proteins."

Belliveau developed a method to find the proverbial needle: He took a piece of noncoding DNA that contained a binding site, poured the contents of an E. coli cell over that DNA, and then identified the that had stuck to the site. 

"This work is a demonstration that we can use our approach to go from nothing—complete ignorance—to actually understanding mechanisms of regulation," says Belliveau. "The next step is to try to scale this up to allow us to go after the entire genome."

"We live in a genomic era," says Phillips. "We have to be able to figure out how, where, and when are turned off and on."

The paper is titled "A systematic approach for dissecting the molecular mechanisms of transcriptional regulation in bacteria."

Explore further: Researchers pinpoint key regulatory role of noncoding genes in prostate cancer development

More information: Systematic approach for dissecting the molecular mechanisms of transcriptional regulation in bacteria, Proceedings of the National Academy of Sciences (2018). DOI: 10.1073/pnas.1722055115

Related Stories

Researchers pinpoint key regulatory role of noncoding genes in prostate cancer development

August 15, 2016
Prostate cancer researchers studying genetic variations have pinpointed 45 genes associated with disease development and progression.

Scientists characterize regulatory DNA sequences responsible for human diseases

August 24, 2017
Scientists from the Children's Medical Center Research Institute at UT Southwestern (CRI) have developed an innovative system to identify and characterize the molecular components that control the activities of regulatory ...

'Mysterious' non-protein-coding RNAs play important roles in gene expression

January 12, 2017
In cells, DNA is transcribed into RNAs that provide the molecular recipe for cells to make proteins. Most of the genome is transcribed into RNA, but only a small proportion of RNAs are actually from the protein-coding regions ...

Discovery of genetic mutation may boost cancer therapies

February 24, 2017
A newly discovered type of genetic mutation that occurs frequently in cancer cells may provide clues about the disease's origins and offer new therapeutic targets, according to research from Weill Cornell Medicine and the ...

Recommended for you

Scientists identify critical cancer immunity genes using new genetic barcoding technology

October 20, 2018
Scientists at Mount Sinai have developed a novel technology for simultaneously analyzing the functions of hundreds of genes with resolution reaching the single cell level. The technology relies on a barcoding approach using ...

A single missing gene leads to miscarriage

October 19, 2018
A single gene from the mother plays such a crucial role in the development of the placenta that its dysfunction leads to miscarriages. Researchers from the Medical Faculty of Ruhr-Universität Bochum (RUB) have observed this ...

Making gene therapy delivery safer and more efficient

October 18, 2018
Viral vectors used to deliver gene therapies undergo spontaneous changes during manufacturing which affects their structure and function, found researchers from the Perelman School of Medicine at the University of Pennsylvania ...

Student develops microfluidics device to help scientists identify early genetic markers of cancer

October 16, 2018
As anyone who has played "Where's Waldo" knows, searching for a single item in a landscape filled with a mélange of characters and objects can be a challenge. Chrissy O'Keefe, a Ph.D. student in the Department of Biomedical ...

Researchers use brain cells in a dish to study genetic origins of schizophrenia

October 16, 2018
A study in Biological Psychiatry has established a new analytical method for investigating the complex genetic origins of mental illnesses using brain cells that are grown in a dish from human embryonic stem cells. Researchers ...

Why heart contractions are weaker in those with hypertrophic cardiomyopathy

October 16, 2018
When a young athlete suddenly dies of a heart attack, chances are high that they suffer from familial hypertrophic cardiomyopathy (HCM). Itis the most common genetic heart disease in the US and affects an estimated 1 in 500 ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.