These mutations could be key to understanding how some harmful conditions develop

September 11, 2017

A team of researchers led by a bioinformatician at the University of California San Diego has developed a method to help determine whether certain hard-to-study mutations in the human genome, called short tandem repeats or microsatellites, are likely to be involved in harmful conditions.

The team, which also includes scientists from the New York Genome Center, Harvard University, and the Massachusetts Institute of Technology, details their findings in the Sept. 11 issue of Nature Genetics.

In short tandem repeats, sequences of one to six of DNA's basic components, called nucleotides, repeat over and over again, sometimes up to hundreds or thousands of times.

These mutations already have been implicated in about 30 conditions. The best known is perhaps Huntington's Disease, which causes the progressive breakdown of nerve cells in the brain. About 30,000 people suffer from the condition in the United States. These people all have more than 40 copies of a specific repeat. The more copies they have, the sooner they are affected by the disease and the more severe it is.

The Nature Genetics paper is part of the ongoing, decades-long effort to pinpoint harmful mutations in the human . Tandem repeats are often overlooked in these efforts, and have sometimes been disregarded as "junk DNA." But researchers led by Melissa Gymrek, an assistant professor at UC San Diego, believe that tandem repeats are likely to play key roles in human health and need to be studied in depth.

"When you look for signals for disease in the human genome, you get too many answers. We are looking for a way to narrow these answers down," said Gymrek, who holds appointments at both the UC San Diego School of Medicine and the Jacobs School of Engineering.

In the next step of their research, scientists plan to use their model to examine the genomes of families with autistic members.

Analyzing repeats

Tandem repeats are difficult to analyze with current genome sequencing techniques. That's because they're usually fairly long, and current tools usually look only at short pieces of DNA. In addition, the process of amplifying DNA for sequencing creates more errors that get in the way.

In this paper, researchers detail how they were able to create a mathematical model that predicts how frequently and in what way the repeats appear and mutate in the human genome. Gymrek and colleagues were able to do this because of the extraordinary amount of genetic data that they had access to—more than 1.5 million repeats from the genomes of 300 individuals.

The researchers based their new algorithm on a method called MUTEA that they previously developed to precisely estimate individual for tandem repeats on the Y chromosome. They modified the algorithm so it would analyze pairs of DNA variations, called haplotypes. The key insight the method provided is that different classes of mutations happen at regular, predictable intervals in time, constituting what they refer to as a molecular clock. This clock can be used to determine how often mutations occur within a genome.

Finding constraints

Next, the researchers used the model to calculate actual mutation rates and compare those to expected mutation rates. This is what geneticists call constraint. For example, regions of the genome that are home to mutations that occur early in life and lead to severe health conditions tend to have fewer in the population than expected by chance—geneticists say they're highly constrained. That's because those suffering from these conditions, like autism, are less likely to pass their genes on to the next generation. Regions of the genome that cause diseases that occur later in life, after patients have had children, like Huntington's Disease, are usually not constrained.

The team used their model on a number of different tandem repeats related to both late and early onset conditions, such as limb malformations. The model correctly identified that repeats involved in early-onset conditions were subject to constraint. They calibrated their method by using a set of tandem repeats that are not associated with specific conditions, which the FBI uses to identify people. As expected, these repeats mutate at the expected rate and are not constrained.

Gymrek and her team are now getting ready to apply their model to find signals for other conditions inside the .

Explore further: New technique searches 'dark genome' for disease mutations

More information: Melissa Gymrek et al. Interpreting short tandem repeat variations in humans using mutational constraint, Nature Genetics (2017). DOI: 10.1038/ng.3952

Related Stories

New technique searches 'dark genome' for disease mutations

August 10, 2017
When doctors can't find a diagnosis for patient's disease, they turn to genetic detectives. Equipped with genomic sequencing technologies available for less than 10 years, these sleuths now routinely search through a patient's ...

Is Huntington's disease more common than we thought?

June 22, 2016
More people may have the potential to develop Huntington's disease than previously thought, according to a study published in the June 22, 2016, online issue of Neurology, the medical journal of the American Academy of Neurology. ...

Recommended for you

Genome editing reveals role of gene important for human embryo development

September 20, 2017
Researchers have used genome editing technology to reveal the role of a key gene in human embryos in the first few days of development. This is the first time that genome editing has been used to study gene function in human ...

A piece of the puzzle: Eight autism-related mutations in one gene

September 19, 2017
Scientists have identified a hotspot for autism-related mutations in a single gene.

Scientists identify key regulator of male fertility

September 19, 2017
When it comes to male reproductive fertility, timing is everything. Now scientists are finding new details on how disruption of this timing may contribute to male infertility or congenital illness.

New assay leads to step toward gene therapy for deaf patients

September 18, 2017
Scientists at Oregon State University have taken an important step toward gene therapy for deaf patients by developing a way to better study a large protein essential for hearing and finding a truncated version of it.

Biologists identify gene involved in kidney-related birth defects

September 18, 2017
A team led by University of Iowa researchers has identified a gene linked to rare, often fatal kidney-related birth defects.

Genomic recycling: Ancestral genes take on new roles

September 18, 2017
One often hears about the multitude of genes we have in common with chimps, birds or other living creatures, but such comparisons are sometimes misleading. The shared percentage usually refers only to genes that encode instructions ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.