Global genome effort seeks genetic roots of disease

By decoding the genomes of more than 1,000 people whose homelands stretch from Africa and Asia to Europe and the Americas, scientists have compiled the largest and most detailed catalog yet of human genetic variation. The massive resource will help medical researchers find the genetic roots of rare and common diseases in populations worldwide. Credit: National Human Genome Research Institute

By decoding the genomes of more than 1,000 people whose homelands stretch from Africa and Asia to Europe and the Americas, scientists have compiled the largest and most detailed catalog yet of human genetic variation. The massive resource will help medical researchers find the genetic roots of rare and common diseases in populations worldwide.

The 1000 Genomes Project involved some 200 scientists at Washington University School of Medicine in St. Louis and other institutions. Results detailing the DNA variations of individuals from 14 ethnic groups are published Oct. 31 in the journal Nature. Eventually, the initiative will involve 2,500 individuals from 26 populations.

"With this resource, researchers have a roadmap to search for the of diseases in populations around the globe," says one of the study's co-principal investigators, Elaine Mardis, PhD, co-director of The Institute at Washington University. "We estimate that each person carries up to several hundred rare DNA variants that could potentially contribute to disease. Now, scientists can investigate how detrimental particular rare variants are in different ethnic groups."

At the , any two people are more than 99 percent alike. But rare variants – those that occur with a frequency of 1 percent or less in a population – are thought to contribute to as well as common conditions like cancer, heart disease and diabetes. Rare variants may also explain why some medications are not effective in certain people or cause side effects such as nausea, vomiting, insomnia and sometimes even or death.

Identifying rare variants across different populations is a major goal of the project. During the pilot phase of the effort, the researchers found that most rare variants differed from one population to another, and that they developed recently in human evolutionary history, after populations in Europe, Africa, Asia and the Americas diverged from a single group. The current study bears this out.

"This information is crucial and will improve our interpretation of individual genomes," says another of the study's co-, Richard K. Wilson, PhD, director of The Genome Institute and a pioneer in cancer genome sequencing. "Now, if we want to study cancer in Mexican Americans or Japanese Americans, for example, we can do so in the context of their diverse geographic or ancestry-based genetic backgrounds."

Results of the new study are based on DNA sequencing of the following populations: Yoruba in Nigeria; Han Chinese in Beijing; Japanese in Tokyo; Utah residents with ancestry from northern and western Europe; Luhya in Kenya; people of African ancestry in the southwestern United States; Toscani in Italy; people of Mexican ancestry in Los Angeles; Southern Han Chinese in China; Iberian from Spain; British in England and Scotland; Finnish from Finland; Colombians in Columbia; and Puerto Rican in Puerto Rico.

All study participants submitted anonymous DNA samples and agreed to have their genetic data included in an online database. To catalog the variants, the researchers first sequenced the entire genome – all the DNA – of each individual in the study about five times. Surveying the genome in this way finds common DNA changes but misses many rare variants.

Then, to find rare variants, they repeatedly sequenced the small portion of the genome that contains genes – about 80 times for each participant to ensure accuracy – and they looked closely for single letter changes in the DNA sequence called SNPs (for single-nucleotide polymorphisms).

Using special tools developed to analyze and integrate the data, the researchers discovered a total of 38 million SNPs, including more than 99 percent of the variants with at frequency of at least one percent in the participants' DNA samples. They also found numerous structural variations, including 1.4 million short stretches of insertions or deletions and 14,000 large DNA deletions.

SNPs and structural variants can help explain an individual's susceptibility to disease, response to drugs or reaction to environmental factors such as air pollution or stress. Other studies have found an association between small insertions and deletions and diseases such as autism and schizophrenia.

The 1000 Genomes Project has generated massive amounts of genomic data. Simply recording the raw information took up some 180 terabytes of hard-drive space, enough to fill more than 40,000 DVDs. All of the information is freely available on the Internet through public databases.

"This tremendous resource builds on the knowledge of the Human Genome Project," says co-author George Weinstock, PhD, associate director of The Genome Institute. "Scientists and, ultimately, patients worldwide will benefit from the extensive effort to understand the shared features and geographic diversity of the human genome."

More information: The 1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature. Oct. 31, 2012.

Related Stories

Johns Hopkins to participate in 1000 Genomes Project

Jan 22, 2008

Researchers at the McKusick-Nathans Institute of Genetic Medicine (IGM) at Johns Hopkins will join other national and international scientists in the 1000 Genomes Project, an ambitious effort that will involve sequencing ...

1000 Genomes Project releases pilot data

Jun 21, 2010

The completion of three pilot projects designed to determine how best to build an extremely detailed map of human genetic variation begins a new chapter in the international project called 1,000 Genomes, said the director ...

Diseases prove elusive in global genetic search

Sep 02, 2010

(PhysOrg.com) -- A failure to use large international sample groups when searching for the genetic basis of common diseases is contributing to a lack of knowledge about the true frequency of illnesses across ...

Researchers sequence exomes of 12 people (w/ Video)

Aug 16, 2009

In a pioneering effort that generated massive amounts of DNA sequence data from 12 people, a team supported by the National Institutes of Health (NIH) has demonstrated the feasibility and value of a new strategy for identifying ...

Recommended for you

Refining the language for chromosomes

Apr 17, 2014

When talking about genetic abnormalities at the DNA level that occur when chromosomes swap, delete or add parts, there is an evolving communication gap both in the science and medical worlds, leading to inconsistencies in ...

Down's chromosome cause genome-wide disruption

Apr 16, 2014

The extra copy of Chromosome 21 that causes Down's syndrome throws a spanner into the workings of all the other chromosomes as well, said a study published Wednesday that surprised its authors.

User comments