Comparing Japanese genome data with distant relatives

October 15, 2018, Tohoku University
Polishing Japanese genome data with distant relatives
The dataset from Tohoku Medical Megabank Organization represents the entire Japanese population. Credit: Tohoku University Tohoku Medical Megabank Organization

A genome bank for the Japanese population can better identify rare genetic variants and disease susceptibilities by adding samples from distant areas of the country.

Incorporating samples from different areas far apart in Japan makes it easier to find on a nationwide scale, according to a new study published in BMC Genomics. It is important to identify these markers as they might indicate which genetic variants lead to certain diseases in a population.

Genetic variations or mutations are common, and pinpointing which variations are responsible for specific traits or diseases is a challenge. One way geneticists try to do this is with studies that cover an entire population and look for rare sequences that have not been identified before. They use genotype imputation, a statistical technique to identify unknown genotypes in order to search through large datasets and pick out a group of genes inherited from a parent called haplotypes, which contain a cluster of variations called (SNPs).

Previously, a dataset of 1,070 genomes containing haplotype information was produced as part of the Tohoku Medical Megabank project. Launched in the aftermath of the 2011 earthquake, the project aims to develop tailor-made medical diagnosis and treatment using people's genetic information in Miyagi and Iwate prefectures. The data, known as the '1KJPN' reference panel, was created based on samples from Miyagi.

In a present study, a research team from the Tohoku Medical Megabank Organization at Tohoku University investigated how accurately Miyagi samples represent the diversity of genome variations of the entire Japanese mainland population. They compared the 1KJPN panel with 144, 39 and 35 genome samples from the areas of Iwate, Nagahama and Aki, respectively.

The results showed that while the Miyagi data was a sufficient representative of the entire population, combining the 1KJPN dataset with genome samples from Iwate, Nagahama and Aki improved the efficiency of the genotype imputation, particularly in identifying rare variants or SNPs on a nationwide scale. The combined data was also more accurate than Japanese samples in the 1000 Genome Project, the largest human genotype database created by the European Bioinformatics Institute.

The comparison of the genome samples from the four areas indicates why the combined data is stronger. The researchers found genetic differentiations increased with distance from Miyagi. For example, populations in the neighbouring regions of Miyagi and Iwate were most similar to each other than populations in Nagahama and Aki, which are more than 700 km (435 miles) and 1,000 km (620 miles) south, respectively.

A deeper analysis of individual genomes identified rare SNPs that are present in Iwate, Nagahama and Aki, but not in Miyagi. More variants were observed in the areas furthest from Miyagi, showing the importance of collecting genomic data from disperse areas to capture a wide range of rare variations.

Interestingly, the team found that Aki samples formed a distinct genome cluster, indicating the Aki population on Shikoku Island is genetically different from populations on neighbouring Honshu Island. This is contrary to the existing notion that genetic differentiation is minimal among populations on Japan's main four islands, which are close together and connected by bridges or tunnels. However, the genetic differences between Aki and the other areas are much smaller than the genetic differences found between Japanese populations and mainland China populations.

In the next step, the researchers hope to collaborate with other groups to produce genetic data from other parts of the country. A combined dataset could be used to identify specific genes associated with common diseases.

Explore further: Rare variant discovered through deep whole-genome sequencing of 1,070 Japanese people

More information: Jun Yasuda et al. Regional genetic differences among Japanese populations and performance of genotype imputation using whole-genome reference panel of the Tohoku Medical Megabank Project, BMC Genomics (2018). DOI: 10.1186/s12864-018-4942-0

Related Stories

Rare variant discovered through deep whole-genome sequencing of 1,070 Japanese people

October 15, 2015
A research group at Tohoku Medical Megabank Organization (ToMMo) has successfully constructed a Japanese population reference panel (1KJPN), from the genome information of 1,070 individuals who had participated in the cohort ...

Genetic diversity of enzymes alters metabolic individuality

September 1, 2016
Scientists from Tohoku University's Tohoku Medical Megabank Organization (ToMMo) have published research about genetic diversity and metabolome in Scientific Reports.

New genetic links for heart disease risk factors identified

September 26, 2016
Scientists from the Welcome Trust Sanger Institute and their collaborators have discovered 17 rare human genetic variations associated with risk factors for diseases such as heart disease and diabetes.

Japonica Array: Improved genotype imputation by designing a population-specific SNP array

September 2, 2015
A research group, led by Professor Masao Nagasaki and Senior Assistant Professor Yosuke Kawai at Tohoku University Tohoku Medical Megabank Organization, has successfully designed the first ever SNP array that has been optimized ...

Estonia to map DNA of over 10% of population

May 21, 2018
A trail-blazer in internet technology, cyber-savvy Estonia is rolling out a high-tech DNA database holding the genetic details of over 150,000 residents to improve the prevention, diagnosis and treatment of chronic disease.

People in Miyagi coastal areas continue to show higher levels of depressive tendencies

April 14, 2016
Tohoku University Tohoku Medical Megabank Organization (ToMMo) has revealed that in 2014, three years after the Great East Japan Earthquake and tsunami, depressive symptoms continue to be higher in the coastal areas than ...

Recommended for you

Study bridges a divide in cell aging in neurodegenerative diseases

November 21, 2018
Research from the University of Toronto has shown that in some neurodegenerative diseases, two hallmarks of cell aging – protein aggregation and a type of DNA instability – are linked. They were previously thought to ...

New method for studying gene expression could improve understanding of brain disease

November 21, 2018
It takes a lot of cells to make a human brain. The organ houses not only an enormous quantity of neurons (tens of billions), but also an impressive diversity of neuron types. In recent years, scientists have been developing ...

Parental 'feeding styles' reflect children's genes

November 20, 2018
New research from King's College London and UCL challenges the idea that a child's weight largely reflects the way their parents feed them. Instead, parents appear to adopt feeding styles in response to their children's natural ...

Scientists identify new genetic causes linked to abnormal pregnancies and miscarriages

November 20, 2018
A team of scientists at the Research Institute of the McGill University Health Centre (RI-MUHC) and McGill University have identified three genes responsible for recurrent molar pregnancies, a rare complication that occurs ...

A study suggests that epigenetic treatments could trigger the development of aggressive tumours

November 20, 2018
A study headed by the Institute for Research in Biomedicine (IRB Barcelona) and published in the journal Nature Cell Biology examined whether the opening of chromatin (a complex formed by DNA bound to proteins) is the factor ...

Redefining colorectal cancer subtypes

November 20, 2018
There is a long-standing belief that colorectal cancer (CRC), which causes some 50,000 deaths in the United States each year, can be categorized into distinct molecular subtypes. In a paper published recently in the journal Genome ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.