May 5, 2022

New tool more accurately uses genomic data to predict disease risk across diverse populations

Polygenic risk scores (PRS) are promising tools for predicting disease risk, but current versions have built-in bias that can affect their accuracy in some populations and result in health disparities. However, a team of researchers from Massachusetts General Hospital (MGH), the Broad Institute of MIT and Harvard, and Shanghai Jiao Tong University in Shanghai, China, have designed a new method for generating PRS that more accurately predict disease risk across populations, which they report in Nature Genetics.

Alterations in a gene's DNA sequence can produce a genetic variant that increases the risk for disease. Some genetic variants are closely linked to certain diseases, such as the BRCA1 mutation and breast cancer. "However, most common human diseases—such as type 2 diabetes, high blood pressure, and depression, for example—are influenced not by single genes, but by hundreds or thousands of genetic variants across the genome. Each variant contributes a small effect," says Tian Ge, Ph.D., an applied mathematician and biostatistician in the Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine at MGH, and co-senior author of the paper. PRS aggregate the effects of genetic variants across the genome and have shown promise for one day being used to predict individual patients' chances of developing diseases. That would allow clinicians to recommend preventive measures and monitor patients closely for early diagnosis and intervention.

However, a PRS must be "trained" to predict disease risk using data from studies in which genomic information is collected from large groups of individuals. While many disease-causing variants are shared, explains Ge, there are important differences in the genetic basis of a disease between individuals of different ancestries. For example, a common genetic variant that is associated with a specific disease in one population may have a lower frequency or even be missing in other populations. When a genetic variant linked to a disease is shared across different populations, its effect size, or how much it increases risk, may also vary from one ancestral group to another, explains Ge. PRS trained using data from one population therefore often have attenuated, or reduced, performance when applied to other populations.

"A major problem with existing methods for PRS calculation is that, to date, most of the genomic studies used data collected from individuals of European ancestry," says Ge. That creates a Eurocentric bias in existing PRS, he says, producing substantially less-accurate predictions and raising the possibility that they could over- or underestimate disease risk in non-European populations.

Fortunately, investigators have increased efforts to collect genomic data from underrepresented populations. Leveraging these resources, Ge and his colleagues created a new tool called PRS-CSx that can integrate data from multiple populations and account for genetic similarities and differences between them. While there's still significantly more genomic data on individuals of European ancestry, the investigators used computational methods that allowed them to maximize the value of non-European data and improve prediction accuracy in ancestrally diverse individuals.

In the study, the investigators used genomic data from individuals in several different populations to predict a wide range of physical measures (such as height, body mass index, and blood pressure), blood biomarkers (such as glucose and cholesterol), and the risk for schizophrenia. Then they compared the predicted trait or disease risk with actual measures or reported disease status to measure PRS-CSx's prediction accuracy. The study's results demonstrated that PRS-CSx is significantly more accurate than existing PRS tools in non-European populations.

"The goal of our work was to narrow the gap between the prediction accuracy in underrepresented populations relative to European individuals, and narrow the gap in health disparities when implementing PRS in clinical settings," says Ge, who notes that the new tool will continue to be refined with the hope that clinicians may one day use it to inform treatment choices and make recommendations about patient care.

PRS-CSx could also have a role in basic research, says the study's lead author, Yunfeng Ruan, Ph.D., a postdoctoral research fellow at the Broad Institute of MIT and Harvard. It could be used, for example, to explore gene-environment interactions, such as how the effect of genetic risk would depend on the level of environmental risk factors in global populations.

Even with PRS-CSx, the gap in prediction accuracy between European and non-European populations remains considerable. Broadening the sample diversity across global populations is crucial to further improve the prediction accuracy of PRS in diverse populations. "The expansion of non-European genomic resources, coupled with advanced analytic methods like PRS-CSx, will accelerate the equitable deployment of PRS in clinical settings," says Hailiang Huang, Ph.D., a statistical geneticist in the Analytic and Translational Genetics Unit at MGH and the Stanley Center for Psychiatric Research at the Broad Institute, and co-senior author of the paper.

More information: Yunfeng Ruan et al, Improving polygenic prediction in ancestrally diverse populations, Nature Genetics (2022). DOI: 10.1038/s41588-022-01054-7

Journal information: Nature Genetics

Provided by Massachusetts General Hospital

Citation: New tool more accurately uses genomic data to predict disease risk across diverse populations (2022, May 5) retrieved 16 July 2024 from https://medicalxpress.com/news/2022-05-tool-accurately-genomic-disease-diverse.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New tool better predicts COPD risk for people of non-European ancestry

76 shares

Feedback to editors

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

3 hours ago

Study suggests reviewing current recommendations that discourage exercise before bed

3 hours ago

Children with conduct disorder show widespread brain structural differences, finds new international study

3 hours ago

World-first international guidelines weeds-out potentially critical scientific fraud

3 hours ago

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

3 hours ago

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

4 hours ago

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

5 hours ago

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

5 hours ago

First health care device powered by body heat made possible by liquid based metals

5 hours ago

Large study confirms siblings of autistic children have 20% chance of autism

6 hours ago

Load comments (0)

New tool more accurately uses genomic data to predict disease risk across diverse populations

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Large study confirms siblings of autistic children have 20% chance of autism

New tool better predicts COPD risk for people of non-European ancestry

New tool facilitates inclusion of people of diverse ancestry in large genetics studies

Polygenic risk scores identify high-risk individuals in European and Asian ancestry, but less so in African ancestry

Diverse genome sequences provide a powerful tool for studying risk of heart disease

Multi-population risk scores could improve risk prediction for inflammatory bowel diseases, study finds

Researchers assert importance of diversity in genomics research

Machine learning helps define new subtypes of Parkinson's disease

Team explores strategies for correcting mutations that cause stroke

Study reveals secrets of energy metabolism, promising better blood transfusions

Study shows AI tool successfully responds to patient questions in electronic health record

Study identifies epigenetic 'switches' that regulate the developmental trajectories of single cells

Preclinical data suggest antioxidant strategy to address mitochondrial dysfunction caused by SARS-CoV-2 virus

Phys.org

Tech Xplore

Science X

New tool more accurately uses genomic data to predict disease risk across diverse populations

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Large study confirms siblings of autistic children have 20% chance of autism

Related Stories

New tool better predicts COPD risk for people of non-European ancestry

New tool facilitates inclusion of people of diverse ancestry in large genetics studies

Polygenic risk scores identify high-risk individuals in European and Asian ancestry, but less so in African ancestry

Diverse genome sequences provide a powerful tool for studying risk of heart disease

Multi-population risk scores could improve risk prediction for inflammatory bowel diseases, study finds

Researchers assert importance of diversity in genomics research

Recommended for you

Machine learning helps define new subtypes of Parkinson's disease

Team explores strategies for correcting mutations that cause stroke

Study reveals secrets of energy metabolism, promising better blood transfusions

Study shows AI tool successfully responds to patient questions in electronic health record

Study identifies epigenetic 'switches' that regulate the developmental trajectories of single cells

Preclinical data suggest antioxidant strategy to address mitochondrial dysfunction caused by SARS-CoV-2 virus

Newsletter sign up

Donate and enjoy an ad-free experience