November 8, 2022

New informatics software helps identify rare genetic variants

by Indiana University School of Medicine

A team of researchers at Indiana University School of Medicine has developed specialized bioinformatics software designed to identify rare genetic variants in whole-genome sequencing studies. Zilin Li, Ph.D., assistant professor of biostatistics and health data science, was the first and co-corresponding author of the recent publication in Nature Methods which details the variant-Set Test for Association using Annotation infoRmation pipeline (or STAARpipeline) framework.

"Even though there are hundreds of millions of rare genetic variants, they have been challenging to study because there was no convenient, scalable and robust pipeline for comprehensive rare-variant analysis, which requires the evaluation of variant sets rather than single variants," Li said.

The STAARpipeline allows researchers to evaluate sets of rare, noncoding genetic variants, which will help enable genetic research. Noncoding genetic variants are parts of the genome that do not code for amino acids, the molecules that combine to form proteins. More than 98 percent of a person's DNA is noncoding.

"Rare variants are observed in 99% of the human genome and are a major source of the missing heritability of complex traits and diseases," Li said.

To use the STAARpipeline, researchers input genotype (genetic code) and phenotype (complex trait or disease code) data into the program. The software analyzes that data and identifies rare variants, grouping the variants into eight functional categories in the gene-centric analysis and into fixed-size sliding windows and newly proposed data-adaptive dynamic windows in the non-gene-centric analysis. The gene-centric analysis focuses on variants in or near genes, while the non-gene-centric analysis focuses on variants in the intergenic region, which is the stretch of DNA located between genes. The program then incorporates multiple variant functional annotations for each variant set to increase analysis power further and summarizes the results for the user.

The research team has already tested the STAARpipeline on large sample sizes, including 40,000 from the National Heart, Lung and Blood Institute (NHLBI) Trans-Omics Precision Medicine Program. During that analysis, STAARpipeline found 49 significant associations in gene-centric noncoding analysis, 35 of which were found based on six new proposed noncoding categories. In addition, data-adaptive size dynamic window analysis detected 43 non-overlapping significant associations in the noncoding genome, 19.4% more than the classical fixed-size sliding window procedure.

The STAARpipeline builds on STAAR, another program Li and his colleagues established, which is a genetic variant-set test for finding connections and associations by using annotation information.

"We believe the STAARpipeline can be expanded to analyze hundreds of millions of variants worth of whole genome sequencing data," Li said. "Since rare variants have been found in 99 percent of the human genome, this program addresses an important gap in informatic analysis."

More information: STAARpipeline: an all-in-one rare-variant tool for biobank-scale whole-genome sequencing data, Nature Methods (2022). DOI: 10.1038/s41592-022-01641-w

Journal information: Nature Methods

Provided by Indiana University School of Medicine

Citation: New informatics software helps identify rare genetic variants (2022, November 8) retrieved 25 April 2024 from https://medicalxpress.com/news/2022-11-informatics-software-rare-genetic-variants.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Novel inherited variants may raise risk for Hodgkin lymphoma

118 shares

Feedback to editors

Genetic variations may predispose people to Parkinson's disease following long-term pesticide exposure, study finds

2 hours ago

Link between depression and cardiovascular disease explained: They partly develop from same gene module

6 hours ago

Study highlights increased risk of second cancers among breast cancer survivors

11 hours ago

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

12 hours ago

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

12 hours ago

A closed-loop drug-delivery system could improve chemotherapy

14 hours ago

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

14 hours ago

Solving the riddle of the sphingolipids in coronary artery disease

14 hours ago

New AI technology estimates brain age using low-cost EEG device

14 hours ago

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

15 hours ago

Load comments (0)

New informatics software helps identify rare genetic variants

Genetic variations may predispose people to Parkinson's disease following long-term pesticide exposure, study finds

Link between depression and cardiovascular disease explained: They partly develop from same gene module

Study highlights increased risk of second cancers among breast cancer survivors

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

A closed-loop drug-delivery system could improve chemotherapy

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

Solving the riddle of the sphingolipids in coronary artery disease

New AI technology estimates brain age using low-cost EEG device

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

Novel inherited variants may raise risk for Hodgkin lymphoma

Using MPRA to determine links between genetic variants and human phenotypes

A new data analysis approach identifies disease-associated splicing variants

New online resource helps connect rare genetic variants to human health and disease

Efficient, systematic genetic analysis helps dissect disease inheritance

Family ties: Inherited genetic variants increase risk of Hodgkin lymphoma

Genetic variations may predispose people to Parkinson's disease following long-term pesticide exposure, study finds

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

Genetic association study opens up new treatment avenues for Pick's disease, a rare form of early-onset dementia

New algorithm could provide early warning for asthma attacks

Study finds AI can develop treatments to prevent 'superbugs'

Immune cells on standby are constantly stimulated by healthy tissue, new study finds

Phys.org

Tech Xplore

Science X

New informatics software helps identify rare genetic variants

Genetic variations may predispose people to Parkinson's disease following long-term pesticide exposure, study finds

Link between depression and cardiovascular disease explained: They partly develop from same gene module

Study highlights increased risk of second cancers among breast cancer survivors

Opioids during pregnancy not linked to substantially increased risk of psychiatric disorders in children

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

A closed-loop drug-delivery system could improve chemotherapy

It's easier now to treat opioid addiction with medication—but use has changed little, study finds

Solving the riddle of the sphingolipids in coronary artery disease

New AI technology estimates brain age using low-cost EEG device

Alteration of brain network condition could predict painful vaso-occlusive crisis in patients with sickle cell disease

Related Stories

Novel inherited variants may raise risk for Hodgkin lymphoma

Using MPRA to determine links between genetic variants and human phenotypes

A new data analysis approach identifies disease-associated splicing variants

New online resource helps connect rare genetic variants to human health and disease

Efficient, systematic genetic analysis helps dissect disease inheritance

Family ties: Inherited genetic variants increase risk of Hodgkin lymphoma

Recommended for you

Genetic variations may predispose people to Parkinson's disease following long-term pesticide exposure, study finds

Research identifies pitfalls and opportunities for generative AI in patient messaging systems

Genetic association study opens up new treatment avenues for Pick's disease, a rare form of early-onset dementia

New algorithm could provide early warning for asthma attacks

Study finds AI can develop treatments to prevent 'superbugs'

Immune cells on standby are constantly stimulated by healthy tissue, new study finds

Newsletter sign up

Donate and enjoy an ad-free experience