An algorithm is sped up to predict harmful effects from specific gene mutations

May 6, 2016

In 2001, researchers developed a formula, or algorithm, that predicts whether a specific change in a gene sequence can result in harmful effects. While useful, the algorithm was slow; the computations underpinning these predictions used multiple central processing units (CPUs) and a significant amount of time. Now A*STAR researchers have adapted the algorithm to work on a graphical processing unit, a specialized electronic circuit that can process huge amounts of data in parallel.

The faster computational time has allowed the team to expand their "database of predictions" from just the human genome to include more than 200 additional organisms.

Similarities exist between the same of different organisms. Even so, individual organisms have differences in parts of their genomes when compared to other organisms of the same species. Some of these differences affect how proteins function and may lead to disease. By comparing genetic sequences, researchers are able to pinpoint disease-causing gene mutations. But this requires sifting through huge amounts of data.

The SIFT (Sorting Intolerant From Tolerant) predicts which changes in a gene — known as variants — could affect the function of the protein that gene encodes. Using SIFT, A*STAR researchers computed potential changes that can occur to gene sequences in humans to compile a database of predictions. Researchers provide SIFT with the gene variants they are investigating as a possible source of disease. SIFT then looks up the variants in its database of predictions. Variants that are predicted deleterious by SIFT are highlighted and may be considered worthy of further investigation.

Compiling SIFT's database for the involved performing computations on multiple CPUs, which took about four minutes to analyse a single gene sequence.

"I had wanted to make SIFT databases for a lot more organisms, but making the human database took significant time," says systems biologist Pauline Ng from the Genome Institute of Singapore.

SIFT was adapted for use with a to make faster predictions. This allowed the team to expand the scope of the algorithm's predictions to cover more than 200 other organisms. SIFT 4G, the updated algorithm, takes only 2.6 seconds to analyse a compared to SIFT's four minutes.

The updated and algorithm will not only facilitate the identification of disease-causing gene mutations but will help researchers understand the genetic variations that make some animal breeds or plants strains more robust or prone to disease.

Explore further: Yeast against the machine: Bakers' yeast could improve diagnosis

More information: Robert Vaser et al. SIFT missense predictions for genomes, Nature Protocols (2015). DOI: 10.1038/nprot.2015.123

Related Stories

Yeast against the machine: Bakers' yeast could improve diagnosis

April 6, 2016
It's easier than ever to sequence our DNA, but doctors still can't exactly tell from our genomes which diseases might befall us. Professor Fritz Roth is setting out to change this by going to basics—to our billion-year-old ...

Genetic risk factors of disparate diseases share similar biological underpinnings

April 28, 2016
The discovery of shared biological properties among independent variants of DNA sequences offers the opportunity to broaden understanding of the biological basis of disease and identify new therapeutic targets, according ...

Recommended for you

Maternal diet may program child for disease risk, but better nutrition later can change that

October 20, 2017
Research has shown that a mother's diet during pregnancy, particularly one that is high-fat, may program her baby for future risk of certain diseases such as diabetes. A new study from nutrition researchers at the University ...

New gene editing approach for alpha-1 antitrypsin deficiency shows promise

October 20, 2017
A new study by scientists at UMass Medical School shows that using a technique called "nuclease-free" gene editing to correct cells with the mutation that causes a rare liver disease leads to repopulation of the diseased ...

Researchers drill down into gene behind frontotemporal lobar degeneration

October 19, 2017
Seven years ago, Penn Medicine researchers showed that mutations in the TMEM106B gene significantly increased a person's risk of frontotemporal lobar degeneration (FTLD), the second most common cause of dementia in those ...

New clues to treat Alagille syndrome from zebrafish

October 18, 2017
A new study led by researchers at Sanford Burnham Prebys Medical Discovery Institute (SBP) identifies potential new therapeutic avenues for patients with Alagille syndrome. The discovery, published in Nature Communications, ...

Genetic variants associated with obsessive-compulsive disorder identified

October 18, 2017
(Medical Xpress)—An international team of researchers has found evidence of four genes that can be linked to obsessive-compulsive disorder (OCD). In their paper published in the journal Nature Communications, the group ...

An architect gene is involved in the assimilation of breast milk

October 17, 2017
A family of "architect" genes called Hox coordinates the formation of organs and limbs during embryonic life. Geneticists from the University of Geneva (UNIGE) and the Swiss Federal Institute of Technology in Lausanne (EPFL), ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.