April 12, 2021

More than the sum of mutations: 165 new cancer genes identified with the help of machine learning

A new algorithm can predict which genes cause cancer, even if their DNA sequence is not changed. A team of researchers in Berlin combined a wide variety of data, analyzed it with "Artificial Intelligence" and identified numerous cancer genes. This opens up new perspectives for targeted cancer therapy in personalized medicine and for the development of biomarkers.

In cancer, cells get out of control. They proliferate and push their way into tissues, destroying organs and thereby impairing essential vital functions. This unrestricted growth is usually induced by an accumulation of DNA changes in cancer genes—i.e. mutations in these genes that govern the development of the cell. But some cancers have only very few mutated genes, which means that other causes lead to the disease in these cases.

A team of researchers at the Max Planck Institute for Molecular Genetics (MPIMG) in Berlin and at the Institute of Computational Biology of Helmholtz Zentrum München developed a new algorithm using machine learning technology to identify 165 previously unknown cancer genes. The sequences of these genes are not necessarily altered—apparently, already a dysregulation of these genes can lead to cancer. All of the newly identified genes interact closely with well-known cancer genes and have been shown to be essential for the survival of tumor cells in cell culture experiments.

Additional targets for personalized medicine

The algorithm, dubbed "EMOGI" for Explainable Multi-Omics Graph Integration, can also explain the relationships in the cell's machinery that make a gene a cancer gene. As the team of researchers headed by Annalisa Marsico describe in the journal Nature Machine Intelligence, the software integrates tens of thousands of data sets generated from patient samples. These contain information about DNA methylations, the activity of individual genes and the interactions of proteins within cellular pathways in addition to sequence data with mutations. In these data, a deep-learning algorithm detects the patterns and molecular principles that lead to the development of cancer.

"Ideally, we obtain a complete picture of all cancer genes at some point, which can have a different impact on cancer progression for different patients", says Marsico, head of a research group at the MPIMG until recently and now at Helmholtz Zentrum München. "This is the foundation for personalized cancer therapy."

Unlike with conventional cancer treatments such as chemotherapy, personalized therapy approaches tailor medication precisely to the type of tumor. "The goal is to select the best therapy for each patient—that is, the most effective treatment with the fewest side effects. Additionally, we would be able to identify cancers already at early stages, based on their molecular characteristics."

"Only if we know the causes of the disease will we be able to counteract or correct them effectively," the researcher says. "That's why it's so important to identify as many mechanisms as possible that can induce cancers."

Better results by combination

"Until now, most research has focused on pathogenic changes in the genetic sequence, i.e., in the blueprint of the cell," says Roman Schulte-Sasse, a doctoral student on Marsico's team and first author of the publication. "At the same time, it has become apparent in recent years that epigenetic perturbations or dysregulated gene activity can lead to cancer as well."

This is why the researchers merged sequence data that reflect faults in the blueprint with information that represents events inside the cell. Initially, the scientists confirmed that mutations, or the multiplication of segments of the genome, are indeed the main drivers of cancer. Then, in a second step, they pinpointed gene candidates that are in a less direct context to the actual cancer-driving gene.

"For instance, we found genes whose sequence is mostly unchanged in cancer, and yet are indispensable to the tumor because they regulate energy supply," Schulte-Sasse says. These genes are out of control by other means, e.g. because of chemical changes on the DNA like methylations. These modifications leave the sequence information intact but govern a gene's activity. "Such genes are promising drug targets, but because they operate in the background, we can only find them by using complex algorithms."

In search of hints for further studies

The researcher's new program adds a considerable number of new entries to the list of suspected cancer genes, which has grown to between 700 and 1,000 in recent years. It was only through a combination of bioinformatics analysis and the newest Artificial Intelligence (AI) methods that the researchers were able to track down the hidden genes.

"The interactions of proteins and genes can be mapped as a mathematical network, known as a graph," Schulte-Sasse says. "You can think of it like trying to guess a railroad network; each station corresponds to a protein or gene, and each interaction among them is the train connection."

With the help of deep learning—the very algorithms that have helped artificial intelligence make a breakthrough in recent years—the researchers were able to discover even those train connections that had previously gone unnoticed. Schulte-Sasse had the computer analyze tens of thousands of different network maps from 16 different cancer types, each containing between 12,000 and 19,000 data points.

Suitable for other types of diseases as well

Hidden in the data are many more interesting details. "We see patterns that are dependent on the particular cancer and tissue" Marsico says. "We see this as evidence that tumors are triggered by different molecular mechanisms in different organs."

The EMOGI program is not limited to cancer, the researchers emphasize. In theory, it can be used to integrate diverse sets of biological data and find patterns there, explains Marsico. "It could be useful to apply our algorithm for similarly complex diseases for which multifaceted data are collected and where genes play an important role. An example might be complex metabolic diseases such as diabetes."

More information: Roman Schulte-Sasse et al. Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms, Nature Machine Intelligence (2021). DOI: 10.1038/s42256-021-00325-y

Journal information: Nature Machine Intelligence

Provided by Max Planck Society

Citation: More than the sum of mutations: 165 new cancer genes identified with the help of machine learning (2021, April 12) retrieved 26 April 2024 from https://medicalxpress.com/news/2021-04-sum-mutations-cancer-genes-machine.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New prediction algorithm identifies previously undetected cancer driver genes

125 shares

Feedback to editors

Gene linked to epilepsy and autism decoded in new study

3 hours ago

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

3 hours ago

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

3 hours ago

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

3 hours ago

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

4 hours ago

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

4 hours ago

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

5 hours ago

Analysis identifies 50 new genomic regions associated with kidney cancer risk

5 hours ago

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

5 hours ago

How buildings influence the microbiome and human health

5 hours ago

Load comments (0)

More than the sum of mutations: 165 new cancer genes identified with the help of machine learning

Additional targets for personalized medicine

Better results by combination

In search of hints for further studies

Suitable for other types of diseases as well

Gene linked to epilepsy and autism decoded in new study

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

Analysis identifies 50 new genomic regions associated with kidney cancer risk

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

How buildings influence the microbiome and human health

New prediction algorithm identifies previously undetected cancer driver genes

New computational method detects disrupted pathways in cancer

Algorithm turns cancer gene discovery on its head

The co-occurrence of cancer driver genes, key to precision medicine

Study eyes partially lost essential genes as possible targets in cancer

Cancer susceptibility genes

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Gene linked to epilepsy and autism decoded in new study

Analysis identifies 50 new genomic regions associated with kidney cancer risk

People with rare longevity mutation may also be protected from cardiovascular disease

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Study finds RNA modification is responsible for disruption of mitochondrial protein synthesis in Alzheimer's disease

Phys.org

Tech Xplore

Science X

More than the sum of mutations: 165 new cancer genes identified with the help of machine learning

Additional targets for personalized medicine

Better results by combination

In search of hints for further studies

Suitable for other types of diseases as well

Gene linked to epilepsy and autism decoded in new study

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

Analysis identifies 50 new genomic regions associated with kidney cancer risk

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

How buildings influence the microbiome and human health

Related Stories

New prediction algorithm identifies previously undetected cancer driver genes

New computational method detects disrupted pathways in cancer

Algorithm turns cancer gene discovery on its head

The co-occurrence of cancer driver genes, key to precision medicine

Study eyes partially lost essential genes as possible targets in cancer

Cancer susceptibility genes

Recommended for you

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Gene linked to epilepsy and autism decoded in new study

Analysis identifies 50 new genomic regions associated with kidney cancer risk

People with rare longevity mutation may also be protected from cardiovascular disease

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Study finds RNA modification is responsible for disruption of mitochondrial protein synthesis in Alzheimer's disease

Newsletter sign up

Donate and enjoy an ad-free experience