November 17, 2017

A math concept from the engineering world points to a way of making massive transcriptome studies more efficient

by Tom Ulrich, Broad Institute of MIT and Harvard

To most people, data compression refers to shrinking existing data—say from a song or picture's raw digital recording—by removing some data, but not so much as to render it unrecognizable (think MP3 or JPEG files). Now, biologists propose to bring a kind of data compression to molecular biology.

A Broad research team has proposed a new compression approach for gene expression (a.k.a. "transcriptomic") experiments, where the data volume per experiment is growing dramatically. Their approach—reported in Cell—leverages a mathematical framework called compressed sensing to collect a relatively small amount of data in the laboratory and mathematically "decompress" it. The result is a very close representation of a cell or tissue's full expression profile.

Engineers can use compressed sensing to reconstruct a signal's full content from just a few direct measurements, making data acquisition faster and cheaper. Some MRI machines, for example, use compressed sensing to scan patients more quickly.

To apply compressed sensing to transcriptomes, the team—led by graduate student Brian Cleary, postdoctoral researcher Le Cong, institute director Eric Lander, and core institute member and Klarman Cell Observatory (KCO) director Aviv Regev—relied on the fact that expression is:

modular—cells do not express genes individually, but as sets in discrete programs—and
sparse—each cell expresses only a limited number of modules at a time.

Taking advantage of these properties, the team thought it might be possible to construct transcriptomes using a few (up to 100-fold fewer than the number of genes) "composite" expression measurements (which sum multiple genes' weighted abundances into one measurement), instead of measuring every individual gene's expression. The researchers then developed an algorithm called BCS-SMAF (for Blind Compressed Sensing-Sparse Module Activity Factorization) that uses randomly collected composite measurements to identify active expression "modules."

The algorithm then reconstructs individual genes' expression within each module. Interesting, BCS-SMAF doesn't need prior information about which genes might constitute a module (e.g., cellular respiration genes or mTOR pathway genes).

In proof-of-concept experiments using various kinds of data (including simulated, published, or existing single-cell and bulk transcriptome data), the team found that BCS-SMAF produced composite-based expression profiles that closely fit the true profiles.

If validated in larger studies, the approach could provide deep insights into cells' active circuitry at greatly reduced experimental and computational costs—benefits that could spill over to other data-intensive biological fields such as proteomics or metabolomics.

More information: Brian Cleary et al. Efficient Generation of Transcriptomic Profiles by Random Composite Measurements, Cell (2017). DOI: 10.1016/j.cell.2017.10.023

Journal information: Cell

Provided by Broad Institute of MIT and Harvard

Citation: A math concept from the engineering world points to a way of making massive transcriptome studies more efficient (2017, November 17) retrieved 17 July 2024 from https://medicalxpress.com/news/2017-11-math-concept-world-massive-transcriptome.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

RefEx, a web tool for a comfortable search of reference data for gene expression analysis

20 shares

Feedback to editors

'Diabetes distress' can increase risk of mental health problems among young people living with type 1 diabetes

2 hours ago

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

13 hours ago

Study suggests reviewing current recommendations that discourage exercise before bed

13 hours ago

Children with conduct disorder show widespread brain structural differences, finds new international study

13 hours ago

World-first international guidelines weeds-out potentially critical scientific fraud

13 hours ago

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

13 hours ago

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

14 hours ago

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

15 hours ago

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

15 hours ago

First health care device powered by body heat made possible by liquid based metals

15 hours ago

Load comments (0)

A math concept from the engineering world points to a way of making massive transcriptome studies more efficient

'Diabetes distress' can increase risk of mental health problems among young people living with type 1 diabetes

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

RefEx, a web tool for a comfortable search of reference data for gene expression analysis

Single-nucleus RNA sequencing, droplet by droplet

Scientists model gene regulation with chromatin accessibility

Linking gene expression and DNA methylation in single cells

New tool allows analysis of single-cell RNA data in pre-malignant tumours

Big-data analysis points toward new drug discovery method

Team explores strategies for correcting mutations that cause stroke

Study reveals secrets of energy metabolism, promising better blood transfusions

Study identifies epigenetic 'switches' that regulate the developmental trajectories of single cells

Preclinical data suggest antioxidant strategy to address mitochondrial dysfunction caused by SARS-CoV-2 virus

How a gene for obesity affects the brain

Unlocking the mystery of preexisting drug resistance: Study sheds light on cancer evolution

Phys.org

Tech Xplore

Science X

A math concept from the engineering world points to a way of making massive transcriptome studies more efficient

'Diabetes distress' can increase risk of mental health problems among young people living with type 1 diabetes

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Related Stories

RefEx, a web tool for a comfortable search of reference data for gene expression analysis

Single-nucleus RNA sequencing, droplet by droplet

Scientists model gene regulation with chromatin accessibility

Linking gene expression and DNA methylation in single cells

New tool allows analysis of single-cell RNA data in pre-malignant tumours

Big-data analysis points toward new drug discovery method

Recommended for you

Team explores strategies for correcting mutations that cause stroke

Study reveals secrets of energy metabolism, promising better blood transfusions

Study identifies epigenetic 'switches' that regulate the developmental trajectories of single cells

Preclinical data suggest antioxidant strategy to address mitochondrial dysfunction caused by SARS-CoV-2 virus

How a gene for obesity affects the brain

Unlocking the mystery of preexisting drug resistance: Study sheds light on cancer evolution

Newsletter sign up

Donate and enjoy an ad-free experience