November 15, 2023 feature

Study unveils similarities between the auditory pathway and deep learning models for processing speech

by Ingrid Fadelli , Medical Xpress

The human auditory pathway is a highly sophisticated biological system that includes both physical structures and brain regions specialized in the perception and processing of sounds. The sounds that humans pick up through their ears are processed in various brain regions, including the cochlear and superior olivary nuclei, the lateral lemniscus, the inferior colliculus and the auditory cortex.

Over the past few decades, computer scientists have developed increasingly advanced computational models that can process sounds and speech, thus artificially replicating the function of the human auditory pathway. Some of these models have achieved remarkable results and are now widely used worldwide, for instance allowing voice assistants (i.e., Alexa, Siri, etc.) to understand the requests of users.

Researchers at University of California, San Francisco, recently set out to compare these models with the human auditory pathway. Their paper, published in Nature Neuroscience, revealed striking similarities between how deep neural networks and how the biological auditory pathway process speech.

"AI speech models have become very good in recent years because of deep learning in computers," Edward F. Chang, one of the authors of the paper, told Medical Xpress. "We were interested to see if what the models learn is similar to how the human brain processes speech."

To compare deep neural networks to the human auditory pathway, the researchers firstly looked at the speech representations produced by the models. These are essentially the ways in which these models encode speech in their different layers.

Subsequently, Chang and his colleagues compared these representations to the activity that takes place in the different parts of the brain associated with the processing of sounds. Remarkably, they found a correlation between the two, unveiling possible similarities between artificial and biological speech processing.

"We used several commercial deep learning models of speech and compared how the artificial neurons in those models compared to real neurons in the brain," Chang explained. "We compared how speech signals are processed across the different layers, or processing stations, in the neural network, and directly compared those to processing across different brain areas."

Interestingly, the researchers also found that models trained to process speech in either English or Mandarin could predict the responses in the brain of native speakers of the corresponding language. This suggests that deep learning techniques process speech similarly to the human brain, also encoding language-specific information.

"AI models that capture context and learn the important statistical properties of speech sounds do well at predicting brain responses," Chang said. "In fact, they are better than traditional linguistic models. The implication is that there is huge potential to harness AI to understand the human brain in the coming years."

The recent work by Chang and his collaborators improves the general understanding of deep neural networks designed to decode human speech, showing that they might be more like the biological auditory system than researchers had anticipated. In the future, it could guide the development of further computational techniques designed to artificially reproduce the neural underpinnings of audition.

"We are now trying to understand what the AI models can be redesigned to better understand the brain. Right now, we just getting started and there is so much to learn," said Chang.

More information: Yuanning Li et al, Dissecting neural computations in the human auditory pathway using deep neural networks for speech, Nature Neuroscience (2023). DOI: 10.1038/s41593-023-01468-4

Journal information: Nature Neuroscience

Citation: Study unveils similarities between the auditory pathway and deep learning models for processing speech (2023, November 15) retrieved 27 April 2024 from https://medicalxpress.com/news/2023-11-unveils-similarities-auditory-pathway-deep.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Brain signals transformed into speech through implants and AI

89 shares

Feedback to editors

Gene linked to epilepsy and autism decoded in new study

12 hours ago

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

12 hours ago

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

12 hours ago

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

12 hours ago

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

13 hours ago

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

13 hours ago

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

14 hours ago

Analysis identifies 50 new genomic regions associated with kidney cancer risk

14 hours ago

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

14 hours ago

How buildings influence the microbiome and human health

14 hours ago

Load comments (0)

Study unveils similarities between the auditory pathway and deep learning models for processing speech

Gene linked to epilepsy and autism decoded in new study

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

Analysis identifies 50 new genomic regions associated with kidney cancer risk

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

How buildings influence the microbiome and human health

Brain signals transformed into speech through implants and AI

Study identifies two brain mechanisms for picking speech out of a crowd

Researchers reconstruct speech from brain activity, illuminating complex neural processes

Sounds and words are processed separately and simultaneously in the brain

Using deep neural networks to predict how natural sounds are processed by the brain

Dog brains are tuned to dog-directed speech spoken by women

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

Neuroscientists investigate how the target of an arm movement is spatially encoded in the primate brain

What happens in the brain when we make decisions about money or food

Coordinating blood vessel activity may be associated with better brain performance

Phys.org

Tech Xplore

Science X

Study unveils similarities between the auditory pathway and deep learning models for processing speech

Gene linked to epilepsy and autism decoded in new study

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Researchers find pregnancy cytokine levels impact fetal brain development and offspring behavior

Study finds biomarkers for psychiatric symptoms in patients with rare genetic condition 22q

Clinical trial evaluates azithromycin for preventing chronic lung disease in premature babies

Scientists report that new gene therapy slows down amyotrophic lateral sclerosis disease progression

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

Analysis identifies 50 new genomic regions associated with kidney cancer risk

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

How buildings influence the microbiome and human health

Related Stories

Brain signals transformed into speech through implants and AI

Study identifies two brain mechanisms for picking speech out of a crowd

Researchers reconstruct speech from brain activity, illuminating complex neural processes

Sounds and words are processed separately and simultaneously in the brain

Using deep neural networks to predict how natural sounds are processed by the brain

Dog brains are tuned to dog-directed speech spoken by women

Recommended for you

Blood test finds knee osteoarthritis up to eight years before it appears on X-rays

Using stem cell-derived heart muscle cells to advance heart regenerative therapy

Illusion demystifies the way vision works: Experiments imply brightness perception occurs deeper in brain than thought

Neuroscientists investigate how the target of an arm movement is spatially encoded in the primate brain

What happens in the brain when we make decisions about money or food

Coordinating blood vessel activity may be associated with better brain performance

Newsletter sign up

Donate and enjoy an ad-free experience