August 18, 2023

Diagnosis of voice condition from call audio

Assistant Professor Yuya Hosoda of the Center for IT-Based Education (CITE), Toyohashi University of Technology has developed a method for estimating the pitch of vocal cord vibrations of humans from call audio.

In this method, the pitch is estimated by integrating the feature quantities extracted from the amplitude and phase spectra of speech on the complex plane. Through experiments, we have demonstrated that the proposed method is not only efficient for call audios whose frequency band is restricted by communication standards, but also works robustly in an environment with background noise. The research is published in the journal IEEE/ACM Transactions on Audio, Speech, and Language Processing.

To prevent the aggravation of neurodegenerative diseases such as Parkinson's disease, the early diagnosis of dysarthria, which is an early symptom, is desirable.

Dysarthria is characterized by tremors in voice and disturbed breathing. Although clinical tests diagnose symptoms from the patient's voice, they are time consuming and labor intensive. Additionally, conducting face-to-face interviews in remote locations such as mountainous areas is difficult. Therefore, in this research, we aim to develop a system that automatically diagnoses dysarthria through telemedicine by performing ward rounds via communication devices.

In patients with dysarthria, abnormalities occur during vocalization wherein voice is produced by vocal cord vibrations generated by air released from the lungs in the throat and oral cavity. In this study, our purpose is to estimate the vibration period (pitch) to diagnose the condition of these vocal cord vibrations.

Until now, a pitch measurement method that is robust against background noise has been devised based on the feature quantities of the amplitude spectrum obtained via the frequency analysis of speech. However, due to communication standards, call audio via telemedicine lacks some of the desired amplitude spectrum. Thus, extracting feature quantities from an amplitude spectrum with reduced information can lead to errors in pitch estimation.

In this research, we propose a method to extract additional feature quantities from the phase spectrum, a by-product of frequency analysis, in addition to the amplitude spectrum. Deriving a relational equation between the phase shift and pitch in the time and frequency directions, we have verified that pitch can be estimated by applying the observed phase shift to the relational equation.

Based on this finding, we extracted new feature quantities from the phase spectrum to quantitatively evaluate the degree of fit to the relational equation. Finally, by integrating the feature quantities extracted from the amplitude spectrum on the complex plane, we compensated for the lack of feature quantities occurring in the pitch estimation of call audio while maintaining robustness against background noise.

In previous studies that used only the amplitude spectrum, since the amount of information was reduced by band limitation, the pitch was estimated to be higher than the original value. However, in the proposed method, the pitch is accurately estimated from call audio using the feature quantities related to the amplitude and phase spectra.

Further, the gross pitch error (GPE), an evaluation index that indicates the percentage of segments where errors occurred, improved to 9.5% in the proposed method, compared to 42.2% in the previous study. In addition, even for call audio with background noise, this method achieved a GPE of 15.2%, demonstrating robustness.

Future outlook

Although this study focused on pitch estimation to detect abnormalities in vocal cord vibrations, respiratory and oral abnormalities also cause dysarthria. To detect these symptoms, methods that extract feature quantities from the amplitude spectrum have been devised. However, the use of the phase spectrum has not been sufficiently validated.

In the future, we will work on extracting relevant feature quantities from the phase spectra for the other cases as well. Further, by comprehensively analyzing these feature quantities, we aim to develop a dysarthria diagnostic system that can function effectively with telemedicine.

More information: Yuya Hosoda et al, Complex-Domain Pitch Estimation Algorithm for Narrowband Speech Signals, IEEE/ACM Transactions on Audio, Speech, and Language Processing (2023). DOI: 10.1109/TASLP.2023.3278488

Provided by Toyohashi University of Technology

Citation: Diagnosis of voice condition from call audio (2023, August 18) retrieved 17 July 2024 from https://medicalxpress.com/news/2023-08-diagnosis-voice-condition-audio.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Study reveals first genetic locus for voice pitch

1 shares

Feedback to editors

'Diabetes distress' can increase risk of mental health problems among young people living with type 1 diabetes

2 hours ago

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

12 hours ago

Study suggests reviewing current recommendations that discourage exercise before bed

12 hours ago

Children with conduct disorder show widespread brain structural differences, finds new international study

12 hours ago

World-first international guidelines weeds-out potentially critical scientific fraud

12 hours ago

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

12 hours ago

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

13 hours ago

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

14 hours ago

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

14 hours ago

First health care device powered by body heat made possible by liquid based metals

15 hours ago

Load comments (0)

Diagnosis of voice condition from call audio

Future outlook

'Diabetes distress' can increase risk of mental health problems among young people living with type 1 diabetes

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Study reveals first genetic locus for voice pitch

A robust phase extraction method for overcoming spectrum overlapping in shearography

Physiological-physical feature fusion for automatic voice spoofing detection

Understanding sound direction estimation in monaural hearing

A route to ultra-fast amplitude-only spatial light modulation using phase-change materials

Lensless complex amplitude demodulation based on deep learning in holographic data storage

First health care device powered by body heat made possible by liquid based metals

Machine learning helps define new subtypes of Parkinson's disease

Researchers achieve success in allowing a patient to 'speak' using only the power of thought

Study reveals link between playing contact sports and parkinsonism in individuals with chronic traumatic encephalopathy

Protein droplets likely don't cause Parkinson's, study suggests

Study reveals insights into Alzheimer's disease mechanisms through novel hydrogel matrix

Phys.org

Tech Xplore

Science X

Diagnosis of voice condition from call audio

Future outlook

'Diabetes distress' can increase risk of mental health problems among young people living with type 1 diabetes

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Related Stories

Study reveals first genetic locus for voice pitch

A robust phase extraction method for overcoming spectrum overlapping in shearography

Physiological-physical feature fusion for automatic voice spoofing detection

Understanding sound direction estimation in monaural hearing

A route to ultra-fast amplitude-only spatial light modulation using phase-change materials

Lensless complex amplitude demodulation based on deep learning in holographic data storage

Recommended for you

First health care device powered by body heat made possible by liquid based metals

Machine learning helps define new subtypes of Parkinson's disease

Researchers achieve success in allowing a patient to 'speak' using only the power of thought

Study reveals link between playing contact sports and parkinsonism in individuals with chronic traumatic encephalopathy

Protein droplets likely don't cause Parkinson's, study suggests

Study reveals insights into Alzheimer's disease mechanisms through novel hydrogel matrix

Newsletter sign up

Donate and enjoy an ad-free experience