Voice prostheses can help patients regain their lost voice

October 24, 2012

Help is on the way for people who suffer from vocal cord dysfunction. Researchers are developing methods that will contribute to manufacturing voice prostheses with improved affective features. For example, for little girls who have lost their voices, the improved artificial voice devices can produce age-appropriate voices, instead of the usual voice of an adult male. These advances in artificial voice production have been made possible by results achieved in a research project led by Professor Samuli Siltanen, results that are good news indeed for the approximately 30,000 Finns with vocal cord problems. Siltanen's project is part of the Academy of Finland's Computational Science Research Programme (LASTU).

One of the fundamental problems of speech signal analysis is to find the vocal cord excitation signal from a digitally recorded speech sound and to determine the shape of the vocal tract, i.e. the mouth and the throat. This so-called glottal inverse filtering of the speech signal requires a highly specialised form of computer calculation. With traditional techniques, inverse filtration is only possible for low-pitch . Women's and children's voices are trickier cases as the higher pitch comes too close in frequency to the lowest resonance of the vocal tract. The novel inverse calculation method developed by Siltanen and his team significantly improves glottal inverse filtering in these cases.

Besides in speech synthesis, inverse filtering is needed in . In speech synthesis, a computer will transform text into synthetic speech. The old-fashioned way is to record individual words and play them one after the other, but this seldom produces natural-sounding speech.

"Most are a result of a specific process. The air flowing between the makes them vibrate. This vibration, if we could hear it, would produce a weird buzzing sound. However, as it moves through the vocal tract, that buzz is transformed into some familiar vowel," explains Siltanen.

Singing, says Siltanen, is a perfect example of this interplay between the vocal cord response and the : "When we sing the vowel 'a' in different pitches, our vocal tracts remain unchanged but the frequency of the excitation changes. On the other hand, we can also sing different vowels in the same pitch, whereby the shape of the tract changes and the excitation stays the same."

Speech recognition is widely used, for example, in mobile phones and automatic telephone services. High-quality glottal inverse filtering improves the success rate of speech recognition in noisy environments.

Related Stories

Recommended for you

Neural efficiency hypothesis confirmed

July 27, 2015

One of the big questions intelligence researchers grapple with is just how differences in intelligence are reflected in the human brain. Researchers at ETH Zurich have succeeded in studying further details relating to suspected ...

How does color blindness affect color preferences?

July 21, 2015

(Medical Xpress)—Dichromacy is a color vision defect in which one of the three types of cone photoreceptors is missing. The condition is hereditary and sex-linked, mostly affecting males. Although researchers have explored ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.