Solving the 'cocktail party problem'

by Keith Hautala

Ever try to make out a quiet voice in a crowded room, where many conversations are happening all at once?

It's what Kevin Donohue calls the "Cocktail Party Problem."

Donohue, the Databeam Professor of Electrical and Computer Engineering at the University of Kentucky, is working on the technology that just might solve it. For more than 25 years he has researched signal-processing systems. This area deals with systems that mimic the 's ability to extract meaning from audiovisual information. A good example is what's going on, behind your eyes and between your ears, when you watch the video above.

"Your brain is making sense out of the sound and images," Donohue says. "Your ears and eyes function as sensors, which send signals to your brain where they are processed to have meaning."

Electrical and computer engineers are not limited to signals that can be seen or heard naturally by humans. They can employ sensors that use ultrasound, x-rays and electromagnetics to tease a meaningful signal out from a .

For the past six years, the main focus of Donohue's work at UK's Center for Visualization and Virtual Environments (the Vis Center) has been in distributed audio systems. This involves arranging systems of microphones in a room to be able to identify sounds, in particular voices, and to isolate and track them using computers.

This video is not supported by your browser at this time.

This technology has applications in surveillance—for example, enabling investigators to home in on a "person of interest" whispering into a cell phone at a noisy airport—as well as in "smart rooms" that "understand" what is happening in an environment and can respond in useful ways, such as taking minutes at meetings, documenting brainstorming sessions, and archiving information for efficient retrieval.

Donohue's work is featured in the above video, produced by the Vis Center as part of its "What's Next" series.

add to favorites email to friend print save as pdf

Related Stories

Deserts 'greening' from rising CO2

Jul 03, 2013

Increased levels of carbon dioxide (CO2) have helped boost green foliage across the world's arid regions over the past 30 years through a process called CO2 fertilisation, according to CSIRO research.

Uncovering how humans hear one voice among many

Mar 11, 2013

Humans have an uncanny ability to zero in on a single voice, even amid the cacophony of voices found in a crowded party or other large gathering of people. Researchers have long sought to identify the precise ...

Recommended for you

Memory in silent neurons

14 hours ago

According to a generally-accepted model of synaptic plasticity, a neuron that communicates with others of the same kind emits an electrical impulse as well as activating its synapses transiently. This electrical ...

Why your favourite song takes you down memory lane

Aug 28, 2014

Music triggers different functions of the brain, which helps explain why listening to a song you like might be enjoyable but a favourite song may plunge you into nostalgia, scientists said on Thursday.

Transcranial Magnetic Stimulation of brain boosts memory

Aug 28, 2014

Stimulating a particular region in the brain via non-invasive delivery of electrical current using magnetic pulses, called Transcranial Magnetic Stimulation, improves memory, reports a new Northwestern Medicine ...

User comments

Adjust slider to filter visible comments by rank

Display comments: newest first

RobertKarlStonjek
not rated yet Jan 07, 2014
Humans generally can't achieve the cocktail party effect from recordings of cocktail parties unless they listen to the recordings many times. Why?

The answer is that in the real cocktail party environment humans turn their head and move in an effort to zoom in on the voice of interest. This is phase locking along with frequency and transient response profiling of the voice of interest. One can not achieve this from a recording...