UW News

May 11, 2001

New technique for sound transmission makes sweet music on Internet

A new technology for transmitting audio that taps into the subtleties of human sound recognition could make listening to your favorite song on the Internet as clear and uninterrupted as tuning in on a radio — even if your computer is a 90-pound weakling in the bandwidth department.

The technology, fine-grain scalable audio encoding, also has potential for audio fingerprinting, currently a hot issue in the recording industry, according to Les Atlas, professor of electrical engineering at the University of Washington. He developed the technique with graduate student Mark Vinton and is presenting the pair’s findings this week in Salt Lake City at the annual International Conference of Acoustic, Speech and Signal Processing.

The two filed for a provisional patent last week.

Atlas said the technology could help computer users in a couple of ways.

“If you’re listening to music on your computer and all you have to plug into is a phone line, then this will make it sound quite a bit better, without interruptions,” he said. “And if you have something with more bandwidth, like a DSL line, this would allow you to instantly access stations, much like the push buttons on your FM radio, without waiting for a buffer. It may not sound quite as good for the first few seconds, but would be easily recognizable and in a few seconds would sound as good as a CD.”

Fine-grain scalable works by prioritizing the signal according to what’s most important for recognition by the human auditory system. As bandwidth gets pinched, less important information is shed. The signal drops slightly in quality, but the drop is gradual.

“For a typical person, the change would be almost unnoticeable,” Atlas said. “If you’re a musician, you might note a difference, but it wouldn’t be a grating change.” And the audio wouldn’t cut out, as is often the case with such popular all-or-nothing tools as MP3.

The technology is designed to take advantage of a little-explored aspect of the human auditory system.

“What we’ve confirmed is there appears to be a second dimension in the auditory system,” Atlas explained. That dimension appears to prioritize sounds according to duration — the longer, slower aspects of a sound appear to be more critical to recognition. The algorithms that make fine-grain scalable work select and prioritize those aspects.

“It’s not standard frequency that’s critical — it’s modulation, or duration,” Atlas said. “This is the first time we’ve been able to code in that dimension.”

The technology also has promising applications for audio fingerprinting, or the determination of unique identifiers within songs. The fingerprints can be used for copyright management and fraud detection, and have been a hot topic in the music industry as companies and artists scramble to find ways to protect their work in today’s digital free-for-all market.

Atlas and Vinton said they’re experimenting to see if the technique will work with video as well. And they’ll continue to push the audio applications to make tuning in your favorite radio station on your computer a viable option.

“We’re reaching the point where we shouldn’t have to have FM radios on the shelves in our office,” Atlas said. “Why not get there?”

###

Atlas will be in Salt Lake City through Saturday and can be contacted via e-mail at atlas@ee.washington.edu or at the Wyndham Hotel Salt Lake City at (801) 531-7500 (voice) or (801) 328-1289 (fax). His telephone number at the UW is (206) 685-1315.