Printer Friendly
The Free Library
5,675,364 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

The machine's got rhythm: computers are learning to understand music and join the band.


Christopher Raphael begins the third movement of a Mozart oboe oboe (ō`bō, ō`boi) [Ital., from Fr. hautbois] or hautboy (ō`boi, hō`–), woodwind instrument of conical bore, its mouthpiece having a double reed.  quartet. As his oboe sounds its second note, his three fellow musicians come in right on cue. Later, he slows down and embellishes with a trill trill, in music, ornament consisting of the more or less rapid alternation of two adjacent notes. Indicated by any of several conventional symbols, it varies in speed and duration and in the manner of its beginning and ending according to context. , and the other players stay right with him. His accompanists don't complain or tire when he practices a passage over and over. And when he's done, he switches them off.

After all, his fellow musicians exist only as a recording. A software package, written by Raphael, controls their tempo tempo [Ital.,=time], in music, the speed of a composition. The composer's intentions as to tempo are conventionally indicated by a set of Italian terms, of which the principal ones are presto (very fast), vivace (lively), allegro (fast),  and makes them respond to the soloist's cues.

Until recently, computers have had little insight into music. They've merely recorded it, stored it, and offered tools that people can use to produce or manipulate it. But now, researchers are teaching computers to recognize the basic musical elements: beat, rhythm, melody, harmony, tempo, and more. Computers with those skills are becoming musical collaborators.

"Technology is changing our sense of what music can be," Raphael says. "The effect is profound."

LEARNING TO LISTEN With training, people can listen to a piece of music and write down the score with few mistakes. Teaching a computer to perform the same task, though, has proved remarkably difficult.

Raphael, an informatics Same as information technology and information systems. The term is more widely used in Europe.  researcher at the University of Indiana in Bloomington, compares the problem to speech recognition. "There's been a veritable army of people who've worked on speech recognition for several decades, and [the problem] still remains open," he says. "Any time you deal with real data, there is a huge amount of variation that you have to understand."

Researchers have succeeded in programming computers to transcribe To copy data from one medium to another; for example, from one source document to another, or from a source document to the computer. It often implies a change of format or codes.  limited kinds of music. For example, software can reliably identify the notes of a single melodic me·lod·ic  
adj.
Of, relating to, or containing melody.



me·lodi·cal·ly adv.
 line played by one instrument in isolation.

The programs analyze the wavelengths of the sound. Hitting the A below middle C on a piano, for example, produces an audio wave at 220 Hertz hertz (hûrts) [for Heinrich R. Hertz], abbr. Hz, unit of frequency, equal to 1 cycle per second. The term is combined with metric prefixes to denote multiple units such as the kilohertz (1,000 Hz), megahertz (1,000,000 Hz), and gigahertz . But it also produces weaker waves, known as overtones, at 440 Hz, 660 Hz, 880 Hz, and so on. The relative strengths of the overtones differ slightly for each instrument, which is why a piano doesn't sound like a violin violin, family of stringed musical instruments having wooden bodies whose backs and fronts are slightly convex, the fronts pierced by two f-hole-shaped resonance holes. . Nevertheless, the characteristic pattern of an A is similar enough across instruments that a computer can recognize it.

When several notes play simultaneously, however, as in a chord chord, in geometry
chord (kôrd), in geometry, straight line segment both end points of which lie on the circumference of a circle or other curve; it is a segment of a secant. A chord passing through the center of a circle is a diameter.
 from one instrument or music from an ensemble, the audio waves from the different notes mix in ways that are hard to untangle. Echoes, noise, and imperfect imperfect: see tense.  recordings muddy the patterns even more.

But researchers are making progress. Every year, various transcription programs go head-to-head in a competition called MIREX (Music Information Retrieval Music information retrieval or MIR is the interdisciplinary science of retrieving information from music.

This includes:
  • Computational methods for classification, clustering, and modelling — Musical feature extraction for mono- and polyphonic music,
 Exchange). The researchers set their programs loose on the same pieces of music and then compare results. This September, when the competition takes place in Vienna, it will for the first time include full transcriptions of polyphonic music Noun 1. polyphonic music - music arranged in parts for several voices or instruments
concerted music, polyphony

music - an artistic form of auditory communication incorporating instrumental or vocal tones in a structured and continuous manner
, in which multiple notes are playing at the same time.

Most systems slice the sound into brief segments and look for a pattern that they can recognize as a given note. After identifying this note, the programs pull its primary frequency and associated overtones out of the sound wave. Then the software repeats the process, picking out other notes in the remaining audio signal until it has accounted for the entire sound.

The results, however, aren't exact. The pattern of a particular note may be obscured by other notes that are playing at the same time. Furthermore, without information on the characteristics of the instrument producing the sound or the acoustics acoustics (ək`stĭks) [Gr.,=the facts about hearing], the science of sound, including its production, propagation, and effects.  of the room in which it was recorded, the programmed patterns of overtones don't accurately correspond to the actual notes in the music.

As a result, when the program pulls an imperfectly im·per·fect  
adj.
1. Not perfect.

2. Grammar Of or being the tense of a verb that shows, usually in the past, an action or a condition as incomplete, continuous, or coincident with another action.

3.
 modeled note out of the mix, it distorts the remaining sound, making it harder to identify the remaining notes. The more notes that are playing at once, the more those distortions pile up.

SELF-TEACHING MACHINES Music-information researchers are taking advantage of the experiences of their colleagues who study speech recognition. After some early advances in the 1970s, further improvements in speech recognition became increasingly difficult. "To take it to the next level," says Daniel Ellis Dan Ellis (born November 18, 1988 in Bramhall, Stockport, Greater Manchester) is an English football player, who attended Bramhall High School. He plays as a striker for Stockport County, where he has progressed through the club's Centre of Excellence youth system.  of Columbia University Columbia University, mainly in New York City; founded 1754 as King's College by grant of King George II; first college in New York City, fifth oldest in the United States; one of the eight Ivy League institutions. , "you had to do 10 times as much work each time."

By the time Ellis started working on speech recognition in 1996, researchers were trying a new approach. "To some extent, they gave up on trying to understand what speech does," Ellis says. "Instead, they collected a bunch of different examples and used statistical techniques" to identify the patterns that underlie speech.

Ellis continued that strategy when he eventually shifted his focus to the analysis of music. He built a program that uses machine-learning techniques to transcribe polyphonic The ability to play back some number of musical notes simultaneously. For example, 16-voice polyphony means a total of 16 notes, or waveforms, can be played concurrently.  piano music.

He started with a program that had no information about how music works. He then fed into his computer 92 recordings of piano music and their scores. Each recording and score had been broken into 100-millisecond bits so that the computer program could associate the sounds with the written notes. Within those selections, the computer would receive an A note, for example, in the varying contexts in which it occurred in the music. The software could then search out the statistical similarities among all the provided examples of A.

In the process, the system indirectly figured out rules of music. For example, it found that an A is often played simultaneously with an E but seldom with an A-sharp, even though the researchers themselves never programmed in that information. Ellis says that his program can take advantage of that subtle pattern and many others, including some that people may not be aware of.

When presented with a novel recording, the program labels as an A any note that shows enough statistical similarity to the As in the training sequence. In a special issue of EURASIP Journal on Advances in Signal Processing EURASIP Journal on Advances in Signal Processing is a peer-reviewed, open access journal.

The overall aim of EURASIP Journal on Advances in Signal Processing is to bring science and applications together with emphasis on both practical and theoretical aspects of signal processing
, an online journal, Ellis reports that his system accurately identified the notes playing in 68 percent of the novel 100-millisecond snippets that it was given. Ellis expects that when his program has analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 more examples--ideally, many thousands more--its detection rate will improve. /

He notes that the next-best system, developed by Anssi Klapuri of the Tampere University of Technology Tampere University of Technology (TUT) (Finnish: Tampereen teknillinen yliopisto (TTY) ) is the second-largest of the universities in engineering sciences in Finland. The university is located in Hervanta, a suburb of Tampere.  in Finland, scored only 47 percent on the test snippets. It's a traditional program that incorporates expert knowledge of music rather than machine learning.

Ellis is quick to point out, however, that this comparison isn't quite fair. Klapuri's system can recognize many kinds of music, not just piano music, so comparing the two on piano music alone gave Ellis' system an artificial advantage.

Ellis plans to enter his program in the September 2007 MIREX competition to see how it does head-to-head against more-traditional programs.

Ellis has also used the self-teaching technique to identify melodies in complex pieces of music, picking out the portion that a person might sing. After spending just a few months to develop such a system, he entered it in last year's MIREX competition and came in third out of 10 entries, with an accuracy of 61 percent. In many cases, he says, the transcribed melodies were recognizable, despite the errors.

The top performer in that competition was a more fully developed program that took a traditional approach. Devised by Karin Dressier of the Fraunhofer Institute for Digital Media Technology in Ilmenau, Germany, that program had a 71 percent accuracy rate. The results of the melody competition will appear in an upcoming issue of IEEE (Institute of Electrical and Electronics Engineers, New York, www.ieee.org) A membership organization that includes engineers, scientists and students in electronics and allied fields.  Transactions on Audio, Speech and Language Processing
For the processing of language by computers, see Natural language processing.


Language processing refers to the way human beings process speech or writing and understand it as language.
.

Ellis says that combining machine-learning strategies with expert knowledge of music and acoustics will ultimately offer the best performance.

FOLLOWING THE MUSIC Even as researchers continue to refine transcription methods, the work is spinning off remarkably useful tools. One advance has turned out to be especially handy: Computers can line up a score with a recording of its performance.

This seemingly trivial capability has many applications. Some of the simplest are programs that display supertitles at the opera at just the right moment or that automatically turn the page for musicians.

Score alignment also opens the door to programs that can correct off-kilter notes going into a microphone before they emerge from loudspeakers--a development that could transform the listener's experience at children's recitals everywhere.

Alignment software analyzes a spectrogram spectrogram (spekˑ·tr·gram),
n
, which shows how the energy of sound waves changes over time across all frequencies. In most popular music, the strong drum rhythms that mark out the time appear on the spectrogram as vertical lines, which make it easy for the computer to keep track of where it is in the score. Another approach that some programs use is to recognize repeating harmonic harmonic.

1 Physical term describing the vibration in segments of a sound-producing body (see sound). A string vibrates simultaneously in its whole length and in segments of halves, thirds, fourths, etc.
 patterns that occur in many pieces of music.

Where drumbeats or repeating harmonic patterns aren't apparent, the researchers have the computer identify the melody or employ other techniques developed for transcription. Having the score as a guide makes the task far easier than transcribing the notes from scratch.

Score-alignment programs could be used after a musician records a piece of music to do the kind of fine-tuning that's now performed painstakingly pains·tak·ing  
adj.
Marked by or requiring great pains; very careful and diligent. See Synonyms at meticulous.

n.
Extremely careful and diligent work or effort.
 by recording studios, fixing such problems as notes that are slightly off pitch or come in late. "It'll be kind of like a spell-check for music," says Roger Dannenberg, a computer scientist at Carnegie Mellon University Carnegie Mellon University, at Pittsburgh, Pa.; est. 1967 through the merger of the Carnegie Institute of Technology (founded 1900, opened 1905) and the Mellon Institute of Industrial Research (founded 1913).  in Pittsburgh who is developing the technology.

The process would make it far easier for amateurs to improve their recordings after performance in the way that professional recording studios now do. "I see what I'm doing as democratizing music-making," Dannenberg says.

COMPUTER AS MUSICIAN Score-alignment technology opened the door for Raphael to develop his computerized-accompaniment program. Mimi Zweig, a professor of music at the University of Indiana, is using the system with her violin students to give them a taste of what it's like to have 100 musicians following their every pause or trill. Zweig is impressed with the responsiveness of the system. "After a long cadenza ca·den·za  
n.
1. An elaborate, ornamental melodic flourish interpolated into an aria or other vocal piece.

2. An extended virtuosic section for the soloist usually near the end of a movement of a concerto.
 or a phrase where you want to take time, it's fight with you," she says. "It's even better than an orchestra in some ways."

Raphael says that the soloist's freedom while using his system makes it a valuable learning tool. Few students ever experience having an orchestra accompany them. Raphael says, "It's a fundamental hole in their musical education. [Playing with an orchestra] is how people develop their ideas about musical interpretation and grow as musicians."

The first component of Raphael's program examines the sound waves produced by the soloist and lines up the performance with the score. But that's not enough, because if the program waits until the soloist plays a note before it comes in with the accompaniment, it will always be late. So, the program predicts what the soloist will do next, using information about the performance from which the accompaniment was derived and the performer's speed in the immediately preceding notes as well as knowledge gained from earlier practice sessions. The program then slows down or speeds up the recording without altering the pitch.

Raphael presented the system in Boston last July at a conference of the Association for the Advancement of Artificial Intelligence The Association for the Advancement of Artificial Intelligence or AAAI is an international, nonprofit, scientific society devoted to advancing the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines. .

His program requires recordings that are missing the solo parts. A company called Music Minus One, based in New York City New York City: see New York, city.
New York City

City (pop., 2000: 8,008,278), southeastern New York, at the mouth of the Hudson River. The largest city in the U.S.
, produces such recordings, and soloists have traditionally practiced by playing along with them. Having gotten used to his computer-accompaniment system, Raphael now scorns the use of such recordings. "You're straitjacketed, following orders from the machine;' he says.

Nevertheless, he sometimes uses Music Minus One recordings for his research. But he's also developing methods to strip the soloists' parts out of high-quality recordings by top performers.

In that task, Raphael doesn't know the precise sound waves that the soloists generated, so he faces some of the difficulties that the music-transcription systems encounter. He inevitably inflicts damage when he removes the solo from a recording.

"But there's a saving grace," he says. The new soloist will be producing sound in just the frequencies that are most damaged, so it will mask the parts that sound worst.

Raphael is still refining his stripping software--"I'm like Edison in search of the right filament filament, in astronomy: see chromosphere. ." But he already uses the system, which he calls "Music Plus One," to make recordings to accompany his oboe playing.

Raphael's system relies entirely on the musical sense of the soloist to drive the accompaniment. "If you have a really terrific, sophisticated live player, that's the right thing to do," he says.

But in a teaching situation, a good accompanist partly follows and partly leads, helping a beginning musician develop a more sophisticated sense of the music.

"It's a hard problem for a computer to get musicality into a performance;' Raphael says.

Even without musical sense, Raphael's program is opening new musical possibilities. Jan Beran, a composer and statistician at the University of Constance in Germany, wrote several oboe solos with piano accompaniment especially for Raphael's system.

Raphael has performed the pieces with his system. He says that he doesn't think that those pieces could be played with a live accompanist.

The rhythmic rhyth·mic   also rhyth·mi·cal
adj.
Of, relating to, or having rhythm; recurring with measured regularity.



rhythmi·cal·ly adv.
 interplays are so complex that performers can't handle them, he says. For example, one piece contains many sections where one musician plays 7 notes while the other plays 11. "Human players say, 'I'll play my 7, you play your 11, and let's shoot for where we come out together,'" Raphael says. "But the program can tell at any place in the middle of this complicated polyrhythm pol·y·rhythm  
n. Music
The use or an instance of simultaneous contrasting rhythms.



poly·rhyth
 exactly where it needs to be."

With music this complicated, Raphael says, the software takes on a peculiar leadership role even though it does nothing but follow. "From the very first rehearsal, it understands the way the parts fit together and sort of teaches you this," he explains.

These developments make some musicians uneasy. Dannenberg, who wrote the earliest computer-accompaniment system, notes that the musicians' union
  • There are several organizations calling themselves the Musicians' Union:
  • For the United Kingdom, see: Musicians' Union (UK)
  • For the United States of America, see listing by state:
 opposes "virtual orchestras Virtual Orchestra Is a term used to identify a variety of different types of technology and art forms. Most commonly used to refer to orchestral simulation, either for pre-recorded or live environments, it also has been used to describe other activities, such as IRCAM’s ," synthesizers in the pit at musicals that replace some of the acoustic instruments.

Dannenberg says, "That's not even the stuff you should be afraid of. My computer-accompaniment technology could completely replace the orchestra."

"There's something about the social presence of live music that's going to keep it alive forever. I'm not interested in using computers to replace live musicians," Dannenberg adds. "The reason that I work with computers and music is because of all the potential that computers have to do new things that you can't do otherwise".
COPYRIGHT 2007 Science Service, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Rehmeyer, Julie J.
Publication:Science News
Article Type:Cover story
Date:Apr 21, 2007
Words:2397
Previous Article:Back to (near) the beginning: galactic springtime.(This Week)
Next Article:Wanted: better yardsticks: measurement inadequacies threaten U.S. competitive edge.



Related Articles
Teacher hits all the right notes with kids.(Schools)(Music: Infectious rhythms and their instructor's funny faces turn students on to learning.)
BRIEFLY.(Entertainment)(MUSIC SIDESHOW)
Movement activities for learning-disabled piano students.(KEEPING THE BEAT)
Misty River enlists guitar star for holiday show.(Entertainment)
A region in harmony: southern music and the sound track of freedom.
Retro rockers go all vintage with vinyl.(Entertainment)(The members of Heavenly Oceans are such big fans of "long-play" records, they made one of...
Cancer forcing band to split.(Entertainment)
New Riders saddle up again.(Entertainment)
Spotlight: Tatsu Aoki; Making art that's collective.
Seriously, Los Mex Pistols isn't just some novelty act.(Entertainment)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles