Printer Friendly
The Free Library
14,506,428 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Picture this: the sounds of speech lead to novel ways of representing complex data.


Picture This

From the faintest whisper to theloudest cry, the human voice has an extraordinary range. It expresses pleasure and pain in innumerable accents and utterances. This remarkable variability, however, makes the analysis of human sounds exceedingly difficult.

Simply recording the pressure variationsover time in the acoustic signal generated by human speech produces a complex waveform. A typical signal includes the smoothly corrugated cor·ru·gate  
v. cor·ru·gat·ed, cor·ru·gat·ing, cor·ru·gates

v.tr.
To shape into folds or parallel and alternating ridges and grooves.

v.intr.
 look of vowel-like sounds and the jagged peaks associated with random noise. Significant silences periodically punctuate punc·tu·ate  
v. punc·tu·at·ed, punc·tu·at·ing, punc·tu·ates

v.tr.
1. To provide (a text) with punctuation marks.

2.
 the waveform. Scattered throughout the signal lie the characteristic signatures of speech sounds known as fricatives and plosives.

Because the human ear alone is not asufficiently reliable tool for rigorously comparing and characterizing sounds, researchers have been developing methods for seeking out the patterns hidden within speech waveforms. Traditional spectrograms, in which sound intensity is plotted against frequency over time, are valuable for showing the general frequency content of a signal. But sometimes both trained and untrained users have difficulty detecting acoustic differences, clearly evident to the ear, in the corresponding spectrograms.

Such difficulties motivated Clifford A.Pickover of the IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries)  Thomas J. Watson Research Center The Thomas J. Watson Research Center is the headquarters for the IBM Research Division.

The center is on three sites, with the main laboratory in Yorktown Heights, New York, 45 miles north of New York City, a building in Hawthorne, New York, and offices in Cambridge,
 in Yorktown Heights, N.Y., to explore several novel methods for representing speech sounds. In some cases, his techniques made subtle differences in acoustic signals considerably more obvious to a human analyst. Furthermore, he found that his methods could be extended to other types of data analysis, including the search for patterns in the sequence of nucleotides along a strand of DNA DNA: see nucleic acid.
DNA
 or deoxyribonucleic acid

One of two types of nucleic acid (the other is RNA); a complex organic compound found in all living cells and many viruses. It is the chemical substance of genes.
.

Pickover's research is an instance ofthe growing interest in finding ways of coping with The Coping With series of books is a series of books aimed at 11-16 year olds, written by Peter Corey and published by Scholastic Hippo. The first book, Coping with Parents, was released in 1989, and the series continued until the last book, Coping with Cash  the vast quantities of data generated by modern instruments and other computer-aided techniques. Particular attention is being paid to identifying patterns or trends. In the past, such efforts have included the use of sound to represent data (SN:6/1/85, p. 348), the use of video and animation, and the use of various computer graphics techniques for manipulating and plotting data.

Pickover's innovations are a mixture ofold ideas used in new contexts and new ideas "New Ideas" is the debut single by Scottish New Wave/Indie Rock act The Dykeenies. It was first released as a Double A-side with "Will It Happen Tonight?" on July 17, 2006. The band also recorded a video for the track.  brought to bear on longstanding data-analysis problems. A summary of much of his work appears in the IBM research IBM Research, a division of IBM, is a research and advanced development organization and currently consists of eight locations throughout the world and hundreds of projects.  report "Computers, Pattern, Chaos and Beauty.' Details of particular techniques are available in other IBM reports. Last week, Pickover presented a "road map' of his "graphics perspectives' at a conference held in New York City New York City: see New York, city.
New York City

City (pop., 2000: 8,008,278), southeastern New York, at the mouth of the Hudson River. The largest city in the U.S.
 on Computer Graphics in the Arts and Sciences.

One of Pickover's most striking andcolorful data-display techniques produces figures that have the sixfold sixfold
Adjective

1. having six times as many or as much

2. composed of six parts

Adverb

by six times as many or as much

Adj. 1.
 symmetry of a snowflake. Although formally termed "symmetrized dot patterns,' these forms could just as well be called "speech-flakes.'

To create a symmetrized dot pattern fora particular sound, Pickover selects a 500-millisecond segment of the corresponding amplitude-vs.-time waveform. He records the amplitudes at 5,000 points along the waveform (every 10,000th of a second). These data are mapped into a snowflake-like pattern.

Suppose the first amplitude is 40 (on ascale from 0 to 50), the second sampled point has an amplitude of 30, and the third has a value of 35. These values are plotted on a graph that looks like a polar view of the earth, with the North Pole North Pole, northern end of the earth's axis, lat. 90°N. It is distinguished from the north magnetic pole. U.S. explorer Robert E. Peary is traditionally credited as being the first to reach (1909) the North Pole. In 1926, Richard E.  at the graph's center. The first number represents an angle, 40|, and the second number the distance, 30 units, from the pole (or origin). That combination marks a spot on the graph. Then the second number becomes the angle and the third represents the distance (30|, 35 units), marking a second spot. This process is repeated until all the amplitude values are accounted for. To create the snowflake-like pattern, the resulting array of dots, which fills roughly one-sixth of the full graph, is reflected as in a kaleidoscope. The addition of color not of the white race; - commonly meaning, esp. in the United States, of negro blood, pure or mixed.

See also: Color
 enhances the patterns.

"This algorithm works well for certainsounds,' says Pickover. For example, it easily distinguishes the "a' in "father' from the "o' in "mom.' The method is particularly sensitive to differences in frequency, producing different curvatures. Further study of the method is needed to determine what the optimal sampling rate and symmetry ought to be for various applications.

Says Pickover, "Intriguing as an artform, these dot patterns may be a way of visually fingerprinting natural and synthetic speech synthetic speech
n.
Speech that is produced by an electronic synthesizer activated by a keyboard, enabling individuals who are incapable of speech to communicate.
 sounds and allowing researchers to detect patterns in data not easily captured with traditional analyses.'

Pickover has tested his algorithm on avariety of sounds, from human and synthetic speech to the full-throated croacking of frogs and the shrill whistles of dolphins. One researcher has already expressed interest in using the technique for studying bird songs. Pickover himself has shown that symmetrized dot patterns may be useful for detecting and characterizing heart abnormalities from the sounds generated by the heart.

Pickover's vectorgrams sometimeslook like the steps a drunkard One who habitually engages in the overindulgence of alcohol.

In order for an individual to be labeled a drunkard, drunkenness must be habitual or must recur on a constant basis.
 would take in wandering across an open area. They can also be used to search for patterns in the sequences of bases in DNA. Essentially, Pickover translates the location of particular bases into movement on a two-dimensional lattice or, more literally, the movement of a pen across a piece of paper. His technique is an application of previous work by other researchers on tracking bit sequences in digital computers and it follows his own efforts to represent the base sequence of a cancer gene's DNA as a waveform.

Pickover assigns the value one to thebases cytosine cytosine (sī`tōsēn'), organic base of the pyrimidine family. It was isolated from the nucleic acid of calf thymus tissue in 1894.  (C) and guanine guanine (gwä`nēn), organic base of the purine family. It was reported (1846) to be in the guano of birds; later (1879–84) it was established as one of the major constituents of nucleic acids.  (G) and the value zero to the bases adenine adenine (ăd`ənĭn, –nīn, –nēn), organic base of the purine family. Adenine combines with the sugar ribose to form adenosine, which in turn can be bonded with from one to three phosphoric acid units, yielding the three  (A) and thymine thymine (thī`mēn), organic base of the pyrimidine family. Thymine was the first pyrimidine to be purified from a natural source, having been isolated from calf thymus and beef spleen in 1893–4.  (T). A typical strand of DNA, consisting of 4,000 bases, becomes a long string of ones and zeros Ones and Zeros is the second full-length release by Canadian indie rock group Immaculate Machine. It is their first official release through Mint Records. Music videos were released for the songs "Broken Ship" and "So Cynical". . Then one of eight possible directions is assigned to each different set of three consecutive digits. For example, from a starting position on a square grid, the string 001 indicates a movement directly upward to an adjacent corner, while the string 101 corresponds to a movement diagonally downward and to the left.

In this way, a string of ones and zeros isconverted into a zigzagging path across a checkerboard checkerboard

the pattern of a chess or draft board; used in many circumstances to display the results of mixing a specific number of variables. The variables are listed in columns designated along the horizontal border and the same or different variables in lines along the vertical
. The pattern seen depends on the base sequence found in the DNA strand. In such vectorgrams, sequences that have a high guanine and cytosine content tend to move downward. A repeating sequence, such as . . . GGGGAAGAATACGAGGGGAA . . ., generates a trace that returns to its starting point. Various characteristic short sequences, found within DNA strands and which appear to control particular DNA properties, turn out to have distinctive signature patterns.

"This permits the human observer,'says Pickover, "to visually detect some important sequence structural properties and patterns not easily captured by traditional means.' However, the method, by failing to distinguish between sequences such as ATATAT . . . and AAAAAA AAAAAA Association for the Abolition of Abused Abbreviations and Asinine Acronyms  . . ., which have different folding and bending properties, can sometimes obscure patterns.

More work is needed on identifyingwhich DNA parameters should be plotted and how those characteristics ought to be represented. Better results may be obtained by using a hexagonal hex·ag·o·nal  
adj.
1. Having six sides.

2. Containing a hexagon or shaped like one.

3. Mineralogy
 grid rather than a square grid, by inspecting pairs or quadruplets of digits instead of triplets, and by assigning values to bases in different ways.

"The exploration of this large parameterspace provides a provocative area for future research,' Pickover writes in the January IBM JOURNAL OF RESEARCH AND DEVELOPMENT. "It may be possible to discover interesting properties and periodicities in the DNA sequence DNA sequence Genetics The precise order of bases–A,T,G,C–in a segment of DNA, gene, chromosome, or an entire genome. See Base pair, Base sequence analysis, Chromosome, Gene, Genome.  by having the [computer] program produce many vectorgrams by automatically iterating ITerating.com is a Wiki-based software guide, where everyone can find, compare and give reviews to thousands of software products. Founded in October of 2005, and based in New York, ITerating.  through a large number of input parameters and mappings. In this way, the program may suggest to the human analyst important features and parameters which would not even be considered otherwise.'

Cartoon faces--some grinning,some sad, some cryptically blank --certainly attract attention. They can also be used to represent complicated data. Introduced in 1973 by statistician Herman Chernoff of Harvard University, the technique has piqued the interest of many data analysts. Using the characteristics of various facial features, such as the face's shape or the mouth's curvature, a single face can convey the value of 10 different variables at the same time. Its effectiveness depends on the human ability to integrate facial features into a meaningful image.

"Such faces have been shown to bemore reliable and more memorable than other tested icons,' says Pickover, "and allow the human analyst to grasp many of the essential regularities and irregularities in the data.'

Pickover has used cartoon faces tocharacterize broad classes of sounds. Different types of sound, when analyzed in this way, appear to generate distinctive types of faces. Such "speech-faces,' says Pickover, may be a useful aid in teaching deaf or near-deaf children how to modify the sounds they make. Although the expressions on the cartoon faces have little to do with facial movements that a person makes when speaking, the cartoon faces provide a vivid feedback target. Students, on seeking a particular face after uttering a sound, would have a better sense of how close they are to the required sound.

IBM researchers have also proposedthe use of cartoon faces on control panels. Pilots in military aircraft, for example, are sometimes overwhelmed by the number of dials and indicators they must monitor to keep an airplane safely in the air. Displaying several key pieces of information in one place is a simple way to reduce this data overload. By learning to recognize certain faces as danger signals, pilots would be able to react more quickly. Cartoon faces also bring together signals that by themselves may not indicate a threatening situation but taken together reveal a potential hazard.

Symmetrized dot patterns, cartoonfaces and DNA vectorgrams represent only a few of the display techniques that Pickover has explored. "For most of these things,' he says, "there really hasn't been a lot of work done on them.' Many techniques look promising, but the conditions under which they work best are not yet known.

Eventually, researchers sitting at theircomputer work stations will be able to call on a broad palette of data-analysis tools. Ironically, the availability in the future of many different ways of displaying data may itself contribute to the data-overload problem.

Photo: A DNA vectorgram converts a long sequenceof nucleotide bases into a walk on a square lattice. Each of four bases is given a value of either 0 or 1 to generate a string of binary digits. Taken in groups of three, these digits are assigned specific directions on the lattice (lower left). A random nucleotide-base sequence produces a vectorgram like the one shown above and to the left. The color changes every 1,000 bases. In contrast, a human bladder cancer bladder cancer

Malignant tumour of the bladder. The most significant risk factor associated with bladder cancer is smoking. Exposure to chemicals called arylamines, which are used in the leather, rubber, printing, and textiles industries, is another risk factor.
 gene has a different, distinctive nucleotide pattern (above right).

Photo: Despite similar waveforms, symmetrizeddot patterns clearly show the difference between a human-made "ee' sound (upper diagram) and a synthesized version of the same sound (lower).

Photo: Symmetrized dot patterns computed fromacoustic waveforms are sensitive to frequency variations. The first set of four diagrams (upper left) demonstrates how the patterns depend on input frequency. The actual waveform is shown at the bottom of each diagram. The lower set of four diagrams shows the differences evident in the sounds of (clockwise, starting at upper left) a rooster rooster

its crowing at dawn heralds each new day. [Western Folklore: Leach, 329]

See : Dawn


rooster

symbol of maleness. [Folklore: Binder, 85]

See : Virility
, a dolphin, a frog and a cat. A normal heart sound (above, top) can be contrasted with cardiac sounds associated with various pathological conditions (middle and bottom).

Photo: Cartoon faces canbe used to represent the values of as many as 10 variables, each variable corresponding to a facial feature. In the upper illustrations, a computer-generated array of faces, in which facial parameters were computed using a random number generator A program routine that produces a random number. Random numbers are created easily in a computer, since there are many random events that take place such as the duration between keystrokes. , demonstrates the diversity of faces available for data representation. Pickover has used cartoon faces to characterize sounds (right). The top row represents the fricative fricative (frik´tiv),
n a speech sound made by forcing the airstream through such a narrow opening that audible high-frequency air
 sound "s'; the second, a "sh'; the third, a "z'; and the fourth, a "v'.
COPYRIGHT 1987 Science Service, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1987, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Peterson, Ivars
Publication:Science News
Date:Jun 20, 1987
Words:1924
Previous Article:End of the world: you won't feel a thing.
Next Article:Antifrost bacteria: so far, so good.
Topics:



Related Articles
Denker done as South's girls basketball coach.(Sports)(The veteran coach resigns after leading the Axemen to two state championships)
EEOC data prove employment discrimination still rife.(national MS society)
What we are doing about symptoms that can't be measured easily.(I want you to know)
Drugs just a click away: online pharmacies can make dangerous drugs easy to get, but also can promote better health care. Should we regulate them?
Brains, bodies, beliefs, and behavior.
Speaking across the chasm: literature as a bridge between science and religion.
Global warming and religious stick fighting.
Complex issues, small fixes.(Letters)(Letter to the editor)
How the human "network" collided with the environment.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles