Printer Friendly
The Free Library
5,670,445 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

World Wide Web Consortium Issues VoiceXML 2.0 and Speech Recognition Grammar as W3C Recommendations.


Business Editors

http://www.w3.org/--(BUSINESS WIRE)--March 16, 2004

Critical components of the W3C (World Wide Web Consortium, www.w3.org) An international industry consortium founded in 1994 by Tim Berners-Lee to develop standards for the Web. It is hosted in the U.S. by the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT (www.csail.mit.edu/index.php).  Speech Interface Framework now

complete

Testimonials from Aspect Communications, Comverse, Genesys, HP,

IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) , Loquendo, Microsoft, Motorola, Nuance, Openstream,

ScanSoft, TellMe, Vocalocity, VoiceGenie, Voxeo, and Voxpilot

are available at:

http://www.w3.org/2004/03/voicexml2-testimonial

Giving voice to the Web, the World Wide Web Consortium (W3C) has published VoiceXML 2.0 and Speech Recognition Grammar Specification Speech Recognition Grammar Specification (SRGS) is an W3C recommendation that defines syntax for representing grammars for use in speech recognition so that developers can specify the words and patterns of words to be listened for by a speech recognizer.  (SRGS SRGS Speech Recognition Grammar Specification
SRGS Stimulated Raman Gain Spectroscopy
SRGS Survivable Relay Ground Stations
) as W3C Recommendations. The goal of VoiceXML 2.0 is to bring the advantages of Web-based development and content delivery to interactive voice response applications. SRGS is key to VoiceXML's support for speech recognition, and is used by developers to describe end-users responses to spoken prompts.

Today's announcement marks the advancement to Recommendation status of the first two specifications in W3C's Speech Interface Framework. Aimed at the world's estimated two billion fixed line and mobile phones, W3C's Speech Interface Framework will allow an unprecedented number of people to use any telephone to interact with appropriately designed Web-based services via key pads, spoken commands, listening to pre-recorded speech, synthetic speech synthetic speech
n.
Speech that is produced by an electronic synthesizer activated by a keyboard, enabling individuals who are incapable of speech to communicate.
 and music.

"The completion of VoiceXML 2.0 and SRGS marks an exciting milestone in the convergence of telecom technologies and the Web. Historically, there were both technical and cultural gaps between the way voice-based systems have evolved and that of the Internet and Web, leaving the information available only to voice systems or the Web," explained Tim Berners-Lee (person) Tim Berners-Lee - The man who invented the World-Wide Web while working at the Center for European Particle Research (CERN). Now Director of the World-Wide Web Consortium.

Tim Berners-Lee graduated from the Queen's College at Oxford University, England, 1976.
, W3C Director. "With the development of the W3C Speech Interface Framework, including VoiceXML 2.0 and SRGS, we're now able to integrate and benefit from the strengths of both groups - the power and impact of industrial research and broad product testing and deployment, and the extensibility and openness of technical solutions that are consistent with Web technical principles, and can scale accordingly."

A World Wide Web Consortium (W3C) Recommendation is understood by industry and the Web community at large as a Web standard. Each Recommendation is a stable specification developed by a W3C Working Group and reviewed by the W3C Membership. Recommendations promote interoperability of Web technologies of the Web by explicitly conveying the industry consensus formed by the Working Group.

VoiceXML 2.0 and SRGS Lay the Foundations for the W3C Speech

Interface Framework

In the W3C Speech Interface Framework, VoiceXML controls how the application interacts with the user, while the Speech Synthesis Markup Language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems.  (SSML SSML Speech Synthesis Markup Language ) is used for spoken prompts and the Speech Recognition Grammar Specification (SRGS) for guiding the speech recognizers via grammars that describe the expected user responses. Other specifications in the Framework include Voice Browser A voice browser is a web browser that presents an interactive voice user interface to the user. In addition, it typically provides an interface to the PSTN or a PBX. Just as a visual web browser works with HTML pages, a voice browser operates on pages that specify voice dialogues.  Call Control (CCXML CCXML Call Control Extensible Markup Language
CCXML Call Control Xml Interpreter
), which provides telephony call control support for VoiceXML and other dialog systems, and Semantic Interpretation for Speech Recognition Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification (SRGS). Since 5 April 2007 it is currently a World Wide Web Consortium Recommendation [1]. , which defines how speech grammars bind to application semantics.

VoiceXML 2.0 Delivers Voice and Interactivity to the W3C Speech

Interface Framework

VoiceXML 2.0 allows developers to create audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and Dual tone multi-frequency (DTMF (Dual-Tone MultiFrequency) The type of audio signals that are generated when you press the buttons on a touch-tone telephone. See also DMTF.

DTMF - Dual Tone Multi Frequency
, or touch-tone) key input, recording of spoken input, telephony, and mixed-initiative conversations. VoiceXML is downloaded from HTTP servers in the same way as HTML HTML
 in full HyperText Markup Language

Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web.
. This means that application developers can take full advantage of widely deployed and industry proven Web technologies.

"VoiceXML 2.0 has the power to change the way phone-based information and customer services are developed. No longer will we we have to press 'one' for this or 'two' for that. Instead, we will be able to make selections and provide information by speech," explained Dave Raggett, W3C Voice Browser Activity Lead. "In addition, VoiceXML 2.0 creates opportunities for people with visual impairments or those needing Web access while keeping their hands and eyes free for other things, such as getting directions while driving."

SRGS Drives Robust Recognition of User Responses

The Speech Recognition Grammar Specification--SRGS-- allows applications to specify the words and phrases Words and Phrases®

A multivolume set of law books published by West Group containing thousands of judicial definitions of words and phrases, arranged alphabetically, from 1658 to the present.
 that users are prompted to speak. This enables robust speaker independent recognition.

SRGS covers both speech and DTMF input. DTMF input is valuable in noisy conditions or when the social context makes it awkward to speak. Speech recognizers are generally able to report the degree of confidence -- that is, the likelihood of having correctly recognized the word or phrase - and may provide the most likely alternatives when the recognizer is uncertain as to which of them the user actually said. SRGS is applicable to more than speech and has been successfully applied to handwriting recognition Handwriting recognition is the ability of a computer to receive intelligible handwritten input. The image of the written text may be sensed "off line" from a piece of paper by optical scanning (optical character recognition).  where the user input is a constrained set of words.

Adoption Rate of VoiceXML 2.0 and SRGS Already Industry Wide

In order to advance to W3C's Recommendation status, there must be evidence of independent interoperable implementations - it must be proven to work. In the case of VoiceXML 2.0, the implementation evidence is extraordinary, with at least eight known implementations in both prototype and fully released products. A complete list of current implementors is available. The implementation report for SRGS includes at least six complete, independent implementations.

There is an extensive, public test suite. While the initial version contained roughly 300 tests, the final version contains over 600 tests. This complements the test suite provided with the Speech Recognition Grammar Specification and the test suite for Speech Synthesis Markup Language which became a W3C Candidate Recommendation in December 2003. Test suites for the remaining specifications in the W3C Speech Interface Framework, including Semantic Interpretation for Speech Recognition and CCXML, are under development by the W3C Voice Browser Working Group and will be published over the next few months.

In addition to the continued work on the remainder of the Speech Interface Framework, the Voice Browser Working Group is already hard at work designing the requirements for the next major version of the dialog markup language, which will build upon the success of VoiceXML 2.0 and incorporate ideas from SALT, XHTML+Voice, and other W3C Member contributions.

The W3C Voice Browser Working Group is among the largest and most active in W3C. Its participants include: Aspect Communications, BeVocal, Canon, Comverse Technology, Convedia, ERCIM ERCIM - European Research Consortium on Informatics and Mathematics. An association of European research organisations promoting cooperative research on key issues in Information Technology. , France Telecom, HeyAnita, Hitachi, HP, IBM, Intel, IWA-HWG IWA-HWG International Webmasters Association / HTML Writers Guild , Loquendo, Microsoft, MITRE, Mitsubishi Electric, Motorola, Nuance Communications, Openstream, SAP, Scansoft, Siemens, Snowshore Networks, Sun Microsystems, Telera, Tellme Networks, Verscape, VoiceGenie Technologies, Voxeo, and Voxpilot.

Contact Americas, Australia --

Janet Daly, janet@w3.org, +1.617.253.5884 or +1.617.253.2613

Contact Europe --

Marie-Claire Forgue, mcf@w3.org, +33.492.38.75.94

Contact Asia --

Yasuyuki Hirakawa yasuyuki@w3.org, +81.466.49.1170

About the World Wide Web Consortium (W3C)

The W3C was created to lead the Web to its full potential by developing common protocols that promote its evolution and ensure its interoperability. It is an international industry consortium jointly run by the MIT Computer Science and Artificial Intelligence Laboratory It has been suggested that and be merged into this article or section.  (CSAIL CSAIL Computer Science & Artificial Intelligence Laboratory (MIT)
CSAIL Computer Science and Artificial Intelligence Lab
) in the USA, the European Research Consortium for Informatics and Mathematics (ERCIM) headquartered in France and Keio University in Japan. Services provided by the Consortium include: a repository of information about the World Wide Web for developers and users, and various prototype and sample applications to demonstrate use of new technology. To date, nearly 400 organizations are Members of the Consortium. For more information see http://www.w3.org/
COPYRIGHT 2004 Business Wire
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Publication:Business Wire
Date:Mar 16, 2004
Words:1194
Previous Article:Vacation Escrow Service Provides Security for Timeshare Rentals; RedWeek.com and Chicago Title Offer Unique Solution.
Next Article:GartnerG2 Says Businesses Are Becoming More Tactical as Senior Executives Look Ahead to Growth in 2004.



Related Articles
VoiceXML 1.0 Comes To Call.(Technology Information)
Automatic Speech Recognition Fine-Tunes Self-Service.
Beyond salt versus VoiceXML: coping with the wealth of standards in speech and multimodal self-service applications. (Call Center/CRM Management...
SpeechPearl XML. (IT News).
Speaking up for cost savings in the call center: VXML takes on the dinosaur of legacy IVR.(e-CRM)
Voicexml versus salt: selecting a voice application standard.(Call Center/CRM Management Scope)(Interactive voice response)
Vowels: a voice-based Web engine for locating speeches.
Definition Du Jour.(Speech-World News)(what is a voice browser)(Brief Article)
Edify achieves VoiceXML 2.0 certification.(VoiceXML Forum)(Brief Article)
Half the equation: open standards are a first step toward speech automation.(SPEECH-WORLD[TM])

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles