Printer Friendly
The Free Library
14,588,558 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

World Wide Web Consortium Issues VoiceXML 2.0 as a W3C Candidate Recommendation; Cornerstone to the W3C Speech Interface Framework is Ready for Implementors.


Business Editors/High-Tech Writers

http://www.w3.org/--(BUSINESS WIRE)--Jan. 28, 2003

Testimonials in support of this announcement are available at:

http://www.w3.org/2003/01/voicexml2-testimonial

Giving voice to the Web, the World Wide Web Consortium (W3C (World Wide Web Consortium, www.w3.org) An international industry consortium founded in 1994 by Tim Berners-Lee to develop standards for the Web. It is hosted in the U.S. by the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT (www.csail.mit.edu/index.php). ) has published VoiceXML 2.0 as a W3C Candidate Recommendation. Advancement of a technical report to Candidate Recommendation is an explicit, public call for implementation. The goal of VoiceXML 2.0 is to bring the advantages of Web-based development and content delivery to interactive voice response applications. Giving Voice to the Web: W3C's Speech Interface Framework

Since 1999, W3C has been working on its Speech Interface Framework to expand access to the Web to allow people to interact via key pads, spoken commands, listening to prerecorded pre·re·cord  
tr.v. pre·re·cord·ed, pre·re·cord·ing, pre·re·cords
To record (a television program, for example) at an earlier time for later presentation or use.

Adj. 1.
 speech, synthetic speech and music. With the number of telephone lines and mobile phones exceeding one billion units worldwide, the specifications of W3C's Speech Interface Framework will allow an unprecedented number of people to use any telephone to access appropriately designed Web-based services.

VoiceXML 2.0 Delivers Voice and Interactivity to the Speech Interface Framework

VoiceXML 2.0 allows developers to create audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF (Dual-Tone MultiFrequency) The type of audio signals that are generated when you press the buttons on a touch-tone telephone. See also DMTF.

DTMF - Dual Tone Multi Frequency
 (touch-tone) key input, recording of spoken input, telephony, and mixed-initiative conversations.

"VoiceXML 2.0 has the power to change the way phone-based information and customer services are developed. No longer will we have to press 'one' for this or 'two' for that. Instead, we will be able to make selections and provide information by speech," explained Dave Raggett, W3C Voice Browser Activity Lead. "In addition, VoiceXML 2.0 creates opportunities for people with visual impairments or those needing Web access while keeping their hands and eyes free for other things, such as getting directions while driving."

In the W3C Speech Interface Framework, VoiceXML controls how the application interacts with the user, while the Speech Synthesis Markup Language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems.  (SSML SSML Speech Synthesis Markup Language ) is used for spoken prompts and the Speech Recognition Grammar Specification Speech Recognition Grammar Specification (SRGS) is an W3C recommendation that defines syntax for representing grammars for use in speech recognition so that developers can specify the words and patterns of words to be listened for by a speech recognizer.  (SRGS SRGS Speech Recognition Grammar Specification
SRGS Stimulated Raman Gain Spectroscopy
SRGS Survivable Relay Ground Stations
) for guiding the speech recognizers via grammars that describe the expected user responses. Other specifications in the Framework include Voice Browser Call Control (CCXML CCXML Call Control Extensible Markup Language
CCXML Call Control Xml Interpreter
), which provides telephony call control support for VoiceXML or other dialog systems, and Semantic Interpretation for Speech Recognition Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification (SRGS). Since 5 April 2007 it is currently a World Wide Web Consortium Recommendation [1]. , which defines the syntax and semantics of the contents of tags in SRGS.

Adoption Rate to Increase with Availability of Test Suites

There is also an extensive set of test suites publically available with the VoiceXML 2.0 Candidate Recommendation. While the initial version contains over 300 tests, the final version is expected to have more than 500 tests. Updates to the test suite will be announced on the Voice Browser's public mailing list.

This complements the test suite provided with the Speech Recognition Grammar Specification, which became a W3C Candidate Recommendation in June 2002. Test suites for the remaining specifications in the W3C Speech Interface Framework, including the Speech Synthesis Markup Language, which enters its Last Call phase today, are under development by the W3C Voice Browser Working Group and will be published over the next few months.

VoiceXML 2.0, Speech Interface Framework to Evolve, Resolve Patent Issues

The W3C Voice Browser Working Group is among the largest and most active in W3C. Its participants include BeVocal Inc., Canon, Comverse, France Telecom, Genesys Telecommunications Laboratories, HP, HeyAnita, Hitachi, IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) , Intel, Loquendo, Microsoft, MITRE, Mitsubishi, Motorola, Nokia, Nortel Networks, Nuance, Philips, PipeBeach, SAP, ScanSoft, SnowShore Networks, SpeechWorks, Sun, Syntellect, Tellme Networks, Unisys, Verascape, VoiceGenie, Voxeo, and VoxPilot. Support for the continued work and commitments to product implementations are strong, as evidenced by the range of testimonials.

As the Working Group moves forward in its technical work across the range of voice-related specifications, patent issues arising from inconsistencies with the Voice Browser Working Group's Royalty-Free Licensing Mode are to be addressed by a Patent Advisory Group within the W3C, per the W3C's Current Patent Practice. With the vast majority of the W3C Voice Browser Working Group committed to the production of an open specification, the Voice Browser Patent Advisory Group will work towards resolving the remaining issues.

About the World Wide Web Consortium (W3C)

The W3C was created to lead the Web to its full potential by developing common protocols that promote its evolution and ensure its interoperability. It is an international industry consortium jointly run by the MIT MIT - Massachusetts Institute of Technology  Laboratory for Computer Science (MIT LCS LCS - Language for Communicating Systems ) in the USA, the European Research Consortium for Informatics and Mathematics (ERCIM ERCIM - European Research Consortium on Informatics and Mathematics. An association of European research organisations promoting cooperative research on key issues in Information Technology. ) headquartered in France and Keio University in Japan. Services provided by the Consortium include: a repository of information about the World Wide Web for developers and users, and various prototype and sample applications to demonstrate use of new technology. To date, nearly 450 organizations are Members of the Consortium. For more information see http://www.w3.org/
COPYRIGHT 2003 Business Wire
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Publication:Business Wire
Date:Jan 28, 2003
Words:797
Previous Article:Computer Resolutions Inc. to Market Dell PC/Register; CRI Offers Retail Market Solution From Point-of-Sale to Back Office.
Next Article:Biophan Announces Successful Animal Testing Of MRI-Safe Temporary Photonic Pacemaker.



Related Articles
VoiceXML 1.0 Comes To Call.(Technology Information)
New technology delivers standard API and dynamism to HTML and XHTML 1.0. (Internet Focus).( Document Object Model (DOM) Level 2 HTML )
Beyond salt versus VoiceXML: coping with the wealth of standards in speech and multimodal self-service applications. (Call Center/CRM Management...
XML Pointer fromW3C. (Internet Focus).
W3C issues RDF and OWL recommendations.(Standards)
Vowels: a voice-based Web engine for locating speeches.
Definition Du Jour.(Speech-World News)(what is a voice browser)(Brief Article)
Dialog manager.(Definition Du Jour)(Brief Article)
Edify achieves VoiceXML 2.0 certification.(VoiceXML Forum)(Brief Article)
Half the equation: open standards are a first step toward speech automation.(SPEECH-WORLD[TM])

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles