World Wide Web Consortium Issues VoiceXML 2.0 as a W3C Proposed Recommendation; Cornerstone to the W3C Speech Interface Framework is Nearly Complete.Business Editors/High-Tech Writers http://www.w3.org/--(BUSINESS WIRE)--Feb. 3, 2004 http://www.w3.org/2004/02/voicexml2-pressrelease.html.en Giving voice to the Web, the World Wide Web Consortium (W3C (World Wide Web Consortium, www.w3.org) An international industry consortium founded in 1994 by Tim Berners-Lee to develop standards for the Web. It is hosted in the U.S. by the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT (www.csail.mit.edu/index.php). ) has published VoiceXML 2.0 as a W3C Proposed Recommendation. The goal of VoiceXML 2.0 is to bring the advantages of Web-based development and content delivery to interactive voice response applications. Advancement of a W3C technical report to Proposed Recommendation indicates that the Working Group has successfully completed both public and W3C Working Group review, and has provided evidence of successful interoperable implementations. It is the final review before a document becomes a W3C Recommendation A W3C Recommendation is the final stage of a ratification process of the World Wide Web Consortium (W3C) working group concerning the standard. It is the equivalent of a published standard in many other industries. , commonly understood as a Web standard. Giving Voice to the Web: W3C's Speech Interface Framework Since 1999, W3C has been working on its Speech Interface Framework to expand access to the Web to allow people to interact via key pads, spoken commands, listening to prerecorded pre·re·cord tr.v. pre·re·cord·ed, pre·re·cord·ing, pre·re·cords To record (a television program, for example) at an earlier time for later presentation or use. Adj. 1. speech, synthetic speech and music. With the number of telephone lines and mobile phones exceeding one billion units worldwide, the specifications of W3C's Speech Interface Framework will allow an unprecedented number of people to use any telephone to access appropriately designed Web-based services. VoiceXML 2.0 Delivers Voice and Interactivity to the Speech Interface Framework VoiceXML 2.0 allows developers to create audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF (Dual-Tone MultiFrequency) The type of audio signals that are generated when you press the buttons on a touch-tone telephone. See also DMTF. DTMF - Dual Tone Multi Frequency (touch-tone) key input, recording of spoken input, telephony, and mixed-initiative conversations. "VoiceXML 2.0 has the power to change the way phone-based information and customer services are developed. No longer will we have to press 'one' for this or 'two' for that. Instead, we will be able to make selections and provide information by speech," explained Dave Raggett, W3C Voice Browser Activity Lead. "In addition, VoiceXML 2.0 creates opportunities for people with visual impairments or those needing Web access while keeping their hands and eyes free for other things, such as getting directions while driving." In the W3C Speech Interface Framework, VoiceXML controls how the application interacts with the user, while the Speech Synthesis Markup Language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems. (SSML SSML Speech Synthesis Markup Language ) is used for spoken prompts and the Speech Recognition Grammar Specification Speech Recognition Grammar Specification (SRGS) is an W3C recommendation that defines syntax for representing grammars for use in speech recognition so that developers can specify the words and patterns of words to be listened for by a speech recognizer. (SRGS SRGS Speech Recognition Grammar Specification SRGS Stimulated Raman Gain Spectroscopy SRGS Survivable Relay Ground Stations ) for guiding the speech recognizers via grammars that describe the expected user responses. Other specifications in the Framework include Voice Browser Call Control (CCXML CCXML Call Control Extensible Markup Language CCXML Call Control Xml Interpreter ), which provides telephony call control support for VoiceXML or other dialog systems, and Semantic Interpretation for Speech Recognition Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification (SRGS). Since 5 April 2007 it is currently a World Wide Web Consortium Recommendation [1]. , which defines the syntax and semantics of the contents of tags in SRGS. Adoption Rate of VoiceXML 2.0 Already Industry Wide In order to advance to W3C's Proposed Recommendation status, there must be evidence of independent interoperable implementations - it must be proven to work. In the case of VoiceXML2, the implementation evidence is extraordinary, with at least eight known implementations in both prototype and fully released products. A complete list of current implementors is available. There is an extensive, public set of test suites. While the initial version contains over 300 tests, the final version is expected to have more than 500 tests. Updates to the test suite will be announced on the Voice Browser public mailing list. This complements the test suite provided with the Speech Recognition Grammar Specification, which became a W3C Candidate Recommendation in June 2002. Test suites for the remaining specifications in the W3C Speech Interface Framework, including the Speech Synthesis Markup Language, are under development by the W3C Voice Browser Working Group and will be published over the next few months. VoiceXML 2.0, Speech Interface Framework to Evolve, Patent Issues Resolved The W3C Voice Browser Working Group is among the largest and most active in W3C. Its participants include BeVocal Inc., Canon, Comverse, France Telecom, Genesys Telecommunications Laboratories, HP, HeyAnita, Hitachi, IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) , Intel, Loquendo, Microsoft, MITRE, Mitsubishi, Motorola, Nokia, Nortel Networks, Nuance, Philips, PipeBeach, SAP, ScanSoft, SnowShore Networks, SpeechWorks, Sun, Syntellect, Tellme Networks, Unisys, Verascape, VoiceGenie, Voxeo, and Voxpilot. Patent issues arising from inconsistencies with the Voice Browser Working Group's Royalty-Free Licensing Mode have been addressed by a Patent Advisory Group within the W3C, per the W3C's Current Patent Practice. As a result, the W3C Voice Browser Working Group committed to the production of an open specification, and the Voice Browser Patent Advisory Group resolved the remaining issues, making it possible for the core VoiceXML 2.0 specification to be made available according to W3C's Royalty-Free License. About the World Wide Web Consortium (W3C) The W3C was created to lead the Web to its full potential by developing common protocols that promote its evolution and ensure its interoperability. It is an international industry consortium jointly run by the MIT Computer Science and Artificial Intelligence Laboratory CSAIL Computer Science and Artificial Intelligence Lab ) in the USA, the European Research Consortium for Informatics and Mathematics (ERCIM ERCIM - European Research Consortium on Informatics and Mathematics. An association of European research organisations promoting cooperative research on key issues in Information Technology. ) headquartered in France and Keio University in Japan. Services provided by the Consortium include: a repository of information about the World Wide Web for developers and users, and various prototype and sample applications to demonstrate use of new technology. To date, nearly 400 organizations are Members of the Consortium. For more information see http://www.w3.org/ |
|
||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion