Can't stand the acronyms? Customer Inter@ction Solutions[R] presents "the standards: a primer".The principal standards and standardized APIs (application program interfaces) that guide the operation and interaction of components in speech architecture are listed and described below. The agency responsible for each standard or API is shown in parentheses after the standard's name. CCXML CCXML Call Control Extensible Markup Language CCXML Call Control Xml Interpreter (www.w3.org). Call Control eXtensible Markup Language Call Control eXtensible Markup Language (CCXML) is an XML standard designed to provide telephony support to VoiceXML. Its current status is a W3C Working Draft, adopted 19 January 2007. is designed to provide telephony call control support for dialog systems. CCXML is intended to serve as an adjunct language for use with a VXML, SALT or other dialog implementation platform. HTTP HTTP in full HyperText Transfer Protocol Standard application-level protocol used for exchanging files on the World Wide Web. HTTP runs on top of the TCP/IP protocol. (www.ietf.org). Hypertext Transfer Protocol See HTTP. (protocol) Hypertext Transfer Protocol - (HTTP) The client-server TCP/IP protocol used on the World-Wide Web for the exchange of HTML documents. It conventionally uses port 80. Latest version: HTTP 1.1, defined in RFC 2068, as of May 1997. is an application-level protocol for distributed, collaborative, hypermedia information systems. It is a generic, stateless protocol that can be used for many tasks beyond its use for hypertext, such as name servers and distributed object management systems, through extension of its request methods, error codes and headers. H. 323 (www.itu.int/home/). H.323 is a standard that specifies the components, protocols and procedures that provide multimedia communication services--realtime audio, video and data communications--over packet networks, including Internet protocol (IP)-based networks. H.323 is part of a family of recommendations that provide multimedia communication services over a variety of networks. JDBC (www.sun.com). Java Database Connectivity (database, programming) Java Database Connectivity - (JDBC) Part of the Java Development Kit which defines an application programming interface for Java for standard SQL access to databases from Java programs. http://java.sun.com/products/jdk/1.1/docs/guide/jdbc/index.html. is an API that allows developers access to virtually any tabular data source from the Java programming language. It provides cross-DBMS connectivity to a wide range of SQL databases and, with the JDBC API, it also provides access to other tabular data sources, such as spreadsheets or flat files. ODBC (www.microsoft.com). Online Database Connectivity is a widely accepted API for database access. It is based on the Call-Level Interface (CLI) specifications from X/Open and ISO/IEC ISO/IEC International Organization for Standardization/International Electrotechnical Commission (ITU-T M 3000) for database APIs and uses Structured Query Language See SQL. Structured Query Language - SQL (SQL) as its database access language. SALT (www.w3.org). Speech Application Language Tags For other meanings of the word salt or acronym "SALT", see salt (disambiguation). Speech Application Language Tags (SALT) is an XML based markup language that is used in HTML and XHTML pages to add voice recognition capabilities to web based applications. is a platform-independent standard that makes possible multimodal and telephony-enabled access to information, applications and Web services from PCs, telephones, tablet PCs and wireless PDAs (personal digital assistants). The standard extends existing mark-up languages such as HTML, XHTML and XML. SIP, RTP, MGCP (www.ietf.org). SIP (Session Initiation Protocol (protocol) Session Initiation Protocol - (SIP) A very simple text-based application-layer control protocol. It creates, modifies, and terminates sessions with one or more participants. Such sessions include Internet telephony and multimedia conferences. It is described in RFC 2543. ) is a signaling protocol for Internet conferencing, telephony, presence, events notification and instant messaging. RTP (Real-time Transport Protocol (protocol) Real-Time Transport Protocol - (RTP) An Internet protocol for transmitting real-time data such as audio and video. RTP itself does not guarantee real-time delivery of data, but it does provide mechanisms for the sending and receiving applications to support streaming ) is a protocol for the transport of real-time data, including audio and video. MGCP/MEGACO (Media Gateway Control Protocol) addresses the relationship between the media gateway, which converts circuit-switched voice to packet-based traffic, and the media gateway controller (sometimes called a softswitch), which dictates the service logic of that traffic. SRGS (www.w3.org). Speech Recognition Grammar Specification Speech Recognition Grammar Specification (SRGS) is an W3C recommendation that defines syntax for representing grammars for use in speech recognition so that developers can specify the words and patterns of words to be listened for by a speech recognizer. defines the syntax for grammar representation intended for use by speech recognizers and other grammar processors so that developers can specify the words and patterns of words to be listened for by a speech recognizer. SSML (www.w3.org). Speech Synthesis Markup Language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems. is a rich, XML-based markup language for assisting the generation of synthetic speech in Web and other applications. Its essential role is to give authors of synthesizable content a standard way to control aspects of speech output such as pronunciation, volume, pitch, rate, etc., across different synthesis-capable platforms. SS7/ISUP (www.ietf.org). Signaling System 7 is an architecture for performing out-of-band signaling in support of the call-establishment, billing, routing and information-exchange functions of the PSTN (public switched telephone network). It identifies functions to be performed by a signaling-system network and a protocol to enable their performance. ISUP (ISDN User Part The ISDN User Part or ISUP is part of the Signaling System #7 which is used to set up telephone calls in Public Switched Telephone Networks. It is specified by the ITU-T as part of the Q.76x series, ANSI (T1.113-YEAR) and Telcordia former Bellcore GR-246 series. ) defines the messages and protocol used in the establishment and tear down of voice and data calls over the PSTN, managing the trunk network on which they rely. VoiceXML (www.w3.org). VoiceXML (Voice eXtensible Markup Language See XML. (language, text) Extensible Markup Language - (XML) An initiative from the W3C defining an "extremely simple" dialect of SGML suitable for use on the World-Wide Web. http://w3.org/XML/. ) is designed for creating audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF key input, recording of spoken input, telephony and mixed-initiative conversations. Its major goal is to bring the advantages of Web-based development and content delivery to interactive voice response applications. WAP/WML (www.openmobilealliance.org). Wireless Application Protocol and Wireless Markup Language Wireless Markup Language, based on XML, is a content format for devices that implement the Wireless Application Protocol (WAP) specification, such as mobile phones, and preceded the use of other markup languages now used with WAP, such as XHTML and even standard HTML (which are refer to a markup language based on XML that is intended for use in specifying content and user interface for narrow band devices, including cellular phones and pagers. XHTML (www.w3.org). eXtended Hyper Text Markup Language is a family of current and future document types and modules that reproduce, subset and extend HTML 4. The XHTML document types are XML-based and are ultimately designed to work in conjunction with XML-based user agents. XML (www.w3.org). eXtensible Markup Language is a simple, very flexible text format derived from SGML (Standard Generalized Markup Language (language, text) Standard Generalized Markup Language - (SGML) A generic markup language for representing documents. SGML is an International Standard that describes the relationship between a document's content and its structure. ). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. X + V (www.w3.org). XHTML + Voice brings spoken interaction to standard Web content by integrating a set of mature Web technologies such as XHTML and XML Events with XML vocabularies were developed as part of the W3C Speech Interface Framework. The profile includes voice modules that support speech synthesis, speech dialogs, command and control, speech grammars and the ability to attach voice event handlers. For information and subscriptions, visit www.TMCnet.com or call 203-852-6800. RELATED ARTICLE: Definition Du Jour Speaker-independence In the earlier days of speech recognition technology, the ultimate user of the speech application had to "train" the system to his or her voice, therefore increasing the potential for success that the system will recognize his or her voice. Developers realized the impracticality of this method and the nuisance of having to continually "retrain" the system for different voices, particularly in enterprise applications. Today, many applications are "speaker-independent," meaning that the system can recognize a multitude of different voices and speech patterns, eliminating the need for constant retraining. By Dr. K. W. (Bill) Scholz, Ph.D. Unisys Corp. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion