Printer Friendly
The Free Library
14,487,175 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Can't stand the acronyms? Customer Inter@ction Solutions[R] presents "the standards: a primer".


The principal standards and standardized APIs (application program interfaces) that guide the operation and interaction of components in speech architecture are listed and described below. The agency responsible for each standard or API is shown in parentheses after the standard's name.

CCXML CCXML Call Control Extensible Markup Language
CCXML Call Control Xml Interpreter
 (www.w3.org). Call Control eXtensible Markup Language Call Control eXtensible Markup Language (CCXML) is an XML standard designed to provide telephony support to VoiceXML. Its current status is a W3C Working Draft, adopted 19 January 2007.  is designed to provide telephony call control support for dialog systems. CCXML is intended to serve as an adjunct language for use with a VXML, SALT or other dialog implementation platform.

HTTP HTTP
 in full HyperText Transfer Protocol

Standard application-level protocol used for exchanging files on the World Wide Web. HTTP runs on top of the TCP/IP protocol.
 (www.ietf.org). Hypertext Transfer Protocol See HTTP.

(protocol) Hypertext Transfer Protocol - (HTTP) The client-server TCP/IP protocol used on the World-Wide Web for the exchange of HTML documents. It conventionally uses port 80.

Latest version: HTTP 1.1, defined in RFC 2068, as of May 1997.
 is an application-level protocol for distributed, collaborative, hypermedia information systems. It is a generic, stateless protocol that can be used for many tasks beyond its use for hypertext, such as name servers and distributed object management systems, through extension of its request methods, error codes and headers.

H. 323 (www.itu.int/home/). H.323 is a standard that specifies the components, protocols and procedures that provide multimedia communication services--realtime audio, video and data communications--over packet networks, including Internet protocol (IP)-based networks. H.323 is part of a family of recommendations that provide multimedia communication services over a variety of networks.

JDBC (www.sun.com). Java Database Connectivity (database, programming) Java Database Connectivity - (JDBC) Part of the Java Development Kit which defines an application programming interface for Java for standard SQL access to databases from Java programs.

http://java.sun.com/products/jdk/1.1/docs/guide/jdbc/index.html.
 is an API that allows developers access to virtually any tabular data source from the Java programming language. It provides cross-DBMS connectivity to a wide range of SQL databases and, with the JDBC API, it also provides access to other tabular data sources, such as spreadsheets or flat files.

ODBC (www.microsoft.com). Online Database Connectivity is a widely accepted API for database access. It is based on the Call-Level Interface (CLI) specifications from X/Open and ISO/IEC ISO/IEC International Organization for Standardization/International Electrotechnical Commission (ITU-T M 3000)  for database APIs and uses Structured Query Language See SQL.

Structured Query Language - SQL
 (SQL) as its database access language.

SALT (www.w3.org). Speech Application Language Tags For other meanings of the word salt or acronym "SALT", see salt (disambiguation).
Speech Application Language Tags (SALT) is an XML based markup language that is used in HTML and XHTML pages to add voice recognition capabilities to web based applications.
 is a platform-independent standard that makes possible multimodal and telephony-enabled access to information, applications and Web services from PCs, telephones, tablet PCs and wireless PDAs (personal digital assistants). The standard extends existing mark-up languages such as HTML, XHTML and XML.

SIP, RTP, MGCP (www.ietf.org). SIP (Session Initiation Protocol (protocol) Session Initiation Protocol - (SIP) A very simple text-based application-layer control protocol. It creates, modifies, and terminates sessions with one or more participants. Such sessions include Internet telephony and multimedia conferences.

It is described in RFC 2543.
) is a signaling protocol for Internet conferencing, telephony, presence, events notification and instant messaging. RTP (Real-time Transport Protocol (protocol) Real-Time Transport Protocol - (RTP) An Internet protocol for transmitting real-time data such as audio and video. RTP itself does not guarantee real-time delivery of data, but it does provide mechanisms for the sending and receiving applications to support streaming ) is a protocol for the transport of real-time data, including audio and video. MGCP/MEGACO (Media Gateway Control Protocol) addresses the relationship between the media gateway, which converts circuit-switched voice to packet-based traffic, and the media gateway controller (sometimes called a softswitch), which dictates the service logic of that traffic.

SRGS (www.w3.org). Speech Recognition Grammar Specification Speech Recognition Grammar Specification (SRGS) is an W3C recommendation that defines syntax for representing grammars for use in speech recognition so that developers can specify the words and patterns of words to be listened for by a speech recognizer.  defines the syntax for grammar representation intended for use by speech recognizers and other grammar processors so that developers can specify the words and patterns of words to be listened for by a speech recognizer.

SSML (www.w3.org). Speech Synthesis Markup Language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems.  is a rich, XML-based markup language for assisting the generation of synthetic speech in Web and other applications. Its essential role is to give authors of synthesizable content a standard way to control aspects of speech output such as pronunciation, volume, pitch, rate, etc., across different synthesis-capable platforms.

SS7/ISUP (www.ietf.org). Signaling System 7 is an architecture for performing out-of-band signaling in support of the call-establishment, billing, routing and information-exchange functions of the PSTN (public switched telephone network). It identifies functions to be performed by a signaling-system network and a protocol to enable their performance. ISUP (ISDN User Part The ISDN User Part or ISUP is part of the Signaling System #7 which is used to set up telephone calls in Public Switched Telephone Networks. It is specified by the ITU-T as part of the Q.76x series, ANSI (T1.113-YEAR) and Telcordia former Bellcore GR-246 series. ) defines the messages and protocol used in the establishment and tear down of voice and data calls over the PSTN, managing the trunk network on which they rely.

VoiceXML (www.w3.org). VoiceXML (Voice eXtensible Markup Language See XML.

(language, text) Extensible Markup Language - (XML) An initiative from the W3C defining an "extremely simple" dialect of SGML suitable for use on the World-Wide Web.

http://w3.org/XML/.
) is designed for creating audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF key input, recording of spoken input, telephony and mixed-initiative conversations. Its major goal is to bring the advantages of Web-based development and content delivery to interactive voice response applications.

WAP/WML (www.openmobilealliance.org). Wireless Application Protocol and Wireless Markup Language Wireless Markup Language, based on XML, is a content format for devices that implement the Wireless Application Protocol (WAP) specification, such as mobile phones, and preceded the use of other markup languages now used with WAP, such as XHTML and even standard HTML (which are  refer to a markup language based on XML that is intended for use in specifying content and user interface for narrow band devices, including cellular phones and pagers.

XHTML (www.w3.org). eXtended Hyper Text Markup Language is a family of current and future document types and modules that reproduce, subset and extend HTML 4. The XHTML document types are XML-based and are ultimately designed to work in conjunction with XML-based user agents.

XML (www.w3.org). eXtensible Markup Language is a simple, very flexible text format derived from SGML (Standard Generalized Markup Language (language, text) Standard Generalized Markup Language - (SGML) A generic markup language for representing documents. SGML is an International Standard that describes the relationship between a document's content and its structure. ). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere.

X + V (www.w3.org). XHTML + Voice brings spoken interaction to standard Web content by integrating a set of mature Web technologies such as XHTML and XML Events with XML vocabularies were developed as part of the W3C Speech Interface Framework. The profile includes voice modules that support speech synthesis, speech dialogs, command and control, speech grammars and the ability to attach voice event handlers.

For information and subscriptions, visit www.TMCnet.com or call 203-852-6800.

RELATED ARTICLE: Definition Du Jour

Speaker-independence

In the earlier days of speech recognition technology, the ultimate user of the speech application had to "train" the system to his or her voice, therefore increasing the potential for success that the system will recognize his or her voice. Developers realized the impracticality of this method and the nuisance of having to continually "retrain" the system for different voices, particularly in enterprise applications. Today, many applications are "speaker-independent," meaning that the system can recognize a multitude of different voices and speech patterns, eliminating the need for constant retraining.

By Dr. K. W. (Bill) Scholz, Ph.D. Unisys Corp.
COPYRIGHT 2004 Technology Marketing Corporation
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Speech-World[TM]
Author:Scholz, K.W. (Bill)
Publication:Customer Interaction Solutions
Geographic Code:1USA
Date:Aug 1, 2004
Words:970
Previous Article:SALT Forum advances mobile content delivery with enhancements to Scalable Vector Graphics specification.(Speech-World[TM])(Speech Application...
Next Article:A view from the outside: defining external call center performance metrics.(Managing by the Numbers)
Topics:



Related Articles
An important message from the publisher of customer inter@ction solutions[TM] magazine. (Publisher's Outlook).(Editorial)
Letters to the editor.(Letter to the Editor)
TMC honors industry visionaries.(New Products)(Brief Article)
TMC launches "Speech-World" Conference in Dallas this November 30-December 2, 2004.
Conference info.(SPEECH-WORLD[TM])
Introducing ... the Boardroom Report[TM]: the CRM, customer interaction and call center industry's view from the top.(Publisher's Outlook)(Editorial)
It's top 50 teleservices & MVP: award time!
Our first-ever award determined by readers.(2006 TMC[R] BEST IN CLASS AWARD)
Customer Inter@ction Solutions celebrates 25 years of excellence.(SPECIAL FOCUS)
The call center and teleservices event of the year celebrates the last quarter century.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles