Printer Friendly
The Free Library
14,715,855 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Voice applications: a reality: tiny screens? Elusive connections? Voice-enable your Web applications so mobile users can access them via phone. It's easier than you think--as long as you take the time to properly plan how you'll integrate Web and voice technologies.


"PRESS 1 FOR ..." You name it. Whether they're interactive voice response (IVR (Interactive Voice Response) An automated telephone information system that speaks to the caller with a combination of fixed voice menus and data extracted from databases in real time. ) or speech-based, telephony-based systems are everywhere. U.S. companies provide their customers with an estimated 50 billion minutes per year of service using touchtone-based IVR systems. This translates to more than $3 billion a year for these systems and the infrastructure to support them.

Similarly, virtually every business today has implemented Web-based systems that provide a far richer set of services and capabilities enabling extensive and inexpensive self-service. But, when it comes to mobile customers and users, Web-based systems aren't always the right answer. Everything from screen size, difficulty entering text data, and the "hands-on/eyes-on" mode of operation create challenges to when and how a customers and employees can use mobile devices to access Web-based services.

Although both Web- and voice-based systems are essential to the mobile economy, their evolution has unfortunately been independent. Until recently, the technology to couple the development of these systems and reuse business logic and data assets hasn't been in place. As a result, enterprises have to deal with high costs and inconsistent service quality between the two channels. This article explores the relationship between Web and voice systems, and shows the possibilities for reducing expenses and the user frustration that comes from treating voice and Web applications as separate entities.

When does voice make sense?

Whether you're considering augmenting Web applications or replacing touch-tone telephony applications, there are several key "qualifying factors" that can help you determine if voice makes sense in your enterprise:

* Do you need to reach customers, partners, or associates when Web or other visual access (including WAP (1) (Wireless Access Point) See access point.

(2) (Wireless Application Protocol) A standard for providing cellular phones, pagers and other handheld devices with secure access to e-mail and text-based Web pages.
, PDA (Personal Digital Assistant) A handheld computer for managing contacts, appointments and tasks. It typically includes a name and address database, calendar, to-do list and note taker, which are the functions in a personal information manager (see PIM). , and smartphone visual access) just "isn't right," but the voice access via a phone is?

* Are your mobile customers or workers seeking hands-free, eyes-free access? Perhaps they're part of a demographic that's less comfortable with or adept at using the Web, or they're in locations where Internet connectivity isn't easily available.

* Does your enterprise manage call centers with live agents (in-sourced or out-sourced) that result in a high cost-per-transaction?

* Are customers and employees dissatisfied with hard-to-navigate, touch-tone applications?

Here are some of the benefits of voice-based applications:

* Voice access to selected applications gives customers and employees anytime, anywhere access via any phone.

* Voice applications dramatically reduce customer and employee service cost when compared to using live agents.

* Many touch-tone based applications cause significant customer dissatisfaction and "zeroing out" to live agents, which drives up transaction costs Transaction Costs

Costs incurred when buying or selling securities. These include brokers' commissions and spreads (the difference between the price the dealer paid for a security and the price they can sell it).
. Voice applications give users convenient, comfortable human-voice interaction with applications to decrease the transaction time and increase user satisfaction.

Serving the Web and voice channels today: separate and unequal

The diagram in figure 1 represents the implementation paradigm for the vast majority of enterprises with existing Web and voice systems.

[FIGURE 1 OMITTED]

In this common scenario, Web applications and voice applications are completely separate application silos: The Web application uses a three-tier model (more on this in the following section); and the voice application is a single application that intermingles the three tiers indiscriminately.

In many cases, this vertical separation of Web from voice creates a situation where the Web side offers a much broader set of applications because of the ability to quickly and cost-effectively build or extend applications. Conversely, the voice side suffers from a closed, proprietary application environment resulting in high costs for application development and deployment. The cost of building voice-oriented applications has discouraged many companies from adding new features or applications on the voice side, creating voice-oriented services distinctly inferior to Web-based services.

Web applications

Web applications have been long implemented using advanced techniques focused on a three-tier model of presentation, business logic, and data. In the three-tier model, you maintain each layer independently (with development independence at each level of the architecture) to simplify systems implementation and maintenance. For example, you call update Web application styles (e.g., graphics, logos, formats, and browser features) without impacting the underlying business logic or data connectivity. Similarly, you can make changes to data or legacy systems with zero customer impact because the presentation layer and business logic remain unchanged. Note that you can deliver the presentation layer of a Web application to a multiplicity of devices--ranging from standard PC browsers to mobile devices including PDAs and smartphones--by appropriately formatting the content for each type of device.

Voice applications

The Web, with its graphical user interfaces graphical user interface (GUI)

Computer display format that allows the user to select commands, call up files, start programs, and do other routine tasks by using a mouse to point to pictorial symbols (icons) or lists of menu choices on the screen as opposed to having to
 (GUIs), has evolved and matured into a highly sophisticated environment offering clear standards and rules for applications development. Voice systems, however, have developed in a fractured, proprietary market, resulting in pockets of proprietary GUI-based tools--you usually can't port applications between IVR systems from different vendors. For traditional IVRs, you can't do it at all. However, some IVRs are now starting to support standards for building applications which, in theory, should make it possible to port applications between platforms. This is still a challenge though because many companies add proprietary extensions that prevent the application from running on other platforms.

Voice systems generally fall two broad categories: Interactive Voice Response (IVR) systems, best known for their menu-driven touch-tone interaction, and standalone speech-based systems, which leverage automatic speech recognition (ASR (Automatic Speech Recognition) Using voice recognition to replace keypad entry for telephone voice menus. Typically used to speak the digits 0 through 9 insted of keying them, ASR systems may be able to recognize a limited vocabulary. See voice recognition and AVSR. ). Some systems support both speech and touch-tone applications. The application "speaks" to the user using either pre-recorded prompts or by automatically generating speech using text-to-speech (TTS (1) See text-to-speech.

(2) (Transaction Tracking System) Software that monitors a transaction until completion. In the event of a hardware or software failure, it ensures that the database is brought back to its former state before the attempt to
) technology. Modern TTS has advanced to often be virtually indistinguishable from a live person. Interaction with a voice application is through a voice user interface (VUI (Voice User Interface) A voice-controlled application on a computer, PDA or smartphone. A VUI is more sophisticated than an interactive voice response (IVR) system. It implies a wide range of commands rather than just voicing "yes" or "no." Contrast with GUI. ), similar in concept to a Web GUI (Graphical User Interface) A graphics-based user interface that incorporates movable windows, icons and a mouse. The ability to resize application windows and change style and size of fonts are the significant advantages of a GUI vs. a character-based interface. . Replacing the keyboard and mouse input in a GUI, VUIs accept spoken or touch-tone input from phone users. While GUIs return information in a visual format, VUIs render information to users via audio output--i.e., recorded prompts and/or dynamically synthesized syn·the·sized  
adj.
1. Relating to or being an instrument whose sound is modified or augmented by a synthesizer.

2. Relating to or being compositions or a composition performed on synthesizers or synthesized instruments.
 speech.

Voice application developers use a markup language markup language

Standard text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship among its parts. The most widely used markup languages are SGML, HTML, and XML.
 similar to HTML HTML
 in full HyperText Markup Language

Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web.
 and XML XML
 in full Extensible Markup Language.

Markup language developed to be a simplified and more structural version of SGML. It incorporates features of HTML (e.g., hypertext linking), but is designed to overcome some of HTML's limitations.
 already used by Web developers. Using the Voice Extensible Markup Language See XML.

(language, text) Extensible Markup Language - (XML) An initiative from the W3C defining an "extremely simple" dialect of SGML suitable for use on the World-Wide Web.

http://w3.org/XML/.
 (VoiceXML), developers can create voice applications that incorporate various scripting languages A high-level programming, or command, language that is interpreted (translated on the fly) rather than compiled ahead of time. A scripting, or script, language may be a general-purpose programming language or it may be limited to specific functions used to augment the running of an  (e.g., Javascript or Perl) and Java controls to provide additional functionality such as database lookup A data search performed within a predefined table of values (array, matrix, etc.) or within a data file.  or dynamic content based on user input. VoiceXML, now several years old, is the defacto standard being adopted by vendors and users. Microsoft has also proffered a second voice application standard, Speech Application Language Tags For other meanings of the word salt or acronym "SALT", see salt (disambiguation).
Speech Application Language Tags (SALT) is an XML based markup language that is used in HTML and XHTML pages to add voice recognition capabilities to web based applications.
 (SALT), now starting to gain traction since the first products supporting it are just becoming available. VoiceXML is the defacto choice for anyone wanting to build a speech application. Unless an enterprise has a huge investment in using .NET and developing under .NET SDK (Software Developer's Kit) See developer's toolkit and Windows SDK.

SDK - Software Developers Kit (or "Software Development Kit").
, SALT isn't the best alternative.

Despite the emergence of VoiceXML as a markup language for voice systems, it hasn't completely eased the problems. Voice applications maintain a separate and unequal status when compared to Web applications. Most tools for VoiceXML-based application development reinforce the standalone voice system paradigm--through build-up build·up also build-up  
n.
1. The act or process of amassing or increasing: a military buildup; a buildup of tension during the strike.

2.
 of independent data, business logic, and presentation layers--in an attempt to be an all-inclusive, "walled garden Refers to a network or service that restricts its users to its own content. Cable TV and satellite TV are walled gardens, offering a finite number of channels and programs to its subscribers. " development environment. Also, many tools available today don't reinforce a three-tier development model, in fact some actually discourage that model.

Giving voice users equal access to Web content

With a Web extension to voice paradigm, a voice and Web systems architecture is transformed into the scenario illustrated in figure 2. In this architecture, users can create VUIs to existing business logic and data assets within the enterprise, offering new access for employees, customers and business partners without the inherent limitations of visual-only systems and without the expense of separate, independent voice-based systems. The tools to support application development for this type of paradigm are available today from some of the leading companies in the application server space: BEA BEA - Basic programming Environment for interactive-graphical Applications, from Siemens-Nixdorf.  has a plug-in for its WebLogic Workshop development toolkit from its partner SandCherry; IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries)  offers a voice toolkit for Web-sphere studio; and Microsoft has a speech application SDK for Visual Studio .NET A suite of programming languages and development tools from Microsoft that supports the .NET environment. Upon its introduction in 2001, it included Visual C# and .NET versions of Visual Basic and Visual C++. See .NET. .

[FIGURE 2 OMITTED]

By leveraging the vast installed base of Web applications, Web infrastructure, and development community, companies can cost-effectively expand the number of applications available to phones using touch-tone or speech. So, not only are voice applications a reality, they're a smart move.

MOBILE BUSINESS BENEFITS

When It comes to mobile customers and users, Web-based systems aren't always the right answer. Everything from screen size, difficulty entering text data, and the "hands-on/eyes-on" mode of operation create challenges to when and how customers and employees can use mobile devices to access Web-based services.

TAKE AWAYS

Here are the key concepts to understand about phone-based delivery of applications:

1. Voice and touch-tone are essential to complement Web-based services in any multi-channel service delivery strategy. The evolving voice services market represents a significant vehicle for mobile operators and companies to contact customers more often and more effectively.

2. Traditional voice application development methodologies are inherently wasteful and expensive because they involve duplicating development time and costs you've already invested in Web-based services.

3. Web-extension solutions for creating voice applications can reduce development costs by up to 80 percent. By extending Web-based applications--including the data, data connectivity, and business logic--to voice, rather than developing voice applications from scratch using proprietary technologies, companies can focus on just creating new voice user interfaces.

4. Using Web extension can drastically improve ROI (Return On Investment) The monetary benefits derived from having spent money on developing or revising a system. In the IT world, there are more ways to compute ROI than Carter has liver pills (and for those of you who never heard of that expression, it means a lot).  for both enterprise and service provider applications. Using a Web extension methodology to reduce development and maintenance costs can dramatically shorten payback Payback

The length of time it takes to recover the initial cost of a project, without regard to the time value of money.
 time for an application, allowing companies to justify a modest investment for speech-enabled services and offering significant dividends for the business.

Michael Sajor, vice president of worldwide business development for SandCherry, has more than 22 years of experience in business development and management of high-tech companies. Most recently, he served as vice president of business development for multimodal Two or more modes of operation. The term is used to refer to a myriad of functions and conditions in which two or more different methods, processes or forms of delivery are used. On the Web, it refers to asking for something one way and receiving the answer another; for example requesting  platform provider Kirusa, Inc., where he directed the company's marketing, business development, sales, and product management operations. Sajor began his career at AT&T Bell Laboratories and then served in various positions at Lucent Technologies including director of mobile Internet Refers to gaining access to the Internet using a lightweight, handheld device. See Mobile IP, PDA, smartphone and mobile TV.  world markets and product realization for the wireless organization and chief information officer for the company's wireless networks group. Michael has extensive experience working with mobile services and platforms including messaging, multimodality, speech, location-based services See mobile positioning. , presentation technologies, and mediation gateways. http://www.sandcherry.com
COPYRIGHT 2004 Advisor Publications, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Emerging Technologies
Author:Sajor, Michael
Publication:Mobile Business Advisor
Geographic Code:1USA
Date:Sep 1, 2004
Words:1690
Previous Article:Mobile development tools: although much of the challenge of mobile development comes down to good old-fashioned architecture and coding, you can ease...
Next Article:And it lets you make phone calls too!(Mobile Lifestyle Advisor[R])(Brief Article)
Topics:



Related Articles
Deliveries. (High Priority!).(Web services)
The fall lineup: the Zire and Tungsten T offer educators affordability and functionality.
Nokia 9290 communicator: this all-in-one device promises to provide all your mobile communication needs.(Smartphone)
Wi-Fi phones: a reality? The ability for your mobile phone to seamlessly roam between cellular networks and Wi-Fi networks is on the horizon, but...
TMC Labs Innovation Awards, Part I.(TMC[TM] Labs)
Conference sessions.(SPEECH-WORLD[TM])
TCS launches BlackBerry 7250.(TeleCommunication Systems Inc.)
That third screen--in your pocket! Think it's just a cell phone? Sure, you can make a call, but there's so much more in store.(tech talk)(Cell phones)
Mobile technology: one for all: emerging dual-mode phones give students a single device to manage all their voice and data needs.
Enterprise mobility: practice allows insurers to blend convergence, collaboration and management.(Technology)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles