A talk with Nuance's Lynda Kate Smith.The editors of Customer Interaction Solutions[R] recently spoke with Lynda Kate Smith Noun 1. Kate Smith - United States singer noted for her rendition of patriotic songs (1909-1986) Kathryn Elizabeth Smith, Smith , vice president and chief marketing officer of Nuance Communications Nuance Communications is a computer software technology company. Their current worldwide headquarters is in Burlington, Massachusetts in the United States. Nuance provides speech and imaging solutions and is mainly known for its speech recognition and speech synthesis software that (www.nuance.com), to hear her opinions on the present and future of speech technologies. CIS Cis (sĭs), same as Kish (1.) (1) (CompuServe Information Service) See CompuServe. (2) (Card Information S : What needs do speech-based call center technologies serve that traditional touch-tone IVR (Interactive Voice Response) An automated telephone information system that speaks to the caller with a combination of fixed voice menus and data extracted from databases in real time. systems do not? LKS LKS Lakes LKS Landau-Kleffner Syndrome LKS Liver, Kidney, Spleen (organ transplant) LKS Lucky Stores, Inc. LKS Last Known Status (military tracking systems) LKS Linux Kernel Subsystem LKS Linux Kernel Summit : More and more businesses, throughout a range of industries, are recognizing the business benefits of speech versus traditional touch-tone technology. Speech improves customer satisfaction and retention, reduces the overall cost of customer service and increases automation rates, as customers tend to stay within voice-enabled automated systems rather than opting out to a live agent. Analyst research indicates speech boosts customer use of automated systems from 20 to 60 percent, compared with touch-tone. On the business side, voice automation can dramatically reduce the cost of handling calls, delivering a return on investment in just a few months. Analysts at Cahners In-Stat and Giga report the average cost-per-call handled by agents ranges anywhere from $2 to more than $15. With voice automation, that cost is often cut to 20 cents or less per call. [ILLUSTRATION OMITTED] From a caller experience standpoint, speech technology shortens the duration of calls by eliminating on-hold times and providing more streamlined, round-the-clock access to information. Customers get where they want to be with fewer steps and hassles--something very important since today's customers have neither the time nor the patience to wait on hold or navigate through a myriad of touch-tone prompts to get what they need. Speech is intuitive and user-friendly (there's no need to memorize mem·o·rize tr.v. mem·o·rized, mem·o·riz·ing, mem·o·riz·es 1. To commit to memory; learn by heart. 2. Computer Science To store in memory: choices) and is more responsive to customers' needs. In fact, a 2003 Harris Interactive Harris Interactive (NASDAQ: HPOL) is an American market research company that specializes in public opinion research using both telephone and surveys on online panels. The company is the product of a 1996 merger between the Gordon S. Black Company and Louis Harris & Associates. study we commissioned revealed that nine out of 10 surveyed consumers preferred speech recognition to touch-tone. Another benefit of speech is enhanced security. It's been estimated by the Federal Trade Commission that identity theft and fraud affect nearly 10 million people and cost businesses upwards of $60 billion per year. Companies are searching for ways to secure their call centers while reducing costs and promoting customer convenience. Voice authentication (1) Verifying the integrity of a transmitted message. See message integrity, e-mail authentication and MAC. (2) Verifying the identity of a user logging into a network. (voiceprint A sample of a person's voice to be used for voice recognition or security systems. technology) allows businesses to offer secure, personalized per·son·al·ize tr.v. per·son·al·ized, per·son·al·iz·ing, per·son·al·iz·es 1. To take (a general remark or characterization) in a personal manner. 2. To attribute human or personal qualities to; personify. over-the-phone customer service. Many financial institutions have implemented voiceprint technology to verify customers and to reduce I.D. theft and fraud. Before transactions (i.e., checking a bank account balance) are authorized, users' spoken voiceprints are compared to those previously enrolled and on file. The transactions are performed only after an exact match is made, based on the unique characteristics of the caller's voice. With automated voice authentication, more callers remain in the automated system, and agents can focus on addressing high priority customer issues. CIS: At this point, have we seen all of the enterprise and consumer applications for speech, or do you think what we've got thus far is the tip of the iceberg tip of the iceberg n. pl. tips of the iceberg A small evident part or aspect of something largely hidden: afraid that these few reported cases of the disease might only be the tip of the iceberg. ? LKS: What we've seen so far is only the tip of a very large iceberg. In the past year alone, we've seen a tremendous increase in demand for voice automation solutions. Speech is no longer an add-on for big companies with ample discretionary budgets; it's a necessity for any company, large or small, that uses the telephone as a means of communicating with customers. We anticipate that speech applications for enterprises and consumers will evolve, continuing the current momentum. Voice automation is following a similar path that we've seen enterprise software markets travel over the past few years: it's moving from custom applications to packaged applications. Packaged applications speed speech application deployment and time-to-market while increasing customer satisfaction. Nuance's packaged applications leverage the company's best practices in voice automation yet also have the flexibility to meet company-specific business and technology requirements. Packaged applications empower companies to configure, manage and maintain their own applications, serving as an easy-to-deploy alternative to built-from-scratch speech applications. This trend will further drive adoption in all markets. CIS: I've heard it said that the actual code behind speech recognition is some of the most complicated code ever to be written. Can you expound ex·pound v. ex·pound·ed, ex·pound·ing, ex·pounds v.tr. 1. To give a detailed statement of; set forth: expounded the intricacies of the new tax law. 2. on why the creation of the actual engine is so complex? LKS: The combination of speech recognition (the ability to interpret words and commands) coupled with natural language understanding (the capability to conduct conversations between humans and machines) is very complex. Nuance alone has been refining this technology for more than 10 years as an independent entity and for many years prior as a part of the Stanford Research Institute Stanford Research Institute - Former name of SRI International. . There are two reasons for the complexity. The main reason, of course, is the complexity of human speech. It is widely varied, subtle, contextual, accented, and the engine must overcome the variations in spoken language to reach a correct interpretation of the request with a high degree of accuracy. The second challenge is deployment environment. Telephones are used everywhere, from the relative quiet of a home or office, to airports, trains and automobiles. They're both wired and wireless. Ultimately, it's any device with a microphone and a network connection. The speech engine must be able to overcome all of the challenges associated with the deployment environment and, to a further degree of complexity, in the product design. The good news is that both challenges listed above have been overcome by Nuance. Other companies have also attempted to solve these technical challenges, and many have abandoned their efforts over time because they were not successful. Instead, they've chosen to partner with Nuance, take our core technologies and build applications on top of them. CIS: Not very long ago, text-to-speech technologies had a reputation for being highly unreliable. How has this changed and improved in the last several years? LKS: Text-to-speech (TTS (1) See text-to-speech. (2) (Transaction Tracking System) Software that monitors a transaction until completion. In the event of a hardware or software failure, it ensures that the database is brought back to its former state before the attempt to ) is often confused with speech to text (in other words Adv. 1. in other words - otherwise stated; "in other words, we are broke" put differently , dictation) software. And it's true that many people's experiences with these primarily desktop products were poor. These products are "speaker dependent," requiring significant amounts of training to be able to recognize words and phrases Words and Phrases® A multivolume set of law books published by West Group containing thousands of judicial definitions of words and phrases, arranged alphabetically, from 1658 to the present. reliably. Telephony-based speech recognition, like that which Nuance provides, is "speaker independent," requiring no training by the user for accurate recognition. As for TTS, the principal issue has been vocal quality, not accuracy or reliability. This software has primarily been used as part of speech applications that deliver frequently changing information that is difficult and expensive to handle with recorded prompts (i.e., train or plane schedules). Speech recognition is used to understand a caller's request, recorded prompts are used to communicate information that rarely changes, and TTS software is used to speak dynamic information, such as the name and address, for a listing--information that is stored as text in a database. Nuance is continually improving the vocal quality of its TTS product to make it much more human and natural sounding and less mechanical. Recently, several TTS-related developments have contributed to this, including things like SSML SSML Speech Synthesis Markup Language (speech synthesis markup language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems. ) that enable developers to add emphasis and tune pronunciations. CIS: Why has Nuance chosen to focus only on enterprise, and not consumer, products? LKS: Nuance software powers a range of consumer-focused products, including the Personal Calling feature in GM Onstar, voice dialing from Sprint and other wireless carriers, and much more. All these products use Nuance's core expertise in telephony speech recognition, also known as network-based speech. We believe the network approach to offering consumer services Consumer Services refers to the formulation, deformulation, technical consulting and testing of most consumer products, such as food, herbs, beverages, vitamins, pharmaceuticals, cosmetics, hair products, household cleaners, [paints, plastics, metals, waxes, coatings, minerals, is far superior to the "embedded Inserted into. See embedded system. " approach some companies are taking primarily because of lack of processing power in today's devices. This limitation forces companies to go with a speaker-dependent approach to speech recognition. Just as with the desktop dictation example, speaker dependency has many drawbacks. CIS: I find the concept of Nuance's "persona creation services" quite fascinating. Do you have a team of psychologists testing and refining to find out what kind of "artificial voice" customers prefer to hear? Does the preference vary from industry to industry? LKS: Personas are a fun and interesting aspect of Nuance's speech applications, as for callers they are the "face" of a company/brand. Nuance research indicates that callers associate the system voice with specific characteristics--even if the system was not designed to exemplify any particular character--and are attracted to the personality that they deem most similar to their own. Callers are also drawn to specific persona traits (i.e., confident, warm, easy-going eas·y·go·ing also eas·y-go·ing adj. 1. a. Living without undue worry or concern; calm. b. Lax or negligent; careless. c. ). For example, a bank may want its persona to convey authority, security and clarity, so perhaps an ideal persona would be a 60-year-old retired investment banker Investment Banker A person representing a financial institution that is in the business of raising capital for corporations and municipalities. Notes: An investment banker may not accept deposits or make commercial loans. who worked for years on Wall Street. We have a team of Nuance persona consultants who work with partners and customers to develop personas. Nuance persona design and production consultants include experts in creative writing, conversational linguistics, sound design and audio production. The persona consultants use their deep working knowledge of natural language speech recognition applications to define the persona early in the design process, based on the identity of the target user group, user feedback about the task they are completing and the imagery associated with the company's brand. Once the persona is created, we run through a series of steps prior to going live. These include customer usability tests, focus sessions, customer surveys and recognition testing. We then conduct a performance analysis post-launch and make any necessary adjustments. We take this process very seriously, as a company's persona helps establish a consistent, positive experience for users that underscores its branding goals and enhances the specific image conveyed by its speech system. It is our goal to create the best (virtual) ambassador possible to represent a company's brand. CIS: Can you name a few of Nuance's OEM (Original Equipment Manufacturer) The rebranding of equipment and selling it. The term initially referred to the company that made the products (the "original" manufacturer), but eventually became widely used to refer to the organization that buys the products and partners that integrate Nuance technologies into their own products? LKS: Nuance works with a variety of partners to bring speech solutions to market. We do have OEM partners, who bundle our core speech technology into theirs as part of an overall solution they are delivering. These include companies like Avaya, Cisco, Edify ed·i·fy tr.v. ed·i·fied, ed·i·fy·ing, ed·i·fies To instruct especially so as to encourage intellectual, moral, or spiritual improvement. , Intervoice, Nortel and Syntellect, among others. Nuance also has relationships with value-added resellers, who deliver speech solutions to their customers based on Nuance's platform product, NVP NVP Network Voice Protocol NVP Nausea and Vomiting of Pregnancy NVP Name-Value Pair NVP National Vice President NVP Nominal Velocity of Propagation NVP N-Version Programming (multiple functionally equivalent program versions) . Representative partners in this category include Bell Canada Bell Canada Enterprises (TSX: BCE, NYSE: BCE), legally BCE Inc., is a major Canadian telecommunications company. Through its subsidiaries including Bell Canada, Bell Aliant, Northwestel, Télébec, and NorthernTel, it is the incumbent local exchange carrier for , Digital Data Voice, Telus Business Solutions, Versay and WorkForce Technologies, among others. Our ecosystem is further rounded out by a wide range of technology partners who offer complementary products and solutions. If you are interested in purchasing reprints of this article (in either print or HTML HTML in full HyperText Markup Language Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web. format), please visit Reprint Management Services online at www.reprintbuyer.com or contact a representative via e-mail at reprints@tmcnet.com or by phone at 800-290-5460. For information and subscriptions, visit www.TMCnet.com or call 203-852-6800. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion