Synopses & Reviews
Most people have experienced an automated speech-recognition system when calling a company. Instead of prompting callers to choose an option by entering numbers, the system asks questions and understands spoken responses. With a more advanced application, callers may feel as if they're having a conversation with another person. Not only will the system respond intelligently, its voice even has personality.
The Art and Business of Speech Recognition examines both the rapid emergence and broad potential of speech-recognition applications. By explaining the nature, design, development, and use of such applications, this book addresses two particular needs:
- Business managers must understand the competitive advantage that speech-recognition applications provide: a more effective way to engage, serve, and retain customers over the phone.
- Application designers must know how to meet their most critical business goal: a satisfying customer experience.
Author Blade Kotelly illuminates these needs from the perspective of an experienced, business-focused practitioner. Among the diverse applications he's worked on, perhaps his most influential design is the flight-information system developed for United Airlines, about which Julie Vallone wrote in Investor's Business Daily: "By the end of the conversation, you might want to take the voice to dinner."
If dinner is the analogy, this concise book is an ideal first course. Managers will learn the potential of speech-recognition applications to reduce costs, increase customer satisfaction, enhance the company brand, and even grow revenues. Designers, especially those just beginning to work in the voice domain, will learn user-interface design principles and techniques needed to develop and deploy successful applications. The examples in the book are real, the writing is accessible and lucid, and the solutions presented are attainable today.
Automated speech recognition (ASR) applications are poised to grow at a phenomenal pace. These systems allow you to speak naturally on the phone, while a computer system both understands what you say and answers knowledgeably. We all increasingly experience such applications when we telephone airline or financial companies for information. Other industries experimenting with these systems include automotive, security, and consumer goods companies. The key to the success of these systems is the design and development of effective voice user interfaces (VUIs). This is both an art and a science that requires an understanding of language, user needs, business requirements, and digital technology. This book provides an overview of this emerging field. It explains, both for managers and developers/designers new to VUIs, what the issues, challenges, and opportunities are, and gives a clear sense of what a well-designed system requires. Using real-world examples from successful, large-scale systems, it shows how a good speech recognition system can save a company money, increase customer satisfaction, and even grow revenue.
Includes bibliographical references (p. 175-176) and index.
About the Author
is the Creative Director of Interface Design for SpeechWorks International, a leading provider of automated speech-recognition software products and services. In addition to United Airlines, he has worked on applications for Apple Computer, E*TRADE, McKesson, Fidelity Investments IBG, FedEx, and others. A frequent conference speaker and university lecturer, Blade has had his work and ideas featured by The New York Times
, The Washington Post
, The Wall Street Journal
, the BBC, and National Public Radio.
Table of Contents
I. THE BACKGROUND. 1.On Telephones, Touchtones, and Business Needs.
Speech Recognition versus Touchtone Functionality.
Problems with Touchtone, and a Speech Recognition Remedy.
What Kinds of Companies Are Using Speech Recognition?
Why Are Companies Using It?
Speech-Recognition Applications: A Typical Example.
Where We've Been-Where We're Going. 2. Technology Primer: About Speech Recognizers.
What the Recognizer Hears (and the Need for Confirmation)
When the Recognizer Listens.
Why Designing a Speech-Recognition Application Is Challenging.
Where We've Been-Where We're Going. 3. The Psychology of How People Interact with Speech-Recognition Systems.
Ask “Dr.” Blade.
Where We've Been-Where We're Going.
II. THE PROCESS OF DESIGNING SPEECH-RECOGNITION SYSTEMS. 4. Research.
Callers' Objectives and Needs.
Aspects of Research.
Assembling a Requirements Specification.
Where We've Been-Where We're Going. 5. Developing the Design.
Conceptualizing and Brainstorming.
Congruence of Style.
Defining the Call Flow.
Vision Clips/Sample Calls.
The Design Specification-Conveying the Details of the Design.
Constructing a Design Specification.
Following Through on the Initial Design Phase.
Where We've Been-Where We're Going. 6. Writing Effective Prompts.
The Language of Asking Questions.
The Art of Writing Perfect Prompts.
Writing Prompts for Elegance, Speed, and Value.
Getting Callers to Focus on the Essentials.
Some Subtleties of Prompt Writing.
Top Five Good Tenets for Writing Prompts.
Top Five Mistakes When Writing Prompts.
Where We've Been-Where We're Going. 7. Production and Branding.
Notes About Implementation and Programming.
Prompt Creation-Text-to-Speech and Recorded Voices.
Directing Voice Talents.
The Art of Recording Prompts.
Other Thoughts on Directing.
Concatenative Prompt Recording.
Some Metrics and Technical Notes.
Where We've Been-Where We're Going. 8. Usability Testing.
The Value of Usability Testing.
How We Test an Application.
Objectives of Usability Testing.
Preparing for the Test.
The Test Subjects.
How to Get Test Subjects.
The Test Environment.
Types of Tests.
The Test Is Over-Now What?
Interpreting Test Results.
Where We've Been-Where We're Going. 9. Deployment.
The Importance of Multi-Phase Deployment.
The Three Phases of Deployment.
Where We've Been-Where We're Going.
III. APPLIED KNOWLEDGE. 10. Case Studies.
United Airlines: Shortcuts for Frequent Fliers.
United Airlines: Providing Extra Help for Those Who Need It.
Continental Airlines: A Different Approach to Flight Information.
A Top-Five Investment Management Company: Handling Complex Two-Choice Questions.
An Online Brokerage Firm: Managing More Complex Tasks.
An Online Brokerage Firm: Preferences and Other Rarely Used Functions.
A Regional Telephone Company: Dealing with Legal Notices and Disclaimers.
Wildfire: List Navigation.
Wildfire: Small Header, Large Body Lists.
A Top-Five U.S. Bank: Large Header, Small Body Lists.
The “Race Condition”.
FedEx: Scaffolding Prompts.
Amtrak: Implicit Confirmation and the “Ellipses/and” Question Form.
AirTran: Reducing the Information Burden.
Semantics: When “Problem” Was a Problem.
Where We've Been-Where We Must Go. Postscript.
Suggested Reading List.