Speech Applications. How do they work?

Similar documents
VoiceXML. Installation and Configuration Guide. Interactive Intelligence Customer Interaction Center (CIC) Version 2016 R4

Special Lecture (406) Spoken Language Dialog Systems Introduction to VoiceXML

About Unified IP IVR. Product names. Summary description of Unified IP IVR. This chapter contains the following:

White Paper Subcategory. Overview of XML Communication Technologies

VClarity Voice Platform

Genesys App Automation Platform Deployment Guide. Hardware and Software Specifications

An overview of interactive voice response applications

Application Notes for Nuance OpenSpeech Attendant with Avaya Voice Portal Issue 1.0

An Approach to VoiceXML Application Modeling

SurVo. Stepping Through the Basics. Version 2.0

Delivering Superior Self Service with Open Standards

Authors Martin Eckert Ingmar Kliche Deutsche Telekom Laboratories.

Version 2.7. Audio File Maintenance Advanced User s Guide

Special Lecture (406) Spoken Language Dialog Systems VoiceXML: Dialogs, Forms and Fields

Unified CVP Architecture Overview

Introducing the VoiceXML Server

Abstract. Avaya Solution & Interoperability Test Lab

Back-end Avaya Aura Experience Portal and SIP-enabled Avaya Aura Contact Center using Context Creation

Speaker Verification in BeVocal VoiceXML

Back-end Avaya Aura Experience Portal and SIP-enabled Avaya Contact Center Select using a Play and Collect sample application

Niusha, the first Persian speech-enabled IVR platform

EVALITA 2009: Loquendo Spoken Dialog System

Implementation of ASR4CRM : An Automated Speech- Enabled Customer Care Service System

Voice Foundation Classes

Avaya Dialog Designer Dialog Designer Developer s Guide

Voice Profile Setup Guide

Version 2.6. SurVo Advanced User s Guide

Cisco CVP VoiceXML 3.0. Element Specifications

Script Step Reference Information

Integrate Speech Technology for Hands-free Operation

Application Notes for Yandex Speechkit Speech Recognition 1.6 with Avaya Aura Experience Portal Issue 1.0

ATTENDANT USER GUIDE

INTRODUCTION TO VOICEXML FOR DISTRIBUTED WEB-BASED APPLICATIONS

SR Telephony Applications. Designing Speech-Only User Interfaces

Hosted Fax Mail. Blue Platform. User Guide

RLAT Rapid Language Adaptation Toolkit

WFSTDM Builder Network-based Spoken Dialogue System Builder for Easy Prototyping

Abstract. Avaya Solution & Interoperability Test Lab

Dynamic Aural Browsing of MathML Documents via VoiceXML

Application Notes for the SDC IntelliSPEECH with Avaya Communication Manager - Issue 1.0

Application Notes for InfoTalk-Recognizer 9.0 with Avaya Aura Experience Portal 6.0 and Avaya Aura Communication Manager 6.2 Issue 1.

Advanced VoIP Applications

A Technical Overview: Voiyager Dynamic Application Discovery

Digital Audio Basics

Module Feature List

Speech Tuner. and Chief Scientist at EIG

Web Architectures. Goal of Architecture Design. Architecture Design. Single Server Configuration. All basic components installed on same machine

Configure Nuance TTS and ASR for Cisco Unified Contact Center Enterprise (UCCE)

Human Interaction Container Paradigm

Unified IP IVR Architecture

PASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year

Software Infrastructure for Spoken Dialogue System. Presenter: Aneef Izhar Ul Haq

Voice. Voice. Patterson EagleSoft Overview Voice 629

Menu Support for 2_Option_Menu Through 10_Option_Menu

Managing Cisco Unity Connection Services

Sparda-Bank Hamburg eg I H C 1

Telephony and Media Provision

A Framework for Visually Impaired Web Users with Keyboard as User Interface

EB GUIDE Studio and EB GUIDE Speech Extension. Dan Henderlong June 2016

Application Notes for Versay CUE Analytics with Avaya Aura Experience Portal Release Issue 1.0

Application Notes for Configuring Computer Instruments Experience Configuration Interface, with Avaya Aura Experience Portal Issue 1.

IVR Module User Guide

Unified Customer Voice Portal Overview

A Sample Configuration for ivoice TAPI Speech-Enabled Auto Attendant 3.2 with Avaya IP Office System Issue 1.0

Abstract. These Application Notes describe the procedures for configuring Computer Instruments eci to interoperate with Avaya Voice Portal.

Homework 3: Dialog. Part 1. Part 2. Results are due 17 th November 3:30pm

EE Voice xml application developer exam.

e-business on demand Competitive Technical Briefing Enterprise Portals

Form. Settings, page 2 Element Data, page 7 Exit States, page 8 Audio Groups, page 9 Folder and Class Information, page 9 Events, page 10

SpeakUp click. Contents. Applications. SpeakUp Firwmware. Algorithm. SpeakUp and SpeakUp 2 click. From MikroElektonika Documentation

Exam Express Exam EE0-411 voice xml application developer exam Version: 5.0 [ Total Questions: 118 ]

Vestec Automatic Speech Recognition Engine Standard Edition Version Installation Guide

Application Notes for Beijing InfoQuick SinoVoice Speech Technology (SinoVoice) jtts with Avaya Voice Portal Issue 1.0

A Convedia White Paper. Controlling Media Servers with SIP

Application Notes for Interactions Virtual Assistant Solutions with Avaya Aura Experience Portal Issue 1.0

Application Notes for Telisma telispeech Automatic Speech Recognition Engine with Avaya Voice Portal - Issue 1.0

Application Notes for Anhui USTC iflytek InterReco with Avaya Aura Experience Portal Issue 1.0

Evaluation of a Mixed-Initiative Dialogue Multimodal Interface

Multilingual Vocal Access to Databases: the Central Role of Ontology Management in VocaBase

VoiceXML Application Development Recommendations

VoiceXML Reference Version 1.6 June 2001

Web2cToGo: Bringing the Web2cToolkit to Mobile Devices. Reinhard Bacher DESY, Hamburg, Germany

Cisco Unified Customer Voice Portal 9.0

Configuration Guide. Index. 1. Admin Menu 2. VoiceXML editor 3. System Reports 4. System Settings. About us

Application Notes for Beijing InfoQuick SinoVoice Speech Technology (SinoVoice) jtts with Avaya Interactive Response Issue 1.0

Unified CCX Administration Web Interface

A NOVEL MECHANISM FOR MEDIA RESOURCE CONTROL IN SIP MOBILE NETWORKS

Abstract. Avaya Solution & Interoperability Test Lab

Getting Started with IP IVR Guide, Release 12.0

Getting Started with Exchange Unified Messaging

Position Statement for Multi-Modal Access

Dialogue systems. Volha Petukhova Saarland University

Application Notes for Beijing InfoQuick SinoVoice Speech Technology jtts with Avaya Voice Portal 5.1 Issue 1.0

Category: Standards Track October 2009

Unified Customer Voice Portal Overview

Cisco Unified CCX Administration Guide, Release 11.0(1)

Deliverable D ASFE-DL: Semantics, Syntaxes and Stylistics (R3)

Auto Attendant. Blue Platform. Administration. User Guide

INFORMATION ACCESS VIA VOICE. dissertation is my own or was done in collaboration with my advisory committee. Yapin Zhong

CMU Sphinx: the recognizer library

Transcription:

Speech Applications How do they work?

What is a VUI? What the user interacts with when using a speech application VUI Elements Prompts or System Messages Prerecorded or Synthesized Grammars Define the possible User responses to the Prompts Dialog Logic Define the System s actions responding to the User s input

Technologies Involved Speech Recognition Recognition of User s input, based on a specific Grammar Speech Synthesis On-demand rendering of System Prompts Dialog Control Dialog Logic implementation

Handling Speech Input User I want to go to Dallas Speech Endpointing Feature Extraction OK, what time do you want to leave? Dialog Management Natural Language Understanding Recognition System Speech Synthesis

Endpointing Detection of the beginning and end of speech Find where the utterance starts Endpointing Wait for a sufficiently long pause to indicate the end

Feature Extraction Endpointed Utterance Transform Utterance into sequence of Feature Vectors Represent measurable characteristics of speech Most typical: energy at different frequencies Sequence of vectors, each one extracted from a 10msec frame Feature Extraction.............................. Feature Vectors

Recognizer Determine the words that make up the utterance Features to Phonemes Acoustic Model Phonemes to Words Pronunciation Dictionary Grammars Feature Vectors.............................. Recognition I want to go to Dallas Word String

Natural Language Processing Word String I want to go to Dallas Assign meaning to words that were spoken Name - Value pairs called slots Defined for each item relevant to the application Natural Language Processing destination: Dallas Meaning

Dialog Management Control the actions of the system Access database Play back information to the user Perform a transaction Play a prompt requesting additional information Meaning destination: Dallas Dialog Management OK, what day do you want to leave? Actions/Prompts

Dialog Manager The heart of a Speech Application Coordinates all subsystems Passes information between them Keeps track of application state Dialog Manager Capture Voice ASR DTMF Replay Audio TTS

Implementation Issues Why do we need a Speech Application? Provide information to the user Make it easier to access a service anywhere, anytime Objectives Reusability of existing infrastructure Reusability of existing data Reusability of existing business logic

Bad Practice Example At first, Barpoint programmed nine variations of its site, but within a few months the company's staff was frantically trying to maintain 90 versions. That effort involved every available employee, pulling many away from their regular duties and causing the company's core business to suffer. The Industry Standard 4/2/2001

The Answer: VoiceXML A special-purpose language for describing interactive voice dialog Simplifies application development Minimizes Internet traffic interaction Separates user interaction code from application logic code Provides portability Provides simplicity Supports rapid prototyping and iterative refinement

Web Application Database Server DB Multimedia files Web Server HTML Scripts Internet HTML Browser PC Application Server Business Logic

Speech Application Database Server DB VoiceXML Scripts Capture Voice Telephone Network Web Server Grammars Voice- XML Browser ASR DTMF Replay Audio Audio files Internet TTS Gateway Application Server Business Logic

Combining Both UIs Application Layer Presentation Layer Data Base App Logic VoiceXML HTML VoiceXML HTML VoiceXML Browser HTML Browser

Key Concepts Callers access commerce, content, and communications services via voice VoiceXML is a language for developing voice-enabled Web sites VoiceXML supports verbal menus and forms While HTML browsers execute on the user s PC, VoiceXML browsers execute on a speech server VUI are very different from GUI