EVALUATING DIALOGUE STRATEGIES IN A SPOKEN DIALOGUE SYSTEM FOR
|
|
- Alexander Turner
- 5 years ago
- Views:
Transcription
1 EVALUATING DIALOGUE STRATEGIES IN A SPOKEN DIALOGUE SYSTEM FOR Fernando Farfán, Heriberto Cuayáhuitl and Alberto Portilla Intelligent Systems Research Group Department of Engineering and Technology Universidad Autónoma de Tlaxcala Apartado Postal No. 140 Apizaco, Tlaxcala, México {farfan, hcuayahu, aportilla}@ingenieria.uatx.mx ABSTRACT This paper presents an evaluation of directed dialogue (DD) and mixed initiative (MI) strategies in a spoken language system for . We compare the DD strategy, in which the system controls the dialog, to the MI strategy, in which users can flexibly control the dialog. For evaluating both strategies we used the PARADISE framework, which supports comparisons among dialogue strategies. Our experimental results show that the MI strategy performance surpasses the DD strategy in efficiency and user satisfaction, but needs more work for achieving qualitative measures and task success. KEY WORDS Dialogue strategies, spoken dialogue systems, by phone, speech recognition, and word spotting 1 Introduction In Mexico, automated services by telephone are few and most of these services are IVR-DTMF systems based on touch tone interfaces. Currently, these services tend to be replaced by Spoken Language Systems (SLS), which are more sophisticated than IVR-DTMF systems, provide a broader range of input, reduce call hold time, and generate costs savings. A SLS allows a human computer interaction based on human speech, using speech technology such as automatic speech recognition (ASR) and speech synthesis or Text To Speech (TTS). A SLS can interact with the user by employing a dialogue strategy such as Directed Dialogue (DD) or Mixed Initiative (MI). DD guides the user interaction to the achievement of goals, with the system always taking the initiative in the interaction, while in MI either the system or the user can take the initiative in the interaction. For example, consider the two dialogue strategies exemplified in D1 and D2, where S is the system and U is the user. In fact, the dialogues were in Spanish, but we here give only their English translation. Directed Dialogue Dialogue Strategy (D1) S: Welcome to the system by phone! S: Since this is your first call, I will provide you with some instructions. S: Your messages are presented as a sorted list, so you can say: first, next, previous, or last. If you want to erase a message, say delete, otherwise, if you want to hear a message, say read. To end the call, say good bye. S: Now you will hear your list of messages, remember that if you dont know what to say, you can say instructions. Let s begin! S: Message sent by Armando Rocha, subject my new cell phone number. U: (silence) S: Message sent by Alberto Morales, subject Company meeting. U: Read S: Please, hold on while I get the message. S: The message is: S: Hi, I just want to let you know that the meeting will be next Friday at 5:30. Don t be late. S: That was the message. Please say one of the following choices: repeat, reply, delete or continue. U: Good bye S: Thanks for calling the system by phone. Good bye. Mixed Initiative Dialogue Strategy (D2) S: Welcome to the system by phone! S: Since this is your first call, I will provide you with some instructions. S: If you want to hear a message, you can say I would like to hear the messages of Ricardo Pérez, if you want to reply to a message you can say I want to reply to the message of Ricardo Pérez. S: What would you like to do? U: Hi, I would like to hear Alberto Morales s messages. S: Please, hold on while I get the messages. S: There is just one message and the message is: S: Hi, I just want to let you know that the meeting will be next Friday at 5:30. Don t be late. S: That was the message. What would you like to do? U: Good bye S: Thanks for calling the system by phone. Good bye.
2 I N P U T Spontaneous Continuos Natural Language Word Spotting Discrete Words Human operator Natural Dialogue System Natural Language System IVR-Speech Instruction s Wait For Call true Welcome Prompt NIP OK? false Request NIP Digits IVR-DTMF Menu Directed Dialogue Mixed Initiative DIALOGUE COMPLEXITY Free Flowing New and Unread Action to perform Invalid User Thanks For Calling Figure 1. Spoken Language Systems Classification [6]. Body- Part Header Delete Hang Up At first glance, MI may seem a better strategy than DD, but previous works suggest that there are several difficulties with MI strategy [1-3]. First, MI requires more complex grammars, which could result in high automatic speech recognition (ASR) error rates. Second, MI may require users to learn what the system can understand, due to the fact that the system does not prompt with valid vocabulary. Thus, our motivation in this work is to evaluate both dialogue strategies (DD and MI) in order to contribute to the solution of the problems described above which MI confronts today. This paper presents a comparative analysis of the performance of two dialogue strategies (DD and MI) in a spoken language system for accessing by phone. In this work we used the most sophisticated evaluation framework currently available for spoken language systems, PAR- ADISE [4]. Our experimental framework uses a state of the art speech recognizer, which has the ability to handle out-of-vocabulary words [5]. The following section describes the system design. Section 3 describes our experimental design, where we provide details of the evaluation methodology. In section 4, we describe our experimental results. Finally, in section 5 we provide our conclusions and comment on future directions of this work. 2 System Design Spoken language systems can be classified by dialogue complexity and by the type of speech input provided by the user. A SLS classification is shown in Figure 1. In order to evaluate both dialogue strategies, two systems were developed: a Speech-IVR system and a Natural Language System (NLS). The Speech-IVR system uses the DD strategy where discrete words are used as speech input, while MI uses a NLS with word spotting capabilities and continuous natural language as speech input. Our systems provide the following capabilities: Reply Figure 2. High-Level diagram for DD and MI strategies. User Authentication, Tells the number of new and unread s, Headers and body-part consultation, Deleting and replying messages A high level call flow diagram for our systems is illustrated in Figure 2. To help users to interact with the system, the system instructs them in the first call, after user authentication. Our dialogue manager uses a state machine to implement both dialogue strategies. Most of the states include a DialogModule T M provided by the recognizer, which takes care of the conversation. A DialogModule includes: an initial prompt, timeout prompts, retry prompts, confirmation prompts, a help prompt, and a vocabulary or grammar. We enabled a set of global commands to be reached anytime in the call flow: cancel, help, instructions, and call termination. The system implementing the DD strategy only uses vocabularies. The system implementing the MI strategy uses grammars, which include a set of filler models to the left and right of keywords for modeling out-of-vocabulary words. In this way we were able to recognize any phrase with at least one keyword and at most two keywords (action and name). The communication architecture components provide critical support for our systems. Figure 3 shows the system architecture. The following hardware and software components were employed in the system communication architecture: ASR: SpeechWorks Recognizer 6.5 Second Edition for Mexican Spanish (barge-in enabled)
3 Automatic Speech Recognizer Speech Synthesizer MAXIMIZE USER SATISFACTION PSTN Card Telephony Interface VRU INTERNET/ INTRANET MAXIMIZE TASK SUCCESS MINIMIZE COST MEASURES Storage Resources Kappa EFFICIENCY METRICS QUALITATIVE METRICS Device Access Mail Server Database Elapsed time, user and system turns. Retry, barge-in, cancel, etc. Figure 3. System communication architecture. Figure 4. PARADISE s structure of objectives for dialogue performance [4]. Speech Synthesizer: voice Eloquent 5.0 Mexican male Card Telephony Interface: Dialogic D21H Programming Language: C++ Telephone Line Simulator: Skutch AS-26 Mail Server: VisNetic MailServer Operating System: Windows NT 4.0 Telephone: Telmex Regulated, BE-408 Type 1 PC: Pentium 600 MHZ, 256 MB RAM 3 Experimental Design The experimental setting was similar to that described in [2]; we also applied the PARADISE evaluation framework for comparing dialogue strategies. The overall structure of objectives in PARADISE that provides the basis for estimating a performance function is shown in Figure 4, with user satisfaction at the top level. In this framework, user satisfaction is correlated to user task success and such success has a payment in cost measures. The cost measures can be classified as either efficiency measures or qualitative measures. Efficiency measures are correlated to the user and the system interaction as a normal course of events (the user speaks and the system listens and understands perfectly what the user just said, while the time goes by), but not everything works perfectly, the system can fail in automatic speech recognition and produce a set of retries or timeouts (qualitative measures) when the user does not know exactly what to say by example. To know task success, the coefficient Kappa is calculated from a confusion matrix that summarizes how well a user or system achieves the information goals of a particular task for a set of dialogues instantiating a set of scenarios. Kappa is defined by Equation 1, and it is described in [7]. P(A) is the proportion of times that the Attribute-Value Matrix (AVM) for a dialogue agrees with the AVM for the scenario key, and P(E) is the proportion of times we would expect the AVMs for the dialogues and keys to agree by chance. When agreement is perfect (all task information items are successfully exchanged), then Kappa is 1. When agreement is only at chance, then Kappa is 0. κ = (p(a) p(e))/(1 p(e)) (1) where, p(e) = n i=1 (t i/t ) 2 (2) and, p(a) = ( n i=1 M(i, j))/t (3) As shown in Figure 4, performance also includes a function that combines cost measures. The PARADISE framework represents each cost measure as a function c i that must be minimized. These cost measures and the Kappa coefficient are the basis for formulating the equation for system performance. This system performance is calculated by the following equation. performance = (α N(κ)) i=1 w i N(c i ) (4) Where α is a weight on κ, the cost functions are weighted by w i, and N is a Z score normalization function, defined by the following equation. N(x) = (x x)/σ x (5) For evaluating both dialogue strategies, 50 undergraduate students, with an average age of 21 and who receive on average four s per day, tested our system. Their only previous experience in consulting was through the UNIX pine tool or with the support of a graphic user
4 Table 1. Tasks to be performed by each user. Task Description Goals 1 Armando Rocha has Get Armando s prepared a meeting, so he cellular telephone is expecting you to call number. him. He sent an with his cellular telephone number. 2 Fernando Mata Terrazas has Get wedding s day invited you to his wedding. and hour. Delete the message. 3 Alberto Morales sent an Get meeting s day about a meeting next and hour. Friday and he wanted you Confirm your to confirm your attendance. assistance to the meeting. interface such as HotMail or Yahoo Mail Service. Each student was requested to perform three tasks including five goals, giving us a total of 125 goals per strategy. The tasks are listed in Table 1. It is important to mention that our testers had no previous experience with a SLS, so that our experiments were performed with novice users. Before our testers started a call, they were given the following tips for using the systems, recommended by [8]. If you do not know what to say, you can say help anytime you want. If the system didn t understand you correctly and is performing a wrong task, you can say cancel to go back a previous step. If you keep silent, the system will tell you what to say. You can barge-in a system dialog, so you do not have to wait until it finishes a word or phrase. When you finish your tasks, you can say good bye to end your call. The data collection for evaluation consisted in: The recording of dialogues using the.ulaw audio files. The recording of metrics for each user session. A survey that helped us to know the opinion about the system usage. In this way, we determined a set of objective and subjective metrics. These metrics are: Task Success, Bargein, Timeout, Retry, Elapsed Time, Help, Cancel, System Turns, User Turns, and Mean Recognition Score (MRS). To measure task success, users provided us with the information required by each task. We used an Attribute Table 2. Tasks to be performed by each user. Attribute Day Hour Delete Possible Values Any day from Monday to Sunday 5:30, any other hour Yes, No Value Matrix (AVM) to represent the information retrieved from users s. An example of AVM for task 2 is shown in Table 2. In order to compute user satisfaction with subjective metrics, we performed a survey with the following questions: TTS quality: Was the system easy to understand? ASR quality: In this conversation, did the system understand what you said? Task Ease-of-Use: In this conversation, was it easy to find the message you were looking for? Interaction Pace: Was the pace of interaction with the system appropriate in this conversation? User experience: Did you know what you could say at each point of the dialogue? System response: How often was the system sluggish and slow to reply to you in this conversation? Expected behavior: Did the system work the way you expected? Future use: Based on your current experience with the system, would you use it regularly as another medium for accessing ? The user satisfaction survey allowed users to fill in a web page with 8 questions and multiple choice answers. The possible answers to most of the questions were: almost never, rarely, sometimes, often, almost, and always. Some questions had responses like yes, no or maybe. Every response was mapped to an integer from 1 to 5, with 5 representing the highest score. The survey also included a text field where users were encouraged to introduce comments about our systems. All responses were summed up, resulting in a User Satisfaction measure for each dialogue ranging from 8 to 40, where 8 is the worst case for all questions and 40 represents the best case. 4 Experimental Results For computing the system performance by (4), we computed a Multiple Linear Regression (MLR) over all objective metrics, which produced a set of coefficients (weights) describing the relative contribution of each
5 Table 3. Metrics performance for DD and MI strategies. Metrics Directed Mixed Dialogue Initiative Efficiency Measures System Turns User Turns Elapsed Time (secs) Qualitative Measures MRS Timeout Retry Help Cancel Barge-in Task Success (κ) User Satisfaction factor. We used US (user satisfaction) as the predictor factor. The results of the regression showed that Elapsed Time (ET), Barge-In and Retry were the most significant factors (p < 0.028). Task success (κ) was computed using equation 1, and ET, Barge-In and Retry were taken from the log files of our systems. Using these metrics we computed the performance function with a second MLR for obtaining α and w i, resulting in the following performance function. perf ormance = 4.950(κ) (ET ) (Barge in) 0.325(Retry) A summary of metrics collected from our systems applying the PARADISE framework is listed in Table 3. 1 This table shows that MI is more efficient and if fact better than DD in three measures (System Turns, User Turns and Elapsed time). However, according to the qualitative measures, DD is better in Mean Recognition Score (MRS), Timeout, Retry and Barge-In. Task Success in the DD strategy was also better than in the MI strategy. It is important to mention that the MRS for MI had a good performance (0.82) despite the difficulty of the recognition tasks in this strategy. We attribute good speech recognition performance in MI because of the use of a word spotting technique based on the use of explicit garbage models [5]. Table 4 1 shows that users prefer the MI strategy to the DD strategy. In this table we can observe some relevant positive results for the MI strategy: Task ease-of-use, user expertise, expected behavior and future use. We find these results interesting due to the fact that we have 25 different opinions for each dialogue strategy. Thus, we assume that users may find MI easy to use because they can say the action they want in just one utterance. Even when task 1 The numbers in bold font represent better results compared with the other strategy. Table 4. User Satisfaction Survey Results. Criteria Directed Mixed Dialogue Initiative TTS Quality ASR Quality Task ease Interaction pace User expertise System response Expected behavior Future use User Satisfaction success is lower in MI, users feel confident in what to say and expect from the system. Also, it seems that users liked MI more due to the fact that they say would use it in the future as another medium for reading . These results tell us that in a near future MI could be the dialogue strategy most used by spoken dialogue systems. We consider that the key factors are MRS and User Interface (UI) design. So our hypothesis is that having a friendly and easy to use UI and a robust speech recognizer, the MI strategy can surpass the DD strategy. Finally, our experiments were performed with novice users, and we suspect that with expert users, MI could reach the same scores in task success as the DD strategy. Also, timeouts and retries would be reduced dramatically due to the fact that users would know what to say and as a consequence MRS may be more accurate. 5 Conclusion We presented our experiments comparing two dialogue strategies, directed dialogue (DD) and mixed initiative (MI), in the context of a spoken dialogue system for accessing by phone. Our results showed that the MI dialogue strategy is more efficient, which means that the metrics System Turns, User Turns and Elapsed time score better than the DD strategy. Also, our results showed that the DD strategy is better in qualitative metrics such as Task Success, MRS, Timeout and Retry. At this point, we conclude that there are two key components that may improve the performance in dialogue strategies: A robust speech recognizer and a well designed user interface. Achieving these key components, better results can be obtained for efficiency and qualitative metrics in both dialogue strategies. Furthermore, our results showed that user satisfaction is better in the MI strategy. Thus, combining high performance in efficiency and qualitative metrics with user satisfaction makes the MI strategy a potential dialogue strategy to be most used by spoken dialogue systems. For achieving high scores in MRS, it is necessary to deal robustly with out-of-vocabulary speech. Our immediate future direction consists in a detailed
6 study of out-of-vocabulary speech, which according to our results is a key component for dealing with the MI strategy. Other future directions consist in a revision of the UIs considering the addition of interaction demonstrations and careful design of timeouts and retries, which will be performed taking into account our experimental results, user comments and a speech data analysis. Finally, another important issue would be related to user experience. Our results are for novice users, may be for expert users the preference and performance for MI is even more? 6 Aknowledgement This research was partially supported by SpeechWorks International Inc. with equipment and software licenses. We would like to thank Ben Serridge for his writing revision on this paper. References [1] Danieli, M., and Gerbino, E., Metrics for Evaluating Dialogue Strategies in a Spoken Language System, In Proceedings of AAAI Spring Symposium on Empirical Methods in Discourse Interpretation and Generation, California, USA, 1995, [2] Walker, M. A., Fromer, J., Fabbrizio, D. G., Mestel C., and Hindle, D., What Can I Say? Evaluating a Spoken Language Interface to , In Proceedings of CHI, California, USA, 1998, [3] Walker, M. A., Kamm, C. A, and Litman, D., Towards Developing General Models of Usability with PAR- ADISE, Natural Language Engineering: Special Issue on Best Practice in Spoken Dialogue Systems, [4] Walker, M. A., Litman, D., Kamm, C. A, and Abella, A., PARADISE: A framework for Evaluating Spoken Dialogue Agents, In Proceedings of ACL/EACL, 1997, [5] Cuayáhuitl, H. and Serridge, Out-Of-Vocabulary Word Modeling and Rejection for Spanish Keyword Spotting Systems, In Proceedings of the Mexican International Conference on Artificial Intelligence, LNAI 2313, Mérida, Mexico, 2002, [6] Thomas, B. S. and Joy J., Saying what Comes Naturally, Speech Technology Magazine, March-April [7] Carletta, J. C., Assessing the Reliability of Subjective Coding, Computational linguistics, 22(2), 1996, [8] Telephone Speech Standards Committee, Universal Commands for Telephony-Based Spoken Language Systems, CHI Bulletin, Volume 32, Number 2, April 2000,
EVALUATING COMPETING AGENT STRATEGIES FOR A VOICE AGENT
EVALUATING COMPETING AGENT STRATEGIES FOR A VOICE EMAIL AGENT Marilyn Walker, Donald Hindle, Jeanne Fromer, Giuseppe Di Fabbrizio, Craig Mestel AT&T Labs Research 180 Park Ave, Florham Park, NJ 07932,
More informationEvaluating Spoken Dialogue Systems. Julia Hirschberg CS /27/2011 1
Evaluating Spoken Dialogue Systems Julia Hirschberg CS 4706 4/27/2011 1 Dialogue System Evaluation Key point about SLP. Whenever we design a new algorithm or build a new application, need to evaluate it
More informationDeveloping And Testing General Models Of Spoken Dialogue System Performance
Developing And Testing General Models Of Spoken Dialogue System Performance Marilyn Walker, Candace Kamm, Julie Boland AT& T Labs - Research 180 Park Ave, Florham Park, N.J. 07932, U.S.A walker, cak@research.att.com,boland@louisiana.edu
More informationSR Telephony Applications. Designing Speech-Only User Interfaces
SR Telephony Applications Designing Speech-Only User Interfaces Overview Types of Services Designing Speech-Only Applications Feedback and Latency Prompting Strategies Handling Errors Usability Testing
More informationINFORMATION ACCESS VIA VOICE. dissertation is my own or was done in collaboration with my advisory committee. Yapin Zhong
INFORMATION ACCESS VIA VOICE Except where reference is made to the work of others, the work described in this dissertation is my own or was done in collaboration with my advisory committee. Yapin Zhong
More informationPRACTICAL SPEECH USER INTERFACE DESIGN
; ; : : : : ; : ; PRACTICAL SPEECH USER INTERFACE DESIGN й fail James R. Lewis. CRC Press Taylor &. Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Group, an informa
More informationWIRE: A Wearable Spoken Language Understanding System for the Military
WIRE: A Wearable Spoken Language Understanding System for the Military Helen Hastie Patrick Craven Michael Orr Lockheed Martin Advanced Technology Laboratories 3 Executive Campus Cherry Hill, NJ 08002
More informationATTENDANT USER GUIDE
ATTENDANT USER GUIDE NOTICE THIS DOCUMENT IS PROVIDED TO YOU FOR INFORMATIONAL PURPOSES ONLY. The information contained in this document is believed by Mitel Networks to be accurate as of the date of its
More informationNiusha, the first Persian speech-enabled IVR platform
2010 5th International Symposium on Telecommunications (IST'2010) Niusha, the first Persian speech-enabled IVR platform M.H. Bokaei, H. Sameti, H. Eghbal-zadeh, B. BabaAli, KH. Hosseinzadeh, M. Bahrani,
More informationSurVo. Stepping Through the Basics. Version 2.0
Stepping Through the Basics Version 2.0 Contents What is a SurVo?... 3 SurVo: Voice Survey Form... 3 About the Documentation... 3 Ifbyphone on the Web... 3 Setting up a SurVo... 4 Speech/Recording Options...
More informationEnterprise Edge 2.0 Voice Messaging Set Up and Operation Guide
Enterprise Edge 2.0 Voice Messaging Set Up and Operation Guide www.nortelnetworks.com 2000 Nortel Networks Contents Chapter 1 Introduction 13 About Enterprise Edge Voice Messaging 13 Basic Enterprise Edge
More informationIBM DirectTalk Speech Recognition for Windows with ViaVoice Technology Delivers Large Vocabulary Speech Recognition in the Telephony Environment
Software Announcement June 27, 2000 IBM DirectTalk Speech Recognition for Windows with ViaVoice Technology Delivers Large Vocabulary Speech Recognition in the Telephony Environment Overview The DirectTalk
More informationAn Adaptive Approach to Collecting Multimodal Input
An Adaptive Approach to Collecting Multimodal Input Anurag Gupta University of New South Wales School of Computer Science and Engineering Sydney, NSW 2052 Australia akgu380@cse.unsw.edu.au Abstract Multimodal
More informationSpeech Applications. How do they work?
Speech Applications How do they work? What is a VUI? What the user interacts with when using a speech application VUI Elements Prompts or System Messages Prerecorded or Synthesized Grammars Define the
More informationAvaya Unified Messenger Client User Guide
Avaya Unified Messenger Client User Guide Version 5.0 Avaya Inc. 211 Mount Airy Road Basking Ridge, New Jersey 07920 www.avaya.com Your comments are welcome. They can assist us in improving our documentation.
More informationVersion 2.6. SurVo Advanced User s Guide
Version 2.6 SurVo Advanced User s Guide Contents What is a SurVo?...3 SurVo: Voice Survey Form...3 About the Documentation...3 Ifbyphone on the Web...3 Setting up a SurVo...4 Speech/Recording Options...4
More informationVerbio Software Reference
Verbio Software Reference Grammar Manager User's Manual Verbio Technologies, S.L. Verbio Software Reference: Grammar Manager User's Manual Verbio Technologies, S.L. Published September, 2011 Copyright
More informationInteraction Feedback
Interaction Feedback Technical Reference 2018 R2 Last updated 2018-02-09 (See Change Log for summary of changes.) Abstract This user guide documents the PureConnect Customer Feedback Management solution
More informationSpeech Tuner. and Chief Scientist at EIG
Speech Tuner LumenVox's Speech Tuner is a complete maintenance tool for end-users, valueadded resellers, and platform providers. It s designed to perform tuning and transcription, as well as parameter,
More informationVClarity Voice Platform
VClarity Voice Platform VClarity L.L.C. Voice Platform Snap-in Functional Overview White Paper Technical Pre-release Version 2.0 for VClarity Voice Platform Updated February 12, 2007 Table of Contents
More informationPast Performance Evaluation Report (PPE) Ordering Instructions and Frequently Asked Questions Table of Contents
Past Performance Evaluation Report (PPE) Ordering Instructions and Frequently Asked Questions Table of Contents I. OVERVIEW OF PAST PERFORMANCE EVALUATION... 2 What is the Past Performance Evaluation (PPE)
More informationCallPilot Multimedia Messaging
CallPilot Multimedia Messaging User Guide Release 1.0 Standard 1.0 December 1998 P0886140 ii Welcome to CallPilot Multimedia Messaging CallPilot Multimedia Messaging from Nortel Networks is an advanced
More informationWeb Architectures. Goal of Architecture Design. Architecture Design. Single Server Configuration. All basic components installed on same machine
Goal of Architecture Design Web Architectures Performance application must sustain expected workload (max no of concurrent users, no page requests per unit of time, max time for delivering web page to
More informationDevelopment of a Speech-Driven Automatic Telephone Service
Department of Linguistics and Philology Språkteknologiprogrammet (Language Technology Programme) Master s thesis in Computational Linguistics 22 november 2006 Development of a Speech-Driven Automatic Telephone
More informationMRCP. Yandex SS Plugin. Usage Guide. Powered by Universal Speech Solutions LLC
Powered by Universal Speech Solutions LLC MRCP Yandex SS Plugin Usage Guide Revision: 1 Created: November 19, 2018 Last updated: November 19, 2018 Author: Arsen Chaloyan Universal Speech Solutions LLC
More informationCS 160: Evaluation. Outline. Outline. Iterative Design. Preparing for a User Test. User Test
CS 160: Evaluation Professor John Canny Spring 2006 2/15/2006 1 2/15/2006 2 Iterative Design Prototype low-fi paper, DENIM Design task analysis contextual inquiry scenarios sketching 2/15/2006 3 Evaluate
More informationAspire Basic Operation (Quick Reference)
Aspire Basic Operation (Quick Reference) To answer an incoming call: To answer an incoming call at your extension, simply lift the receiver. The phone is programmed to automatically answer an incoming
More informationSpeaker Verification in BeVocal VoiceXML
Speaker Verification in BeVocal VoiceXML Version 1.5 May 2001 BeVocal, Inc. 1380 Bordeaux Drive Sunnyvale, CA 94089 2001. BeVocal, Inc. All rights reserved. 2 SPEAKER VERIFICATION IN BEVOCAL VOICEXML Table
More informationAvaya Unified Messenger Telephone User Interface Online Guide
Avaya Unified Messenger Select from the following Introduction Sending voice messages Getting started Recording prompts and greetings Reviewing messages Personalizing your mailbox Replying to messages
More informationOracle isupplier. Supplier User Guide
Supplier User Guide Table of Contents isupplier Application, Help and Support... 3 Registration... 3 For Help... 3 Overview... 4 User Registration... 5 Login to isupplier... 7 isupplier and Sourcing Responsibility
More informationDARPA Communicator Dialog Travel Planning Systems: The June 2000 Data Collection
DARPA Communicator Dialog Travel Planning Systems: The June 2 Data Collection M. Walker, J. Aberdeen, J. Boland, E. Bratt, J. Garofolo, L. Hirschman, A. Le, S. Lee, S. Narayanan, K. Papineni, B. Pellom,
More informationWhite Paper Subcategory. Overview of XML Communication Technologies
Subcategory Overview of XML Communication Technologies Executive Summary A significant shift has occurred in the communications infrastructures deployed today. This shift is the result of the acceptance
More informationBusline 3.0 VXML. Prepared For: BCT By: Enghouse Transportation. July 16, Page title Busline VXML Approved by 1/8
usline 3.0 VXML Prepared For: CT y: July 16, 2018 usline VXML pproved by 1/8 Multi Language?, but language selection to be made available by CT s PX. buslinevxml.language.message buslinevxml.language.selection
More informationTutorial 1 Creating a Personal Call Flow
Call Director Tutorial 1 Creating a Personal Call Flow This tutorial walks you through the creation of a personal call flow for John, a technician at ACME Widgets. John's requirements are typical of most
More informationMRCP. Kaldi SR Plugin. Usage Guide. Powered by Universal Speech Solutions LLC
Powered by Universal Speech Solutions LLC MRCP Kaldi SR Plugin Usage Guide Revision: 1 Created: February 6, 2018 Last updated: February 6, 2018 Author: Arsen Chaloyan Universal Speech Solutions LLC Overview
More informationBusline 3.0 VXML. Prepared For: BCT By: Enghouse Transportation. December 12, 2016
usline 3.0 VXML Prepared For: CT y: December 12, 2016 2013 Enghouse is a Trademark of Enghouse Systems Limited ll Other Trademarks elong to Their Respective Proprietors Subject to Change without tice usline
More informationCS 160: Evaluation. Professor John Canny Spring /15/2006 1
CS 160: Evaluation Professor John Canny Spring 2006 2/15/2006 1 Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 2 Outline
More informationConfidence Measures: how much we can trust our speech recognizers
Confidence Measures: how much we can trust our speech recognizers Prof. Hui Jiang Department of Computer Science York University, Toronto, Ontario, Canada Email: hj@cs.yorku.ca Outline Speech recognition
More informationEnd-to-End Evaluation of Multimodal Dialogue Systems - can we Transfer Established Methods?
End-to-End Evaluation of Multimodal Dialogue Systems - can we Transfer Established Methods? Nicole Beringer, Katerina Louka, Victoria Penide-Lopez, Uli Türk Institut für Phonetik und Sprachliche Kommunikation
More informationScript Step Reference Information
Script Step Reference Information This chapter lists all the steps available for use in creating scripts. These steps are accessed using the palette pane (see Using the Palette Pane, page 8). This chapter
More informationHosted Fax Mail. Blue Platform. User Guide
Hosted Fax Mail Blue Platform Hosted Fax Mail User Guide Contents 1 About this Guide... 2 2 Hosted Fax Mail... 3 3 Getting Started... 4 3.1 Logging On to the Web Portal... 4 4 Web Portal Mailbox... 6 4.1
More informationHow accurate is AGNITIO KIVOX Voice ID?
How accurate is AGNITIO KIVOX Voice ID? Overview Using natural speech, KIVOX can work with error rates below 1%. When optimized for short utterances, where the same phrase is used for enrolment and authentication,
More informationCall Center functionality within your reach
Call Center functionality within your reach Page 1 What is agentel? Page 2 AgenTel is a scalable, CTI-based solution for departmental, small-and medium sized call centers. A total call center solution
More informationAutoAttendant User Guide
AutoAttendant User Guide Updated April 2017 Table of Contents AutoAttendant Overview... 3 1.1 Accessing the AutoAttendant Self-care TUI... 3 1.2 AutoAttendant Configuration Menus... 4 1.3 AutoAttendant
More informationMRCP. Google SR Plugin. Usage Guide. Powered by Universal Speech Solutions LLC
Powered by Universal Speech Solutions LLC MRCP Google SR Plugin Usage Guide Revision: 6 Created: May 17, 2017 Last updated: January 22, 2018 Author: Arsen Chaloyan Universal Speech Solutions LLC Overview
More informationDue on: May 12, Team Members: Arpan Bhattacharya. Collin Breslin. Thkeya Smith. INFO (Spring 2013): Human-Computer Interaction
Week 6 Assignment: Heuristic Evaluation of Due on: May 12 2013 Team Members: Arpan Bhattacharya Collin Breslin Thkeya Smith INFO 608-902 (Spring 2013): Human-Computer Interaction Group 1 HE Process Overview
More informationMRCP. Watson SS Plugin. Usage Guide. Powered by Universal Speech Solutions LLC
Powered by Universal Speech Solutions LLC MRCP Watson SS Plugin Usage Guide Revision: 2 Created: June 28, 2018 Last updated: August 10, 2018 Author: Arsen Chaloyan Universal Speech Solutions LLC Overview
More informationZYCOO CooVox Series IP Phone System Extension User Guide. Software version v3.0.1
ZYCOO CooVox Series IP Phone System Extension User Guide Software version v3.0.1 Contents Extension User Web Portal... 1 Extension User Web Login... 1 Web Dial... 3 Inbound Calls... 3 Contacts... 4 Call
More informationCHAPTER 18: CLIENT COMMUNICATION
CHAPTER 18: CLIENT COMMUNICATION Chapter outline When to communicate with clients What modes of communication to use How much to communicate How to benefit from client communication Understanding your
More informationAuthors Martin Eckert Ingmar Kliche Deutsche Telekom Laboratories.
Workshop on speaker biometrics and VoiceXML 3.0 March 5-6, 2009, Menlo Park, CA, US Proposal of an SIV architecture and requirements Authors Martin Eckert (martin.eckert@telekom.de), Ingmar Kliche (ingmar.kliche@telekom.de),
More informationGetting Started. Reliance Communications, Inc. 603 Mission Street Santa Cruz, CA
Getting Started Reliance Communications, Inc. 603 Mission Street Santa Cruz, CA 95060 888-527-5225 www.schoolmessenger.com Index Before you Begin...3 Getting Started...3 Log In...3 The Start Page...3 Help...4
More informationCisco CVP VoiceXML 3.0. Element Specifications
Cisco CVP VoiceXML 3.0 CISCO CVP VOICEXML 3.0 Publication date: 14 January 2005 Copyright (C) 2000-2005 Audium Corporation. All rights reserved. Distributed by Cisco Systems, Inc. under license from Audium
More informationInteractive Voice Response System for College automation
Interactive Voice Response System for College automation Ms. Ayesha Mahamadshafi Attar 1, Ms. ShrutiSudhir Aitavade 2, Ms. PoonamArjunKalkhambkar 3, Ms. SofiyaRiyaj Nadaf 4 1Department of Computer Science
More informationUsability Report. Author: Stephen Varnado Version: 1.0 Date: November 24, 2014
Usability Report Author: Stephen Varnado Version: 1.0 Date: November 24, 2014 2 Table of Contents Executive summary... 3 Introduction... 3 Methodology... 3 Usability test results... 4 Effectiveness ratings
More informationCallPilot Mini Reference Guide
Part No. P0989496 02 CallPilot Mini Reference Guide 2 CallPilot Mini Reference Guide Copyright 2002 Ntel Netwks All rights reserved. 2002. The infmation in this document is subject to change without notice.
More informationEvaluation of a Mixed-Initiative Dialogue Multimodal Interface
Evaluation of a Mixed-Initiative Dialogue Multimodal Interface Baoli Zhao, Tony Allen and Andrzej Bargiela The Nottingham Trent University {baoli.zhao,tony.allen,andrzej.bargiela}@ntu.ac.uk www.ntu.ac.uk
More informationTowards Systematic Usability Verification
Towards Systematic Usability Verification Max Möllers RWTH Aachen University 52056 Aachen, Germany max@cs.rwth-aachen.de Jonathan Diehl RWTH Aachen University 52056 Aachen, Germany diehl@cs.rwth-aachen.de
More informationHuman Computer Interaction Using Speech Recognition Technology
International Bulletin of Mathematical Research Volume 2, Issue 1, March 2015 Pages 231-235, ISSN: 2394-7802 Human Computer Interaction Using Recognition Technology Madhu Joshi 1 and Saurabh Ranjan Srivastava
More informationSOFTWARE DESIGN AND DEVELOPMENT OF MUTIMODAL INTERACTION
SOFTWARE DESIGN AND DEVELOPMENT OF MUTIMODAL INTERACTION Marie-Luce Bourguet Queen Mary University of London Abstract: Key words: The multimodal dimension of a user interface raises numerous problems that
More informationThe Practical Side of Teaching the Elderly Visually Impaired Users to Use the
The Practical Side of Teaching the Elderly Visually Impaired Users to Use the E-Mail Takuya Nishimoto, Masahiro Araki and Yasuhisa Niimi Department of Electronics and Information Science, Faculty of Engineering
More informationHands-Free Internet using Speech Recognition
Introduction Trevor Donnell December 7, 2001 6.191 Preliminary Thesis Proposal Hands-Free Internet using Speech Recognition The hands-free Internet will be a system whereby a user has the ability to access
More informationSpeech Recognition, The process of taking spoken word as an input to a computer
Speech Recognition, The process of taking spoken word as an input to a computer program (Baumann) Have you ever found yourself yelling at your computer, wishing you could make it understand what you want
More informationDesigning an Auto Attendant Script
Designing an Auto Attendant Script This chapter describes the design of an Auto Attendant (AA) script, aa_sample1.aef, which is included with the Cisco Unity Express Script Editor, and contains the following
More informationUsability Metrics for Spoken Language Systems
Usability Metrics for Spoken Language Systems Priyanka Gupta Auburn University 107 Dunstan Hall Dept. of Computer Science & Software Engineering Auburn University, AL, 36849 USA guptapr@auburn.edu Juan
More informationNortel Networks Integrated Call Director
Title page Nortel Networks Communication Server 1000 Release 4.0 Nortel Networks Integrated Call Director User Guide Revision history Revision history July 2004 Standard 1.00. This document is issued
More informationHome Voice Mail. User Guide Frontier Communications Corporation. HVM_UG_0414
Home Voice Mail User Guide 2014 Frontier Communications Corporation. HVM_UG_0414 Basic Standard OnePoint Voice Mail Greeting 5 minutes 5 minutes 5 minutes Busy Greeting Message Storage 30 days 30 days
More informationA Technical Overview: Voiyager Dynamic Application Discovery
A Technical Overview: Voiyager Dynamic Application Discovery A brief look at the Voiyager architecture and how it provides the most comprehensive VoiceXML application testing and validation method available.
More informationGrid4 s SmartCOMM Hosted IP Complete. Auto Attendant User Guide
Grid4 s SmartCOMM Hosted IP Complete Auto Attendant User Guide Table of Contents List of Figures... 4 List of Tables... 5 Issue Status... 6 1. Getting Started... 7 1.1 About This Manual... 7 1.2 Conventions...
More informationWriting a Research Paper
Writing a Research Paper I. Scott MacKenzie 1 Research Paper The final step Research is not finished until the results are published! 2 1 Organization of a Research Paper Title Abstract Body Main sections
More informationVoice Messaging User Guide from Level 3. Updated April Level 3 Communications, LLC. All rights reserved. 1
Voice Messaging User Guide from Level 3 Updated April 2017 Level 3 Communications, LLC. All rights reserved. 1 Table of Contents 1 Introduction... 4 1.1 Voice Mailbox... 4 1.2 Additional Voice Mailbox
More informationUsability Inspection Methods. Overview. Usability Measures. SMD157 Human-Computer Interaction Fall 2003
INSTITUTIONEN FÖR SYSTEMTEKNIK LULEÅ TEKNISKA UNIVERSITET Usability Inspection Methods SMD157 Human-Computer Interaction Fall 2003 Nov-20-03 SMD157, Usability Inspection Methods 1 L Overview Usability
More informationProblem Solving through Programming In C Prof. Anupam Basu Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur
Problem Solving through Programming In C Prof. Anupam Basu Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture - 04 Introduction to Programming Language Concepts
More informationLABORATORY 117. Intorduction to VoiceXML (3)
LABORATORY 117 Intorduction to VoiceXML (3) 1 TAC2000/2000 Delivering Content Collecting User Input Main Topics 2 TAC2000/2000 %$ %$!! 802.16 IP Telephone Lab #" " The element has several attributes bargein
More informationHitachi ID Password Manager Telephony Integration
Hitachi ID Password Manager Telephony Integration 2016 Hitachi ID Systems, Inc. All rights reserved. Contents 1 Introduction 1 2 Functional integration 2 2.1 Self-service password reset....................................
More informationMultilingual Vocal Access to Databases: the Central Role of Ontology Management in VocaBase
Multilingual Vocal Access to Databases: the Central Role of Ontology Management in VocaBase Christophe Dupriez DB Scape Chaussée de Bruxelles, 63 - Waterloo (Belgium) dupriez@dbscape.com - http://www.dbscape.com
More informationConstructing Empirical Models for Automatic Dialog Parameterization
Constructing Empirical Models for Automatic Dialog Parameterization Mikhail Alexandrov 1, Xavier Blanco 1, Natalia Ponomareva 2, and Paolo Rosso 2 1 Universidad Autonoma de Barcelona, Spain 2 Universidad
More informationAn overview of interactive voice response applications
An overview of interactive voice response applications Suneetha Chittamuri Senior Software Engineer IBM India April, 2004 Copyright International Business Machines Corporation 2004. All rights reserved.
More informationUsability. HCI - Human Computer Interaction
Usability HCI - Human Computer Interaction Computer systems optimization for easy access and communication Definition design Basic principles Testing assessment implementation Quality Utility funcionality
More informationVoic Complete User Guide
VoiceMail Complete User Guide Thank you for subscribing to Pioneer VoiceMail service. We re happy you ve chosen Pioneer for your telecommunication needs. In addition to exceptional local and long distance
More informationAccessory HandsFreeLink TM User s Information Manual
Accessory HandsFreeLink TM User s Information Manual A Few Words About Safety Your safety, and the safety of others, is very important. Operating the Accessory HandsFreeLink TM safely is an important responsibility.
More informationMeasuring the Usability of Web-Based Services Exemplified by University Student Information System
To appear in Proc. 15th Annual Business Congress, IMDA2006, June 18-21, 2006, Saraevo, Bosnia and Herzegovina. Measuring the of Web-Based Services Exemplified by University Student Information System Asil
More informationINTRODUCTION TO VOICEXML FOR DISTRIBUTED WEB-BASED APPLICATIONS
ιατµηµατικό Μεταπτυχιακό Πρόγραµµα Σπουδών : Οικονοµική & ιοίκηση Τηλεπικοινωνιακών ικτύων (Νέες υπηρεσίες και τεχνολογίες δικτύων) INTRODUCTION TO VOICEXML FOR DISTRIBUTED WEB-BASED APPLICATIONS Π.Κ Κίκιραs
More informationThe L&S LSS Podcaster s Tutorial for Audacity
The L&S LSS Podcaster s Tutorial for Audacity The L&S LSS Podcaster s Tutorial for Audacity... 1 Audacity Quick Reference... 2 About this tutorial... 3 Some Thoughts Before You Get Started... 3 Do Academic
More informationVoice Dialing Setup Guide
www.babytel.net Contents Introduction... 2 Enabling Voice Dialing... 2 Enabling Voice Dialing for internal users... 2 Enabling Voice Dialing in an Automated Attendant... 2 Automated Attendant Pronunciation
More informationEnterprise Edge 2.0 Voice Messaging Reference Guide
Enterprise Edge 2.0 Voice Messaging Reference Guide www.nortelnetworks.com 2000 Nortel Networks P0911621 Issue 02 Contents Chapter 1 How to use this guide 7 Introduction 7 Conventions and symbols used
More informationMonitoring and Recording
CHAPTER 34 Call centers need to be able to guarantee the quality of customer service that an agent in a call center provides. To protect themselves from legal liability, call centers need to be able to
More informationSprint Messenger SM One to Many Messaging Quick Reference Guide
Sprint Messenger SM One to Many Messaging Quick Reference Guide Quick Reference Guide - Page 1 of 26 Table of contents: 1.0 Introduction...3 2.0 Welcome Screen...4 2.1 Example Setting up a meeting...5
More informationPhonologies The Voice of Technology
Phonologies Media Services Framework Copyright 2004 Phonologies (India) Private Limited Copyright 2001 2004 by Phonologies (India) Private Limited. Phonologies, InterpreXer and Oktopous are trademarks
More informationAuto Attendant Script Example
Auto Attendant Script Example Last Updated: April 28, 2010 This chapter describes how to configure an auto attendant (AA) script. It uses the sample script aa_sample1.aef, which is included with the Cisco
More informationVoice Biometric Integration
Voice Biometric Integration Product Requirements Document, Version 2 November 28, 2016 Team Tres Commas Members: Jonathan Easterman, Sasha Shams, Arda Ungun, Carson Holoien, Vince Nicoara Mentors: Mike
More informationEnterprise Edge Voice Messaging Reference Guide
Enterprise Edge Voice Messaging Reference Guide 1-800-4 NORTEL www.nortelnetworks.com 1999 Nortel Networks P0908524 Issue 01 Contents Chapter 1 How to use this guide 7 Introduction 7 Conventions and symbols
More informationTips and Tricks Working with Scribe Trace Files
Tips and Tricks Working with Scribe Trace Files This document gives some details about how Scribe tracing works, how to enable tracing, and how to read trace files. It also includes some tips and tricks
More informationStaff Reference. INNLINE 2020 Voice Mail
Staff Reference INNLINE 2020 Voice Mail Notice The Innovation voice mail system (also referred to as the System ) is made available (under the terms of the Innovation Technologies Worldwide, Inc. License
More informationCA Auto Attendant Module. Operation Manual v1.3
CA Auto Attendant Module Operation Manual v1.3 Poltys, Inc. 3300 N. Main Street, Suite D, Anderson, SC 29621-4128 +1 (864) 642-6103 www.poltys.com 2013, Poltys Inc. All rights reserved. The information
More informationUSER REFERENCE MANUAL
USER REFERENCE MANUAL for Serenade Telephone User Interface (TUI) OCTEL MESSAGING DIVISION THE POWER OF MESSAGING With voice messaging people don t have to be available and connected at the same time to
More informationSoftware Design and Evaluation by Ergonomics Knowledge and Intelligent Design System (EKIDES)
PsychNology Journal, 2003 Volume 1, Number 4, 378-390 Software Design and Evaluation by Ergonomics Knowledge and Intelligent Design System (EKIDES) Dr.-Ing. Iwona Jastrzebska-Fraczek and Prof. Dr. rer.
More informationCisco Unity Express 8.0 Voic System User s Guide for Advanced Features
Cisco Unity Express 8.0 Voice-Mail System User s Guide for Advanced Features First Published: October 26, 2010 This guide provides information about using some of the advanced voice-mail features of your
More informationPanasonic KX-TVS75, KXTVS75, TVS75, KX-TVS100, KXTVS100, TVS100, KX-TVS200, KXTVS200, TVS200
POWER KX-TVS75 / KX-TVS100 / KX-TVS200 www.voicesonic.com Phone: 877-289-2829 VOICE PROCESSING SYSTEM Voice Processing System Subscriber s Guide Panasonic KX-TVS75, KXTVS75, TVS75, KX-TVS100, KXTVS100,
More informationSMB8000 Interactive Voice Response
Notice Note that when converting this document from its original format to a.pdf file, some minor font and format changes may occur. When viewing and printing this document, we cannot guarantee that your
More informationGuide & User Instructions
Guide & User Instructions Revised 08/2011 726 Grant Street Troy Ohio 45373 877.698.3262 937.335.3887 onecallnow.com support@onecallnow.com America s Largest Message Notification Provider Copyright 2009-2011
More information