Human Computer Interaction Using Speech Recognition Technology

Size: px
Start display at page:

Download "Human Computer Interaction Using Speech Recognition Technology"

Transcription

1 International Bulletin of Mathematical Research Volume 2, Issue 1, March 2015 Pages , ISSN: Human Computer Interaction Using Recognition Technology Madhu Joshi 1 and Saurabh Ranjan Srivastava 2 Department of Computer Engineering, SKIT, Jaipur madhujoshi896@gmail.com, srs@skit.ac.in Abstract recognition technology is a kind of technology that provides a communication between a man and machine. In this paper we describe the speech recognition technology. Here we implement the comparing of input string with the inbuilt dictionary. Here we provide the security feature by asking random questions when we login. We create a database dictionary through which user interact with machine. Here we create Input database and Output database. The Input database contains the various interfaces of machine. When user provides input to the machine by speaking then system will perform matching of this string with the system dictionary. When the string matches then machine recognizes the input string and it performs the output task. Here the user interacts with machine by asking various queries or performing various tasks. Here the machine recognizes the users voice and gives output to user. 1 INTRODUCTION recognition technology is a kind of technology that allows a machine to identify the words that a person speaks into a microphone and convert it to the written text. recognition is thus sometimes referred to as speech-to-text [1]. recognition allows us to provide input to an application with our voice. Just like clicking on mouse, typing on keyboard, or pressing a key on the phone keypad provides input to an application, speech recognition allows us to provide input by talking. In the desktop world, we need a microphone to be able to do this. The speech recognition process is performed by a software component known as the speech recognition engine [2]. The main goal of the speech recognition engine is to process the spoken input and translate it into the text that an application understands. In this paper we describe the main work of the speech recognition engine. Here firstly the user log-in in which the machine ask various random questions. After that when the speech recognition engine recognizes the input then the application can interpret the result of the recognition as a command. This application is a command and control application. Grammar(s) Audio Input Recognition Engine Recognized Text Acoustic model Figure 1.1 The basic working principle of speech recognition engine 2 DATA MODEL The basic data model of internal processing of speech recognition engine is defined by a graphical representation. It can be shown as Received: February 20, 2015 Keywords: component; formatting; recognition; API; User voice; Technology application

2 232 Madhu Joshi, Mr. Saurabh Ranjan Srivastava Signal Processing Unit Comparison Unit Models Word Sequence Search Language Models Recognized Word Sequence Perform output Command Figure 2.1 The data modules of our speech recognition application The voice or speech is firstly processed by the signal processing module. The speech processing module translated the speech waveform into a speech pattern representation. The speech pattern consists of a sequence of feature vectors. The speech pattern is compared with the reference pattern that is stored with class identities. Here when the speech pattern matched then the input voice is recognized. And the respective command is executed and the application performs the output command. The output may be a external interface or response by speaking. The whole processing of our application can be define as Analog to Digital Converter (Sound Card) Recognition Engine User defined Dictionary Security For login procedure User take Action Action Performed by the Machine Add Database External Interface Speaking Figure 2.2 The whole execution procedure of our speech recognition application Here, the voice is converted into digital signals through the analog to digital converter. Firstly the project loaded then the user voice is converted into digital signal which passes through the speech recognition engine. Then the speech recognition engine recognized the voice and the user logins by answering the various random questions. And then the user performs the task by taking action or speaking commands. Now the machine gives the response according to the users command.

3 Human Computer Interaction Using Recognition Technology 233 Here, the user take action by speaking the commands. There are various commands present here like open notepad, open command prompt, open Google and the many more commands are present here. When the user say open command prompt then the voice of the user is recognized by the speech recognition engine and the command prompt will open easily and interactively [2]. Here we create a large recognition vocabulary for interacting the user to machine. We define a user dictionary in which all the commands are present and the user can interact with machine through these commands. Here, we define mathematical expression for speech to text conversion which is performed in our implementation. It is assumed that each utterance consists of a sequence of linguistically meaningful and structured words, and our main goal is convert the spoken signal into the word sequence as accurately as possible [8]. The output of the utterances depends on recognized sequence of words. This task is sometimes known as the speech to text conversion. The following approach shows the word decoding task. Word decoding formulation basically depends on the Bayes decision theory [8]. Here we define this theorm based on our architecture as follows: Let Q = (,,, ) be a set of observations and W = (,, ) be a sequence of words Here, the observation Q is the realization of the sequence of words W where each ε V in the defined dictionary [8]. Here we determine W R that defines the recognized phrase. For finding out W R, speech recognizer implements the maximum a posteriori rule as: W R = max (W Q) By using bayes theorem it can be defined as W R = max (Q W) (W)/ (Q) Here, these (Q W) (W) key quantities take decision for recognizing the word sequence. These parameters are the decision making parameters. Here, the key (Q) is not involved in optimization process [8]. The first benefit is that speech offers a way of issuing commands while allowing hands and eyes to remain free. Operations normally carried out through the direct manipulation modality such as open Calculator, open WordPad etc. Thus multiple actions can be simultaneously carried out. This is particularly useful in cases when hands/eyes are already busy, but other tasks need to be dealt with from time to time; for example, when direct manipulation is used to drive a car[1], speech can be used to control the radio, car phone, and other on-board systems. The second benefit is that users can refer to objects which are not present in their current view of the virtual world; in a direct manipulation interface, actions can only be applied to objects which are visually present. The most observable benefit of speech is naturalness, or more precisely, familiarity. Users are familiar with using English language to act in the world. A central issue in developing a speech interface to virtual worlds is the nature of the relationship between the system and user. Here we define the speech interface through which a ser and system interact to each other very interactively [6]. Performance Most speech recognition engines try very hard to find a match and are usually very forgiving. But it is very important to note that the engine is always returning its best guess for what was said. Performance of our speech recognition engine application can be defined as per the graph representation as: Fig 2.3 Performance evaluation of our application

4 234 Madhu Joshi, Mr. Saurabh Ranjan Srivastava Here the performance graph shows recognition speed or fluency of recognition increases with increases the size of the dictionary. In our application the whole string is recognized in some seconds. There is a trade-off between coverage and accuracy in speech recognition systems: the larger the user vocabulary and grammar, the greater the potential for recognition errors. API In our application, we use the speech application programming interface. The version of speech API which is used by our application is speech API IMPLEMENTATION We implement the human computer interaction using the speech recognition engine. Here the user interact with the machine and can perform the various commands. We show here some snapshot which describes the implementation part of our work. The user login is shown here. The user can login by speaking their name. Here, After the login process the user can interact with machine through the various commands. By using these commands the user can perform various tasks. Here In the above snapshot one command is executed by the user. The user opens calculator by speaking to computer. In the same way user can interact with the machine through various commands by their voice recognition. 4 CONCLUSION We have concluded that by using our project every person can interact with machine through their voice. We have described the human computer interaction application using speech recognition engine. Here we provide the various commands to user for interacting with the machine. The user can perform various tasks through these commands by speaking. ACKNOWLEDGMENT I am very thankful to my co-author who help me for researching on this project.

5 Human Computer Interaction Using Recognition Technology 235 REFERENCES [1] Youhao Yu (2012)_Research on Recognition Technology and Its Application International Conference on Computer Science and Electronics Engineering. [2] Jianliang Meng, Junwei Zhang,Haoquan Zhao (2012) overview of the Recognition Technology, Fourth International Conference on Computational and Information Sciences. [3] Mohammad A. M. Abu Shariah, Raja N. Ainon1, Roziati Zainuddin, Othman O. Khalifa (2007), Human Computer Interaction Using Isolated-Words Recognition Technology, International Conference on Intelligent and Advanced Systems. [4] Kazuyo Tanaka (1998), Next Major Application Systems and key Techniques in Recognition Technology, /9. [5] K. H. Davis, R. Biddulph, and S. Balashek, (1952), Automatic Recognition of Spoken Digits, J. Acoust. Soc. Amer., 24. 6, [6] Rabiner L R, Juang B H (1993), Fundamentals of Recognition, Englewood Cliffs: Prentice Hall. [7] International Workshop on Robot and Human Interactive Communication, IEEE Press, Sept [8] Biing-Hwang Juang And Sadaoki Furui Automatic Recognition and Understanding of Spoken Language A First Step Toward Natural Human Machine Communication, IEEE 2000

Speech Recognizing Robotic Arm for Writing Process

Speech Recognizing Robotic Arm for Writing Process Speech Recognizing Robotic Arm for Writing Process 1 Dhanshri R. Pange, 2 Dr. Anil R. Karwankar 1 M. E. Electronics Student, 2 Professor, Department of Electronics and Telecommunication Govt. Engineering

More information

Intelligent Hands Free Speech based SMS System on Android

Intelligent Hands Free Speech based SMS System on Android Intelligent Hands Free Speech based SMS System on Android Gulbakshee Dharmale 1, Dr. Vilas Thakare 3, Dr. Dipti D. Patil 2 1,3 Computer Science Dept., SGB Amravati University, Amravati, INDIA. 2 Computer

More information

In fact, in many cases, one can adequately describe [information] retrieval by simply substituting document for information.

In fact, in many cases, one can adequately describe [information] retrieval by simply substituting document for information. LµŒ.y A.( y ý ó1~.- =~ _ _}=ù _ 4.-! - @ \{=~ = / I{$ 4 ~² =}$ _ = _./ C =}d.y _ _ _ y. ~ ; ƒa y - 4 (~šƒ=.~². ~ l$ y C C. _ _ 1. INTRODUCTION IR System is viewed as a machine that indexes and selects

More information

Turns your voice into text with up to 99% accuracy. New - Up to a 15% improvement to out-of-the-box accuracy compared to Dragon version 12

Turns your voice into text with up to 99% accuracy. New - Up to a 15% improvement to out-of-the-box accuracy compared to Dragon version 12 Recognition accuracy Turns your voice into text with up to 99% accuracy New - Up to a 15% improvement to out-of-the-box accuracy compared to Dragon version 12 Recognition speed Words appear on the screen

More information

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

International Journal of Modern Trends in Engineering and Research   e-issn No.: , Date: 2-4 July, 2015 International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Communication media for Blinds Based on Voice Mrs.K.M.Sanghavi 1, Radhika Maru

More information

Computer Speech. by Dick Evans,

Computer Speech. by Dick Evans, Computer Speech by Dick Evans, www.rwevans.com One of the class attendees wanted to know more about talking to the computer and having it talk back to us. Actually, I think the request was for the speech

More information

BEST PRACTICES & CRITICAL SUCCESS FACTORS

BEST PRACTICES & CRITICAL SUCCESS FACTORS FLUENCY DIRECT BEST PRACTICES & CRITICAL SUCCESS FACTORS MICROPHONE USAGE Check the microphone settings to verify the microphone you intend to use is the one selected and that the record buttons are appropriately

More information

DRAGON FOR AMBULATORY CARE PROVIDERS

DRAGON FOR AMBULATORY CARE PROVIDERS DRAGON FOR AMBULATORY CARE PROVIDERS Presented by the IS Training Department, Children s Hospital of The King s Daughters August 2011 INTRODUCTION... 1 OBJECTIVES... 1 DRAGON SETUP... 2 COMPONENTS OF

More information

Fluency Direct FAQ's

Fluency Direct FAQ's September 2013 Fluency Direct FAQ's Version 7.85 1710 Murray Avenue Pittsburgh, PA 412.422.2002 solutions@mmodal.com CONFIDENTIALITY DISCLAIMER All information methods and concepts contained in or disclosed

More information

Speech User Interface for Information Retrieval

Speech User Interface for Information Retrieval Speech User Interface for Information Retrieval Urmila Shrawankar Dept. of Information Technology Govt. Polytechnic Institute, Nagpur Sadar, Nagpur 440001 (INDIA) urmilas@rediffmail.com Cell : +919422803996

More information

Speech Recognition, The process of taking spoken word as an input to a computer

Speech Recognition, The process of taking spoken word as an input to a computer Speech Recognition, The process of taking spoken word as an input to a computer program (Baumann) Have you ever found yourself yelling at your computer, wishing you could make it understand what you want

More information

CMU Sphinx: the recognizer library

CMU Sphinx: the recognizer library CMU Sphinx: the recognizer library Authors: Massimo Basile Mario Fabrizi Supervisor: Prof. Paola Velardi 01/02/2013 Contents 1 Introduction 2 2 Sphinx download and installation 4 2.1 Download..........................................

More information

Voice command system. & Using the voice command. system. NOTE

Voice command system. & Using the voice command. system. NOTE 80 system The voice command system enables the audio, hands-free phone system, etc. to be operated using voice commands. Refer to the Command list F83 for samples of voice commands. s can be used even

More information

Integrate Speech Technology for Hands-free Operation

Integrate Speech Technology for Hands-free Operation Integrate Speech Technology for Hands-free Operation Copyright 2011 Chant Inc. All rights reserved. Chant, SpeechKit, Getting the World Talking with Technology, talking man, and headset are trademarks

More information

Analyzing Mel Frequency Cepstral Coefficient for Recognition of Isolated English Word using DTW Matching

Analyzing Mel Frequency Cepstral Coefficient for Recognition of Isolated English Word using DTW Matching Abstract- Analyzing Mel Frequency Cepstral Coefficient for Recognition of Isolated English Word using DTW Matching Mr. Nitin Goyal, Dr. R.K.Purwar PG student, USICT NewDelhi, Associate Professor, USICT

More information

A NEURAL NETWORK APPLICATION FOR A COMPUTER ACCESS SECURITY SYSTEM: KEYSTROKE DYNAMICS VERSUS VOICE PATTERNS

A NEURAL NETWORK APPLICATION FOR A COMPUTER ACCESS SECURITY SYSTEM: KEYSTROKE DYNAMICS VERSUS VOICE PATTERNS A NEURAL NETWORK APPLICATION FOR A COMPUTER ACCESS SECURITY SYSTEM: KEYSTROKE DYNAMICS VERSUS VOICE PATTERNS A. SERMET ANAGUN Industrial Engineering Department, Osmangazi University, Eskisehir, Turkey

More information

SpeakToText 2.5 Speech Recognition User Manual (Version 2.51)

SpeakToText 2.5 Speech Recognition User Manual (Version 2.51) Making it FUN and EASY to use SPEECH with your COMPUTER! CoolSoft, LLC INTRODUCTION SpeakToText 2.5 Speech Recognition User Manual (Version 2.51) SpeakToText 2.5 Speech Recognition, Version 2.51 is a powerful

More information

10.1 Introduction. Higher Level Processing. Word Recogniton Model. Text Output. Voice Signals. Spoken Words. Syntax, Semantics, Pragmatics

10.1 Introduction. Higher Level Processing. Word Recogniton Model. Text Output. Voice Signals. Spoken Words. Syntax, Semantics, Pragmatics Chapter 10 Speech Recognition 10.1 Introduction Speech recognition (SR) by machine, which translates spoken words into text has been a goal of research for more than six decades. It is also known as automatic

More information

Quick Start Guide MAC Operating System Built-In Accessibility

Quick Start Guide MAC Operating System Built-In Accessibility Quick Start Guide MAC Operating System Built-In Accessibility Overview The MAC Operating System X has many helpful universal access built-in options for users of varying abilities. In this quickstart,

More information

Q.bo Webi User s Guide

Q.bo Webi User s Guide Contents Q.bo Webi reference guide... 2 1.1. Login... 3 1.2. System Check... 3 1.3. Config Wizard... 6 1.4. Teleoperation... 7 1.5. Training... 9 1.6. Questions & Answers... 10 1.7. Voice Recognition...

More information

Hands-Free Internet using Speech Recognition

Hands-Free Internet using Speech Recognition Introduction Trevor Donnell December 7, 2001 6.191 Preliminary Thesis Proposal Hands-Free Internet using Speech Recognition The hands-free Internet will be a system whereby a user has the ability to access

More information

Summary. Speech-Enabling Visual Basic 6 Applications with Microsoft SAPI Microsoft Corporation. All rights reserved.

Summary. Speech-Enabling Visual Basic 6 Applications with Microsoft SAPI Microsoft Corporation. All rights reserved. Speech-Enabling Visual Basic 6 Applications with Microsoft SAPI 5.1 2001 Microsoft Corporation. All rights reserved. Summary Prerequisites You need no previous experience with speech recognition, but you

More information

Voice Command Based Computer Application Control Using MFCC

Voice Command Based Computer Application Control Using MFCC Voice Command Based Computer Application Control Using MFCC Abinayaa B., Arun D., Darshini B., Nataraj C Department of Embedded Systems Technologies, Sri Ramakrishna College of Engineering, Coimbatore,

More information

Voice activated spell-check

Voice activated spell-check Technical Disclosure Commons Defensive Publications Series November 15, 2017 Voice activated spell-check Pedro Gonnet Victor Carbune Follow this and additional works at: http://www.tdcommons.org/dpubs_series

More information

Voice Access to Music: Evolution from DSLI 2014 to DSLI 2016

Voice Access to Music: Evolution from DSLI 2014 to DSLI 2016 Voice Access to Music: Evolution from DSLI 2014 to DSLI 2016 Aidan Kehoe Cork, Ireland akehoe@logitech.com Asif Ahsan Newark, CA, USA aaahsan@logitech.com Amer Chamseddine EPFL Lausanne, Switzerland amer.chamseddine@epfl.ch

More information

The State of Speech Recognition on Mobile

The State of Speech Recognition on Mobile The State of Speech Recognition on Mobile The future won't be like Star Trek. Scott Adams, creator of Dilbert Why do I care about speech rec? = Cape Bretoner + Here's a conversation between two Cape

More information

An overview of interactive voice response applications

An overview of interactive voice response applications An overview of interactive voice response applications Suneetha Chittamuri Senior Software Engineer IBM India April, 2004 Copyright International Business Machines Corporation 2004. All rights reserved.

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

Master Your Mac. simple ways to tweak, customize, and secure os x

Master Your Mac. simple ways to tweak, customize, and secure os x Master Your Mac simple ways to tweak, customize, and secure os x matt cone 10 Talking to Your Mac You don t need a degree in computer science to know that talking to your computer is one of the ultimate

More information

Polite mode for a virtual assistant

Polite mode for a virtual assistant Technical Disclosure Commons Defensive Publications Series February 21, 2018 Polite mode for a virtual assistant Thomas Deselaers Pedro Gonnet Follow this and additional works at: https://www.tdcommons.org/dpubs_series

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

Speech Applications. How do they work?

Speech Applications. How do they work? Speech Applications How do they work? What is a VUI? What the user interacts with when using a speech application VUI Elements Prompts or System Messages Prerecorded or Synthesized Grammars Define the

More information

Software/Hardware Co-Design of HMM Based Isolated Digit Recognition System

Software/Hardware Co-Design of HMM Based Isolated Digit Recognition System 154 JOURNAL OF COMPUTERS, VOL. 4, NO. 2, FEBRUARY 2009 Software/Hardware Co-Design of HMM Based Isolated Digit Recognition System V. Amudha, B.Venkataramani, R. Vinoth kumar and S. Ravishankar Department

More information

Problem Solving through Programming In C Prof. Anupam Basu Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur

Problem Solving through Programming In C Prof. Anupam Basu Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Problem Solving through Programming In C Prof. Anupam Basu Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture - 04 Introduction to Programming Language Concepts

More information

Computer Basics Microsoft Windows CB 200

Computer Basics Microsoft Windows CB 200 Computer Basics Microsoft Windows CB 200 Table of Contents Using Windows... 3 Desktop... 3 Taskbar... 4 The Start menu... 4 The Quick Launch bar... 5 The System Tray... 6 Customization... 6 How to Use

More information

FOMA 905i Application Functions

FOMA 905i Application Functions Terminal Application Speech Recognition Remote Operation FOMA 905i Application Functions An automatic software update function, an enhanced Flash *1 function, a music player usage history transmission

More information

Getting Started with Zoom

Getting Started with Zoom Getting Started with Zoom The Institute of Agriculture has recently purchased a site license for a new cloud-based video conferencing service known as Zoom. If you have ever connected to a GoToMeeting,

More information

The innovating Windows Mobile -based Telematic Solution for the car

The innovating Windows Mobile -based Telematic Solution for the car The innovating Windows Mobile -based Telematic Solution for the car CONTENTS OVERVIEW... 3 The hands-free kit... 3 Message reader... 5 Media player... 6 Road safety... 7 DISPLAY AND BUTTONS ON THE STEERING

More information

y texthelp Read&Write for Google Chrome Quick Reference Guide Docs, Slides and Web read&write - j & Google Docs

y texthelp Read&Write for Google Chrome Quick Reference Guide Docs, Slides and Web read&write - j & Google Docs y texthelp Read&Write for Chrome Quick Reference Guide 12.17 f m El 11 s, Slides and i >» := n i* - j Tool Symbol Where it works How it works Text to Speech Reads text aloud with dual color highlighting

More information

Dragon TV Overview. TIF Workshop 24. Sept Reimund Schmald mob:

Dragon TV Overview. TIF Workshop 24. Sept Reimund Schmald mob: Dragon TV Overview TIF Workshop 24. Sept. 2013 Reimund Schmald reimund.schmald@nuance.com mob: +49 171 5591906 2002-2013 Nuance Communications, Inc. All rights reserved. Page 1 Reinventing the relationship

More information

Niusha, the first Persian speech-enabled IVR platform

Niusha, the first Persian speech-enabled IVR platform 2010 5th International Symposium on Telecommunications (IST'2010) Niusha, the first Persian speech-enabled IVR platform M.H. Bokaei, H. Sameti, H. Eghbal-zadeh, B. BabaAli, KH. Hosseinzadeh, M. Bahrani,

More information

A Kinect Sensor based Windows Control Interface

A Kinect Sensor based Windows Control Interface , pp.113-124 http://dx.doi.org/10.14257/ijca.2014.7.3.12 A Kinect Sensor based Windows Control Interface Sang-Hyuk Lee 1 and Seung-Hyun Oh 2 Department of Computer Science, Dongguk University, Gyeongju,

More information

RLAT Rapid Language Adaptation Toolkit

RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit Tim Schlippe May 15, 2012 RLAT Rapid Language Adaptation Toolkit - 2 RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit - 3 Outline Introduction

More information

Interaction Style Categories. COSC 3461 User Interfaces. What is a Command-line Interface? Command-line Interfaces

Interaction Style Categories. COSC 3461 User Interfaces. What is a Command-line Interface? Command-line Interfaces COSC User Interfaces Module 2 Interaction Styles What is a Command-line Interface? An interface where the user types commands in direct response to a prompt Examples Operating systems MS-DOS Unix Applications

More information

Discovering Computers Chapter 5 Input

Discovering Computers Chapter 5 Input Discovering Computers 2009 Chapter 5 Input Chapter 5 Objectives Define input List the characteristics of a keyboard Describe different mouse types and how they work Summarize how various pointing devices

More information

Research on Construction of Road Network Database Based on Video Retrieval Technology

Research on Construction of Road Network Database Based on Video Retrieval Technology Research on Construction of Road Network Database Based on Video Retrieval Technology Fengling Wang 1 1 Hezhou University, School of Mathematics and Computer Hezhou Guangxi 542899, China Abstract. Based

More information

A Smart Power System Weihan Bo, Mi Li, Xi-Ping Peng, Xiang Li, Xin Huang *

A Smart Power System Weihan Bo, Mi Li, Xi-Ping Peng, Xiang Li, Xin Huang * 3rd International Conference on Mechanical Engineering and Intelligent Systems (ICMEIS 2015) A Smart Power System Weihan Bo, Mi Li, Xi-Ping Peng, Xiang Li, Xin Huang * Xi'an Jiaotong-Liverpool University,

More information

Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following:

Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following: Garage Band Instructions Tutorial Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following: Demonstrate Audio editing techniques

More information

Zing Speak Tutorial. By Sandy McCauley January 13, 2012

Zing Speak Tutorial. By Sandy McCauley January 13, 2012 Zing Speak Tutorial By Sandy McCauley January 13, 2012 What is Zing Speak? Zing Speak is a feature in the KNK Zing plug-in, beginning with version 2.0. It allows you to communicate by voice with the computer

More information

SPEAKEASY USER TRAINING

SPEAKEASY USER TRAINING SPEAKEASY USER TRAINING Speakeasy allows users to create personalized voice profiles, which can improve the positive results when you are using speech-to-text. When you create a voice profile, you speak

More information

Practice Test Guidance Document for the 2018 Administration of the AASCD 2.0 Independent Field Test

Practice Test Guidance Document for the 2018 Administration of the AASCD 2.0 Independent Field Test Practice Test Guidance Document for the 2018 Administration of the AASCD 2.0 Independent Field Test Updated October 2, 2018 Contents Practice Test Overview... 2 About the AASCD 2.0 Online Assessment Practice

More information

Car Information Systems for ITS

Car Information Systems for ITS Car Information Systems for ITS 102 Car Information Systems for ITS Kozo Nakamura Ichiro Hondo Nobuo Hataoka, Ph.D. Shiro Horii OVERVIEW: For ITS (intelligent transport systems) car information systems,

More information

Prediction and Selection of Sequence of Actions Related to Voice Activated Computing Systems

Prediction and Selection of Sequence of Actions Related to Voice Activated Computing Systems Technical Disclosure Commons Defensive Publications Series July 03, 2017 Prediction and Selection of Sequence of Actions Related to Voice Activated Computing Systems John D. Lanza Foley & Lardner LLP Follow

More information

Voice Recognition Implementation: Voice recognition software development kit (SDK), downloadable as freeware or shareware.

Voice Recognition Implementation: Voice recognition software development kit (SDK), downloadable as freeware or shareware. 1 General Description: The purpose of this project is to increase the speed and accuracy of name recall in elderly patients by creating an installable software package which will be used as a game. The

More information

Front-end Specification of XISL

Front-end Specification of XISL Front-end Specification of XISL Input Modalities The attributes of element are,, and so on. The attribute specifies the of input, and the attribute specifies the input. Input mode is written in

More information

SVD-based Universal DNN Modeling for Multiple Scenarios

SVD-based Universal DNN Modeling for Multiple Scenarios SVD-based Universal DNN Modeling for Multiple Scenarios Changliang Liu 1, Jinyu Li 2, Yifan Gong 2 1 Microsoft Search echnology Center Asia, Beijing, China 2 Microsoft Corporation, One Microsoft Way, Redmond,

More information

The WordRead Toolbar lets you use WordRead's powerful features at any time without getting in your way.

The WordRead Toolbar lets you use WordRead's powerful features at any time without getting in your way. Welcome to WordRead Welcome to WordRead. WordRead is designed to make it easier for you to do things with your computer by making it speak and making things easier to read. It is closely integrated with

More information

LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES

LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES Bo-ren Bai 1, Berlin Chen 2, Hsin-min Wang 2, Lee-feng Chien 2, and Lin-shan Lee 1,2 1 Department of Electrical

More information

VOICE AND TOUCH BASED INPUT

VOICE AND TOUCH BASED INPUT Technical Disclosure Commons Defensive Publications Series March 13, 2015 VOICE AND TOUCH BASED INPUT Michael Cardosa Follow this and additional works at: http://www.tdcommons.org/dpubs_series Recommended

More information

Speech Tuner. and Chief Scientist at EIG

Speech Tuner. and Chief Scientist at EIG Speech Tuner LumenVox's Speech Tuner is a complete maintenance tool for end-users, valueadded resellers, and platform providers. It s designed to perform tuning and transcription, as well as parameter,

More information

Multioperating Autonomous Robot

Multioperating Autonomous Robot Multioperating Autonomous Robot Sheikh Rafik Manihar, Electronics and Instrumentation, Chhatrapati Shivaji Institute of Technology, Durg ABSTRACT- Conventionally, wireless controlled robots user circuits,

More information

Alexa, what did I do last summer?

Alexa, what did I do last summer? , what did I do last summer? Vladimir Katalov, ElcomSoft SecTor 2018 ElcomSoft Ltd. www.elcomsoft.com 1 Who s Alexa? Amazon Alexa is a virtual assistant developed by Amazon She s 4 years young First appeared

More information

TypeIt ReadIt. Macintosh v 1.7

TypeIt ReadIt. Macintosh v 1.7 TypeIt ReadIt Macintosh v 1.7 1 Table of Contents Page Topic 3 TypeIt ReadIt 4 What s New With Version 1.7 5 System Requirements 6 User Interface 11 Keyboard Shortcuts 12 Printing 2 TypeIt ReadIt TypeIt

More information

Voice Control becomes Natural

Voice Control becomes Natural Voice Control becomes Natural ITU-T FOCUS GROUP CarCom -- SPEECH IN CARS Dr. Udo Haiber Torino, Italy, October 16, 2009 Overview Company What is Natural? Involved Components Focus Change Approach Conclusion

More information

Real-time Talking Head Driven by Voice and its Application to Communication and Entertainment

Real-time Talking Head Driven by Voice and its Application to Communication and Entertainment ISCA Archive Real-time Talking Head Driven by Voice and its Application to Communication and Entertainment Shigeo MORISHIMA Seikei University ABSTRACT Recently computer can make cyberspace to walk through

More information

It is been used to calculate the score which denotes the chances that given word is equal to some other word.

It is been used to calculate the score which denotes the chances that given word is equal to some other word. INTRODUCTION While I was tackling a NLP (Natural Language Processing) problem for one of my project "Stephanie", an open-source platform imitating a voice-controlled virtual assistant, it required a specific

More information

Type your codes into the Username and Password section and click on Login.

Type your codes into the Username and Password section and click on Login. Students guide to the Net Languages platform English for Work Premium Contents 1. How to enter the course... 1 2. How to navigate around the course... 1 3. How to view your progress... 5 4. Internal mail...

More information

Speech Control System for Robot Based on Raspberry Pi

Speech Control System for Robot Based on Raspberry Pi Advanced Materials Research Online: 2013-09-04 ISSN: 1662-8985, Vols. 791-793, pp 663-667 doi:10.4028/www.scientific.net/amr.791-793.663 2013 Trans Tech Publications, Switzerland Speech Control System

More information

ViaTalk. Quick guide. Version:v1.1

ViaTalk. Quick guide. Version:v1.1 ViaTalk Quick guide Version:v1.1 Release:February, 2014 Introduction ViaTalk runs independently without going through any hassle installation. Just connect your smart phone with your computer and launch

More information

NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION

NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION 8T - 56 NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION LX NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION DESCRIPTION TELECOMMUNICATIONS The hands-free cellular system uses Bluetooth technology

More information

Spoken Term Detection Using Multiple Speech Recognizers Outputs at NTCIR-9 SpokenDoc STD subtask

Spoken Term Detection Using Multiple Speech Recognizers Outputs at NTCIR-9 SpokenDoc STD subtask NTCIR-9 Workshop: SpokenDoc Spoken Term Detection Using Multiple Speech Recognizers Outputs at NTCIR-9 SpokenDoc STD subtask Hiromitsu Nishizaki Yuto Furuya Satoshi Natori Yoshihiro Sekiguchi University

More information

May Read&Write 5 Gold for Mac Beginners Guide

May Read&Write 5 Gold for Mac Beginners Guide May 2012 Read&Write 5 Gold for Mac Beginners Guide Read&Write 5 Gold for Mac INTRODUCTION... 3 SPEECH... 4 SPELLING... 6 PREDICTION... 8 DICTIONARY... 10 PICTURE DICTIONARY... 12 SOUNDS LIKE AND CONFUSABLE

More information

SpeakToText 2.5 Speech Recognition QUICK START GUIDE (Version 2.51)

SpeakToText 2.5 Speech Recognition QUICK START GUIDE (Version 2.51) Making it FUN and EASY to use SPEECH with your COMPUTER! CoolSoft, LLC SpeakToText 2.5 Speech Recognition QUICK START GUIDE (Version 2.51) Important Note: This Quick Start Guide is intended for previous

More information

A Prototype Robot Speech Interface with Multimodal Feedback

A Prototype Robot Speech Interface with Multimodal Feedback Proceedings of the 2002 IEEE Int. Workshop on Robot and Human Interactive Communication Berlin, Germany, Sept. 25-27, 2002 A Prototype Robot Speech Interface with Multimodal Feedback Mathias Haage +, Susanne

More information

Design of a Speech Interface for Augmenting Desktop Accessibility

Design of a Speech Interface for Augmenting Desktop Accessibility Design of a Speech Interface for Augmenting Desktop Accessibility David F. Rodrigues L²F - Spoken Language Systems Laboratory, INESC ID Lisboa / IST R. Alves Redol, 9, 1000-029 Lisboa, Portugal http://l2f.inesc-id.pt/

More information

User Instructions. For WiFi (PRO 2) Manual Version 1. For warranty, service and support, contact:

User Instructions. For WiFi (PRO 2) Manual Version 1. For warranty, service and support, contact: User Instructions For WiFi (PRO 2) Manual Version 1 For warranty, service and support, contact: Installed By: Install Company Name: Installer Phone Number: Installer Email Address: Date of Install: Note:

More information

User Manual. Helios PTT for BlackBerry

User Manual. Helios PTT for BlackBerry User Manual Helios PTT for BlackBerry Technical Support: Tel.: 1 250 762 7540 (8 a.m. to 5 p.m. Pacific time) E-Mail: support@heliosglobaltech.com Version 1.1 Table of contents: 1 Technical Support...

More information

Type your codes into the Username and Password section and click on Login.

Type your codes into the Username and Password section and click on Login. Students guide to the Net Languages platform First Certificate of English Practice Tests Contents 1. How to enter the course... 1 2. How to navigate around the practice test... 1 3. How to view your progress...

More information

GoogleTalk Installation Instructions:

GoogleTalk Installation Instructions: GoogleTalk Installation Instructions: Before you begin: Ensure you have an updated copy of your Operating system including Direct X9.0 or higher. You can download this update free of charge from Microsoft

More information

Authors Martin Eckert Ingmar Kliche Deutsche Telekom Laboratories.

Authors Martin Eckert Ingmar Kliche Deutsche Telekom Laboratories. Workshop on speaker biometrics and VoiceXML 3.0 March 5-6, 2009, Menlo Park, CA, US Proposal of an SIV architecture and requirements Authors Martin Eckert (martin.eckert@telekom.de), Ingmar Kliche (ingmar.kliche@telekom.de),

More information

Chapter 7. Representing Information Digitally

Chapter 7. Representing Information Digitally Chapter 7 Representing Information Digitally Learning Objectives Explain the link between patterns, symbols, and information Determine possible PandA encodings using a physical phenomenon Encode and decode

More information

These are meant to be used as desktop reminders or cheat sheets for using Read&Write Gold. To use. your Print Dialog box as shown

These are meant to be used as desktop reminders or cheat sheets for using Read&Write Gold. To use. your Print Dialog box as shown These are meant to be used as desktop reminders or cheat sheets for using Read&Write Gold. To use them Print as HANDOUTS by setting your Print Dialog box as shown Then Print and Cut up as individual cards,

More information

Digital Audio Basics

Digital Audio Basics CSC 170 Introduction to Computers and Their Applications Lecture #2 Digital Audio Basics Digital Audio Basics Digital audio is music, speech, and other sounds represented in binary format for use in digital

More information

Voice Profile Setup Guide

Voice Profile Setup Guide This document will help a user learn how to create, update, and maintain voice profiles. Understanding the voice profile is an important part in understanding how the ASR Transcription and interaction

More information

User Guide. Parrot MKi9000. English. Parrot MKi9000 User guide 1

User Guide. Parrot MKi9000. English. Parrot MKi9000 User guide 1 User Guide Parrot MKi9000 English Parrot MKi9000 User guide 1 Content Content... 2 Introduction... 4 Installing the Parrot MKi9000... 5 Car stereo with an ISO connector...5 Car stereo with line-in jacks...6

More information

Voice control PRINCIPLE OF OPERATION USING VOICE CONTROL. Activating the system

Voice control PRINCIPLE OF OPERATION USING VOICE CONTROL. Activating the system control PRINCIPLE OF OPERATION control enables operation of the audio and telephone systems without the need to divert your attention from the road ahead in order to change settings, or receive feedback

More information

It s Built In! Accessibility Options in Windows XP and Apple OS X

It s Built In! Accessibility Options in Windows XP and Apple OS X It s Built In! Accessibility Options in Windows XP and Apple OS X Delaware Instructional Technology Conference Joanne Jennings Office of Educational Technology University of Delaware Accessible Technology

More information

TypeIt ReadIt. Windows v 1.7

TypeIt ReadIt. Windows v 1.7 TypeIt ReadIt Windows v 1.7 1 Table of Contents Page Topic 3 TypeIt ReadIt 4 What s New With Version 1.7 5 System Requirements 6 User Interface 11 Keyboard Shortcuts 12 Printing 2 TypeIt ReadIt TypeIt

More information

Web2cToGo: Bringing the Web2cToolkit to Mobile Devices. Reinhard Bacher DESY, Hamburg, Germany

Web2cToGo: Bringing the Web2cToolkit to Mobile Devices. Reinhard Bacher DESY, Hamburg, Germany Web2cToGo: Bringing the Web2cToolkit to Mobile Devices Reinhard Bacher DESY, Hamburg, Germany Outline Introduction to Web2cToolkit New: Web2cToGo project Web2cToGo Web-Desktop Web-Desktop navigation and

More information

Read&Write 9 GOLD Training Guide

Read&Write 9 GOLD Training Guide . Read&Write 9 GOLD Training Guide Revised 29 th Jan 2009 Contents 1. Introduction... 1 2. Getting started... 2 Exercise 1 Logging into the system... 2 Exercise 2 Understanding the toolbar... 2 Exercise

More information

Students are placed in System 44 based on their performance in the Scholastic Phonics Inventory. System 44 Placement and Scholastic Phonics Inventory

Students are placed in System 44 based on their performance in the Scholastic Phonics Inventory. System 44 Placement and Scholastic Phonics Inventory System 44 Overview The System 44 student application leads students through a predetermined path to learn each of the 44 sounds and the letters or letter combinations that create those sounds. In doing

More information

Available Online at

Available Online at ISSN 2320-2602 Volume 2, No.12, December 2013 Nadeem Ahmed International Kanasro et al., International Journal Journal of Advances of Advances in in Computer Computer Science Science and Technology, and

More information

The Grid 2 is accessible to everybody, accepting input from eye gaze, switches, headpointer, touchscreen, mouse, and other options too.

The Grid 2 is accessible to everybody, accepting input from eye gaze, switches, headpointer, touchscreen, mouse, and other options too. The Grid 2-89224 Product Overview The Grid 2 is an all-in-one package for communication and access. The Grid 2 allows people with limited or unclear speech to use a computer as a voice output communication

More information

Say-it: Design of a Multimodal Game Interface for Children Based on CMU Sphinx 4 Framework

Say-it: Design of a Multimodal Game Interface for Children Based on CMU Sphinx 4 Framework Grand Valley State University ScholarWorks@GVSU Technical Library School of Computing and Information Systems 2014 Say-it: Design of a Multimodal Game Interface for Children Based on CMU Sphinx 4 Framework

More information

WFSTDM Builder Network-based Spoken Dialogue System Builder for Easy Prototyping

WFSTDM Builder Network-based Spoken Dialogue System Builder for Easy Prototyping WFSTDM Builder Network-based Spoken Dialogue System Builder for Easy Prototyping Etsuo Mizukami and Chiori Hori Abstract This paper introduces a network-based spoken dialog system development tool kit:

More information

MARATHI TEXT-TO-SPEECH SYNTHESISYSTEM FOR ANDROID PHONES

MARATHI TEXT-TO-SPEECH SYNTHESISYSTEM FOR ANDROID PHONES International Journal of Advances in Applied Science and Engineering (IJAEAS) ISSN (P): 2348-1811; ISSN (E): 2348-182X Vol. 3, Issue 2, May 2016, 34-38 IIST MARATHI TEXT-TO-SPEECH SYNTHESISYSTEM FOR ANDROID

More information

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION Recognition Accuracy Turns your voice into text with up to 99% accuracy NEW - Up to a 20% improvement to out-of-the-box accuracy compared to Dragon version 11 Recognition Speed Words appear on the screen

More information

OCR Coverage. Open Court Reading Grade K CCSS Correlation

OCR Coverage. Open Court Reading Grade K CCSS Correlation Grade K Common Core State Standards Reading: Literature Key Ideas and Details RL.K.1 With prompting and support, ask and answer questions about key details in a text. OCR Coverage Unit 1: T70 Unit 2: T271,

More information

BLUETOOTH SYSTEM ALTEA/ALTEA XL/ALTEA FREETRACK/LEON OWNER S MANUAL

BLUETOOTH SYSTEM ALTEA/ALTEA XL/ALTEA FREETRACK/LEON OWNER S MANUAL BLUETOOTH SYSTEM ALTEA/ALTEA XL/ALTEA FREETRACK/LEON OWNER S MANUAL Table of Contents 1 Table of Contents Manual structure.................... 2 Introduction to the Bluetooth system.................................

More information

MITSUBISHI MOTORS NORTH AMERICA, INC. SMARTPHONE LINK DISPLAY AUDIO SYSTEM (SDA) QUICK REFERENCE GUIDE FOR ANDROID USERS

MITSUBISHI MOTORS NORTH AMERICA, INC. SMARTPHONE LINK DISPLAY AUDIO SYSTEM (SDA) QUICK REFERENCE GUIDE FOR ANDROID USERS MITSUBISHI MOTORS NORTH AMERICA, INC. SMARTPHONE LINK DISPLAY AUDIO SYSTEM (SDA) QUICK REFERENCE GUIDE FOR ANDROID USERS SMARTPHONE LINK DISPLAY AUDIO SYSTEM (SDA): ANDROID AUTO SMARTPHONE LINK DISPLAY

More information