Contents. Resumen. List of Acronyms. List of Mathematical Symbols. List of Figures. List of Tables. I Introduction 1

Size: px
Start display at page:

Download "Contents. Resumen. List of Acronyms. List of Mathematical Symbols. List of Figures. List of Tables. I Introduction 1"

Transcription

1 Contents Agraïments Resum Resumen Abstract List of Acronyms List of Mathematical Symbols List of Figures List of Tables VII IX XI XIII XVIII XIX XXII XXIV I Introduction 1 1 Introduction Motivation Scientific and Technological Objectives Document Structure... 5 Bibliography Background Statistical Natural Language Recognition Preprocessing and Feature Extraction Automatic Speech Recognition Features O -line Handwriting Text Recognition Features On-line Handwriting Text Recognition Features Tandem Features Statistical Modelling Morphological Modelling Language Modelling I

2 Advances on the Transcription of Historical Manuscripts Lexicon Modelling Decoding The Viterbi Algorithm Recognition Output Formats Assistive Transcription of Historical Manuscripts Crowdsourcing for Natural Language Processing Tasks Evaluation Measures Natural Language Recognition Evaluation Language Model Evaluation Computer Assisted Transcription Evaluation Multimodal Crowdsourcing Statistical Significance Datasets Historical Manuscript Corpora (O -line Handwriting) Touch Screen Handwriting Corpus (On-line Handwriting): UNIPEN Training Speech Corpus: Albayzin Multimodal (Text - Speech) Corpora Bibliography II Multimodality 41 3 Combining Handwriting and Speech Introduction Hypothesis Combination on Natural Language Recognition Recogniser Output Voting Error Reduction (ROVER) N-best ROVER Lattices Rescoring Our proposal: Bimodal Confusion Network Combination Subnetworks Based Alignment Composing a New Confusion Network Conclusions Bibliography Multimodal Experimental Results Experimental Framework Datasets Features Models II

3 4.1.4 Evaluation Metrics Experimental Setup Experiment 1: Iterative and Non-Iterative Combination Experiments with Cristo Salvador Experiments with Rodrigo Experiment 2: Unimodal and Multimodal Combination Baseline Experiments Unimodal Combination Experiments Multimodal Combination Experiment Di culty of Reaching the Oracle Values Experiment 3: Multimodal Combination Comparative Conclusions and Future Work Bibliography III Interactivity 65 5 Assistive Transcription Computer Assisted Transcription Overview Multimodal Computer Assisted Transcription Multimodal Hypotheses Combination in CATTI Multimodal Hypotheses Correction in CATTI Conclusions Bibliography Interactivity Experimental Results Experimental Framework Datasets Features Models Evaluation Metrics Experimental Setup Experiment 1: Multimodal Hypotheses Combination Experiments with Cristo Salvador Experiments with Rodrigo Experiment 2: Multimodal Hypotheses Correction O -line and On-line HTR Results CATTI and Multimodal CATTI Results Experiment 3: Multimodal Hypotheses Combination and Correction III

4 Advances on the Transcription of Historical Manuscripts Post-Edition Baseline Results CATTI Results Multimodal CATTI Results Conclusions and Future Work Bibliography IV Crowdsourcing 85 7 Collective Collaboration Multimodal Crowdsourcing Framework Language Model Interpolation Multimodal Combination Reliability Verification Lines Selection Client Application for Speech Acquisition Conclusions Bibliography Crowdsourcing Experiments Experimental Conditions Datasets Features Models Evaluation Metrics Experimental Setup Experiment 1: Supervised Multimodal Crowdsourcing Baseline and Framework Adjustment Speaker Ordering ASR Reliability Verification Absence of Speech Utterances Collaborator E ort Optimisation Experiment 2: Unsupervised Multimodal Crowdsourcing Baseline and Framework Adjustment Preliminary Experiments ASR Reliability Verification and Collaboration E ort Collaboration E ort per Line Conclusions and Future Work Bibliography IV

5 V Conclusions and Future Work Conclusions and Future Work Conclusions Scientific Work and Contributions Future Work Bibliography V

Handwritten Text Recognition

Handwritten Text Recognition Handwritten Text Recognition M.J. Castro-Bleda, Joan Pasto Universidad Politécnica de Valencia Spain Zaragoza, March 2012 Text recognition () TRABHCI Zaragoza, March 2012 1 / 1 The problem: Handwriting

More information

Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents

Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents Vicente Bosch vbosch@iti.upv.es Alejandro Hector Toselli ahector@iti.upv.es Enrique Vidal evidal@iti.upv.es Pattern

More information

MLSALT11: Large Vocabulary Speech Recognition

MLSALT11: Large Vocabulary Speech Recognition MLSALT11: Large Vocabulary Speech Recognition Riashat Islam Department of Engineering University of Cambridge Trumpington Street, Cambridge, CB2 1PZ, England ri258@cam.ac.uk I. INTRODUCTION The objective

More information

Handwritten Text Recognition

Handwritten Text Recognition Handwritten Text Recognition M.J. Castro-Bleda, S. España-Boquera, F. Zamora-Martínez Universidad Politécnica de Valencia Spain Avignon, 9 December 2010 Text recognition () Avignon Avignon, 9 December

More information

OnLine Handwriting Recognition

OnLine Handwriting Recognition OnLine Handwriting Recognition (Master Course of HTR) Alejandro H. Toselli Departamento de Sistemas Informáticos y Computación Universidad Politécnica de Valencia February 26, 2008 A.H. Toselli (ITI -

More information

Mathematics Shape and Space: Polygon Angles

Mathematics Shape and Space: Polygon Angles a place of mind F A C U L T Y O F E D U C A T I O N Department of Curriculum and Pedagogy Mathematics Shape and Space: Polygon Angles Science and Mathematics Education Research Group Supported by UBC Teaching

More information

Knowledge-Based Word Lattice Rescoring in a Dynamic Context. Todd Shore, Friedrich Faubel, Hartmut Helmke, Dietrich Klakow

Knowledge-Based Word Lattice Rescoring in a Dynamic Context. Todd Shore, Friedrich Faubel, Hartmut Helmke, Dietrich Klakow Knowledge-Based Word Lattice Rescoring in a Dynamic Context Todd Shore, Friedrich Faubel, Hartmut Helmke, Dietrich Klakow Section I Motivation Motivation Problem: difficult to incorporate higher-level

More information

A Hidden Markov Model for Alphabet Soup Word Recognition

A Hidden Markov Model for Alphabet Soup Word Recognition A Hidden Markov Model for Alphabet Soup Word Recognition Shaolei Feng 1 Nicholas R. Howe 2 R. Manmatha 1 1 University of Massachusetts, Amherst 2 Smith College Motivation: Inaccessible Treasures Historical

More information

Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents

Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents Vicente Bosch Campos vbosch@iti.upv.es Alejandro Héctor Toselli ahector@iti.upv.es Enrique Vidal evidal@iti.upv.es

More information

Semantic Word Embedding Neural Network Language Models for Automatic Speech Recognition

Semantic Word Embedding Neural Network Language Models for Automatic Speech Recognition Semantic Word Embedding Neural Network Language Models for Automatic Speech Recognition Kartik Audhkhasi, Abhinav Sethy Bhuvana Ramabhadran Watson Multimodal Group IBM T. J. Watson Research Center Motivation

More information

Learning The Lexicon!

Learning The Lexicon! Learning The Lexicon! A Pronunciation Mixture Model! Ian McGraw! (imcgraw@mit.edu)! Ibrahim Badr Jim Glass! Computer Science and Artificial Intelligence Lab! Massachusetts Institute of Technology! Cambridge,

More information

Constrained Discriminative Training of N-gram Language Models

Constrained Discriminative Training of N-gram Language Models Constrained Discriminative Training of N-gram Language Models Ariya Rastrow #1, Abhinav Sethy 2, Bhuvana Ramabhadran 3 # Human Language Technology Center of Excellence, and Center for Language and Speech

More information

Overview. Search and Decoding. HMM Speech Recognition. The Search Problem in ASR (1) Today s lecture. Steve Renals

Overview. Search and Decoding. HMM Speech Recognition. The Search Problem in ASR (1) Today s lecture. Steve Renals Overview Search and Decoding Steve Renals Automatic Speech Recognition ASR Lecture 10 January - March 2012 Today s lecture Search in (large vocabulary) speech recognition Viterbi decoding Approximate search

More information

Maximum Likelihood Beamforming for Robust Automatic Speech Recognition

Maximum Likelihood Beamforming for Robust Automatic Speech Recognition Maximum Likelihood Beamforming for Robust Automatic Speech Recognition Barbara Rauch barbara@lsv.uni-saarland.de IGK Colloquium, Saarbrücken, 16 February 2006 Agenda Background: Standard ASR Robust ASR

More information

Automatic Speech Recognition (ASR)

Automatic Speech Recognition (ASR) Automatic Speech Recognition (ASR) February 2018 Reza Yazdani Aminabadi Universitat Politecnica de Catalunya (UPC) State-of-the-art State-of-the-art ASR system: DNN+HMM Speech (words) Sound Signal Graph

More information

FP SIMPLE4ALL deliverable D6.5. Deliverable D6.5. Initial Public Release of Open Source Tools

FP SIMPLE4ALL deliverable D6.5. Deliverable D6.5. Initial Public Release of Open Source Tools Deliverable D6.5 Initial Public Release of Open Source Tools The research leading to these results has received funding from the European Community s Seventh Framework Programme (FP7/2007-2013) under grant

More information

Semantic Video Indexing

Semantic Video Indexing Semantic Video Indexing T-61.6030 Multimedia Retrieval Stevan Keraudy stevan.keraudy@tkk.fi Helsinki University of Technology March 14, 2008 What is it? Query by keyword or tag is common Semantic Video

More information

Imperfect transcript driven speech recognition

Imperfect transcript driven speech recognition Imperfect transcript driven speech recognition Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François Bonastre To cite this version: Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François

More information

Interactive Handwritten Text Recognition and Indexing of Historical Documents: the transcriptorum Project

Interactive Handwritten Text Recognition and Indexing of Historical Documents: the transcriptorum Project Interactive Handwritten Text Recognition and ing of Historical Documents: the transcriptorum Project Alejandro H. Toselli ahector@prhlt.upv.es Pattern Recognition and Human Language Technology Reseach

More information

Adaptive Gesture Recognition System Integrating Multiple Inputs

Adaptive Gesture Recognition System Integrating Multiple Inputs Adaptive Gesture Recognition System Integrating Multiple Inputs Master Thesis - Colloquium Tobias Staron University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Technical Aspects

More information

BMEGUI Tutorial 1 Spatial kriging

BMEGUI Tutorial 1 Spatial kriging BMEGUI Tutorial 1 Spatial kriging 1. Objective The primary objective of this exercise is to get used to the basic operations of BMEGUI using a purely spatial dataset. The analysis will consist in an exploratory

More information

The Essential Guide to Video Processing

The Essential Guide to Video Processing The Essential Guide to Video Processing Second Edition EDITOR Al Bovik Department of Electrical and Computer Engineering The University of Texas at Austin Austin, Texas AMSTERDAM BOSTON HEIDELBERG LONDON

More information

CROSS-REFERENCE TABLE ASME A Including A17.1a-1997 Through A17.1d 2000 vs. ASME A

CROSS-REFERENCE TABLE ASME A Including A17.1a-1997 Through A17.1d 2000 vs. ASME A CROSS-REFERENCE TABLE ASME Including A17.1a-1997 Through A17.1d 2000 vs. ASME 1 1.1 1.1 1.1.1 1.2 1.1.2 1.3 1.1.3 1.4 1.1.4 2 1.2 3 1.3 4 Part 9 100 2.1 100.1 2.1.1 100.1a 2.1.1.1 100.1b 2.1.1.2 100.1c

More information

Summary of Contents LIST OF FIGURES LIST OF TABLES

Summary of Contents LIST OF FIGURES LIST OF TABLES Summary of Contents LIST OF FIGURES LIST OF TABLES PREFACE xvii xix xxi PART 1 BACKGROUND Chapter 1. Introduction 3 Chapter 2. Standards-Makers 21 Chapter 3. Principles of the S2ESC Collection 45 Chapter

More information

Spoken Term Detection Using Multiple Speech Recognizers Outputs at NTCIR-9 SpokenDoc STD subtask

Spoken Term Detection Using Multiple Speech Recognizers Outputs at NTCIR-9 SpokenDoc STD subtask NTCIR-9 Workshop: SpokenDoc Spoken Term Detection Using Multiple Speech Recognizers Outputs at NTCIR-9 SpokenDoc STD subtask Hiromitsu Nishizaki Yuto Furuya Satoshi Natori Yoshihiro Sekiguchi University

More information

6.2 Conceptual Framework for Autonomic Service Compositions

6.2 Conceptual Framework for Autonomic Service Compositions CONTENTS i preliminaries 1 1 introduction 3 1.1 Motivation 6 1.2 Problem Statement 8 1.3 Research Challenges 9 1.4 The Approach 11 1.5 Research Methodology 14 1.6 Thesis Context 16 1.7 Outline 16 2 background

More information

RLAT Rapid Language Adaptation Toolkit

RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit Tim Schlippe May 15, 2012 RLAT Rapid Language Adaptation Toolkit - 2 RLAT Rapid Language Adaptation Toolkit RLAT Rapid Language Adaptation Toolkit - 3 Outline Introduction

More information

Speech Tuner. and Chief Scientist at EIG

Speech Tuner. and Chief Scientist at EIG Speech Tuner LumenVox's Speech Tuner is a complete maintenance tool for end-users, valueadded resellers, and platform providers. It s designed to perform tuning and transcription, as well as parameter,

More information

A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition

A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition Théodore Bluche, Hermann Ney, Christopher Kermorvant SLSP 14, Grenoble October

More information

Discriminative Training and Adaptation of Large Vocabulary ASR Systems

Discriminative Training and Adaptation of Large Vocabulary ASR Systems Discriminative Training and Adaptation of Large Vocabulary ASR Systems Phil Woodland March 30th 2004 ICSI Seminar: March 30th 2004 Overview Why use discriminative training for LVCSR? MMIE/CMLE criterion

More information

Speech Technology Using in Wechat

Speech Technology Using in Wechat Speech Technology Using in Wechat FENG RAO Powered by WeChat Outline Introduce Algorithm of Speech Recognition Acoustic Model Language Model Decoder Speech Technology Open Platform Framework of Speech

More information

Introduction to PTC Windchill ProjectLink 11.0

Introduction to PTC Windchill ProjectLink 11.0 Introduction to PTC Windchill ProjectLink 11.0 Overview Course Code Course Length TRN-4756-T 8 Hours In this course, you will learn how to participate in and manage projects using Windchill ProjectLink

More information

Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models

Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models ABSTRACT Euisok Chung Hyung-Bae Jeon Jeon-Gue Park and Yun-Keun Lee Speech Processing Research Team, ETRI, 138 Gajeongno,

More information

Speech Recognition. Project: Phone Recognition using Sphinx. Chia-Ho Ling. Sunya Santananchai. Professor: Dr. Kepuska

Speech Recognition. Project: Phone Recognition using Sphinx. Chia-Ho Ling. Sunya Santananchai. Professor: Dr. Kepuska Speech Recognition Project: Phone Recognition using Sphinx Chia-Ho Ling Sunya Santananchai Professor: Dr. Kepuska Objective Use speech data corpora to build a model using CMU Sphinx.Apply a built model

More information

Sensor based pervasive System for user controlled automation

Sensor based pervasive System for user controlled automation International Journal of Embedded Systems, Robotics and Computer Engineering. Volume 1, Number 1 (2015), pp. 23-31 International Research Publication House http://www.irphouse.com Sensor based pervasive

More information

Automatic Speech Recognition on Mobile Devices and over Communication Networks

Automatic Speech Recognition on Mobile Devices and over Communication Networks Zheng-Hua Tan and Berge Lindberg Automatic Speech Recognition on Mobile Devices and over Communication Networks ^Spri inger g< Contents Preface Contributors v xix 1. Network, Distributed and Embedded Speech

More information

TALK project work at UCAM

TALK project work at UCAM TALK project work at UCAM Matt Stuttle August 2005 Web: Email: http://mi.eng.cam.ac.uk/ mns25 mns25@eng.cam.ac.uk Dialogs on Dialogs Overview TALK project Goals Baseline system the Hidden Information State

More information

TABLE OF CONTENTS CHAPTER TITLE PAGE NO NO.

TABLE OF CONTENTS CHAPTER TITLE PAGE NO NO. vi TABLE OF CONTENTS CHAPTER TITLE PAGE NO NO. ABSTRACT iii LIST OF TABLES xiii LIST OF FIGURES xiv LIST OF SYMBOLS AND ABBREVIATIONS xix 1 INTRODUCTION 1 1.1 CLOUD COMPUTING 1 1.1.1 Introduction to Cloud

More information

PHONE-BASED SPOKEN DOCUMENT RETRIEVAL IN CONFORMANCE WITH THE MPEG-7 STANDARD

PHONE-BASED SPOKEN DOCUMENT RETRIEVAL IN CONFORMANCE WITH THE MPEG-7 STANDARD PHONE-BASED SPOKEN DOCUMENT RETRIEVAL IN CONFORMANCE WITH THE MPEG-7 STANDARD NICOLAS MOREAU, HYOUNG GOOK KIM, AND THOMAS SIKORA Communication Systems Group, Technical University of Berlin, Germany [moreau,kim,sikora]@nue.tu-berlin.de

More information

Introduction to Creo Elements/Direct 19.0 Modeling

Introduction to Creo Elements/Direct 19.0 Modeling Introduction to Creo Elements/Direct 19.0 Modeling Overview Course Code Course Length TRN-4531-T 3 Day In this course, you will learn the basics about 3-D design using Creo Elements/Direct Modeling. You

More information

Memory-Efficient Heterogeneous Speech Recognition Hybrid in GPU-Equipped Mobile Devices

Memory-Efficient Heterogeneous Speech Recognition Hybrid in GPU-Equipped Mobile Devices Memory-Efficient Heterogeneous Speech Recognition Hybrid in GPU-Equipped Mobile Devices Alexei V. Ivanov, CTO, Verbumware Inc. GPU Technology Conference, San Jose, March 17, 2015 Autonomous Speech Recognition

More information

Introduction to Windchill PDMLink 10.2 for the Implementation Team

Introduction to Windchill PDMLink 10.2 for the Implementation Team Introduction to Windchill PDMLink 10.2 for the Implementation Team Overview Course Code Course Length TRN-4262-T 2 Days In this course, you will learn how to complete basic Windchill PDMLink functions.

More information

Workshop: Automatisierte Handschriftenerkennung

Workshop: Automatisierte Handschriftenerkennung Workshop: Automatisierte Handschriftenerkennung Joan Andreu Sánchez Pattern Recognition and Human Language Research group (Technical University of Valencia) Günter Mühlberger, Sebastian Colutto, Philip

More information

Tuning. Philipp Koehn presented by Gaurav Kumar. 28 September 2017

Tuning. Philipp Koehn presented by Gaurav Kumar. 28 September 2017 Tuning Philipp Koehn presented by Gaurav Kumar 28 September 2017 The Story so Far: Generative Models 1 The definition of translation probability follows a mathematical derivation argmax e p(e f) = argmax

More information

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System Mono-font Cursive Arabic Text Recognition Using Speech Recognition System M.S. Khorsheed Computer & Electronics Research Institute, King AbdulAziz City for Science and Technology (KACST) PO Box 6086, Riyadh

More information

Large Scale Distributed Acoustic Modeling With Back-off N-grams

Large Scale Distributed Acoustic Modeling With Back-off N-grams Large Scale Distributed Acoustic Modeling With Back-off N-grams Ciprian Chelba* and Peng Xu and Fernando Pereira and Thomas Richardson Abstract The paper revives an older approach to acoustic modeling

More information

Manifold Learning Theory and Applications

Manifold Learning Theory and Applications Manifold Learning Theory and Applications Yunqian Ma and Yun Fu CRC Press Taylor Si Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Group, an informa business Contents

More information

Contents. I Introduction to Data Science 1. Preface. List of Tables. List of Figures

Contents. I Introduction to Data Science 1. Preface. List of Tables. List of Figures Contents Preface List of Tables List of Figures xv xix xxi I Introduction to Data Science 1 1 Prologue: Why data science? 3 1.1 What is data science?.............................. 4 1.2 Case study: The

More information

Viterbi Based Alignment between Text Images and their Transcripts

Viterbi Based Alignment between Text Images and their Transcripts Viteri Based Alignment etween Text Images and their Transcripts Alejandro H. Toselli, Verónica Romero and Enrique Vidal Institut Tecnològic d Informàtica Universitat Politècnica de València Camí de Vera

More information

Stochastic Segment Modeling for Offline Handwriting Recognition

Stochastic Segment Modeling for Offline Handwriting Recognition 2009 10th nternational Conference on Document Analysis and Recognition tochastic egment Modeling for Offline Handwriting Recognition Prem Natarajan, Krishna ubramanian, Anurag Bhardwaj, Rohit Prasad BBN

More information

K Nearest Neighbor Wrap Up K- Means Clustering. Slides adapted from Prof. Carpuat

K Nearest Neighbor Wrap Up K- Means Clustering. Slides adapted from Prof. Carpuat K Nearest Neighbor Wrap Up K- Means Clustering Slides adapted from Prof. Carpuat K Nearest Neighbor classification Classification is based on Test instance with Training Data K: number of neighbors that

More information

Document downloaded from: This paper must be cited as:

Document downloaded from:   This paper must be cited as: Document downloaded from: http://hdl.handle.net/1/0 This paper must be cited as: The final publication is available at https://doi.org/.0/s00-01-- Copyright Springer-Verlag Additional Information Neural

More information

LOW-RANK MATRIX FACTORIZATION FOR DEEP NEURAL NETWORK TRAINING WITH HIGH-DIMENSIONAL OUTPUT TARGETS

LOW-RANK MATRIX FACTORIZATION FOR DEEP NEURAL NETWORK TRAINING WITH HIGH-DIMENSIONAL OUTPUT TARGETS LOW-RANK MATRIX FACTORIZATION FOR DEEP NEURAL NETWORK TRAINING WITH HIGH-DIMENSIONAL OUTPUT TARGETS Tara N. Sainath, Brian Kingsbury, Vikas Sindhwani, Ebru Arisoy, Bhuvana Ramabhadran IBM T. J. Watson

More information

Ground-Truth Production in the transcriptorium Project

Ground-Truth Production in the transcriptorium Project 2014 11th IAPR International Workshop on Document Analysis Systems Ground-Truth Production in the transcriptorium Project B. Gatos and G. Louloudis Inst. of Inf. and Telecommunications National Centre

More information

Segmentation free Bangla OCR using HMM: Training and Recognition

Segmentation free Bangla OCR using HMM: Training and Recognition Segmentation free Bangla OCR using HMM: Training and Recognition Md. Abul Hasnat, S.M. Murtoza Habib, Mumit Khan BRAC University, Bangladesh mhasnat@gmail.com, murtoza@gmail.com, mumit@bracuniversity.ac.bd

More information

Discriminative Training of Decoding Graphs for Large Vocabulary Continuous Speech Recognition

Discriminative Training of Decoding Graphs for Large Vocabulary Continuous Speech Recognition Discriminative Training of Decoding Graphs for Large Vocabulary Continuous Speech Recognition by Hong-Kwang Jeff Kuo, Brian Kingsbury (IBM Research) and Geoffry Zweig (Microsoft Research) ICASSP 2007 Presented

More information

Introduction to PTC Windchill PDMLink 11.0 for Heavy Users

Introduction to PTC Windchill PDMLink 11.0 for Heavy Users Introduction to PTC Windchill PDMLink 11.0 for Heavy Users Overview Course Code Course Length TRN-4751-T 16 Hours In this course, you will learn how to complete the day-to-day functions that enable you

More information

D6.1.2: Second report on scientific evaluations

D6.1.2: Second report on scientific evaluations D6.1.2: Second report on scientific evaluations UPVLC, XEROX, JSI-K4A, RWTH, EML and DDS Distribution: Public translectures Transcription and Translation of Video Lectures ICT Project 287755 Deliverable

More information

Advanced Multimodal Machine Learning

Advanced Multimodal Machine Learning Advanced Multimodal Machine Learning Lecture 1.2: Challenges and applications Louis-Philippe Morency Tadas Baltrušaitis 1 Objectives Identify the 5 technical challenges in multimodal machine learning Identify

More information

Open-Vocabulary Recognition of Machine-Printed Arabic Text Using Hidden Markov Models

Open-Vocabulary Recognition of Machine-Printed Arabic Text Using Hidden Markov Models Open-Vocabulary Recognition of Machine-Printed Arabic Text Using Hidden Markov Models Irfan Ahmad 1,2,*, Sabri A. Mahmoud 1, and Gernot A. Fink 2 1 Information and Computer Science Department, KFUPM, Dhahran

More information

Advanced Control Foundation: Tools, Techniques and Applications. Terrence Blevins Willy K. Wojsznis Mark Nixon

Advanced Control Foundation: Tools, Techniques and Applications. Terrence Blevins Willy K. Wojsznis Mark Nixon Advanced Control Foundation: Tools, Techniques and Applications Terrence Blevins Willy K. Wojsznis Mark Nixon Contents Acknowledgments About the Authors vii xvii Foreword xxi Chapter 1 INTRODUCTION 1 Chapter

More information

THE PERFORMANCE of automatic speech recognition

THE PERFORMANCE of automatic speech recognition IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 6, NOVEMBER 2006 2109 Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments Michael L. Seltzer,

More information

Certificate Program. Introduction to Microsoft Excel 2013

Certificate Program. Introduction to Microsoft Excel 2013 Certificate Program We offer online education programs designed to provide the workforce skills necessary to enter a new field or advance your current career. Our Online Career Training Programs in the

More information

SHSAT Review Class Week 3-10/21/2016

SHSAT Review Class Week 3-10/21/2016 SHSAT Review Class Week 3-10/21/2016 Week Two Agenda 1. Going over HW (Test 2) 2. Review of Geometry - Practice set 3. Questions before we leave Test 2 Questions? Ask about any questions you were confused

More information

AAM Guide for Authors

AAM Guide for Authors ISSN: 1932-9466 AAM Guide for Authors Application and Applied Mathematics: An International Journal (AAM) invites contributors from throughout the world to submit their original manuscripts for review

More information

Optimizing Speech Recognition Evaluation Using Stratified Sampling

Optimizing Speech Recognition Evaluation Using Stratified Sampling INTERSPEECH 01 September 1, 01, San Francisco, USA Optimizing Speech Recognition Evaluation Using Stratified Sampling Janne Pylkkönen, Thomas Drugman, Max Bisani Amazon {jannepyl, drugman, bisani}@amazon.com

More information

ACT s College Readiness Standards

ACT s College Readiness Standards Course ACT s College Readiness Standards Select a single piece of data (numerical or nonnumerical) from a simple data presentation (e.g., a table or graph with two or three variables; a food web diagram)

More information

Using Bagging and Boosting Techniques for Improving Coreference Resolution

Using Bagging and Boosting Techniques for Improving Coreference Resolution Informatica 34 (2010) 111 118 111 Using Bagging and Boosting Techniques for Improving Coreference Resolution Smita Vemulapalli Center for Signal and Image Processing (CSIP), School of Electrical and Computer

More information

Annexure I: Contact Details:

Annexure I: Contact Details: Ref: CO/IT-BPR/CSC Date:.09.2017 Annexure I: Contact Details: a) Name of the company b) Company s address in India c) Contact person d) Telephone no. e) Fax f) E-mail address g) Service tax registration

More information

Applications of Machine Translation

Applications of Machine Translation Applications of Machine Translation Index Historical Overview Commercial Products Open Source Software Special Applications Future Aspects History Before the Computer: Mid 1930s: Georges Artsrouni and

More information

Language Resources. Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F Paris, France Tel Fax.

Language Resources. Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F Paris, France Tel Fax. Language Resources By the Other Data Center over 15 years fruitful partnership Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F-75013 Paris, France Tel. +33 1 43 13 33 33 -- Fax. +33 1 43 13 33 30 choukri@elda.org

More information

CSCI 5582 Artificial Intelligence. Today 10/31

CSCI 5582 Artificial Intelligence. Today 10/31 CSCI 5582 Artificial Intelligence Lecture 17 Jim Martin Today 10/31 HMM Training (EM) Break Machine Learning 1 Urns and Balls Π Urn 1: 0.9; Urn 2: 0.1 A Urn 1 Urn 2 Urn 1 Urn 2 0.6 0.3 0.4 0.7 B Urn 1

More information

Integrated exemplar-based template matching and statistical modeling for continuous speech recognition

Integrated exemplar-based template matching and statistical modeling for continuous speech recognition Sun and Zhao EURASIP Journal on Audio, Speech, and Music Processing 2014, 2014:4 RESEARCH Open Access Integrated exemplar-based template matching and statistical modeling for continuous speech recognition

More information

Alternative Clusterings: Current Progress and Open Challenges

Alternative Clusterings: Current Progress and Open Challenges Alternative Clusterings: Current Progress and Open Challenges James Bailey Department of Computer Science and Software Engineering The University of Melbourne, Australia 1 Introduction Cluster analysis:

More information

SCCAP. User Guide: Version 198

SCCAP. User Guide: Version 198 SCCAP User Guide: Version 198 Table of Contents Introduction 3 CHAPTER 1 I. The Welcome Screen 4 II. The Load or Create Case Screen 5 III. Logging onto a Server 6 IV. Creating the Code Template 7 V. Creating

More information

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Deep Learning for Computer Vision with MATLAB By Jon Cherrie Deep Learning for Computer Vision with MATLAB By Jon Cherrie 2015 The MathWorks, Inc. 1 Deep learning is getting a lot of attention "Dahl and his colleagues won $22,000 with a deeplearning system. 'We

More information

Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition

Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition ISCA Archive http://www.isca-speech.org/archive Auditory-Visual Speech Processing (AVSP) 2013 Annecy, France August 29 - September 1, 2013 Audio-visual interaction in sparse representation features for

More information

Introduction to HTK Toolkit

Introduction to HTK Toolkit Introduction to HTK Toolkit Berlin Chen 2003 Reference: - The HTK Book, Version 3.2 Outline An Overview of HTK HTK Processing Stages Data Preparation Tools Training Tools Testing Tools Analysis Tools Homework:

More information

System Administration of PTC Windchill 11.0

System Administration of PTC Windchill 11.0 System Administration of PTC Windchill 11.0 Overview Course Code Course Length TRN-4830-T 16 Hours In this course, you will gain an understanding of how to perform routine Windchill system administration

More information

SPOKEN DOCUMENT CLASSIFICATION BASED ON LSH

SPOKEN DOCUMENT CLASSIFICATION BASED ON LSH 3 st January 3. Vol. 47 No.3 5-3 JATIT & LLS. All rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 SPOKEN DOCUMENT CLASSIFICATION BASED ON LSH ZHANG LEI, XIE SHOUZHI, HE XUEWEN Information and

More information

Part I: Data Mining Foundations

Part I: Data Mining Foundations Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?

More information

Introduction to PTC Windchill PDMLink 11.0 for the Implementation Team

Introduction to PTC Windchill PDMLink 11.0 for the Implementation Team Introduction to PTC Windchill PDMLink 11.0 for the Implementation Team Overview Course Code Course Length TRN-4752-T 16 Hours In this course, you will learn how to complete basic Windchill PDMLink functions.

More information

Applying Supervised Learning

Applying Supervised Learning Applying Supervised Learning When to Consider Supervised Learning A supervised learning algorithm takes a known set of input data (the training set) and known responses to the data (output), and trains

More information

Handwritten word verification by SVM-based hypotheses re-scoring and multiple thresholds rejection

Handwritten word verification by SVM-based hypotheses re-scoring and multiple thresholds rejection Author manuscript, published in "International Conference on Frontiers in Handwriting Recognition (2010)" Handwritten word verification by SVM-based hypotheses re-scoring and multiple thresholds rejection

More information

Oracle Exadata Recipes

Oracle Exadata Recipes Oracle Exadata Recipes A Problem-Solution Approach John Clarke Apress- Contents About the Author About the Technical Reviewer Acknowledgments Introduction xxxiii xxxv xxxvii xxxix Part 1: Exadata Architecture

More information

Table of Contents. Preface... vii Abstract... vii Kurzfassung... x Acknowledgements... xiii. I The Preliminaries 1

Table of Contents. Preface... vii Abstract... vii Kurzfassung... x Acknowledgements... xiii. I The Preliminaries 1 Preface............................................ vii Abstract............................................ vii Kurzfassung.......................................... x Acknowledgements......................................

More information

"Charting the Course... SharePoint 2007 Hands-On Labs Course Summary

Charting the Course... SharePoint 2007 Hands-On Labs Course Summary Course Summary Description This series of 33 hands-on labs allows students to explore the new features of Microsoft SharePoint Server, Microsoft Windows, Microsoft Office, including Microsoft Office Groove,

More information

Preparation Meeting. Recent Advances in the Analysis of 3D Shapes. Emanuele Rodolà Matthias Vestner Thomas Windheuser Daniel Cremers

Preparation Meeting. Recent Advances in the Analysis of 3D Shapes. Emanuele Rodolà Matthias Vestner Thomas Windheuser Daniel Cremers Preparation Meeting Recent Advances in the Analysis of 3D Shapes Emanuele Rodolà Matthias Vestner Thomas Windheuser Daniel Cremers What You Will Learn in the Seminar Get an overview on state of the art

More information

MINIMUM EXACT WORD ERROR TRAINING. G. Heigold, W. Macherey, R. Schlüter, H. Ney

MINIMUM EXACT WORD ERROR TRAINING. G. Heigold, W. Macherey, R. Schlüter, H. Ney MINIMUM EXACT WORD ERROR TRAINING G. Heigold, W. Macherey, R. Schlüter, H. Ney Lehrstuhl für Informatik 6 - Computer Science Dept. RWTH Aachen University, Aachen, Germany {heigold,w.macherey,schlueter,ney}@cs.rwth-aachen.de

More information

VERIFICATION AND VALIDATION FOR QUALITY OF UML 2.0 MODELS

VERIFICATION AND VALIDATION FOR QUALITY OF UML 2.0 MODELS VERIFICATION AND VALIDATION FOR QUALITY OF UML 2.0 MODELS BHUVAN UNHELKAR, PHD WILEY- INTERSCIENCE A Wiley-lnterscience Publication JOHN WILEY & SONS, INC. Contents Figures Foreword Preface Acknowledgments

More information

Contents. Preface xvii Acknowledgments. CHAPTER 1 Introduction to Parallel Computing 1. CHAPTER 2 Parallel Programming Platforms 11

Contents. Preface xvii Acknowledgments. CHAPTER 1 Introduction to Parallel Computing 1. CHAPTER 2 Parallel Programming Platforms 11 Preface xvii Acknowledgments xix CHAPTER 1 Introduction to Parallel Computing 1 1.1 Motivating Parallelism 2 1.1.1 The Computational Power Argument from Transistors to FLOPS 2 1.1.2 The Memory/Disk Speed

More information

Lecture 7: Neural network acoustic models in speech recognition

Lecture 7: Neural network acoustic models in speech recognition CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 7: Neural network acoustic models in speech recognition Outline Hybrid acoustic modeling overview Basic

More information

Making Deep Belief Networks Effective for Large Vocabulary Continuous Speech Recognition

Making Deep Belief Networks Effective for Large Vocabulary Continuous Speech Recognition Making Deep Belief Networks Effective for Large Vocabulary Continuous Speech Recognition Tara N. Sainath 1, Brian Kingsbury 1, Bhuvana Ramabhadran 1, Petr Fousek 2, Petr Novak 2, Abdel-rahman Mohamed 3

More information

Detection of Acoustic Events in Meeting-Room Environment

Detection of Acoustic Events in Meeting-Room Environment 11/Dec/2008 Detection of Acoustic Events in Meeting-Room Environment Presented by Andriy Temko Department of Electrical and Electronic Engineering Page 2 of 34 Content Introduction State of the Art Acoustic

More information

Comparing Natural and Synthetic Training Data for Off-line Cursive Handwriting Recognition

Comparing Natural and Synthetic Training Data for Off-line Cursive Handwriting Recognition Comparing Natural and Synthetic Training Data for Off-line Cursive Handwriting Recognition Tamás Varga and Horst Bunke Institut für Informatik und angewandte Mathematik, Universität Bern Neubrückstrasse

More information

TABLE OF CONTENTS PAGE TITLE NO.

TABLE OF CONTENTS PAGE TITLE NO. TABLE OF CONTENTS CHAPTER PAGE TITLE ABSTRACT iv LIST OF TABLES xi LIST OF FIGURES xii LIST OF ABBREVIATIONS & SYMBOLS xiv 1. INTRODUCTION 1 2. LITERATURE SURVEY 14 3. MOTIVATIONS & OBJECTIVES OF THIS

More information

Discriminative training and Feature combination

Discriminative training and Feature combination Discriminative training and Feature combination Steve Renals Automatic Speech Recognition ASR Lecture 13 16 March 2009 Steve Renals Discriminative training and Feature combination 1 Overview Hot topics

More information

Selection of Best Match Keyword using Spoken Term Detection for Spoken Document Indexing

Selection of Best Match Keyword using Spoken Term Detection for Spoken Document Indexing Selection of Best Match Keyword using Spoken Term Detection for Spoken Document Indexing Kentaro Domoto, Takehito Utsuro, Naoki Sawada and Hiromitsu Nishizaki Graduate School of Systems and Information

More information

arxiv: v1 [cs.cl] 30 Jan 2018

arxiv: v1 [cs.cl] 30 Jan 2018 ACCELERATING RECURRENT NEURAL NETWORK LANGUAGE MODEL BASED ONLINE SPEECH RECOGNITION SYSTEM Kyungmin Lee, Chiyoun Park, Namhoon Kim, and Jaewon Lee DMC R&D Center, Samsung Electronics, Seoul, Korea {k.m.lee,

More information

Improving Transcription of Manuscripts with Multimodality and Interaction

Improving Transcription of Manuscripts with Multimodality and Interaction IberSPEECH 2018 21-23 November 2018, Barcelona, Sain Imroving Transcrition of Manuscrits with Multimodality and Interaction Emilio Granell, Carlos-D. Martínez-Hinarejos, Verónica Romero Pattern Recognition

More information