Minimal-Impact Personal Audio Archives

Size: px
Start display at page:

Download "Minimal-Impact Personal Audio Archives"

Transcription

1 Minimal-Impact Personal Audio Archives Dan Ellis, Keansub Lee, Jim Ogle Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA 1. Personal Audio Archives 2. Segmenting & Clustering 3. Speech Detection 4. Repeated Events 5. Future Personal Audio Archives - Ellis, Lee, Ogle p. 1 /18

2 1. Personal Audio Archives Easy to record everything you hear <2GB / 64 kbps Hard to find anything how to scan? how to visualize? how to index? Need automatic analysis Need minimal impact Personal Audio Archives - Ellis, Lee, Ogle p. 2 /18

3 Information in Audio Long-duration recordings contain info on: location type (restaurant, street,...) and specific activity talking, walking, typing people generic (2 males), specific (Chuck & John) spoken content... maybe but not: what people and things looked like day/night gaze, posture, motion,... Personal Audio Archives - Ellis, Lee, Ogle p. 3 /18

4 Applications Automatic appointment-book history fills in when & where of movements Life statistics how long did I spend in meetings this week? most frequent conversations favorite phrases? Retrieving details what exactly did I promise? privacy issues... Nostalgia... or what? Personal Audio Archives - Ellis, Lee, Ogle p. 4 /18

5 2. Segmentation & Clustering Top-level structure for long recordings: Where are the major boundaries? e.g. for diary application support for manual browsing Length of fundamental time-frame 60s rather than 10ms? background more important than foreground average out uncharacteristic transients Perceptually-motivated features.. so results have perceptual relevance broad spectrum + some detail Personal Audio Archives - Ellis, Lee, Ogle p. 5 /18

6 Features 20 Average Linear Energy Normalized Energy Deviation 60 freq / bark freq / bark Average Log Energy 60 db Log Energy Deviation db 15 freq / bark freq / bark Average Spectral Entropy db freq / bark freq / bark Spectral Entropy Deviation 10 5 db bits time / min Capture both average and variation Capture a little more detail in subbands... bits Personal Audio Archives - Ellis, Lee, Ogle p. 6 /18

7 BIC Segmentation Results Evaluate: 62 hr hand-marked dataset 8 days, 139 segments, 16 categories measure Correct Accept False Accept = 2%: Feature Correct Accept μdb 80.8% μh 81.1% σh/μh 81.6% μdb + σh/μh 84.0% μdb + σh/μh + μh 83.6% mfcc 73.6% Sensitivity o µ db µ H! H /µ H µ db +! H /µ H µ db + µ H +! H /µ H Specificity Personal Audio Archives - Ellis, Lee, Ogle p. 7 /18

8 Segment Clustering Daily activity has lots of repetition: Automatically cluster similar segments affinity of segments as KL2 distances 4*5)#1-% 1))%'23 -"#"0-),"#,)# ()!%*#)/,'(('"#.,#)"- ()!%*#)+!"#$%"&' ;01),0:('23 4%#))% #)4%"*#"2% (',#"# !"15*4 7!15 (', #4% 4%# 666 Personal Audio Archives - Ellis, Lee, Ogle p. 8 /18

9 Clustering Results Clustering of automatic segments gives anonymous classes BIC criterion to choose number of clusters make best correspondence to 16 GT clusters Frame-level scoring gives ~70% correct errors when same place has multiple ambiences Personal Audio Archives - Ellis, Lee, Ogle p. 9 /18

10 3. Speech Detection Speech emerges as most interesting content Just identifying speech would be useful goal is speaker identification / labeling Lots of background noise conventional Voice Activity Detection inadequate Insight: Listeners detect pitch track (melody) look for voice-like periodicity in noise 4000 coffeeshop excerpt Frequency Time Personal Audio Archives - Ellis, Lee, Ogle p. 10/18

11 Voice Periodicity Enhancement Noise-robust subband autocorrelation Subtract local average suppresses steady background e.g. machine noise 15 min test set; 88% acc (79% w/o enhancement) also for enhancing speech (harmonic filtering) Personal Audio Archives - Ellis, Lee, Ogle p. 11/18

12 4. Repeating Events Recurring sound events can be informative indicate similar circumstance... but: define event sound organization define recurring event how similar?.. and how to find them tractable? Idea: Use hashing (fingerprints) index points to other occurrences of each hash; intersection of hashes points to match - much quicker search use a fingerprint insensitive to background? Personal Audio Archives - Ellis, Lee, Ogle p. 12/18

13 Shazam Fingerprints Prominent spectral onsets are landmarks; Use relations {f1, f2, t} as hashes 4000 Phone ring - Shazam fingerprint intrinsically robust to background noise Personal Audio Archives - Ellis, Lee, Ogle p. 13/18

14 Exhaustive Search for Repeats More selective hashes few hits required to confirm match (faster; better precision) but less robust to backgound (reduce recall) Works well when exact structure repeats recorded music, electronic alerts no good for organic sounds e.g. garage door Personal Audio Archives - Ellis, Lee, Ogle p. 14/18

15 5. Future: Browsing Tools / Diary interface Browsing links to other information (diary, , photos) synchronize with note taking? (Stifelman & Arons) Release Tools + how to for capture!"#!! '!!(D!%D&$ '!!(D!%D&( '!!(D!%D&) '!!(D!%D&* '!!(D!%D&+!"#$!!%#!!!%#$! &!#!! &!#$! &&#!!,-./01223,-./01223 >2= <..68=: <..68=:',2/63.0 EFG!( C' EFG!$ &&#$! &'#!! &'#$! :-27, <..68=:' &$#$! 34; &(#!! <..68=:'?4= ?8H. C' <..68=: F4<;4-64B &(#$! <..68=:,2/63.0 <..68=: &+#$! &"#$! &%#!! &%#$! /.<8=4- :,,2/63.0 :-27, C.//.- H.4=/7; <..68=:' :-27, ; Personal Audio Archives - Ellis, Lee, Ogle &"#!! :-27, :-414< &*#$! &+#!! :-27, &)#!! &*#!! -9: 02<,<6: &$#!! &)#$! :-27, ,-./01223 =4614= p. 15 /18

16 Future: Speech Recognition Most audio is too noisy for standard ASR actually reassuring for privacy issues But... similar to Meeting Recordings NIST distant microphone conditions Speech enhancement - directional filtering 2 channels a big improvement over one... use a more special-purpose directional mic? Personal Audio Archives - Ellis, Lee, Ogle p. 16/18

17 Privacy and Security Recordings are controversial privacy expectations: speech should be ephemeral? Oops button, delayed review (Roy) subpoenas... (Golubchik) Access to recordings is very sensitive.. but preservation is important too Approaches don t store intelligible audio.. but lessens utility - maybe store ASR output? split and store on multiple machines - tiered, distributed trust/access protocols Big issue! Personal Audio Archives - Ellis, Lee, Ogle p. 17 /18

18 Conclusions Personal Audio is easy & cheap to collect but is it any use? Segmentation/clustering works well Voice detection in noise is harder prospects for speaker identification Hashing to find arbitrary repeating events Tools distribution as a goal Personal Audio Archives - Ellis, Lee, Ogle p. 18 /18

Audio & Music Research at LabROSA

Audio & Music Research at LabROSA Audio & Music Research at LabROSA Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/

More information

LabROSA Research Overview

LabROSA Research Overview LabROSA Research Overview Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu 1. Music 2. Environmental sound 3.

More information

Mining Large-Scale Music Data Sets

Mining Large-Scale Music Data Sets Mining Large-Scale Music Data Sets Dan Ellis & Thierry Bertin-Mahieux Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA {dpwe,thierry}@ee.columbia.edu

More information

Multimedia Indexing. Lecture 12: EE E6820: Speech & Audio Processing & Recognition. Spoken document retrieval Audio databases.

Multimedia Indexing. Lecture 12: EE E6820: Speech & Audio Processing & Recognition. Spoken document retrieval Audio databases. EE E6820: Speech & Audio Processing & Recognition Lecture 12: Multimedia Indexing 1 Spoken document retrieval 2 Audio databases 3 Open issues Dan Ellis http://www.ee.columbia.edu/~dpwe/e6820/

More information

Lecture 12: Multimedia Indexing. Spoken Document Retrieval (SDR)

Lecture 12: Multimedia Indexing. Spoken Document Retrieval (SDR) EE E68: Speech & Audio Processing & Recognition Lecture : Multimedia Indexing 3 Spoken document retrieval Audio databases Open issues Dan Ellis http://www.ee.columbia.edu/~dpwe/e68/

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,

More information

Detection of Acoustic Events in Meeting-Room Environment

Detection of Acoustic Events in Meeting-Room Environment 11/Dec/2008 Detection of Acoustic Events in Meeting-Room Environment Presented by Andriy Temko Department of Electrical and Electronic Engineering Page 2 of 34 Content Introduction State of the Art Acoustic

More information

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Extracting meaning

More information

Get the most out of your Oticon hearing instruments

Get the most out of your Oticon hearing instruments Get the most out of your Oticon hearing instruments CONNECTIVITY Your ideal companion With Oticon ConnectLine you can get the most out of your Oticon hearing instruments in more situations. ConnectLine

More information

Movie synchronization by audio landmark matching

Movie synchronization by audio landmark matching Movie synchronization by audio landmark matching Ngoc Q. K. Duong, Franck Thudor To cite this version: Ngoc Q. K. Duong, Franck Thudor. Movie synchronization by audio landmark matching. IEEE International

More information

Connecting to Webex for eorganic Webinar Attendees: Instructions and Troubleshooting

Connecting to Webex for eorganic Webinar Attendees: Instructions and Troubleshooting Connecting to Webex for eorganic Webinar Attendees: Instructions and Troubleshooting We hope this detailed guide will help anyone who has trouble getting connected to our webinars or hearing the sound!

More information

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting

More information

Adobe Sound Booth Tutorial

Adobe Sound Booth Tutorial Adobe Sound Booth Tutorial Recording your Voice in the Studio 1. Open Adobe Sound Booth 2. Click File>New>Empty Audio File 3. Hit the Record Button (red circle button at the bottom of the screen) 4. In

More information

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec / / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank

More information

Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices

Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices V1.19, 2018-12-25 Alango Technologies 1 Executive Summary

More information

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Spectral modeling of musical sounds

Spectral modeling of musical sounds Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

Maximum Likelihood Beamforming for Robust Automatic Speech Recognition

Maximum Likelihood Beamforming for Robust Automatic Speech Recognition Maximum Likelihood Beamforming for Robust Automatic Speech Recognition Barbara Rauch barbara@lsv.uni-saarland.de IGK Colloquium, Saarbrücken, 16 February 2006 Agenda Background: Standard ASR Robust ASR

More information

MPEG-7 Audio: Tools for Semantic Audio Description and Processing

MPEG-7 Audio: Tools for Semantic Audio Description and Processing MPEG-7 Audio: Tools for Semantic Audio Description and Processing Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Why semantic description

More information

Robustness and independence of voice timbre features under live performance acoustic degradations

Robustness and independence of voice timbre features under live performance acoustic degradations Robustness and independence of voice timbre features under live performance acoustic degradations Dan Stowell and Mark Plumbley dan.stowell@elec.qmul.ac.uk Centre for Digital Music Queen Mary, University

More information

CHROMA AND MFCC BASED PATTERN RECOGNITION IN AUDIO FILES UTILIZING HIDDEN MARKOV MODELS AND DYNAMIC PROGRAMMING. Alexander Wankhammer Peter Sciri

CHROMA AND MFCC BASED PATTERN RECOGNITION IN AUDIO FILES UTILIZING HIDDEN MARKOV MODELS AND DYNAMIC PROGRAMMING. Alexander Wankhammer Peter Sciri 1 CHROMA AND MFCC BASED PATTERN RECOGNITION IN AUDIO FILES UTILIZING HIDDEN MARKOV MODELS AND DYNAMIC PROGRAMMING Alexander Wankhammer Peter Sciri introduction./the idea > overview What is musical structure?

More information

Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors

Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors Ajay Divakaran, Kadir A. Peker, Regunathan Radhakrishnan, Ziyou Xiong and Romain Cabasson Presented by Giulia Fanti 1 Overview Motivation

More information

Repeating Segment Detection in Songs using Audio Fingerprint Matching

Repeating Segment Detection in Songs using Audio Fingerprint Matching Repeating Segment Detection in Songs using Audio Fingerprint Matching Regunathan Radhakrishnan and Wenyu Jiang Dolby Laboratories Inc, San Francisco, USA E-mail: regu.r@dolby.com Institute for Infocomm

More information

System Identification Related Problems at

System Identification Related Problems at media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification

More information

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device How to Change the Default Playback & Recording Audio Device Sound is a very important part of our computing experience. We listen to music, do voice chat, watch movies, play games, record sound, etc. In

More information

Chapter 5.5 Audio Programming

Chapter 5.5 Audio Programming Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming

More information

A Short Introduction to Audio Fingerprinting with a Focus on Shazam

A Short Introduction to Audio Fingerprinting with a Focus on Shazam A Short Introduction to Audio Fingerprinting with a Focus on Shazam MUS-17 Simon Froitzheim July 5, 2017 Introduction Audio fingerprinting is the process of encoding a (potentially) unlabeled piece of

More information

Multimedia Database Systems. Retrieval by Content

Multimedia Database Systems. Retrieval by Content Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,

More information

LP2CD Wizard 2.0 User's Manual

LP2CD Wizard 2.0 User's Manual LP2CD Wizard 2.0 User's Manual Table of Contents 1. Installation Instructions a. Connecting the Vinyl2USB Converter b. Installing the Software 2. Using LP2CD Wizard a. Setting up and Testing for Audio

More information

INTRODUCTION TO SAMPLING 1

INTRODUCTION TO SAMPLING 1 INTRODUCTION TO SAMPLING 1 1.1 What is sampling? This book is an introduction to the creation of virtual instruments through sampling. Sampling is the process of recording a sound source one part at a

More information

Basic Features Guide

Basic Features Guide Basic Features Guide This guide will walk you through the basic features and functions of the SpectrumVoIP Phone System. Version 1.1 Placing and Receiving Calls In order to place a call on your Spectrum

More information

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>

<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>> THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar

More information

Best-in-class audio recording

Best-in-class audio recording Best-in-class audio recording Philips Voice Tracer range 2013 New Philips Voice Tracer range Best-in-class audio recording Only the perfect combination of audio quality & ease of use delivers the best

More information

Andrea PureAudio BT-200 Noise Canceling Bluetooth Headset Performance Comparative Testing

Andrea PureAudio BT-200 Noise Canceling Bluetooth Headset Performance Comparative Testing Andrea Audio Test Labs Andrea PureAudio BT-200 Noise Canceling Bluetooth Headset August 28 th 2008 Rev A Andrea Electronics Corporation 65 Orville Drive Suite One Bohemia NY 11716 (631)-719-1800 www.andreaelectronics.com

More information

HKIoTDemo Documentation

HKIoTDemo Documentation HKIoTDemo Documentation Release 1.0 Eric Tran, Tyler Freckmann October 12, 2016 Contents 1 Video of the Demo 3 2 About the project 5 3 Challenges we ran into 7 4 Architecture Overview 9 4.1 Architecture

More information

Optimal Video Adaptation and Skimming Using a Utility-Based Framework

Optimal Video Adaptation and Skimming Using a Utility-Based Framework Optimal Video Adaptation and Skimming Using a Utility-Based Framework Shih-Fu Chang Digital Video and Multimedia Lab ADVENT University-Industry Consortium Columbia University Sept. 9th 2002 http://www.ee.columbia.edu/dvmm

More information

SOUND EVENT DETECTION AND CONTEXT RECOGNITION 1 INTRODUCTION. Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2

SOUND EVENT DETECTION AND CONTEXT RECOGNITION 1 INTRODUCTION. Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2 Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2 1 Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 33720, Tampere, Finland toni.heittola@tut.fi,

More information

Manifold Constrained Deep Neural Networks for ASR

Manifold Constrained Deep Neural Networks for ASR 1 Manifold Constrained Deep Neural Networks for ASR Department of Electrical and Computer Engineering, McGill University Richard Rose and Vikrant Tomar Motivation Speech features can be characterized as

More information

First Communications Cloud IP PBX User Guide (Polycom)

First Communications Cloud IP PBX User Guide (Polycom) First Communications Cloud IP PBX User Guide (Polycom) 2017 Property of First Communications Contents Introduction... 3 General Phone Operations... 4 Polycom VVX 300 Series... 4 Polycom VVX 300 Series

More information

8x8 Virtual Office Online with Softphone User Guide

8x8 Virtual Office Online with Softphone User Guide User Guide Version 2.0, February 2011 Contents Introduction...4 System Requirements...4 Supported Operating Systems...4 Supported Browsers...4 Required ports...4 VoIP...4 Operating System Requirements...4

More information

Ponto Streamer. New wireless communication possibilities. Ponto TM The Bone Anchored Hearing System

Ponto Streamer. New wireless communication possibilities. Ponto TM The Bone Anchored Hearing System Ponto Streamer New wireless communication possibilities Ponto TM The Bone Anchored Hearing System Your ideal companion New communication possibilities With Ponto Streamer you get access to Oticon ConnectLine

More information

Polycom VVX410. Full user guide

Polycom VVX410. Full user guide Polycom VVX410 Full user guide Contents Contents... 2 Introduction... 4 How to set up the Polycom VVX410... 5 Phone Keys and Hardware... 6 Using your phone... 7 Home view... 7 Lines View... 7 Calls View...

More information

Modeling Coarticulation in Continuous Speech

Modeling Coarticulation in Continuous Speech ing in Oregon Health & Science University Center for Spoken Language Understanding December 16, 2013 Outline in 1 2 3 4 5 2 / 40 in is the influence of one phoneme on another Figure: of coarticulation

More information

A GET YOU GOING GUIDE

A GET YOU GOING GUIDE A GET YOU GOING GUIDE To Your copy here Audio Notetaker 4.0 April 2015 1 Learning Support Getting Started with Audio Notetaker Audio Notetaker is highly recommended for those of you who use a Digital Voice

More information

D1.4 Digitization Guide Cassette Audio Project Parameters

D1.4 Digitization Guide Cassette Audio Project Parameters D1.4 Digitization Guide Cassette Audio Project Parameters Summary This guide is a step by step manual which should enable the reader to digitize an audio cassette tape. Before employing this guide project

More information

Complex Identification Decision Based on Several Independent Speaker Recognition Methods. Ilya Oparin Speech Technology Center

Complex Identification Decision Based on Several Independent Speaker Recognition Methods. Ilya Oparin Speech Technology Center Complex Identification Decision Based on Several Independent Speaker Recognition Methods Ilya Oparin Speech Technology Center Corporate Overview Global provider of voice biometric solutions Company name:

More information

Quick Start Guide MAC Operating System Built-In Accessibility

Quick Start Guide MAC Operating System Built-In Accessibility Quick Start Guide MAC Operating System Built-In Accessibility Overview The MAC Operating System X has many helpful universal access built-in options for users of varying abilities. In this quickstart,

More information

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Previous Lecture Audio Retrieval - Query by Humming

More information

SPEECH FEATURE EXTRACTION USING WEIGHTED HIGHER-ORDER LOCAL AUTO-CORRELATION

SPEECH FEATURE EXTRACTION USING WEIGHTED HIGHER-ORDER LOCAL AUTO-CORRELATION Far East Journal of Electronics and Communications Volume 3, Number 2, 2009, Pages 125-140 Published Online: September 14, 2009 This paper is available online at http://www.pphmj.com 2009 Pushpa Publishing

More information

Using Speech Recognition for controlling a Pan-Tilt-Zoom Network Camera

Using Speech Recognition for controlling a Pan-Tilt-Zoom Network Camera Using Speech Recognition for controlling a Pan-Tilt-Zoom Network Camera Enrique Garcia Department of Computer Science University of Lund Lund, Sweden enriqueg@axis.com Sven Grönquist Department of Computer

More information

CS 525M Mobile and Ubiquitous Computing Healthcare and Personal Assistants Intro. Emmanuel Agu

CS 525M Mobile and Ubiquitous Computing Healthcare and Personal Assistants Intro. Emmanuel Agu CS 525M Mobile and Ubiquitous Computing Healthcare and Personal Assistants Intro Emmanuel Agu Ubicomp for Healthcare Currently: Healthcare is appointment based (fixed time), infrequent Specific location

More information

Available online Journal of Scientific and Engineering Research, 2016, 3(4): Research Article

Available online   Journal of Scientific and Engineering Research, 2016, 3(4): Research Article Available online www.jsaer.com, 2016, 3(4):417-422 Research Article ISSN: 2394-2630 CODEN(USA): JSERBR Automatic Indexing of Multimedia Documents by Neural Networks Dabbabi Turkia 1, Lamia Bouafif 2, Ellouze

More information

CHAPTER 3. Preprocessing and Feature Extraction. Techniques

CHAPTER 3. Preprocessing and Feature Extraction. Techniques CHAPTER 3 Preprocessing and Feature Extraction Techniques CHAPTER 3 Preprocessing and Feature Extraction Techniques 3.1 Need for Preprocessing and Feature Extraction schemes for Pattern Recognition and

More information

Introduction to Google Voice

Introduction to Google Voice Introduction to Google Voice This document provides an introduction to Google Voice, a free application provided by Google. With Google Voice, you can make and receive local and international calls, SMS,

More information

AAC Apps App includes three different air horn sounds. Tap on horn wanted, and shake to increase volume. Shake harder to increase volume further.

AAC Apps App includes three different air horn sounds. Tap on horn wanted, and shake to increase volume. Shake harder to increase volume further. Air Horn LOUD! Bamboo Paper Notebook FreeSpeech $0.99 Locabulary - see MyScript Memo - See MyTalkTools Mobile Lite Phrase Board (English version) Scribble Press AAC Apps App includes three different air

More information

EVAS CAN Bus. Ref : User Guide

EVAS CAN Bus. Ref : User Guide EVAS CAN Bus Ref : 115-311-001 User Guide Contents 1 Characteristics... 3 1.1 Operating characteristics... 3 1.2 Connectors... 3 2 System operation... 4 2.1 Continuous listening mode... 4 2.2 Impulse listening

More information

Bamboo Paper - Notebook. FreeSpeech. Locabulary NO Wifi Req'd - see description for details. MyTalkTools Mobile Lite

Bamboo Paper - Notebook. FreeSpeech. Locabulary NO Wifi Req'd - see description for details. MyTalkTools Mobile Lite ipad App Inventory AAC ICON: TITLE/WIFI/COST: DESCRIPTION: Air Horn LOUD! App includes three different air horn sounds. Tap on horn wanted, and shake to increase volume. Shake harder to increase volume

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

ELANTRA TOPICS. Phone Pairing Navigation Blue Link

ELANTRA TOPICS. Phone Pairing Navigation Blue Link QUICK TIPS ELANTRA TOPICS Phone Pairing Navigation Blue Link PHONE PAIRING Connecting for the First Time 1. To begin, the vehicle s shifter must be in PARK 2. Press the PHONE button 3. Touch YES 4. Turn

More information

VoIP Overview. Device Setup The device is configured via the VoIP tab of the devices Device Properties dialog in Integration Designer.

VoIP Overview. Device Setup The device is configured via the VoIP tab of the devices Device Properties dialog in Integration Designer. VoIP Overview DESCRIPTION: RTI devices with VoIP (Voice over IP) support currently support peer-to-peer communication with other RTI devices and 3rd party devices that support the SIP protocol. Audio is

More information

Table of Contents. The Home and More screens... 14

Table of Contents. The Home and More screens... 14 Table of Contents SmartMeet Overview...1 From SmartMeet, you can:...1 Setup... 2 System requirements... 2 Downloading SmartMeet....3 Starting SmartMeet for the first time... 4 Add user details...5 To add

More information

Are You Too Busy? Practical Tips For Better Time Management

Are You Too Busy? Practical Tips For Better Time Management with Lorena Prime Are You Too Busy? Practical Tips For Better Time Management Is this How You Feel? What s a Productivity Expert? Focuses on offices (at work or virtual / home) Sets up file systems and

More information

How Do I Search & Replay Communications

How Do I Search & Replay Communications How Do I Search & Replay Communications Quantify Search & Replay is used to search, retrieve and replay recorded communications from your Red Box Recorder. If you have replay permissions, then you can

More information

SMARTWATCH User Manual

SMARTWATCH User Manual SMARTWATCH User Manual Please refer to this manual before using your LOGIC LIFE 20 SmartWatch. ENGLISH Pages OVERVIEW 3 CHARGING 4 USABILITY 5 PAIRING 6-7 FUNCTIONS 8-12 TECHNICAL SPECIFICATIONS 13 2 1.

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research

More information

Multimedia Event Detection for Large Scale Video. Benjamin Elizalde

Multimedia Event Detection for Large Scale Video. Benjamin Elizalde Multimedia Event Detection for Large Scale Video Benjamin Elizalde Outline Motivation TrecVID task Related work Our approach (System, TF/IDF) Results & Processing time Conclusion & Future work Agenda 2

More information

Let life inspire you. with ReSound Unite wireless accessories. Learn more about ReSound Unite wireless accessories.

Let life inspire you. with ReSound Unite wireless accessories. Learn more about ReSound Unite wireless accessories. Learn more about ReSound Unite wireless accessories. Ask your local hearing specialist Scan here or go to www.resound.com/unite M101528-GB-12.10-Rev.A Let life inspire you with ReSound Unite wireless accessories

More information

Agenda. Quick Start Menu. Understanding the Interface. Voice Status Icons. Commonly Used Features. Security. Dialing Out. Question & Answer Feature

Agenda. Quick Start Menu. Understanding the Interface. Voice Status Icons. Commonly Used Features. Security. Dialing Out. Question & Answer Feature Voice Management Agenda Quick Start Menu Understanding the Interface Voice Status Icons Commonly Used Features Security Dialing Out Question & Answer Feature Recording / Archiving Quick Start Menu Upon

More information

Logging in. Your teacher will give you a login address during lectures or via .

Logging in. Your teacher will give you a login address during lectures or via  . Logging in Your teacher will give you a login address during lectures or via email. Students usually login as guest. Type your name and click Enter Room. You can also login with your VAMK ID. Write your

More information

Call Recording System. Installation and User Guide

Call Recording System. Installation and User Guide Call Recording System Installation and User Guide Issue 1.0A 2004.05 Details For Changes Version Date Details Notes 1.0A 2004-05-19 First Created ii CALL RECORDING SYSTEM... I INSTALLATION AND USER GUIDE...

More information

Data fusion and multi-cue data matching using diffusion maps

Data fusion and multi-cue data matching using diffusion maps Data fusion and multi-cue data matching using diffusion maps Stéphane Lafon Collaborators: Raphy Coifman, Andreas Glaser, Yosi Keller, Steven Zucker (Yale University) Part of this work was supported by

More information

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency

More information

Innovative Industrial Solutions, Inc Skyline Drive Russellville, AR Phone (479) Fax (479)

Innovative Industrial Solutions, Inc Skyline Drive Russellville, AR Phone (479) Fax (479) In-ear Mic Headset Industrial grade design for all commercial uses Flexible, high grade cable reinforced with Kevlar provides strength 30% Reduction of Noise level/ansi Certified In-ear microphone technology

More information

SAMSUNG HANDSET USER GUIDE FOR DS-5007S / DS-5014S / DS-5038S / DS-5014D / DS-5021D ITP-5107 / ITP-5114D / ITP5121D

SAMSUNG HANDSET USER GUIDE FOR DS-5007S / DS-5014S / DS-5038S / DS-5014D / DS-5021D ITP-5107 / ITP-5114D / ITP5121D SAMSUNG HANDSET USER GUIDE FOR DS-5007S / DS-5014S / DS-5038S / DS-5014D / DS-5021D ITP-5107 / ITP-5114D / ITP5121D FOR TECHNICAL TIPS PLEASE VISIT OUR WEBSITE www.conversetelecom.com 2 Table of Contents

More information

Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following:

Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following: Garage Band Instructions Tutorial Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following: Demonstrate Audio editing techniques

More information

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric

More information

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval Lesson 11 Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Retrieval = Query + Search Informational Retrieval: Get required information from database/web

More information

Talking Books in PowerPoint

Talking Books in PowerPoint Talking Books in PowerPoint Quick Guide Created 10/03 Updated 10/09 JC Creating a template The following instructions are based on PowerPoint XP (2000, 2002,2003) Create a blank page Open up PowerPoint

More information

How to edit audio tracks

How to edit audio tracks How to edit audio tracks Although at times you will use Adobe Audition to record new audio, you will often use it to edit, clean up, or add effects to an existing audio file such as an audio file recorded

More information

Large scale object/scene recognition

Large scale object/scene recognition Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database

More information

R300. Quick Start Guide 15G06A E3403

R300. Quick Start Guide 15G06A E3403 R300 E3403 Quick Start Guide 15G06A348000 Layout Features 1 2 6 5 3 4 7 8 9 10 11 12 1 External Antenna port Connects to an external antenna for better signal performance (the external antenna is optional)

More information

Phone Settings 26 Ringer Volume 26. Basic Calling Features 13 Help Online Services 43

Phone Settings 26 Ringer Volume 26. Basic Calling Features 13 Help Online Services 43 1 Congratulations on purchasing your new VTech product. Before using this telephone, please read the Important safety instructions on page 89 of this manual. The information contained in this manual is

More information

IPLDK CRS. Installation and User Guide ISSUE 1.0A

IPLDK CRS. Installation and User Guide ISSUE 1.0A IPLDK CRS Installation and User Guide ISSUE 1.0A 1. INTRODUCTION...5 1.1 OVERVIEW...5 1.2 Features...5 2. INSTALLATION ENVIRONMENT...6 2.1 Hardware Specification...6 2.2 System Requirements...6 3. PROGRAM

More information

Topics in Linguistic Theory: Laboratory Phonology Spring 2007

Topics in Linguistic Theory: Laboratory Phonology Spring 2007 MIT OpenCourseWare http://ocw.mit.edu 24.910 Topics in Linguistic Theory: Laboratory Phonology Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION 15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:

More information

Bringing the Voices of Communities Together:

Bringing the Voices of Communities Together: Bringing the Voices of Communities Together: The Middletown Digital Oral History Project Maren Read Archivist for Manuscript Collections MLRead@bsu.edu Amanda Hurford Digital Initiatives Multimedia Developer

More information

QUICK TIPS SANTA FE. Phone Pairing Navigation Blue Link TOPICS

QUICK TIPS SANTA FE. Phone Pairing Navigation Blue Link TOPICS QUICK TIPS SANTA FE TOPICS Phone Pairing Navigation Blue Link PHONE PAIRING Connecting for the First Time 1. To begin, the vehicle s shifter must be in PARK 2. Press the PHONE button 3. Touch YES 4. Turn

More information

R-09HR ReleaseNote. R-09HR Operating System Version 2.00 RO9HRRN200

R-09HR ReleaseNote. R-09HR Operating System Version 2.00 RO9HRRN200 R-09HR ReleaseNote R-09HR Operating System Version.00 009 Roland Corporation U.S. All rights reserved. No part of this publication may be reproduced in any form without the written permission of Roland

More information

OCR Interfaces for Visually Impaired

OCR Interfaces for Visually Impaired OCR Interfaces for Visually Impaired TOPIC ASSIGNMENT 2 Author: Sachin FERNANDES Graduate 8 Undergraduate Team 2 TOPIC PROPOSAL Instructor: Dr. Robert PASTEL March 4, 2016 LIST OF FIGURES LIST OF FIGURES

More information

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Takuichi Nishimura Real World Computing Partnership / National Institute of Advanced

More information

INSTRUCTION MANUAL Mi9 Executive Digital Voice Recorder, 60hrs SB-VR9100

INSTRUCTION MANUAL Mi9 Executive Digital Voice Recorder, 60hrs SB-VR9100 INSTRUCTION MANUAL Mi9 Executive Digital Voice Recorder, 60hrs SB-VR9100 Revised: May 21, 2014 Thank you for purchasing from SafetyBasement.com! We appreciate your business. We made this simple manual

More information

TABLE OF CONTENTS. Introduction Setting up Your Patriot Voice Controls Starting the System Controls...

TABLE OF CONTENTS. Introduction Setting up Your Patriot Voice Controls Starting the System Controls... USER MANUAL TABLE OF CONTENTS Introduction... 03 Setting up Your Patriot Voice... 04 Controls... 05 Starting the System... 06 Controls... 06 Additional Keys... 09 Menu Zone... 10 System Settings... 15

More information

NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION

NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION 8T - 56 NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION LX NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION DESCRIPTION TELECOMMUNICATIONS The hands-free cellular system uses Bluetooth technology

More information

Table of Contents. iii

Table of Contents. iii TECHNICAL GUIDE Table of Contents MobileMeet Overview... 1 From MobileMeet, you can:... 1 Setup... 2 System requirements... 2 Bandwidth and Data Transfer... 3 Downloading MobileMeet... 4 Starting MobileMeet

More information

8180 LOUD RINGER USER GUIDE

8180 LOUD RINGER USER GUIDE 8180 LOUD RINGER USER GUIDE Table of Contents Overview.... 3 Key Features.... 3 Loudness.... 3 Ambient Noise Compensation.... 3 Outputs for External Equipment and Devices.... 3 Ring Tones... 3 Blue Indicator

More information

National Writers Workshop Wichita, Kan., May 19 20, 2007

National Writers Workshop Wichita, Kan., May 19 20, 2007 The No-Fear Guide To Multimedia Skills National Writers Workshop Wichita, Kan., May 19 20, 2007 Mindy McAdams University of Florida E-mail mmcadams@jou.ufl.edu >>> All links http://mindymcadams.com/guest/nww/

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information