Minimal-Impact Personal Audio Archives
|
|
- Lindsay Reynolds
- 5 years ago
- Views:
Transcription
1 Minimal-Impact Personal Audio Archives Dan Ellis, Keansub Lee, Jim Ogle Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA 1. Personal Audio Archives 2. Segmenting & Clustering 3. Speech Detection 4. Repeated Events 5. Future Personal Audio Archives - Ellis, Lee, Ogle p. 1 /18
2 1. Personal Audio Archives Easy to record everything you hear <2GB / 64 kbps Hard to find anything how to scan? how to visualize? how to index? Need automatic analysis Need minimal impact Personal Audio Archives - Ellis, Lee, Ogle p. 2 /18
3 Information in Audio Long-duration recordings contain info on: location type (restaurant, street,...) and specific activity talking, walking, typing people generic (2 males), specific (Chuck & John) spoken content... maybe but not: what people and things looked like day/night gaze, posture, motion,... Personal Audio Archives - Ellis, Lee, Ogle p. 3 /18
4 Applications Automatic appointment-book history fills in when & where of movements Life statistics how long did I spend in meetings this week? most frequent conversations favorite phrases? Retrieving details what exactly did I promise? privacy issues... Nostalgia... or what? Personal Audio Archives - Ellis, Lee, Ogle p. 4 /18
5 2. Segmentation & Clustering Top-level structure for long recordings: Where are the major boundaries? e.g. for diary application support for manual browsing Length of fundamental time-frame 60s rather than 10ms? background more important than foreground average out uncharacteristic transients Perceptually-motivated features.. so results have perceptual relevance broad spectrum + some detail Personal Audio Archives - Ellis, Lee, Ogle p. 5 /18
6 Features 20 Average Linear Energy Normalized Energy Deviation 60 freq / bark freq / bark Average Log Energy 60 db Log Energy Deviation db 15 freq / bark freq / bark Average Spectral Entropy db freq / bark freq / bark Spectral Entropy Deviation 10 5 db bits time / min Capture both average and variation Capture a little more detail in subbands... bits Personal Audio Archives - Ellis, Lee, Ogle p. 6 /18
7 BIC Segmentation Results Evaluate: 62 hr hand-marked dataset 8 days, 139 segments, 16 categories measure Correct Accept False Accept = 2%: Feature Correct Accept μdb 80.8% μh 81.1% σh/μh 81.6% μdb + σh/μh 84.0% μdb + σh/μh + μh 83.6% mfcc 73.6% Sensitivity o µ db µ H! H /µ H µ db +! H /µ H µ db + µ H +! H /µ H Specificity Personal Audio Archives - Ellis, Lee, Ogle p. 7 /18
8 Segment Clustering Daily activity has lots of repetition: Automatically cluster similar segments affinity of segments as KL2 distances 4*5)#1-% 1))%'23 -"#"0-),"#,)# ()!%*#)/,'(('"#.,#)"- ()!%*#)+!"#$%"&' ;01),0:('23 4%#))% #)4%"*#"2% (',#"# !"15*4 7!15 (', #4% 4%# 666 Personal Audio Archives - Ellis, Lee, Ogle p. 8 /18
9 Clustering Results Clustering of automatic segments gives anonymous classes BIC criterion to choose number of clusters make best correspondence to 16 GT clusters Frame-level scoring gives ~70% correct errors when same place has multiple ambiences Personal Audio Archives - Ellis, Lee, Ogle p. 9 /18
10 3. Speech Detection Speech emerges as most interesting content Just identifying speech would be useful goal is speaker identification / labeling Lots of background noise conventional Voice Activity Detection inadequate Insight: Listeners detect pitch track (melody) look for voice-like periodicity in noise 4000 coffeeshop excerpt Frequency Time Personal Audio Archives - Ellis, Lee, Ogle p. 10/18
11 Voice Periodicity Enhancement Noise-robust subband autocorrelation Subtract local average suppresses steady background e.g. machine noise 15 min test set; 88% acc (79% w/o enhancement) also for enhancing speech (harmonic filtering) Personal Audio Archives - Ellis, Lee, Ogle p. 11/18
12 4. Repeating Events Recurring sound events can be informative indicate similar circumstance... but: define event sound organization define recurring event how similar?.. and how to find them tractable? Idea: Use hashing (fingerprints) index points to other occurrences of each hash; intersection of hashes points to match - much quicker search use a fingerprint insensitive to background? Personal Audio Archives - Ellis, Lee, Ogle p. 12/18
13 Shazam Fingerprints Prominent spectral onsets are landmarks; Use relations {f1, f2, t} as hashes 4000 Phone ring - Shazam fingerprint intrinsically robust to background noise Personal Audio Archives - Ellis, Lee, Ogle p. 13/18
14 Exhaustive Search for Repeats More selective hashes few hits required to confirm match (faster; better precision) but less robust to backgound (reduce recall) Works well when exact structure repeats recorded music, electronic alerts no good for organic sounds e.g. garage door Personal Audio Archives - Ellis, Lee, Ogle p. 14/18
15 5. Future: Browsing Tools / Diary interface Browsing links to other information (diary, , photos) synchronize with note taking? (Stifelman & Arons) Release Tools + how to for capture!"#!! '!!(D!%D&$ '!!(D!%D&( '!!(D!%D&) '!!(D!%D&* '!!(D!%D&+!"#$!!%#!!!%#$! &!#!! &!#$! &&#!!,-./01223,-./01223 >2= <..68=: <..68=:',2/63.0 EFG!( C' EFG!$ &&#$! &'#!! &'#$! :-27, <..68=:' &$#$! 34; &(#!! <..68=:'?4= ?8H. C' <..68=: F4<;4-64B &(#$! <..68=:,2/63.0 <..68=: &+#$! &"#$! &%#!! &%#$! /.<8=4- :,,2/63.0 :-27, C.//.- H.4=/7; <..68=:' :-27, ; Personal Audio Archives - Ellis, Lee, Ogle &"#!! :-27, :-414< &*#$! &+#!! :-27, &)#!! &*#!! -9: 02<,<6: &$#!! &)#$! :-27, ,-./01223 =4614= p. 15 /18
16 Future: Speech Recognition Most audio is too noisy for standard ASR actually reassuring for privacy issues But... similar to Meeting Recordings NIST distant microphone conditions Speech enhancement - directional filtering 2 channels a big improvement over one... use a more special-purpose directional mic? Personal Audio Archives - Ellis, Lee, Ogle p. 16/18
17 Privacy and Security Recordings are controversial privacy expectations: speech should be ephemeral? Oops button, delayed review (Roy) subpoenas... (Golubchik) Access to recordings is very sensitive.. but preservation is important too Approaches don t store intelligible audio.. but lessens utility - maybe store ASR output? split and store on multiple machines - tiered, distributed trust/access protocols Big issue! Personal Audio Archives - Ellis, Lee, Ogle p. 17 /18
18 Conclusions Personal Audio is easy & cheap to collect but is it any use? Segmentation/clustering works well Voice detection in noise is harder prospects for speaker identification Hashing to find arbitrary repeating events Tools distribution as a goal Personal Audio Archives - Ellis, Lee, Ogle p. 18 /18
Audio & Music Research at LabROSA
Audio & Music Research at LabROSA Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/
More informationLabROSA Research Overview
LabROSA Research Overview Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu 1. Music 2. Environmental sound 3.
More informationMining Large-Scale Music Data Sets
Mining Large-Scale Music Data Sets Dan Ellis & Thierry Bertin-Mahieux Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA {dpwe,thierry}@ee.columbia.edu
More informationMultimedia Indexing. Lecture 12: EE E6820: Speech & Audio Processing & Recognition. Spoken document retrieval Audio databases.
EE E6820: Speech & Audio Processing & Recognition Lecture 12: Multimedia Indexing 1 Spoken document retrieval 2 Audio databases 3 Open issues Dan Ellis http://www.ee.columbia.edu/~dpwe/e6820/
More informationLecture 12: Multimedia Indexing. Spoken Document Retrieval (SDR)
EE E68: Speech & Audio Processing & Recognition Lecture : Multimedia Indexing 3 Spoken document retrieval Audio databases Open issues Dan Ellis http://www.ee.columbia.edu/~dpwe/e68/
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,
More informationDetection of Acoustic Events in Meeting-Room Environment
11/Dec/2008 Detection of Acoustic Events in Meeting-Room Environment Presented by Andriy Temko Department of Electrical and Electronic Engineering Page 2 of 34 Content Introduction State of the Art Acoustic
More informationWorkshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards
Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Extracting meaning
More informationGet the most out of your Oticon hearing instruments
Get the most out of your Oticon hearing instruments CONNECTIVITY Your ideal companion With Oticon ConnectLine you can get the most out of your Oticon hearing instruments in more situations. ConnectLine
More informationMovie synchronization by audio landmark matching
Movie synchronization by audio landmark matching Ngoc Q. K. Duong, Franck Thudor To cite this version: Ngoc Q. K. Duong, Franck Thudor. Movie synchronization by audio landmark matching. IEEE International
More informationConnecting to Webex for eorganic Webinar Attendees: Instructions and Troubleshooting
Connecting to Webex for eorganic Webinar Attendees: Instructions and Troubleshooting We hope this detailed guide will help anyone who has trouble getting connected to our webinars or hearing the sound!
More informationDUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING
DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting
More informationAdobe Sound Booth Tutorial
Adobe Sound Booth Tutorial Recording your Voice in the Studio 1. Open Adobe Sound Booth 2. Click File>New>Empty Audio File 3. Hit the Record Button (red circle button at the bottom of the screen) 4. In
More information/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec
/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank
More informationKeyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices
Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices V1.19, 2018-12-25 Alango Technologies 1 Executive Summary
More informationMPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding
MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?
More informationPrinciples of Audio Coding
Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting
More informationSpectral modeling of musical sounds
Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer
More informationIntroducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd
Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics
More informationMaximum Likelihood Beamforming for Robust Automatic Speech Recognition
Maximum Likelihood Beamforming for Robust Automatic Speech Recognition Barbara Rauch barbara@lsv.uni-saarland.de IGK Colloquium, Saarbrücken, 16 February 2006 Agenda Background: Standard ASR Robust ASR
More informationMPEG-7 Audio: Tools for Semantic Audio Description and Processing
MPEG-7 Audio: Tools for Semantic Audio Description and Processing Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Why semantic description
More informationRobustness and independence of voice timbre features under live performance acoustic degradations
Robustness and independence of voice timbre features under live performance acoustic degradations Dan Stowell and Mark Plumbley dan.stowell@elec.qmul.ac.uk Centre for Digital Music Queen Mary, University
More informationCHROMA AND MFCC BASED PATTERN RECOGNITION IN AUDIO FILES UTILIZING HIDDEN MARKOV MODELS AND DYNAMIC PROGRAMMING. Alexander Wankhammer Peter Sciri
1 CHROMA AND MFCC BASED PATTERN RECOGNITION IN AUDIO FILES UTILIZING HIDDEN MARKOV MODELS AND DYNAMIC PROGRAMMING Alexander Wankhammer Peter Sciri introduction./the idea > overview What is musical structure?
More informationVideo Summarization Using MPEG-7 Motion Activity and Audio Descriptors
Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors Ajay Divakaran, Kadir A. Peker, Regunathan Radhakrishnan, Ziyou Xiong and Romain Cabasson Presented by Giulia Fanti 1 Overview Motivation
More informationRepeating Segment Detection in Songs using Audio Fingerprint Matching
Repeating Segment Detection in Songs using Audio Fingerprint Matching Regunathan Radhakrishnan and Wenyu Jiang Dolby Laboratories Inc, San Francisco, USA E-mail: regu.r@dolby.com Institute for Infocomm
More informationSystem Identification Related Problems at
media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification
More informationHow to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device
How to Change the Default Playback & Recording Audio Device Sound is a very important part of our computing experience. We listen to music, do voice chat, watch movies, play games, record sound, etc. In
More informationChapter 5.5 Audio Programming
Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming
More informationA Short Introduction to Audio Fingerprinting with a Focus on Shazam
A Short Introduction to Audio Fingerprinting with a Focus on Shazam MUS-17 Simon Froitzheim July 5, 2017 Introduction Audio fingerprinting is the process of encoding a (potentially) unlabeled piece of
More informationMultimedia Database Systems. Retrieval by Content
Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,
More informationLP2CD Wizard 2.0 User's Manual
LP2CD Wizard 2.0 User's Manual Table of Contents 1. Installation Instructions a. Connecting the Vinyl2USB Converter b. Installing the Software 2. Using LP2CD Wizard a. Setting up and Testing for Audio
More informationINTRODUCTION TO SAMPLING 1
INTRODUCTION TO SAMPLING 1 1.1 What is sampling? This book is an introduction to the creation of virtual instruments through sampling. Sampling is the process of recording a sound source one part at a
More informationBasic Features Guide
Basic Features Guide This guide will walk you through the basic features and functions of the SpectrumVoIP Phone System. Version 1.1 Placing and Receiving Calls In order to place a call on your Spectrum
More information<< WILL FILL IN THESE SECTIONS THIS WEEK to provide sufficient background>>
THE GSS CODEC MUSIC 422 FINAL PROJECT Greg Sell, Song Hui Chon, Scott Cannon March 6, 2005 Audio files at: ccrma.stanford.edu/~gsell/422final/wavfiles.tar Code at: ccrma.stanford.edu/~gsell/422final/codefiles.tar
More informationBest-in-class audio recording
Best-in-class audio recording Philips Voice Tracer range 2013 New Philips Voice Tracer range Best-in-class audio recording Only the perfect combination of audio quality & ease of use delivers the best
More informationAndrea PureAudio BT-200 Noise Canceling Bluetooth Headset Performance Comparative Testing
Andrea Audio Test Labs Andrea PureAudio BT-200 Noise Canceling Bluetooth Headset August 28 th 2008 Rev A Andrea Electronics Corporation 65 Orville Drive Suite One Bohemia NY 11716 (631)-719-1800 www.andreaelectronics.com
More informationHKIoTDemo Documentation
HKIoTDemo Documentation Release 1.0 Eric Tran, Tyler Freckmann October 12, 2016 Contents 1 Video of the Demo 3 2 About the project 5 3 Challenges we ran into 7 4 Architecture Overview 9 4.1 Architecture
More informationOptimal Video Adaptation and Skimming Using a Utility-Based Framework
Optimal Video Adaptation and Skimming Using a Utility-Based Framework Shih-Fu Chang Digital Video and Multimedia Lab ADVENT University-Industry Consortium Columbia University Sept. 9th 2002 http://www.ee.columbia.edu/dvmm
More informationSOUND EVENT DETECTION AND CONTEXT RECOGNITION 1 INTRODUCTION. Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2
Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2 1 Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 33720, Tampere, Finland toni.heittola@tut.fi,
More informationManifold Constrained Deep Neural Networks for ASR
1 Manifold Constrained Deep Neural Networks for ASR Department of Electrical and Computer Engineering, McGill University Richard Rose and Vikrant Tomar Motivation Speech features can be characterized as
More informationFirst Communications Cloud IP PBX User Guide (Polycom)
First Communications Cloud IP PBX User Guide (Polycom) 2017 Property of First Communications Contents Introduction... 3 General Phone Operations... 4 Polycom VVX 300 Series... 4 Polycom VVX 300 Series
More information8x8 Virtual Office Online with Softphone User Guide
User Guide Version 2.0, February 2011 Contents Introduction...4 System Requirements...4 Supported Operating Systems...4 Supported Browsers...4 Required ports...4 VoIP...4 Operating System Requirements...4
More informationPonto Streamer. New wireless communication possibilities. Ponto TM The Bone Anchored Hearing System
Ponto Streamer New wireless communication possibilities Ponto TM The Bone Anchored Hearing System Your ideal companion New communication possibilities With Ponto Streamer you get access to Oticon ConnectLine
More informationPolycom VVX410. Full user guide
Polycom VVX410 Full user guide Contents Contents... 2 Introduction... 4 How to set up the Polycom VVX410... 5 Phone Keys and Hardware... 6 Using your phone... 7 Home view... 7 Lines View... 7 Calls View...
More informationModeling Coarticulation in Continuous Speech
ing in Oregon Health & Science University Center for Spoken Language Understanding December 16, 2013 Outline in 1 2 3 4 5 2 / 40 in is the influence of one phoneme on another Figure: of coarticulation
More informationA GET YOU GOING GUIDE
A GET YOU GOING GUIDE To Your copy here Audio Notetaker 4.0 April 2015 1 Learning Support Getting Started with Audio Notetaker Audio Notetaker is highly recommended for those of you who use a Digital Voice
More informationD1.4 Digitization Guide Cassette Audio Project Parameters
D1.4 Digitization Guide Cassette Audio Project Parameters Summary This guide is a step by step manual which should enable the reader to digitize an audio cassette tape. Before employing this guide project
More informationComplex Identification Decision Based on Several Independent Speaker Recognition Methods. Ilya Oparin Speech Technology Center
Complex Identification Decision Based on Several Independent Speaker Recognition Methods Ilya Oparin Speech Technology Center Corporate Overview Global provider of voice biometric solutions Company name:
More informationQuick Start Guide MAC Operating System Built-In Accessibility
Quick Start Guide MAC Operating System Built-In Accessibility Overview The MAC Operating System X has many helpful universal access built-in options for users of varying abilities. In this quickstart,
More informationMultimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig
Multimedia Databases Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Previous Lecture Audio Retrieval - Query by Humming
More informationSPEECH FEATURE EXTRACTION USING WEIGHTED HIGHER-ORDER LOCAL AUTO-CORRELATION
Far East Journal of Electronics and Communications Volume 3, Number 2, 2009, Pages 125-140 Published Online: September 14, 2009 This paper is available online at http://www.pphmj.com 2009 Pushpa Publishing
More informationUsing Speech Recognition for controlling a Pan-Tilt-Zoom Network Camera
Using Speech Recognition for controlling a Pan-Tilt-Zoom Network Camera Enrique Garcia Department of Computer Science University of Lund Lund, Sweden enriqueg@axis.com Sven Grönquist Department of Computer
More informationCS 525M Mobile and Ubiquitous Computing Healthcare and Personal Assistants Intro. Emmanuel Agu
CS 525M Mobile and Ubiquitous Computing Healthcare and Personal Assistants Intro Emmanuel Agu Ubicomp for Healthcare Currently: Healthcare is appointment based (fixed time), infrequent Specific location
More informationAvailable online Journal of Scientific and Engineering Research, 2016, 3(4): Research Article
Available online www.jsaer.com, 2016, 3(4):417-422 Research Article ISSN: 2394-2630 CODEN(USA): JSERBR Automatic Indexing of Multimedia Documents by Neural Networks Dabbabi Turkia 1, Lamia Bouafif 2, Ellouze
More informationCHAPTER 3. Preprocessing and Feature Extraction. Techniques
CHAPTER 3 Preprocessing and Feature Extraction Techniques CHAPTER 3 Preprocessing and Feature Extraction Techniques 3.1 Need for Preprocessing and Feature Extraction schemes for Pattern Recognition and
More informationIntroduction to Google Voice
Introduction to Google Voice This document provides an introduction to Google Voice, a free application provided by Google. With Google Voice, you can make and receive local and international calls, SMS,
More informationAAC Apps App includes three different air horn sounds. Tap on horn wanted, and shake to increase volume. Shake harder to increase volume further.
Air Horn LOUD! Bamboo Paper Notebook FreeSpeech $0.99 Locabulary - see MyScript Memo - See MyTalkTools Mobile Lite Phrase Board (English version) Scribble Press AAC Apps App includes three different air
More informationEVAS CAN Bus. Ref : User Guide
EVAS CAN Bus Ref : 115-311-001 User Guide Contents 1 Characteristics... 3 1.1 Operating characteristics... 3 1.2 Connectors... 3 2 System operation... 4 2.1 Continuous listening mode... 4 2.2 Impulse listening
More informationBamboo Paper - Notebook. FreeSpeech. Locabulary NO Wifi Req'd - see description for details. MyTalkTools Mobile Lite
ipad App Inventory AAC ICON: TITLE/WIFI/COST: DESCRIPTION: Air Horn LOUD! App includes three different air horn sounds. Tap on horn wanted, and shake to increase volume. Shake harder to increase volume
More informationLecture 16 Perceptual Audio Coding
EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero
More informationPerceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding
Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.
More informationELANTRA TOPICS. Phone Pairing Navigation Blue Link
QUICK TIPS ELANTRA TOPICS Phone Pairing Navigation Blue Link PHONE PAIRING Connecting for the First Time 1. To begin, the vehicle s shifter must be in PARK 2. Press the PHONE button 3. Touch YES 4. Turn
More informationVoIP Overview. Device Setup The device is configured via the VoIP tab of the devices Device Properties dialog in Integration Designer.
VoIP Overview DESCRIPTION: RTI devices with VoIP (Voice over IP) support currently support peer-to-peer communication with other RTI devices and 3rd party devices that support the SIP protocol. Audio is
More informationTable of Contents. The Home and More screens... 14
Table of Contents SmartMeet Overview...1 From SmartMeet, you can:...1 Setup... 2 System requirements... 2 Downloading SmartMeet....3 Starting SmartMeet for the first time... 4 Add user details...5 To add
More informationAre You Too Busy? Practical Tips For Better Time Management
with Lorena Prime Are You Too Busy? Practical Tips For Better Time Management Is this How You Feel? What s a Productivity Expert? Focuses on offices (at work or virtual / home) Sets up file systems and
More informationHow Do I Search & Replay Communications
How Do I Search & Replay Communications Quantify Search & Replay is used to search, retrieve and replay recorded communications from your Red Box Recorder. If you have replay permissions, then you can
More informationSMARTWATCH User Manual
SMARTWATCH User Manual Please refer to this manual before using your LOGIC LIFE 20 SmartWatch. ENGLISH Pages OVERVIEW 3 CHARGING 4 USABILITY 5 PAIRING 6-7 FUNCTIONS 8-12 TECHNICAL SPECIFICATIONS 13 2 1.
More informationSystem Identification Related Problems at SMN
Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research
More informationMultimedia Event Detection for Large Scale Video. Benjamin Elizalde
Multimedia Event Detection for Large Scale Video Benjamin Elizalde Outline Motivation TrecVID task Related work Our approach (System, TF/IDF) Results & Processing time Conclusion & Future work Agenda 2
More informationLet life inspire you. with ReSound Unite wireless accessories. Learn more about ReSound Unite wireless accessories.
Learn more about ReSound Unite wireless accessories. Ask your local hearing specialist Scan here or go to www.resound.com/unite M101528-GB-12.10-Rev.A Let life inspire you with ReSound Unite wireless accessories
More informationAgenda. Quick Start Menu. Understanding the Interface. Voice Status Icons. Commonly Used Features. Security. Dialing Out. Question & Answer Feature
Voice Management Agenda Quick Start Menu Understanding the Interface Voice Status Icons Commonly Used Features Security Dialing Out Question & Answer Feature Recording / Archiving Quick Start Menu Upon
More informationLogging in. Your teacher will give you a login address during lectures or via .
Logging in Your teacher will give you a login address during lectures or via email. Students usually login as guest. Type your name and click Enter Room. You can also login with your VAMK ID. Write your
More informationCall Recording System. Installation and User Guide
Call Recording System Installation and User Guide Issue 1.0A 2004.05 Details For Changes Version Date Details Notes 1.0A 2004-05-19 First Created ii CALL RECORDING SYSTEM... I INSTALLATION AND USER GUIDE...
More informationData fusion and multi-cue data matching using diffusion maps
Data fusion and multi-cue data matching using diffusion maps Stéphane Lafon Collaborators: Raphy Coifman, Andreas Glaser, Yosi Keller, Steven Zucker (Yale University) Part of this work was supported by
More informationCOS 116 The Computational Universe Laboratory 4: Digital Sound and Music
COS 116 The Computational Universe Laboratory 4: Digital Sound and Music In this lab you will learn about digital representations of sound and music, especially focusing on the role played by frequency
More informationInnovative Industrial Solutions, Inc Skyline Drive Russellville, AR Phone (479) Fax (479)
In-ear Mic Headset Industrial grade design for all commercial uses Flexible, high grade cable reinforced with Kevlar provides strength 30% Reduction of Noise level/ansi Certified In-ear microphone technology
More informationSAMSUNG HANDSET USER GUIDE FOR DS-5007S / DS-5014S / DS-5038S / DS-5014D / DS-5021D ITP-5107 / ITP-5114D / ITP5121D
SAMSUNG HANDSET USER GUIDE FOR DS-5007S / DS-5014S / DS-5038S / DS-5014D / DS-5021D ITP-5107 / ITP-5114D / ITP5121D FOR TECHNICAL TIPS PLEASE VISIT OUR WEBSITE www.conversetelecom.com 2 Table of Contents
More informationAudio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following:
Garage Band Instructions Tutorial Audio involves developing a variety of techniques. In this short course, you will learn the necessary skills to do the following: Demonstrate Audio editing techniques
More informationPerceptual Audio Coders What to listen for: Artifacts of Parametric Coding
Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric
More informationLesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval
Lesson 11 Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Retrieval = Query + Search Informational Retrieval: Get required information from database/web
More informationTalking Books in PowerPoint
Talking Books in PowerPoint Quick Guide Created 10/03 Updated 10/09 JC Creating a template The following instructions are based on PowerPoint XP (2000, 2002,2003) Create a blank page Open up PowerPoint
More informationHow to edit audio tracks
How to edit audio tracks Although at times you will use Adobe Audition to record new audio, you will often use it to edit, clean up, or add effects to an existing audio file such as an audio file recorded
More informationLarge scale object/scene recognition
Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database
More informationR300. Quick Start Guide 15G06A E3403
R300 E3403 Quick Start Guide 15G06A348000 Layout Features 1 2 6 5 3 4 7 8 9 10 11 12 1 External Antenna port Connects to an external antenna for better signal performance (the external antenna is optional)
More informationPhone Settings 26 Ringer Volume 26. Basic Calling Features 13 Help Online Services 43
1 Congratulations on purchasing your new VTech product. Before using this telephone, please read the Important safety instructions on page 89 of this manual. The information contained in this manual is
More informationIPLDK CRS. Installation and User Guide ISSUE 1.0A
IPLDK CRS Installation and User Guide ISSUE 1.0A 1. INTRODUCTION...5 1.1 OVERVIEW...5 1.2 Features...5 2. INSTALLATION ENVIRONMENT...6 2.1 Hardware Specification...6 2.2 System Requirements...6 3. PROGRAM
More informationTopics in Linguistic Theory: Laboratory Phonology Spring 2007
MIT OpenCourseWare http://ocw.mit.edu 24.910 Topics in Linguistic Theory: Laboratory Phonology Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More information15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION
15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:
More informationBringing the Voices of Communities Together:
Bringing the Voices of Communities Together: The Middletown Digital Oral History Project Maren Read Archivist for Manuscript Collections MLRead@bsu.edu Amanda Hurford Digital Initiatives Multimedia Developer
More informationQUICK TIPS SANTA FE. Phone Pairing Navigation Blue Link TOPICS
QUICK TIPS SANTA FE TOPICS Phone Pairing Navigation Blue Link PHONE PAIRING Connecting for the First Time 1. To begin, the vehicle s shifter must be in PARK 2. Press the PHONE button 3. Touch YES 4. Turn
More informationR-09HR ReleaseNote. R-09HR Operating System Version 2.00 RO9HRRN200
R-09HR ReleaseNote R-09HR Operating System Version.00 009 Roland Corporation U.S. All rights reserved. No part of this publication may be reproduced in any form without the written permission of Roland
More informationOCR Interfaces for Visually Impaired
OCR Interfaces for Visually Impaired TOPIC ASSIGNMENT 2 Author: Sachin FERNANDES Graduate 8 Undergraduate Team 2 TOPIC PROPOSAL Instructor: Dr. Robert PASTEL March 4, 2016 LIST OF FIGURES LIST OF FIGURES
More informationMusic Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming
Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Takuichi Nishimura Real World Computing Partnership / National Institute of Advanced
More informationINSTRUCTION MANUAL Mi9 Executive Digital Voice Recorder, 60hrs SB-VR9100
INSTRUCTION MANUAL Mi9 Executive Digital Voice Recorder, 60hrs SB-VR9100 Revised: May 21, 2014 Thank you for purchasing from SafetyBasement.com! We appreciate your business. We made this simple manual
More informationTABLE OF CONTENTS. Introduction Setting up Your Patriot Voice Controls Starting the System Controls...
USER MANUAL TABLE OF CONTENTS Introduction... 03 Setting up Your Patriot Voice... 04 Controls... 05 Starting the System... 06 Controls... 06 Additional Keys... 09 Menu Zone... 10 System Settings... 15
More informationNAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION
8T - 56 NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION LX NAVIGATION/TELECOMMUNICATION - SERVICE INFORMATION DESCRIPTION TELECOMMUNICATIONS The hands-free cellular system uses Bluetooth technology
More informationTable of Contents. iii
TECHNICAL GUIDE Table of Contents MobileMeet Overview... 1 From MobileMeet, you can:... 1 Setup... 2 System requirements... 2 Bandwidth and Data Transfer... 3 Downloading MobileMeet... 4 Starting MobileMeet
More information8180 LOUD RINGER USER GUIDE
8180 LOUD RINGER USER GUIDE Table of Contents Overview.... 3 Key Features.... 3 Loudness.... 3 Ambient Noise Compensation.... 3 Outputs for External Equipment and Devices.... 3 Ring Tones... 3 Blue Indicator
More informationNational Writers Workshop Wichita, Kan., May 19 20, 2007
The No-Fear Guide To Multimedia Skills National Writers Workshop Wichita, Kan., May 19 20, 2007 Mindy McAdams University of Florida E-mail mmcadams@jou.ufl.edu >>> All links http://mindymcadams.com/guest/nww/
More informationPerceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.
Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general
More information