A Brief Overview of Audio Information Retrieval. Unjung Nam CCRMA Stanford University
|
|
- Donald Nash
- 6 years ago
- Views:
Transcription
1 A Brief Overview of Audio Information Retrieval Unjung Nam CCRMA Stanford University 1
2 Outline What is AIR? Motivation Related Field of Research Elements of AIR Experiments and discussion Music Classification System December 2000 by Unjung Nam 2
3 What is AIR? Audio Information Retrieval (AIR): Audio Information: Speech, Music, Natural sounds, etc. To develop various methods in order to recognize the audio information Audio Human Speech? Music? Animal Sound? Clock Alarm? : Computer December 2000 by Unjung Nam 3
4 What is AIR? Applications Speech-related retrieval Recognizing and Transcribing the content of Radio programs, Telephone conversations, Recorded Meetings Music-related retrieval Music similarity, Music style classification, Instrument recognition Others audio retrieval applications Alarms, animal sounds, natural sounds, etc. December 2000 by Unjung Nam 4
5 What is AIR? Muscle Fish Audio Retrieval December 2000 by Unjung Nam 5
6 Motivation Multimedia Information stored on computer systems increases due to Internet. Multimedia database is classified/retrieved in a manual process which is often subjective and inaccurate when describing audio. Multimedia database should be handled by the methods of automatic analysis, segmentation, indexing and retrieval. December 2000 by Unjung Nam 6
7 Related Field of Research Automatic Speech Recognition Overview of the process Audio/Video Classification Video Mail Retrieval Computer Vision Image Retrieval, Face Recognition Multimedia Database Management December 2000 by Unjung Nam 7
8 Automatic Speech Recognition: Overview December 2000 by Unjung Nam 8
9 Audio/Video Classification: Video Mail Retrieval December 2000 by Unjung Nam 9
10 Computer Vision: Image Retrieval December 2000 by Unjung Nam 10
11 Computer Vision: Face Recognition December 2000 by Unjung Nam 11
12 Multimedia Database Management: Characteristics of Multimedia Information Retrieval Content based retrieval Automatic Indexing Similarity Matching Similar content may have different representation Data filtering rather than exact matching (data selection) Browsing and Relevance feedback No ideal mathematical model for defining similarity, human feedback is required December 2000 by Unjung Nam 12
13 Elements of AIR Building Classification Database Audio Signal Feature Extraction Feature Classification/Clustering Retrieval of Best Matching Audio Audio Input Feature Extraction Projection to Model Space Find Matching Model Space Classification Model Space Retrieval of Best Match December 2000 by Unjung Nam 13
14 Feature Extraction Time domain feature modules Short-Time Energy and Average Magnitude Short-Time Average Zero-Crossing Rate Linear Prediction Pulse metric, etc. Spectral domain feature models STFT Spectral Centroid Harmony Analysis MFCC Constant Q, etc. December 2000 by Unjung Nam 14
15 Classification/Clustering Methods Deterministic Methods Minimum Distance Classifier k-nearest neighbor (k-nn) Discriminant functions Generalized Discriminators, etc. Statistical Methods Class-related Probability Functions Minimum Error Classification Likelihood-based MAP Classification Approximating a Bayes Classifier Parameterization and Probability Estimation: Hidden Markov Model, etc. December 2000 by Unjung Nam 15
16 Experiments: Music Classification System Genre category File No. Filename Duration MATLAB Graphic notation (secs.) Jazz 1 dance.wav 11 go 2 rocky.wav 13 gx 3 band.wav 46 g+ Pop/Rock 4 queen.wav 3 gd 5 susqnet.wav 16 * 6 pop.wav 26 v 7 latin.wav 26 * 8 latin2.wav reggae.wav 26 d 10 reggae2.wav 26 ^ Classic 11 phantom.wav 6 ro 12 quartet4.wav 10 rv 13 quartet3.wav 7 rx 14 quartet2.wav 8 r* 15 quartet1.wav 10 r^ 16 musicnight.wav 14 r< 17 angel.wav 9 r> 18 piacel.wav 8 r+ 19 piavio.wav 7 rx 20 piavio1.wav 7 rs Test Signal b3.wav 7 k. December 2000 by Unjung Nam 16
17 Experiments: Feature Modules Spectral Centroid Center Gravity of Spectrum: Brightness of a sound Sound File Preprocessing frame STFT Input & N Windowing Number of N Spectral Centroid The individual centroid of a spectral frame is defined as the average frequency weighted by amplitudes, divided by the sum of the amplitudes, or: Spectral Centroid Here, F [k] is the amplitude corresponding to bin k in DFT spectrum. N k = 1 = N k = 1 kf[ k] F[ k] December 2000 by Unjung Nam 17
18 Experiments: Spectral Centroid Jazz Pop/Rock Classic December 2000 by Unjung Nam 18
19 Experiments: Spectral Centroid The weighted average spectral centroid of each frames in 20 sound files. Green: Blue: Red: jazz pop/rock classic pop/rock higher than classical classical fluctuating alot Spectral Centroid December 2000 by Unjung Nam 19
20 Experiments: Feature Modules Short-Time Energy Function Amplitude variation over the time Rhythm and periodicity information Sound File Input Preprocessing & Windowing frame N Number of N Energy Change En = [ x( m) w( n m ] 2 1 ) N m 1, w( x) = 0, 0 x N 1, otherwise. Where x(m) is the discrete time audio signal, n is time index of the short-time energy, and w(m) is a rectangle window December 2000 by Unjung Nam 20
21 Experiments: Short-Time Energy Function Jazz: dance.wav December 2000 by Unjung Nam 21
22 Experiments: Short-Time Energy Function The pop/rock music samples show the most fluctuating energy while classical music samples shows stable energy fluctuation. The energy changes of jazz samples seem to show medium fluctuation. December 2000 by Unjung Nam 22
23 Experiments: Feature Modules Short-Time Average Zero Crossing Rate Zero-Crossing Rate (ZCR) is a measure of how often the signal crosses zero per unit time. occur if successive samples have different signs. The rate at which zero-crossings occur is a simple measure of the frequency content of a signal. w(n) is a rectangle window of length N 1 Zn = sgn 2 where m sgn [ x( m) ] sgn[ x( m 1) ] [ x( n) ] 1, = 1, x( n) 0, x( n) < 0, w( n m), December 2000 by Unjung Nam 23
24 Experiments: Short-Time Average ZCR Classic: quartet4.wav December 2000 by Unjung Nam 24
25 Experiments: Short-Time Average ZCR It doesn t seem to show any indication of classifying the three different music genres. December 2000 by Unjung Nam 25
26 Experiments: Classification/Clustering Methods K-means clustering algorithm K-means cluster analysis programs begin by creating the K clusters according to some arbitrary procedure. The program calculates the means or centroids of each of the clusters. If one of the observations is closer to the centroid of another cluster, then the observation is made a member of that cluster. K-Nearest Neighbour Classifier (KNN) to classify a feature space with a given set of sample data by evaluating the k nearest sample points of each point in the feature space. December 2000 by Unjung Nam 26
27 Experiments: Classification/Clustering Methods K-means clustering KNN classifier December 2000 by Unjung Nam 27
28 Experiments: Classification/Clustering Methods Feature vectors of the 20 input files get extracted in each frame and got plotted. red: classical blue: pop/rock green: jazz The first figure: spectral centroid in x-axis and short-time energy in y axis. The second: shorttime energy in x-axis and short-time ZCR in y- axis.the third figure: 3 dimensional space December 2000 by Unjung Nam 28
29 Experiments: Classification/Clustering Methods The means of feature vectors of 20 music samples are plotted in 3 dimensional space December 2000 by Unjung Nam 29
30 Experiments: Classification/Clustering Methods The feature vectors of the test signal b3.wav gets plotted as black dots in 2 dimensional space. December 2000 by Unjung Nam 30
31 Experiments: Classification/Clustering Methods The nearest neighbours are determined using Euclidean distance. Each mean of the 20 sound samples gets the predicted class labels as an index 1 to 20. Each of the feature vectors of the test signal is assigned to one of 20 means. The test signal in the figure above determined that the number of feature vectors assigned to 13 is the greatest. The following describes the result in MATLAB. Test 13. Quartet3.wav >> classfier ans = class is 13 >> classpoint classpoint = >> % 27 feature vectors assigned to 13 th class December 2000 by Unjung Nam 31
32 Experiments: Discussion Though the test signal and the quartet3.wav are not fell into a same category, they sounded similar in terms of rhythm and tempo information. It seems that this system doesn t effectively classify the timbre information. The number of feature modules are limited in this system. The variance factor of the feature vectors is not considered. Need to experiment with more samples. December 2000 by Unjung Nam 32
Automatic Classification of Audio Data
Automatic Classification of Audio Data Carlos H. C. Lopes, Jaime D. Valle Jr. & Alessandro L. Koerich IEEE International Conference on Systems, Man and Cybernetics The Hague, The Netherlands October 2004
More informationThe Automatic Musicologist
The Automatic Musicologist Douglas Turnbull Department of Computer Science and Engineering University of California, San Diego UCSD AI Seminar April 12, 2004 Based on the paper: Fast Recognition of Musical
More information1 Introduction. 3 Data Preprocessing. 2 Literature Review
Rock or not? This sure does. [Category] Audio & Music CS 229 Project Report Anand Venkatesan(anand95), Arjun Parthipan(arjun777), Lakshmi Manoharan(mlakshmi) 1 Introduction Music Genre Classification continues
More informationMultimedia Database Systems. Retrieval by Content
Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,
More informationCase-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.
CS 188: Artificial Intelligence Fall 2008 Lecture 25: Kernels and Clustering 12/2/2008 Dan Klein UC Berkeley Case-Based Reasoning Similarity for classification Case-based reasoning Predict an instance
More informationCS 188: Artificial Intelligence Fall 2008
CS 188: Artificial Intelligence Fall 2008 Lecture 25: Kernels and Clustering 12/2/2008 Dan Klein UC Berkeley 1 1 Case-Based Reasoning Similarity for classification Case-based reasoning Predict an instance
More informationMusic Genre Classification
Music Genre Classification Matthew Creme, Charles Burlin, Raphael Lenain Stanford University December 15, 2016 Abstract What exactly is it that makes us, humans, able to tell apart two songs of different
More informationText classification II CE-324: Modern Information Retrieval Sharif University of Technology
Text classification II CE-324: Modern Information Retrieval Sharif University of Technology M. Soleymani Fall 2015 Some slides have been adapted from: Profs. Manning, Nayak & Raghavan (CS-276, Stanford)
More informationCHAPTER 8 Multimedia Information Retrieval
CHAPTER 8 Multimedia Information Retrieval Introduction Text has been the predominant medium for the communication of information. With the availability of better computing capabilities such as availability
More informationAudio Classification and Content Description
2004:074 MASTER S THESIS Audio Classification and Content Description TOBIAS ANDERSSON MASTER OF SCIENCE PROGRAMME Department of Computer Science and Electrical Engineering Division of Signal Processing
More informationKernels and Clustering
Kernels and Clustering Robert Platt Northeastern University All slides in this file are adapted from CS188 UC Berkeley Case-Based Learning Non-Separable Data Case-Based Reasoning Classification from similarity
More informationDietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++
Dietrich Paulus Joachim Hornegger Pattern Recognition of Images and Speech in C++ To Dorothea, Belinda, and Dominik In the text we use the following names which are protected, trademarks owned by a company
More informationAvailable online Journal of Scientific and Engineering Research, 2016, 3(4): Research Article
Available online www.jsaer.com, 2016, 3(4):417-422 Research Article ISSN: 2394-2630 CODEN(USA): JSERBR Automatic Indexing of Multimedia Documents by Neural Networks Dabbabi Turkia 1, Lamia Bouafif 2, Ellouze
More informationCSE 573: Artificial Intelligence Autumn 2010
CSE 573: Artificial Intelligence Autumn 2010 Lecture 16: Machine Learning Topics 12/7/2010 Luke Zettlemoyer Most slides over the course adapted from Dan Klein. 1 Announcements Syllabus revised Machine
More informationCS 343: Artificial Intelligence
CS 343: Artificial Intelligence Kernels and Clustering Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.
More informationMachine Learning and Pervasive Computing
Stephan Sigg Georg-August-University Goettingen, Computer Networks 17.12.2014 Overview and Structure 22.10.2014 Organisation 22.10.3014 Introduction (Def.: Machine learning, Supervised/Unsupervised, Examples)
More informationAdaptive Gesture Recognition System Integrating Multiple Inputs
Adaptive Gesture Recognition System Integrating Multiple Inputs Master Thesis - Colloquium Tobias Staron University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Technical Aspects
More informationDUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING
DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting
More informationCS229 Final Project: Audio Query By Gesture
CS229 Final Project: Audio Query By Gesture by Steinunn Arnardottir, Luke Dahl and Juhan Nam {steinunn,lukedahl,juhan}@ccrma.stanford.edu December 2, 28 Introduction In the field of Music Information Retrieval
More informationPattern Recognition Chapter 3: Nearest Neighbour Algorithms
Pattern Recognition Chapter 3: Nearest Neighbour Algorithms Asst. Prof. Dr. Chumphol Bunkhumpornpat Department of Computer Science Faculty of Science Chiang Mai University Learning Objectives What a nearest
More informationObject Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision
Object Recognition Using Pictorial Structures Daniel Huttenlocher Computer Science Department Joint work with Pedro Felzenszwalb, MIT AI Lab In This Talk Object recognition in computer vision Brief definition
More informationClustering & Classification (chapter 15)
Clustering & Classification (chapter 5) Kai Goebel Bill Cheetham RPI/GE Global Research goebel@cs.rpi.edu cheetham@cs.rpi.edu Outline k-means Fuzzy c-means Mountain Clustering knn Fuzzy knn Hierarchical
More informationCSC411/2515 Tutorial: K-NN and Decision Tree
CSC411/2515 Tutorial: K-NN and Decision Tree Mengye Ren csc{411,2515}ta@cs.toronto.edu September 25, 2016 Cross-validation K-nearest-neighbours Decision Trees Review: Motivation for Validation Framework:
More informationClassification: Feature Vectors
Classification: Feature Vectors Hello, Do you want free printr cartriges? Why pay more when you can get them ABSOLUTELY FREE! Just # free YOUR_NAME MISSPELLED FROM_FRIEND... : : : : 2 0 2 0 PIXEL 7,12
More informationCAMCOS Report Day. December 9 th, 2015 San Jose State University Project Theme: Classification
CAMCOS Report Day December 9 th, 2015 San Jose State University Project Theme: Classification On Classification: An Empirical Study of Existing Algorithms based on two Kaggle Competitions Team 1 Team 2
More informationMathematics of Data. INFO-4604, Applied Machine Learning University of Colorado Boulder. September 5, 2017 Prof. Michael Paul
Mathematics of Data INFO-4604, Applied Machine Learning University of Colorado Boulder September 5, 2017 Prof. Michael Paul Goals In the intro lecture, every visualization was in 2D What happens when we
More informationOn Classification: An Empirical Study of Existing Algorithms Based on Two Kaggle Competitions
On Classification: An Empirical Study of Existing Algorithms Based on Two Kaggle Competitions CAMCOS Report Day December 9th, 2015 San Jose State University Project Theme: Classification The Kaggle Competition
More informationAn Introduction to Pattern Recognition
An Introduction to Pattern Recognition Speaker : Wei lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering 1 Abstract Not a new research field Wide range included
More informationk-nn classification & Statistical Pattern Recognition
k-nn classification & Statistical Pattern Recognition Andreas C. Kapourani (Credit: Hiroshi Shimodaira) February 27 k-nn classification In classification, the data consist of a training set and a test
More informationNearest Neighbor Classification
Nearest Neighbor Classification Professor Ameet Talwalkar Professor Ameet Talwalkar CS260 Machine Learning Algorithms January 11, 2017 1 / 48 Outline 1 Administration 2 First learning algorithm: Nearest
More informationMachine Learning. Nonparametric methods for Classification. Eric Xing , Fall Lecture 2, September 12, 2016
Machine Learning 10-701, Fall 2016 Nonparametric methods for Classification Eric Xing Lecture 2, September 12, 2016 Reading: 1 Classification Representing data: Hypothesis (classifier) 2 Clustering 3 Supervised
More informationNearest Neighbor Classification
Nearest Neighbor Classification Charles Elkan elkan@cs.ucsd.edu October 9, 2007 The nearest-neighbor method is perhaps the simplest of all algorithms for predicting the class of a test example. The training
More informationNon-Parametric Modeling
Non-Parametric Modeling CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Introduction Non-Parametric Density Estimation Parzen Windows Kn-Nearest Neighbor
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationComparative Analysis of Machine Learning Algorithms for Audio Signals Classification
IJCSNS International Journal of Computer Science and Network Security, VOL.15 No.6, June 2015 49 Comparative Analysis of Machine Learning Algorithms for Audio Signals Classification Poonam Mahana Department
More informationComputer Vision. Exercise Session 10 Image Categorization
Computer Vision Exercise Session 10 Image Categorization Object Categorization Task Description Given a small number of training images of a category, recognize a-priori unknown instances of that category
More informationTWO-STEP SEMI-SUPERVISED APPROACH FOR MUSIC STRUCTURAL CLASSIFICATION. Prateek Verma, Yang-Kai Lin, Li-Fan Yu. Stanford University
TWO-STEP SEMI-SUPERVISED APPROACH FOR MUSIC STRUCTURAL CLASSIFICATION Prateek Verma, Yang-Kai Lin, Li-Fan Yu Stanford University ABSTRACT Structural segmentation involves finding hoogeneous sections appearing
More informationGene Clustering & Classification
BINF, Introduction to Computational Biology Gene Clustering & Classification Young-Rae Cho Associate Professor Department of Computer Science Baylor University Overview Introduction to Gene Clustering
More informationMachine Perception of Music & Audio. Topic 10: Classification
Machine Perception of Music & Audio Topic 10: Classification 1 Classification Label objects as members of sets Things on the left Things on the right There is a set of possible examples Each example is
More informationAccelerometer Gesture Recognition
Accelerometer Gesture Recognition Michael Xie xie@cs.stanford.edu David Pan napdivad@stanford.edu December 12, 2014 Abstract Our goal is to make gesture-based input for smartphones and smartwatches accurate
More informationAutomatic Colorization of Grayscale Images
Automatic Colorization of Grayscale Images Austin Sousa Rasoul Kabirzadeh Patrick Blaes Department of Electrical Engineering, Stanford University 1 Introduction ere exists a wealth of photographic images,
More informationAudio Retrieval Using Multiple Feature Vectors
Audio Retrieval Using Multiple Vectors Vaishali Nandedkar Computer Dept., JSPM, Pune, India E-mail- Vaishu111@yahoo.com Abstract Content Based Audio Retrieval system is very helpful to facilitate users
More informationPattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures
Pattern recognition Classification/Clustering GW Chapter 12 (some concepts) Textures Patterns and pattern classes Pattern: arrangement of descriptors Descriptors: features Patten class: family of patterns
More informationA Comparative Study of Conventional and Neural Network Classification of Multispectral Data
A Comparative Study of Conventional and Neural Network Classification of Multispectral Data B.Solaiman & M.C.Mouchot Ecole Nationale Supérieure des Télécommunications de Bretagne B.P. 832, 29285 BREST
More informationClassification Algorithms in Data Mining
August 9th, 2016 Suhas Mallesh Yash Thakkar Ashok Choudhary CIS660 Data Mining and Big Data Processing -Dr. Sunnie S. Chung Classification Algorithms in Data Mining Deciding on the classification algorithms
More informationSYDE 372 Introduction to Pattern Recognition. Distance Measures for Pattern Classification: Part I
SYDE 372 Introduction to Pattern Recognition Distance Measures for Pattern Classification: Part I Alexander Wong Department of Systems Design Engineering University of Waterloo Outline Distance Measures
More informationExperiments in computer-assisted annotation of audio
Experiments in computer-assisted annotation of audio George Tzanetakis Computer Science Dept. Princeton University en St. Princeton, NJ 844 USA +1 69 8 491 gtzan@cs.princeton.edu Perry R. Cook Computer
More informationMODULE 7 Nearest Neighbour Classifier and its variants LESSON 11. Nearest Neighbour Classifier. Keywords: K Neighbours, Weighted, Nearest Neighbour
MODULE 7 Nearest Neighbour Classifier and its variants LESSON 11 Nearest Neighbour Classifier Keywords: K Neighbours, Weighted, Nearest Neighbour 1 Nearest neighbour classifiers This is amongst the simplest
More informationData Mining. 3.5 Lazy Learners (Instance-Based Learners) Fall Instructor: Dr. Masoud Yaghini. Lazy Learners
Data Mining 3.5 (Instance-Based Learners) Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction k-nearest-neighbor Classifiers References Introduction Introduction Lazy vs. eager learning Eager
More information3D Object Recognition using Multiclass SVM-KNN
3D Object Recognition using Multiclass SVM-KNN R. Muralidharan, C. Chandradekar April 29, 2014 Presented by: Tasadduk Chowdhury Problem We address the problem of recognizing 3D objects based on various
More informationPattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures
Pattern recognition Classification/Clustering GW Chapter 12 (some concepts) Textures Patterns and pattern classes Pattern: arrangement of descriptors Descriptors: features Patten class: family of patterns
More informationAnalyzing Vocal Patterns to Determine Emotion Maisy Wieman, Andy Sun
Analyzing Vocal Patterns to Determine Emotion Maisy Wieman, Andy Sun 1. Introduction The human voice is very versatile and carries a multitude of emotions. Emotion in speech carries extra insight about
More information10-701/15-781, Fall 2006, Final
-7/-78, Fall 6, Final Dec, :pm-8:pm There are 9 questions in this exam ( pages including this cover sheet). If you need more room to work out your answer to a question, use the back of the page and clearly
More informationMS1b Statistical Data Mining Part 3: Supervised Learning Nonparametric Methods
MS1b Statistical Data Mining Part 3: Supervised Learning Nonparametric Methods Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Supervised Learning: Nonparametric
More informationOptimizing feature representation for speaker diarization using PCA and LDA
Optimizing feature representation for speaker diarization using PCA and LDA itsikv@netvision.net.il Jean-Francois Bonastre jean-francois.bonastre@univ-avignon.fr Outline Speaker Diarization what is it?
More informationMachine learning Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures
Machine learning Pattern recognition Classification/Clustering GW Chapter 12 (some concepts) Textures Patterns and pattern classes Pattern: arrangement of descriptors Descriptors: features Patten class:
More informationFACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU
FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU 1. Introduction Face detection of human beings has garnered a lot of interest and research in recent years. There are quite a few relatively
More informationFeature Extractors. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. The Perceptron Update Rule.
CS 188: Artificial Intelligence Fall 2007 Lecture 26: Kernels 11/29/2007 Dan Klein UC Berkeley Feature Extractors A feature extractor maps inputs to feature vectors Dear Sir. First, I must solicit your
More informationIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence COMP307 Machine Learning 2: 3-K Techniques Yi Mei yi.mei@ecs.vuw.ac.nz 1 Outline K-Nearest Neighbour method Classification (Supervised learning) Basic NN (1-NN)
More informationDynamic Time Warping
Centre for Vision Speech & Signal Processing University of Surrey, Guildford GU2 7XH. Dynamic Time Warping Dr Philip Jackson Acoustic features Distance measures Pattern matching Distortion penalties DTW
More informationDETECTING INDOOR SOUND EVENTS
DETECTING INDOOR SOUND EVENTS Toma TELEMBICI, Lacrimioara GRAMA Signal Processing Group, Basis of Electronics Department, Faculty of Electronics, Telecommunications and Information Technology, Technical
More informationRobustness and independence of voice timbre features under live performance acoustic degradations
Robustness and independence of voice timbre features under live performance acoustic degradations Dan Stowell and Mark Plumbley dan.stowell@elec.qmul.ac.uk Centre for Digital Music Queen Mary, University
More informationSupervised vs unsupervised clustering
Classification Supervised vs unsupervised clustering Cluster analysis: Classes are not known a- priori. Classification: Classes are defined a-priori Sometimes called supervised clustering Extract useful
More informationSearch Engines. Information Retrieval in Practice
Search Engines Information Retrieval in Practice All slides Addison Wesley, 2008 Classification and Clustering Classification and clustering are classical pattern recognition / machine learning problems
More informationGenerative and discriminative classification techniques
Generative and discriminative classification techniques Machine Learning and Category Representation 2014-2015 Jakob Verbeek, November 28, 2014 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.14.15
More informationSEMANTIC COMPUTING. Lecture 8: Introduction to Deep Learning. TU Dresden, 7 December Dagmar Gromann International Center For Computational Logic
SEMANTIC COMPUTING Lecture 8: Introduction to Deep Learning Dagmar Gromann International Center For Computational Logic TU Dresden, 7 December 2018 Overview Introduction Deep Learning General Neural Networks
More informationIntroduction to Pattern Recognition Part II. Selim Aksoy Bilkent University Department of Computer Engineering
Introduction to Pattern Recognition Part II Selim Aksoy Bilkent University Department of Computer Engineering saksoy@cs.bilkent.edu.tr RETINA Pattern Recognition Tutorial, Summer 2005 Overview Statistical
More informationLast week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints
Last week Multi-Frame Structure from Motion: Multi-View Stereo Unknown camera viewpoints Last week PCA Today Recognition Today Recognition Recognition problems What is it? Object detection Who is it? Recognizing
More informationNetwork Traffic Measurements and Analysis
DEIB - Politecnico di Milano Fall, 2017 Introduction Often, we have only a set of features x = x 1, x 2,, x n, but no associated response y. Therefore we are not interested in prediction nor classification,
More informationECG782: Multidimensional Digital Signal Processing
ECG782: Multidimensional Digital Signal Processing Object Recognition http://www.ee.unlv.edu/~b1morris/ecg782/ 2 Outline Knowledge Representation Statistical Pattern Recognition Neural Networks Boosting
More informationA Framework for Efficient Fingerprint Identification using a Minutiae Tree
A Framework for Efficient Fingerprint Identification using a Minutiae Tree Praveer Mansukhani February 22, 2008 Problem Statement Developing a real-time scalable minutiae-based indexing system using a
More informationDistribution-free Predictive Approaches
Distribution-free Predictive Approaches The methods discussed in the previous sections are essentially model-based. Model-free approaches such as tree-based classification also exist and are popular for
More information11/2/2017 MIST.6060 Business Intelligence and Data Mining 1. Clustering. Two widely used distance metrics to measure the distance between two records
11/2/2017 MIST.6060 Business Intelligence and Data Mining 1 An Example Clustering X 2 X 1 Objective of Clustering The objective of clustering is to group the data into clusters such that the records within
More informationMACHINE LEARNING: CLUSTERING, AND CLASSIFICATION. Steve Tjoa June 25, 2014
MACHINE LEARNING: CLUSTERING, AND CLASSIFICATION Steve Tjoa kiemyang@gmail.com June 25, 2014 Review from Day 2 Supervised vs. Unsupervised Unsupervised - clustering Supervised binary classifiers (2 classes)
More informationNaïve Bayes for text classification
Road Map Basic concepts Decision tree induction Evaluation of classifiers Rule induction Classification using association rules Naïve Bayesian classification Naïve Bayes for text classification Support
More informationLarge Scale Data Analysis Using Deep Learning
Large Scale Data Analysis Using Deep Learning Machine Learning Basics - 1 U Kang Seoul National University U Kang 1 In This Lecture Overview of Machine Learning Capacity, overfitting, and underfitting
More informationCOMPUTER AND ROBOT VISION
VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington A^ ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California
More informationWorkshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards
Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Extracting meaning
More informationCS178: Machine Learning and Data Mining. Complexity & Nearest Neighbor Methods
+ CS78: Machine Learning and Data Mining Complexity & Nearest Neighbor Methods Prof. Erik Sudderth Some materials courtesy Alex Ihler & Sameer Singh Machine Learning Complexity and Overfitting Nearest
More informationFace identification system using MATLAB
Project Report ECE 09.341 Section #3: Final Project 15 December 2017 Face identification system using MATLAB Stephen Glass Electrical & Computer Engineering, Rowan University Table of Contents Introduction
More informationAnnouncements. CS 188: Artificial Intelligence Spring Classification: Feature Vectors. Classification: Weights. Learning: Binary Perceptron
CS 188: Artificial Intelligence Spring 2010 Lecture 24: Perceptrons and More! 4/20/2010 Announcements W7 due Thursday [that s your last written for the semester!] Project 5 out Thursday Contest running
More informationRecognizing Handwritten Digits Using the LLE Algorithm with Back Propagation
Recognizing Handwritten Digits Using the LLE Algorithm with Back Propagation Lori Cillo, Attebury Honors Program Dr. Rajan Alex, Mentor West Texas A&M University Canyon, Texas 1 ABSTRACT. This work is
More informationA GENERIC SYSTEM FOR AUDIO INDEXING: APPLICATION TO SPEECH/ MUSIC SEGMENTATION AND MUSIC GENRE RECOGNITION
A GENERIC SYSTEM FOR AUDIO INDEXING: APPLICATION TO SPEECH/ MUSIC SEGMENTATION AND MUSIC GENRE RECOGNITION Geoffroy Peeters IRCAM - Sound Analysis/Synthesis Team, CNRS - STMS Paris, France peeters@ircam.fr
More informationA Taxonomy of Semi-Supervised Learning Algorithms
A Taxonomy of Semi-Supervised Learning Algorithms Olivier Chapelle Max Planck Institute for Biological Cybernetics December 2005 Outline 1 Introduction 2 Generative models 3 Low density separation 4 Graph
More informationText-Independent Speaker Identification
December 8, 1999 Text-Independent Speaker Identification Til T. Phan and Thomas Soong 1.0 Introduction 1.1 Motivation The problem of speaker identification is an area with many different applications.
More informationHighlights Extraction from Unscripted Video
Highlights Extraction from Unscripted Video T 61.6030, Multimedia Retrieval Seminar presentation 04.04.2008 Harrison Mfula Helsinki University of Technology Department of Computer Science, Espoo, Finland
More informationA framework for audio analysis
MARSYAS: A framework for audio analysis George Tzanetakis 1 Department of Computer Science Princeton University Perry Cook 2 Department of Computer Science 3 and Department of Music Princeton University
More informationShort Survey on Static Hand Gesture Recognition
Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of
More informationAn Environmental Audio Based Context Recognition System Using Smartphones
University of Twente Master Thesis An Environmental Audio Based Context Recognition System Using Smartphones Author: Gebremedhin T. Abreha Supervisor: Dr. Nirvana Meratnia Committee: Prof. Paul Havinga
More informationCSCI567 Machine Learning (Fall 2014)
CSCI567 Machine Learning (Fall 2014) Drs. Sha & Liu {feisha,yanliu.cs}@usc.edu September 9, 2014 Drs. Sha & Liu ({feisha,yanliu.cs}@usc.edu) CSCI567 Machine Learning (Fall 2014) September 9, 2014 1 / 47
More informationFeature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule.
CS 188: Artificial Intelligence Fall 2008 Lecture 24: Perceptrons II 11/24/2008 Dan Klein UC Berkeley Feature Extractors A feature extractor maps inputs to feature vectors Dear Sir. First, I must solicit
More informationImplementation of Speech Based Stress Level Monitoring System
4 th International Conference on Computing, Communication and Sensor Network, CCSN2015 Implementation of Speech Based Stress Level Monitoring System V.Naveen Kumar 1,Dr.Y.Padma sai 2, K.Sonali Swaroop
More informationMultimedia Event Detection for Large Scale Video. Benjamin Elizalde
Multimedia Event Detection for Large Scale Video Benjamin Elizalde Outline Motivation TrecVID task Related work Our approach (System, TF/IDF) Results & Processing time Conclusion & Future work Agenda 2
More informationPractical Image and Video Processing Using MATLAB
Practical Image and Video Processing Using MATLAB Chapter 18 Feature extraction and representation What will we learn? What is feature extraction and why is it a critical step in most computer vision and
More informationUSING OF THE K NEAREST NEIGHBOURS ALGORITHM (k-nns) IN THE DATA CLASSIFICATION
USING OF THE K NEAREST NEIGHBOURS ALGORITHM (k-nns) IN THE DATA CLASSIFICATION Gîlcă Natalia, Roșia de Amaradia Technological High School, Gorj, ROMANIA Gîlcă Gheorghe, Constantin Brîncuși University from
More informationSpectral modeling of musical sounds
Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer
More informationStructured Perceptron. Ye Qiu, Xinghui Lu, Yue Lu, Ruofei Shen
Structured Perceptron Ye Qiu, Xinghui Lu, Yue Lu, Ruofei Shen 1 Outline 1. 2. 3. 4. Brief review of perceptron Structured Perceptron Discriminative Training Methods for Hidden Markov Models: Theory and
More informationAnnouncements. CS 188: Artificial Intelligence Spring Generative vs. Discriminative. Classification: Feature Vectors. Project 4: due Friday.
CS 188: Artificial Intelligence Spring 2011 Lecture 21: Perceptrons 4/13/2010 Announcements Project 4: due Friday. Final Contest: up and running! Project 5 out! Pieter Abbeel UC Berkeley Many slides adapted
More informationReal-time Object Detection CS 229 Course Project
Real-time Object Detection CS 229 Course Project Zibo Gong 1, Tianchang He 1, and Ziyi Yang 1 1 Department of Electrical Engineering, Stanford University December 17, 2016 Abstract Objection detection
More informationSequence representation of Music Structure using Higher-Order Similarity Matrix and Maximum likelihood approach
Sequence representation of Music Structure using Higher-Order Similarity Matrix and Maximum likelihood approach G. Peeters IRCAM (Sound Analysis/Synthesis Team) - CNRS (STMS) 1 supported by the RIAM Ecoute
More information