Intro to Artificial Intelligence

Similar documents
Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University

COMP 551 Applied Machine Learning Lecture 13: Unsupervised learning

Mathematics of Data. INFO-4604, Applied Machine Learning University of Colorado Boulder. September 5, 2017 Prof. Michael Paul

PARALLEL CLASSIFICATION ALGORITHMS

Unsupervised Learning : Clustering

COMS 4771 Clustering. Nakul Verma

ECG782: Multidimensional Digital Signal Processing

Nonparametric Importance Sampling for Big Data

Overview Citation. ML Introduction. Overview Schedule. ML Intro Dataset. Introduction to Semi-Supervised Learning Review 10/4/2010

Jarek Szlichta

CS839: Probabilistic Graphical Models. Lecture 10: Learning with Partially Observed Data. Theo Rekatsinas

Cluster Analysis: Agglomerate Hierarchical Clustering

K-Nearest Neighbour Classifier. Izabela Moise, Evangelos Pournaras, Dirk Helbing

Unsupervised Learning

KTH ROYAL INSTITUTE OF TECHNOLOGY. Lecture 14 Machine Learning. K-means, knn

CS 2750 Machine Learning. Lecture 19. Clustering. CS 2750 Machine Learning. Clustering. Groups together similar instances in the data sample

Robot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University

Deep Learning for Computer Vision

Machine Learning : Clustering, Self-Organizing Maps

Chapter 3: Supervised Learning

MTTTS17 Dimensionality Reduction and Visualization. Spring 2018 Jaakko Peltonen. Lecture 11: Neighbor Embedding Methods continued

Introduction to Artificial Intelligence

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

劉介宇 國立台北護理健康大學 護理助產研究所 / 通識教育中心副教授 兼教師發展中心教師評鑑組長 Nov 19, 2012

Applying Supervised Learning

CS6716 Pattern Recognition

10/5/2017 MIST.6060 Business Intelligence and Data Mining 1. Nearest Neighbors. In a p-dimensional space, the Euclidean distance between two records,

CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, :59pm, PDF to Canvas [100 points]

Lecture 3. Oct

Association Rule Mining and Clustering

Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation

Lecture 6 K- Nearest Neighbors(KNN) And Predictive Accuracy

Machine Learning (CSE 446): Practical Issues

CSE4334/5334 DATA MINING

Random Forest A. Fornaser

K-Nearest Neighbors. Jia-Bin Huang. Virginia Tech Spring 2019 ECE-5424G / CS-5824

Introduction to Machine Learning. Xiaojin Zhu

CS4445 Data Mining and Knowledge Discovery in Databases. A Term 2008 Exam 2 October 14, 2008

R (2) Data analysis case study using R for readily available data set using any one machine learning algorithm.

Generative and discriminative classification techniques

Machine Learning in the Wild. Dealing with Messy Data. Rajmonda S. Caceres. SDS 293 Smith College October 30, 2017

10/14/2017. Dejan Sarka. Anomaly Detection. Sponsors

Big Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1

SYDE 372 Introduction to Pattern Recognition. Distance Measures for Pattern Classification: Part I

CS 229 Midterm Review

Machine Learning Lecture 3

CS325 Artificial Intelligence Ch. 20 Unsupervised Machine Learning

IBL and clustering. Relationship of IBL with CBR

Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

k-nearest Neighbor (knn) Sept Youn-Hee Han

Lecture 11: Clustering Introduction and Projects Machine Learning

Unsupervised Learning: Clustering

Expectation Maximization (EM) and Gaussian Mixture Models

CAMCOS Report Day. December 9 th, 2015 San Jose State University Project Theme: Classification

Recognition: Face Recognition. Linda Shapiro EE/CSE 576

Machine Learning using MapReduce

Data Preprocessing. Supervised Learning

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

CS 188: Artificial Intelligence Fall 2008

Introduction to Supervised Learning

INF 4300 Classification III Anne Solberg The agenda today:

CISC 4631 Data Mining

Machine Learning for OR & FE

ADVANCED CLASSIFICATION TECHNIQUES

Data Mining Classification: Alternative Techniques. Lecture Notes for Chapter 4. Instance-Based Learning. Introduction to Data Mining, 2 nd Edition

Applied Statistics for Neuroscientists Part IIa: Machine Learning

CSE 158. Web Mining and Recommender Systems. Midterm recap

Lecture 27: Review. Reading: All chapters in ISLR. STATS 202: Data mining and analysis. December 6, 2017

Bayes Classifiers and Generative Methods

ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA

Clustering and Dimensionality Reduction. Stony Brook University CSE545, Fall 2017

Data Mining. Lecture 03: Nearest Neighbor Learning

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS

CSC 411: Lecture 14: Principal Components Analysis & Autoencoders

MS1b Statistical Data Mining Part 3: Supervised Learning Nonparametric Methods

Lecture 25: Review I

CSC 411: Lecture 14: Principal Components Analysis & Autoencoders

VECTOR SPACE CLASSIFICATION

K- Nearest Neighbors(KNN) And Predictive Accuracy

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering

On Information-Maximization Clustering: Tuning Parameter Selection and Analytic Solution

Using Machine Learning to Optimize Storage Systems

Midterm Examination CS540-2: Introduction to Artificial Intelligence

Data mining. Classification k-nn Classifier. Piotr Paszek. (Piotr Paszek) Data mining k-nn 1 / 20

Introduction to Pattern Recognition Part II. Selim Aksoy Bilkent University Department of Computer Engineering

CS 1675 Introduction to Machine Learning Lecture 18. Clustering. Clustering. Groups together similar instances in the data sample

Voronoi Region. K-means method for Signal Compression: Vector Quantization. Compression Formula 11/20/2013

More Learning. Ensembles Bayes Rule Neural Nets K-means Clustering EM Clustering WEKA

Answer All Questions. All Questions Carry Equal Marks. Time: 20 Min. Marks: 10.

Supervised and Unsupervised Learning (II)

Question 1: knn classification [100 points]

Machine Learning Methods in Visualisation for Big Data 2018

Machine Learning. Classification

A Taxonomy of Semi-Supervised Learning Algorithms

Machine Learning in Biology

Transcription:

Intro to Artificial Intelligence Ahmed Sallam { Lecture 5: Machine Learning ://. } ://..

2 Review Probabilistic inference Enumeration Approximate inference

3 Today What is machine learning? Supervised learning Unsupervised learning

4

5 Machine learning! Give the machine the ability how to solve some problems (e.g. statistical phenomenon) based on incomplete information Here, learning is not only a question of remembering but also of generalization to unseen cases Learn by example

6 Machine learning types Suppose you are building a Face recognition program Supervised learning (Classification): You have a dataset of faces image and other images. The dataset is labeled (classified) into Faces images, and other images. Program task know is to classify new images based on the available dataset. Unsupervised learning (Clustering): You don t have a dataset, and program role know is to cluster similar images for a new given dataset into different group, e.g. it can distinguish that faces are very different from landscapes, which are very different from horses.

7

8 Supervised Classification Referred to as classification or pattern recognition. The goal is to find a functional mapping between the input data (patterns or examples) X, to a class label Y. i.e. Y=f(X) A pattern is described by a set of features. E.g. color of eyes, or distance between the eyes. Pattern recognition task can be viewed as a two dimensional matrix, whose axes are the examples and the features.

9 Pattern classification tasks Data collection and representation. Feature selection and/or feature reduction. Classification.

10 Classification algorithms Techniques for Low dimensional data sets K-Nearest Neighbor Classification Linear Discriminant Analysis Decision Trees Neural Networks Techniques for High dimensional data sets Support Vector Machines Boosting

11 K-Nearest Neighbor Classification K nearest neighbors measured by a distance function. These functions are valid only with continuous variables.

12 KNN Example Consider the following data concerning credit default. Age and Loan are two numerical variables (predictors) and Default is the target. Know we want to classify unknown case ((Age=48 and Loan=$142,000) using Euclidean distance.)!!!!

13 KNN Example Using Euclidean distance: E.g. distance between (48, $142k) and (33, 150k) Sqrt[(48-33)^2 + (142000-150000)^2] = 8000.01 So for all the dataset

14 KNN Example cont. Know we choose the number of neighbors K If K=1 then this case is Y If K=3 then we have 2 Y and 1 N which give us Y.

15 KNN disadvantages Computationally expensive Requires a large memory to store the training data.

16

17 Example suppose you had a basket and it is filled with some fresh fruits your task is to arrange the same type fruits at one place. This time you don't know any thing about that fruits, you are first time seeing these fruits so how will you arrange the same type of fruits. What you will do first you take on fruit and you will select any physical character of that particular fruit. suppose you taken color. Then you will arrange them base on the color, then the groups will be some thing like this. RED COLOR GROUP: apples & cherry fruits. GREEN COLOR GROUP: bananas & grapes. so now you will take another physical character as size, so now the groups will be some thing like this. RED COLOR AND BIG SIZE: apple. RED COLOR AND SMALL SIZE: cherry fruits. GREEN COLOR AND BIG SIZE: bananas. GREEN COLOR AND SMALL SIZE: grapes. Done

18 Techniques and algorithms clustering (e.g., k-means, mixture models, hierarchical clustering) Approaches for learning latent variable models such as Expectation maximization algorithm (EM) Method of moments Blind signal separation techniques