Machine Learning. Lecture Slides for. ETHEM ALPAYDIN The MIT Press, h1p://

Similar documents
Machine Learning. Sourangshu Bhattacharya

D-Separation. b) the arrows meet head-to-head at the node, and neither the node, nor any of its descendants, are in the set C.

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Reasoning About Uncertainty

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Part II. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

CSCE 478/878 Lecture 6: Bayesian Learning and Graphical Models. Stephen Scott. Introduction. Outline. Bayes Theorem. Formulas

FMA901F: Machine Learning Lecture 6: Graphical Models. Cristian Sminchisescu

COMP90051 Statistical Machine Learning

CS 343: Artificial Intelligence

Machine Learning

Machine Learning

CS 343: Artificial Intelligence

Bayesian Machine Learning - Lecture 6

Bayesian Networks. A Bayesian network is a directed acyclic graph that represents causal relationships between random variables. Earthquake.

Machine Learning

Computer Vision Group Prof. Daniel Cremers. 4a. Inference in Graphical Models

Lecture 4: Undirected Graphical Models

Lecture 9: Undirected Graphical Models Machine Learning

Probabilistic Graphical Models

Lecture 5: Exact inference. Queries. Complexity of inference. Queries (continued) Bayesian networks can answer questions about the underlying

Sequence Labeling: The Problem

Inference. Inference: calculating some useful quantity from a joint probability distribution Examples: Posterior probability: Most likely explanation:

CS 188: Artificial Intelligence

A New Approach For Convert Multiply-Connected Trees in Bayesian networks

Graphical Models. David M. Blei Columbia University. September 17, 2014

Lecture 3: Conditional Independence - Undirected

Recall from last time. Lecture 4: Wrap-up of Bayes net representation. Markov networks. Markov blanket. Isolating a node

Directed Graphical Models

Chapter 8 of Bishop's Book: Graphical Models

Graphical models are a lot like a circuit diagram they are written down to visualize and better understand a problem.

Workshop report 1. Daniels report is on website 2. Don t expect to write it based on listening to one project (we had 6 only 2 was sufficient

3 : Representation of Undirected GMs

Lecture 5: Exact inference

Graphical Models & HMMs

Graphical Models. Dmitrij Lagutin, T Machine Learning: Basic Principles

The Basics of Graphical Models

COMP90051 Statistical Machine Learning

2. Graphical Models. Undirected pairwise graphical models. Factor graphs. Bayesian networks. Conversion between graphical models. Graphical Models 2-1

A note on the pairwise Markov condition in directed Markov fields

Graphical Models. Pradeep Ravikumar Department of Computer Science The University of Texas at Austin

Introduction to Probabilistic Graphical Models

2. Graphical Models. Undirected graphical models. Factor graphs. Bayesian networks. Conversion between graphical models. Graphical Models 2-1

Statistical Techniques in Robotics (STR, S15) Lecture#06 (Wednesday, January 28)

Machine Learning Lecture 16

Graphical Models and Markov Blankets

Learning Bayesian Networks (part 3) Goals for the lecture

Chapter 2 PRELIMINARIES. 1. Random variables and conditional independence

Collective classification in network data

Stat 5421 Lecture Notes Graphical Models Charles J. Geyer April 27, Introduction. 2 Undirected Graphs

Bayesian Networks Inference

Introduction to Graphical Models

STA 4273H: Statistical Machine Learning

CS242: Probabilistic Graphical Models Lecture 3: Factor Graphs & Variable Elimination

COS 513: Foundations of Probabilistic Modeling. Lecture 5

Today. Logistic Regression. Decision Trees Redux. Graphical Models. Maximum Entropy Formulation. Now using Information Theory

Introduction to information theory and coding - Lecture 1 on Graphical models

A Brief Introduction to Bayesian Networks AIMA CIS 391 Intro to Artificial Intelligence

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

Junction tree propagation - BNDG 4-4.6

Directed Graphical Models (Bayes Nets) (9/4/13)

EXPLORING CAUSAL RELATIONS IN DATA MINING BY USING DIRECTED ACYCLIC GRAPHS (DAG)

Probabilistic Graphical Models

Graphical Models Reconstruction

Machine Learning

6 : Factor Graphs, Message Passing and Junction Trees

Causality in Communication: The Agent-Encapsulated Bayesian Network Model

Dependency detection with Bayesian Networks

Graphical Models Part 1-2 (Reading Notes)

Machine Learning

Conditional Random Fields and beyond D A N I E L K H A S H A B I C S U I U C,

Computer vision: models, learning and inference. Chapter 10 Graphical Models

Scene Grammars, Factor Graphs, and Belief Propagation

A Probabilistic Relaxation Framework for Learning Bayesian Network Structures from Data

Conditional Random Fields - A probabilistic graphical model. Yen-Chin Lee 指導老師 : 鮑興國

Scene Grammars, Factor Graphs, and Belief Propagation

Bayesian Classification Using Probabilistic Graphical Models

OSU CS 536 Probabilistic Graphical Models. Loopy Belief Propagation and Clique Trees / Join Trees

Introduction to Bayesian networks

Research Article Structural Learning about Directed Acyclic Graphs from Multiple Databases

Computational Intelligence

Bayes Net Learning. EECS 474 Fall 2016

4 Factor graphs and Comparing Graphical Model Types

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

Modeling and Reasoning with Bayesian Networks. Adnan Darwiche University of California Los Angeles, CA

ALTW 2005 Conditional Random Fields

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms for Inference Fall 2014

BAYES-LIN: An object-oriented environment for Bayes linear local computation

Warm-up as you walk in

ECE521 W17 Tutorial 10

Machine Learning

Probabilistic Graphical Models

STAT 598L Probabilistic Graphical Models. Instructor: Sergey Kirshner. Exact Inference

1 : Introduction to GM and Directed GMs: Bayesian Networks. 3 Multivariate Distributions and Graphical Models

ECE521 Lecture 18 Graphical Models Hidden Markov Models

Using a Model of Human Cognition of Causality to Orient Arcs in Structural Learning

Belief propagation in a bucket-tree. Handouts, 275B Fall Rina Dechter. November 1, 2000

Probabilistic Graphical Models

Transcription:

Lecture Slides for INTRODUCTION TO Machine Learning ETHEM ALPAYDIN The MIT Press, 2010 alpaydin@boun.edu.tr h1p://www.cmpe.boun.edu.tr/~ethem/i2ml2e

CHAPTER 16: Graphical Models

Graphical Models Aka Bayesian networks, probabilisoc networks Nodes are hypotheses (random vars) and the probabilioes corresponds to our belief in the truth of the hypothesis Arcs are direct influences between hypotheses The structure is represented as a directed acyclic graph (DAG) The parameters are the condioonal probabilioes in the arcs (Pearl, 1988, 2000; Jensen, 1996; Lauritzen, 1996) 3

Causes and Bayes Rule causal diagnos<c Diagnos<c inference: Knowing that the grass is wet, what is the probability that rain is the cause? 4

CondiOonal Independence X and Y are independent if P(X,Y)=P(X)P(Y) X and Y are condioonally independent given Z if or P(X,Y Z)=P(X Z)P(Y Z) P(X Y,Z)=P(X Z) Three canonical cases: Head- to- tail, Tail- to- tail, head- to- head 5

Case 1: Head- to- Head P(X,Y,Z)=P(X)P(Y X)P(Z Y) P(W C)=P(W R)P(R C)+P(W ~R)P(~R C) 6

Case 2: Tail- to- Tail P(X,Y,Z)=P(X)P(Y X)P(Z X) 7

Case 3: Head- to- Head P(X,Y,Z)=P(X)P(Y)P(Z X,Y) 8

Causal vs DiagnosOc Inference Causal inference: If the sprinkler is on, what is the probability that the grass is wet? P(W S) = P(W R,S) P(R S) + P(W ~R,S) P(~R S) = P(W R,S) P(R) + P(W ~R,S) P(~R) = 0.95 0.4 + 0.9 0.6 = 0.92 Diagnos<c inference: If the grass is wet, what is the probability that the sprinkler is on? P(S W) = 0.35 > 0.2 P(S) P(S R,W) = 0.21 Explaining away: Knowing that it has rained decreases the probability that the sprinkler is on. 9

Causes Causal inference: P(W C) = P(W R,S) P(R,S C) + P(W ~R,S) P(~R,S C) + P(W R,~S) P(R,~S C) + P(W ~R,~S) P(~R,~S C) and use the fact that P(R,S C) = P(R C) P(S C) Diagnos<c: P(C W ) =? 10

ExploiOng the Local Structure P (F C) =? 11

ClassificaOon diagnos<c Bayes rule inverts the arc: P (C x ) 12

Naive Bayes Classifier Given C, x j are independent: p(x C) = p(x 1 C) p(x 2 C)... p(x d C) 13

Hidden Markov Model as a 14

15

Linear Regression 16

d- SeparaOon A path from node A to node B is blocked if a) The direc5ons of edges on the path meet head- to- tail (case 1) or tail- to- tail (case 2) and the node is in C, or b) The direc5ons of edges meet head- to- head (case 3) and neither that node nor any of its descendants is in C. If all paths are blocked, A and B are d- separated (condi5onally independent) given C. BCDF is blocked given C. BEFG is blocked by F. BEFD is blocked unless F (or G) is given. 17

Belief PropagaOon (Pearl, 1988) Chain: 18

Trees 19

Polytrees How can we model P(X U 1,U 2,...,U k ) cheaply? 20

JuncOon Trees If X does not separate E + and E -, we convert it into a juncoon tree and then apply the polytree algorithm Tree of moralized, clique nodes 21

Undirected Graphs: Markov In a Markov random field, dependencies are symmetric, for example, pixels in an image In an undirected graph, A and B are independent if removing C makes them unconnected. PotenOal funcoon ψ c (X c ) shows how favorable is the parocular configuraoon X over the clique C The joint is defined in terms of the clique potenoals 22

Factor Graphs Define new factor nodes and write the joint in terms of them 23

Learning a Graphical Model Learning the condioonal probabilioes, either as tables (for discrete case with small number of parents), or as parametric funcoons Learning the structure of the graph: Doing a state- space search over a score funcoon that uses both goodness of fit to data and some measure of complexity 24

Influence Diagrams decision node chance node u<lity node 25