Reasoning About Uncertainty

Similar documents
Machine Learning. Sourangshu Bhattacharya

Machine Learning. Lecture Slides for. ETHEM ALPAYDIN The MIT Press, h1p://

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Part II. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Chapter 2 PRELIMINARIES. 1. Random variables and conditional independence

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Bayesian Machine Learning - Lecture 6

Graphical Models. Dmitrij Lagutin, T Machine Learning: Basic Principles

Probabilistic Graphical Models

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Markov Equivalence in Bayesian Networks

COMP90051 Statistical Machine Learning

D-Separation. b) the arrows meet head-to-head at the node, and neither the node, nor any of its descendants, are in the set C.

FMA901F: Machine Learning Lecture 6: Graphical Models. Cristian Sminchisescu

Introduction to Bayesian networks

Bayesian Networks. A Bayesian network is a directed acyclic graph that represents causal relationships between random variables. Earthquake.

Extensions of Undirected and Acyclic, Directed Graphical Models

Lecture 5: Exact inference. Queries. Complexity of inference. Queries (continued) Bayesian networks can answer questions about the underlying

Lecture 4: Undirected Graphical Models

Separators and Adjustment Sets in Markov Equivalent DAGs

Factor Graphs and message passing

CS242: Probabilistic Graphical Models Lecture 3: Factor Graphs & Variable Elimination

CS 343: Artificial Intelligence

Lecture 5: Exact inference

Modeling and Reasoning with Bayesian Networks. Adnan Darwiche University of California Los Angeles, CA

ENGINEERS ACADEMY. 7. Given Boolean theorem. (a) A B A C B C A B A C. (b) AB AC BC AB BC. (c) AB AC BC A B A C B C.

Probabilistic Graphical Models

Graphical models are a lot like a circuit diagram they are written down to visualize and better understand a problem.

Graphical Models and Markov Blankets

6 : Factor Graphs, Message Passing and Junction Trees

3 : Representation of Undirected GMs

Bayesian Network & Anomaly Detection

Statistical Techniques in Robotics (STR, S15) Lecture#06 (Wednesday, January 28)

Machine Learning

A New Approach For Convert Multiply-Connected Trees in Bayesian networks

Introduction to information theory and coding - Lecture 1 on Graphical models

Lecture 9: Undirected Graphical Models Machine Learning

STA 4273H: Statistical Machine Learning

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or

Workshop report 1. Daniels report is on website 2. Don t expect to write it based on listening to one project (we had 6 only 2 was sufficient

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

Recall from last time. Lecture 4: Wrap-up of Bayes net representation. Markov networks. Markov blanket. Isolating a node

Section 4.2 selected answers Math 131 Multivariate Calculus D Joyce, Spring 2014

2. Graphical Models. Undirected graphical models. Factor graphs. Bayesian networks. Conversion between graphical models. Graphical Models 2-1

Section 3.1: Nonseparable Graphs Cut vertex of a connected graph G: A vertex x G such that G x is not connected. Theorem 3.1, p. 57: Every connected

Philadelphia University Faculty of Information Technology Department of Computer Science. Computer Logic Design. By Dareen Hamoudeh.

Loopy Belief Propagation

Av. Prof. Mello Moraes, 2231, , São Paulo, SP - Brazil

Machine Learning!!!!! Srihari. Chain Graph Models. Sargur Srihari

Computer vision: models, learning and inference. Chapter 10 Graphical Models

BOOLEAN ALGEBRA. 1. State & Verify Laws by using :

Escola Politécnica, University of São Paulo Av. Prof. Mello Moraes, 2231, , São Paulo, SP - Brazil

Belief propagation in a bucket-tree. Handouts, 275B Fall Rina Dechter. November 1, 2000

Unit-IV Boolean Algebra

OSU CS 536 Probabilistic Graphical Models. Loopy Belief Propagation and Clique Trees / Join Trees

Node Aggregation for Distributed Inference in Bayesian Networks

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms for Inference Fall 2014

Graphical Analysis of Value of Information in Decision Models

Stat 5421 Lecture Notes Graphical Models Charles J. Geyer April 27, Introduction. 2 Undirected Graphs

On the number of quasi-kernels in digraphs

Section 10.1: Graphs and Graph Models. Introduction to Graphs Definition of a Graph Types of Graphs Examples of Graphs

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or

Social science application of graphical models on mobility data

A note on the pairwise Markov condition in directed Markov fields

On Sparse Gaussian Chain Graph Models

Graphical Models. Pradeep Ravikumar Department of Computer Science The University of Texas at Austin

Machine Learning

COS Lecture 13 Autonomous Robot Navigation

(1) Given the following system of linear equations, which depends on a parameter a R, 3x y + 5z = 2 4x + y + (a 2 14)z = a + 2

A Transformational Characterization of Markov Equivalence for Directed Maximal Ancestral Graphs

Graphical Models. David M. Blei Columbia University. September 17, 2014

COMP90051 Statistical Machine Learning

Markov Random Fields

Part I: Sum Product Algorithm and (Loopy) Belief Propagation. What s wrong with VarElim. Forwards algorithm (filtering) Forwards-backwards algorithm

Lecture 3: Conditional Independence - Undirected

A Discovery Algorithm for Directed Cyclic Graphs

CS 188: Artificial Intelligence

Conditional Random Fields : Theory and Application

UNIT 2 BOOLEAN ALGEBRA

Graphical Models & HMMs

Bayesian Classification Using Probabilistic Graphical Models

Introduction to Mobile Robotics SLAM Landmark-based FastSLAM

Introduction to Hidden Markov models

CS242: Probabilistic Graphical Models Lecture 2B: Loopy Belief Propagation & Junction Trees

Graphical Models Part 1-2 (Reading Notes)

Testing Independencies in Bayesian Networks with i-separation

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 9, SEPTEMBER

Exploring Localization In Bayesian Networks For Large Expert Systems

Junction tree propagation - BNDG 4-4.6

Directed Graphical Models

Expectation Propagation

Graphical Probability Models for Inference and Decision Making

Computer Vision Group Prof. Daniel Cremers. 4a. Inference in Graphical Models

CS 343: Artificial Intelligence

Causality in Communication: The Agent-Encapsulated Bayesian Network Model

DSAS Laboratory no 4. Laboratory 4. Logic forms

Recognizing Interval Bigraphs by Forbidden Patterns

Mean Field and Variational Methods finishing off

Today. Logistic Regression. Decision Trees Redux. Graphical Models. Maximum Entropy Formulation. Now using Information Theory

Transcription:

Reasoning About Uncertainty Graphical representation of causal relations (examples) Graphical models Inference in graphical models (introduction) 1 Jensen, 1996 Example 1: Icy Roads 2 1

Jensen, 1996 Example 2: Wet Grass 3 Jensen, 1996 Example 3: Burglary/Earthquake 4 2

Graph Connections Serial connection Diverging connection Converging connection 5 Serial Connection 6 3

Diverging Connection 7 Converging Connection 8 4

Jensen, 1996 Conditional Dependence 9 Graphical Models Directed graphs Undirected graphs Chain codes 10 5

Directed graphs represent causality encode independence and conditional independence orderedpair(v,d); D-set of directed edges X Y between vertices in V directed cycle (loop); acyclic & cyclic graphs path is a sequence of edges (each with endpoints); acyclic/cyclic/directed paths 11 Examples: Directed Graphs (cont ) 12 6

D-Separation Two variables A and B in a DAG are d-separated if for all paths between A and B there is an intermediate variable V such that either The connection is serial or diverging and the state of V is known or The connection converging and neither V nor any of Vs descendents have received evidence 13 Jensen, 1996 Examples of d-separation 14 7

Jensen, 1996 Examples of d-separation 15 Directed Global Markov Property The directed global Markov property, i.e., d- separation DS, is defined as: DG DS X Y Z if X and Y are d- separated by Z in DG. Example DG1 A C B; A C {B,D}; B D C; B D A; DG2 A C D; A C {B,D}; B D C; B D A; DG3 A D C; B A C; B D {C,A}; 16 8

Undirected Graphs An undirected graph (UG) is also known as a Markov random field An ordered pair(v,u); V is a set of vertices; U is a set of undirected edges X-Y between vertices. For disjoint sets of vertices X, Y and Z (Z may be empty), if there is no path from a variable X X to avariabley Y that does not include some variable in Z,thenX and Y aresaidtobe separated by Z. 17 Undirected Graphs (cont ) Examples: 18 9

Undirected Global Markov Property Theundirected global Markov property, i.e., separation U, is defined as: UG U X Y Z if X and Y are separated by Z in UG. Example UG1 A D C; A D {B,C}; B D C; A B C; A B {C,D}; UG2 A C B; A D {C,B}; A D B; A D C; D B C; 19 20 Chain Graphs Achain graph (CG), admitting both directed and undirected edges, generalises a graphical model based on directed or undirected graphs A chain graph has no partially directed cycles, where a partially directed cycle,g,isasequence of n distinct edges E 1,,E n (n 3) with endpoints X i, X i+1 respectively, such that: i. X 1 X n+1 ii. i (1 i n)eitherx i X i+1 or X i X i+1, and iii. j (1 j n) such that X j X j+1. 10

Examples: Chain Graphs (cont ) (a) CG1 (b) CG2 (a) graphs containing partially directed cycles and (b) chain graphs 21 22 Lauritzen-Wermuth-Frydenberg (LWF) global Markov property TheLWF global Markov property for chain graphs ( LWF), is then defined as: CG LWF X Y Z if X is separated from Y by Z in the undirected moral graph based on CG. Example CG1 A D C; A D {B,C}; B D C; A B C; A B {C,D}; CG2 A D C; A D {C,B}; A C B; A D B; D B C; 11

Chain Graphs (cont ) Acomplex in CG is a subgraph with the following form: X V 1 V n Y (n 1). Moralisation is needed for deriving the Markov property for chain graphs. Moralisation is achieved by adding the undirected edge X Y to a complex graph. A moral graph is the undirected graph created by moralizing all complexes in CG, and then replacing all directed edges with undirected edges. 23 Exact Inference in Graphical Models Directed graphs DAG, Bayesian network, belief network Representation of the joint probability using prior & conditional probabilities. Edges (arcs) & vertices (nodes) Causality Inference is represented graphically 24 12

Exact Inference in DAGs Example 1 Bayes theorem X)Y X) X,Y) Y)X Y) 25 Example 1 Bayes theorem (cont ) (Revision) Bayes theorem combines known (and observed) probabilities to compute unknown probability of interest. P ( A B) = B A) A) B) Bayes theorem can be formulated as: likelihoodx prior posterior = evidence 26 13

Example 1 Bayes theorem (cont ) Consider the first (left) model and let us assume that we are given X), Y X) and Y=y. We are interested in the posterior probability X Y=y). Employing the product rule of probability we derive the evidence (marginal distribution) Y)=Σ x Y X)X), and then use Bayes theorem to calculate: Y = y X ) X ) X Y = y) = Y = y) 27 Exact Inference in DAGs Example 2 conditional independence We assume the structure Z X Y for arbitrary edges and interested in computing Y Z=z) given Z,X), X) and Y,X). We can first derive the joint probability: 28 P ( Z, X ) = Z X ) X ) P ( Y, X ) = Y X ) X ) P ( X, Y, Z) = Z, X ) Y, X ) / X ) 14

Example 2 Conditional Independence (cont ) and then extract: P ( Y Z, X ) = X, Y, Z) Z, X ) Y = = Y X ) X ) Z X ) X ) Z, X ) 29 Example 2 Conditional Independence (cont ) The joint probability of different possible models can be factorised according to similar guidelines. For example, Z X Y can get different forms each is factorised differently: Z X Y: X,Y,Z)=X)Y X)Z X) Z X Y: X,Y,Z)=Y X)X Z)Z) Z X Y: X,Y,Z)=X Y)Z X)Y) 30 15