STAT 598L Probabilistic Graphical Models. Instructor: Sergey Kirshner. Exact Inference

Similar documents
Probabilistic Graphical Models

Lecture 5: Exact inference. Queries. Complexity of inference. Queries (continued) Bayesian networks can answer questions about the underlying

Lecture 5: Exact inference

Part II. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Probabilistic Graphical Models

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

Ch9: Exact Inference: Variable Elimination. Shimi Salant, Barak Sternberg

CS242: Probabilistic Graphical Models Lecture 3: Factor Graphs & Variable Elimination

STA 4273H: Statistical Machine Learning

Markov Random Fields

CS 343: Artificial Intelligence

These notes present some properties of chordal graphs, a set of undirected graphs that are important for undirected graphical models.

CS 188: Artificial Intelligence

STAT 598L Learning Bayesian Network Structure

Computer Vision Group Prof. Daniel Cremers. 4a. Inference in Graphical Models

Lecture 11: May 1, 2000

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Bayesian Networks Inference (continued) Learning

Bayesian Networks Inference

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Search Algorithms for Solving Queries on Graphical Models & the Importance of Pseudo-trees in their Complexity.

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms For Inference Fall 2014

Machine Learning. Sourangshu Bhattacharya

D-Separation. b) the arrows meet head-to-head at the node, and neither the node, nor any of its descendants, are in the set C.

6 : Factor Graphs, Message Passing and Junction Trees

V,T C3: S,L,B T C4: A,L,T A,L C5: A,L,B A,B C6: C2: X,A A

Bayesian Networks, Winter Yoav Haimovitch & Ariel Raviv

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms For Inference Fall 2014

Probabilistic Graphical Models

Bayesian Networks. A Bayesian network is a directed acyclic graph that represents causal relationships between random variables. Earthquake.

Recitation 4: Elimination algorithm, reconstituted graph, triangulation

Inference Complexity As Learning Bias. The Goal Outline. Don t use model complexity as your learning bias

Lecture 13: May 10, 2002

Computer vision: models, learning and inference. Chapter 10 Graphical Models

Modeling and Reasoning with Bayesian Networks. Adnan Darwiche University of California Los Angeles, CA

Speeding Up Computation in Probabilistic Graphical Models using GPGPUs

EE512 Graphical Models Fall 2009

OSU CS 536 Probabilistic Graphical Models. Loopy Belief Propagation and Clique Trees / Join Trees

Junction tree propagation - BNDG 4-4.6

STAT 598L Probabilistic Graphical Models. Instructor: Sergey Kirshner. Introduction

FMA901F: Machine Learning Lecture 6: Graphical Models. Cristian Sminchisescu

Lecture 9: Undirected Graphical Models Machine Learning

ECE521 Lecture 21 HMM cont. Message Passing Algorithms

CS 664 Flexible Templates. Daniel Huttenlocher

Local Probabilistic Models: Context-Specific CPDs. Sargur Srihari

Lecture 3: Conditional Independence - Undirected

An efficient approach for finding the MPE in belief networks

Belief propagation in a bucket-tree. Handouts, 275B Fall Rina Dechter. November 1, 2000

Bayes Net Learning. EECS 474 Fall 2016

An Effective Upperbound on Treewidth Using Partial Fill-in of Separators

CS242: Probabilistic Graphical Models Lecture 2B: Loopy Belief Propagation & Junction Trees

Exact Inference: Elimination and Sum Product (and hidden Markov models)

Av. Prof. Mello Moraes, 2231, , São Paulo, SP - Brazil

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Workshop report 1. Daniels report is on website 2. Don t expect to write it based on listening to one project (we had 6 only 2 was sufficient

Parallel Exact Inference on the Cell Broadband Engine Processor

Scene Grammars, Factor Graphs, and Belief Propagation

Junction Trees and Chordal Graphs

Escola Politécnica, University of São Paulo Av. Prof. Mello Moraes, 2231, , São Paulo, SP - Brazil

Scene Grammars, Factor Graphs, and Belief Propagation

Conflict Graphs for Combinatorial Optimization Problems

Graphical Models. Pradeep Ravikumar Department of Computer Science The University of Texas at Austin

Class2: Constraint Networks Rina Dechter

2. Graphical Models. Undirected graphical models. Factor graphs. Bayesian networks. Conversion between graphical models. Graphical Models 2-1

Core Membership Computation for Succinct Representations of Coalitional Games

Data Structures and Algorithms

Minimal Dominating Sets in Graphs: Enumeration, Combinatorial Bounds and Graph Classes

Machine Learning Lecture 16

Graph Theory. Probabilistic Graphical Models. L. Enrique Sucar, INAOE. Definitions. Types of Graphs. Trajectories and Circuits.

ECE521 W17 Tutorial 10

Semi-Independent Partitioning: A Method for Bounding the Solution to COP s

Mean Field and Variational Methods finishing off

Topological parameters for time-space tradeoff

Chordal Graphs: Theory and Algorithms

Probabilistic Partial Evaluation: Exploiting rule structure in probabilistic inference

Treewidth and graph minors

EE512A Advanced Inference in Graphical Models

CS270 Combinatorial Algorithms & Data Structures Spring Lecture 19:

1 Format. 2 Topics Covered. 2.1 Minimal Spanning Trees. 2.2 Union Find. 2.3 Greedy. CS 124 Quiz 2 Review 3/25/18

NP-Hardness. We start by defining types of problem, and then move on to defining the polynomial-time reductions.

Learning factorizations in estimation of distribution algorithms using affinity propagation

A General Scheme for Automatic Generation of Search Heuristics from Specification Dependencies. Kalev Kask and Rina Dechter

Tools for Scalable Data Mining

Abstract. 2 Background 2.1 Belief Networks. 1 Introduction

Probabilistic Graphical Models

Chordal graphs MPRI

Kernelization Upper Bounds for Parameterized Graph Coloring Problems

Decision Problems. Observation: Many polynomial algorithms. Questions: Can we solve all problems in polynomial time? Answer: No, absolutely not.

LOCAL STRUCTURE AND DETERMINISM IN PROBABILISTIC DATABASES. Theodoros Rekatsinas, Amol Deshpande, Lise Getoor

GRAPHICAL MODELS FOR UNCERTAIN DATA

Learning Probabilistic Relational Models

Mean Field and Variational Methods finishing off

CSE 332 Spring 2014: Midterm Exam (closed book, closed notes, no calculators)

Learning Bounded Treewidth Bayesian Networks

10708 Graphical Models: Homework 2

Introduction to Algorithms May 14, 2003 Massachusetts Institute of Technology Professors Erik Demaine and Shafi Goldwasser.

Bayesian Classification Using Probabilistic Graphical Models

COMP260 Spring 2014 Notes: February 4th

Interaction Between Input and Output-Sensitive

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

Transcription:

STAT 598L Probabilistic Graphical Models Instructor: Sergey Kirshner Exact Inference

What To Do With Bayesian/Markov Network? Compact representation of a complex model, but Goal: efficient extraction of information i.e., posing queries

Types of Queries Maximum a posterior (MAP) hypothesis Difficulty Intelligence Grade SAT Letter

Types of Queries Maximum a posterior (MAP) hypothesis Most probable explanation (MPE) Difficulty Intelligence Grade SAT Letter

Types of Queries Maximum a posterior (MAP) hypothesis Most probable explanation (MPE) Probability of evidence Partition function Difficulty Intelligence Grade SAT Letter

Types of Queries Maximum a posterior (MAP) hypothesis Most probable explanation (MPE) Probability of evidence Partition function Prior and/or posterior marginals Focus on probabilities Will translate to hypotheses Difficulty Intelligence Grade SAT Letter

How Difficult Is Answering Queries? Consider full probability table (assume n binary variables) Simple marginal Find P(X i ) Complexity: O(2 n-1 ) Can we do this faster? In the general case, no (more on this later) What if the joint distribution has structure?

Rearranging Terms in the Sum Difficulty Intelligence Grade SAT Letter

Complexity? Rearranging Terms Exponential in the size of the largest factor In turn, depends on the order of the summation Also depends on whether all of the variables need to be summed over Bad news Finding optimal order is NP-hard

All I Learned in This Course Was From Watching the Trees Graphs without loops Directed trees Implies at most one parent for each node Polytrees Possibly more than one parent for some nodes x 1 x 1 x 1 x 2 x 6 x2 x 6 x2 x 6 x 3 x 5 x 3 x 5 x 3 x 5 x4 x4 x4

Why Trees? Because efficient exact inference reduces to inference over trees

Chain: Tree, Only Simpler! x 1 x 2 x 3 x 4 x 5

Sum-Product Method for Chains Computational complexity? O(n Val(X max ) 2 ) Storage complexity? Same x 1 x 3 x 4 x 5 x 2

Sum-Product for Trees x 1 x 3 x 4 x 5 x 2

Sum-Product Method (Variable Elimination) Pick an order by selecting a node as root Start at the end of the ordering Sum out the child and update (product) the factor for the parent Continue until no variables left to sum Can be done in the opposite direction Sum out the parents rather than children

What To Do for General Graphs? Think decomposable Junction trees

Pick ordering Variable Elimination Bayesian networks: topological ordering For each node in the ordering A Compute the product of all factors involving A Sum out A Obtain a factor involving variables in all factors with A Complexity Depends on the ordering

What Happens to the Network? Difficulty Intelligence Chordal Grade SAT Letter

Induced Graph Nodes = nodes of the original Bayesian/Markov network Edges = pairs of variables in the same factor at any stage of variable elimination Induced graph is chordal Every factor is a clique Difficulty Intelligence Every maximum clique is Grade SAT an intermediate factor Letter

Induced Width Complexity of variable elimination Exponential in the size of the largest clique of the induced graph Induced width: size of the largest clique of the induced graph 1 Optimization: minimize induced width Original graph is chordal easy! General case NP-hard!

Chordal Graphs H is induced graph? H is chordal graph Chordal graphs admit ordering that do not introduce extra edges Proof: While there are vertices Pick a simplicial vertex and eliminate the corresponding variable

Finding Orderings Chordal graphs: maximum cardinality Pick a vertex with the largest number of neighbors Eliminate it Does not add edges (homework) How well does it do for the general case?

General Greedy Strategies Current ordering O=empty While O H Select a node v according to a heuristic h(h,o) Add the node to the ordering, O={O,{v}} How to choose a heuristic h? Problem is NP-hard, so none is optimal for all graphs Examples: Min-neighbors number of neighbors in the current graph Min-weight domain cardinality of its neighbors Min-fill number of edges to add if eliminated Weighted-min-fill sum of weights of edges added if eliminated

Comparison of Ordering Heuristics

Conditioning What if some variables are known? Don t add to the induced width Idea: treat the values of some variables as known Solve the elimination for each value Aggregate Coherence Difficulty Intelligence Grade SAT Letter Job

Local Probability Model What if a CPD has a compact representation Noisy-or Decision tree/graph Rule-based

Noisy-or Can be made more efficient! L F 1 F k X 1 X k Y