1 gR2002 Peter Spirtes Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

1 Learning Causal Structure from Observational and Experimental Data Richard Scheines Carnegie Mellon University.
A Tutorial on Learning with Bayesian Networks
Discovering Cyclic Causal Models by Independent Components Analysis Gustavo Lacerda Peter Spirtes Joseph Ramsey Patrik O. Hoyer.
Topic Outline Motivation Representing/Modeling Causal Systems
Bayesian Network and Influence Diagram A Guide to Construction And Analysis.
Logistic Regression Psy 524 Ainsworth.
Outline 1)Motivation 2)Representing/Modeling Causal Systems 3)Estimation and Updating 4)Model Search 5)Linear Latent Variable Models 6)Case Study: fMRI.
BAYESIAN NETWORKS. Bayesian Network Motivation  We want a representation and reasoning system that is based on conditional independence  Compact yet.
Introduction of Probabilistic Reasoning and Bayesian Networks
Challenges posed by Structural Equation Models Thomas Richardson Department of Statistics University of Washington Joint work with Mathias Drton, UC Berkeley.
1 Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
Mediating Between Causes and Probabilities: the Use of Graphical Models in Econometrics Alessio Moneta Max Planck Institute of Economics, Jena, and Sant’Anna.
Ambiguous Manipulations
Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15 th 2006.
Causal Modeling for Anomaly Detection Andrew Arnold Machine Learning Department, Carnegie Mellon University Summer Project with Naoki Abe Predictive Modeling.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
CAUSAL SEARCH IN THE REAL WORLD. A menu of topics  Some real-world challenges:  Convergence & error bounds  Sample selection bias  Simpson’s paradox.
Bayes Net Perspectives on Causation and Causal Inference
1 Tetrad: Machine Learning and Graphcial Causal Models Richard Scheines Joe Ramsey Carnegie Mellon University Peter Spirtes, Clark Glymour.
Summary of the Bayes Net Formalism David Danks Institute for Human & Machine Cognition.
A Brief Introduction to Graphical Models
Causal Inference and Graphical Models Peter Spirtes Carnegie Mellon University.
Bayesian Learning By Porchelvi Vijayakumar. Cognitive Science Current Problem: How do children learn and how do they get it right?
Alternative statistical modeling of Pharmacokinetics and Pharmacodynamics A collaboration between Aalborg University and Novo Nordisk A/S Claus Dethlefsen.
1 Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon.
Nov. 13th, Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour, and many others Dept. of Philosophy & CALD Carnegie Mellon.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Bayesian Networks for Data Mining David Heckerman Microsoft Research (Data Mining and Knowledge Discovery 1, (1997))
Bayesian Classification. Bayesian Classification: Why? A statistical classifier: performs probabilistic prediction, i.e., predicts class membership probabilities.
Methodological Problems in Cognitive Psychology David Danks Institute for Human & Machine Cognition January 10, 2003.
Physics Fluctuomatics (Tohoku University) 1 Physical Fluctuomatics 12th Bayesian network and belief propagation in statistical inference Kazuyuki Tanaka.
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Course files
Penn State - March 23, The TETRAD Project: Computational Aids to Causal Discovery Peter Spirtes, Clark Glymour, Richard Scheines and many others.
Learning With Bayesian Networks Markus Kalisch ETH Zürich.
I271B The t distribution and the independent sample t-test.
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Lecture notes 9 Bayesian Belief Networks.
CHAPTER 6 Naive Bayes Models for Classification. QUESTION????
Exploratory studies: you have empirical data and you want to know what sorts of causal models are consistent with it. Confirmatory tests: you have a causal.
Dependency Networks for Collaborative Filtering and Data Visualization UAI-2000 발표 : 황규백.
Lecture 2: Statistical learning primer for biologists
The Visual Causality Analyst: An Interactive Interface for Causal Reasoning Jun Wang, Stony Brook University Klaus Mueller, Stony Brook University, SUNY.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Classification COMP Seminar BCB 713 Module Spring 2011.
Variable selection in Regression modelling Simon Thornley.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
1 Day 2: Search June 14, 2016 Carnegie Mellon University Center for Causal Discovery.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Graduate School of Information Sciences, Tohoku University
Read R&N Ch Next lecture: Read R&N
Irina Rish IBM T.J.Watson Research Center
Markov Properties of Directed Acyclic Graphs
CSCI 5822 Probabilistic Models of Human and Machine Learning
Read R&N Ch Next lecture: Read R&N
A Short Tutorial on Causal Network Modeling and Discovery
Center for Causal Discovery: Summer Short Course/Datathon
Causal Data Mining Richard Scheines
Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour,
Extra Slides.
Graduate School of Information Sciences, Tohoku University
Graduate School of Information Sciences, Tohoku University
BN Semantics 3 – Now it’s personal! Parameter Learning 1
Searching for Graphical Causal Models of Education Data
Graduate School of Information Sciences, Tohoku University
Read R&N Ch Next lecture: Read R&N
Graduate School of Information Sciences, Tohoku University
Chapter 14 February 26, 2004.
Presentation transcript:

1 gR2002 Peter Spirtes Carnegie Mellon University

2 Graphs often given causal interpretation Graphs can be used to represent both causal hypotheses and probability distributions Graphs can be used to represent both causal hypotheses and probability distributions  e.g. in a directed acyclic graph (DAG) A  B means A is a direct cause of B  DAG also represents a set of distributions sharing conditional independence relations Causal interpretation is common in social science applications (structural equation modelling) Causal interpretation is common in social science applications (structural equation modelling) Causal representation of genetic regulatory networks Causal representation of genetic regulatory networks

3 TETRAD Dedicated to search for causal models under a variety of different assumptions about what is known Dedicated to search for causal models under a variety of different assumptions about what is known  Has several different kinds of graphs, depending upon background assumptions  Has a number of different kinds of search strategies  Allows some explicit representation of background knowledge  Has some modules for calculating equivalence class of given graph  Recently developed graphical interface  Should have module for calculating effects of interventions

4 The causal interpretation of graphical models suggests several unusual operations Calculation of effect of manipulation Calculation of effect of manipulation Calculation of equivalence class (aid to calculation of effect of manipulation) Calculation of equivalence class (aid to calculation of effect of manipulation)

5 Kinds of graphical models in TETRAD Directed acyclic graphs (discrete, normal) Directed acyclic graphs (discrete, normal) Directed cyclic graphs (normal) Directed cyclic graphs (normal) Pattern Pattern Mixed ancestral graphs (normal) Mixed ancestral graphs (normal) Partial ancestral graphs Partial ancestral graphs

6 Difference between calculation of manipulation versus conditioning In conditioning, the result depends only upon the joint distribution and the event conditioned on, In conditioning, the result depends only upon the joint distribution and the event conditioned on, In manipulating, the results depend upon the joint distribution, the event manipulated, and the causal relations among the variables In manipulating, the results depend upon the joint distribution, the event manipulated, and the causal relations among the variables  This means that locating alternative good models is essential for correct prediction of manipulation

7 P(Lung Cancer = yes|Smoking = yes) = ¾ Conditioning

8 Manipulating Smoking – First Step

9 P(Lung Cancer = yes||Smoking = yes) = ¾ = Manipulating Smoking – After waiting P(Lung Cancer = yes|Smoking = yes) = ¾

10 Calculation of effect of manipulation When there are no latent variables and structure is known - simple When there are no latent variables and structure is known - simple When there are latent variables and the structure is known (Pearl 2001) When there are latent variables and the structure is known (Pearl 2001) When the structure is partially known (SGS 2001) When the structure is partially known (SGS 2001)

11 Calculation of Effect of Manipulation – Equivalence Class A B C D G 1 G 1 A B C D G 2 G 2 A B C D Pattern Pattern G 1 and G 2 represent the same distribution, agree on the effect on D of manipulating B, disagree about the effect on A of manipulating B

12 Calculation of Effect of Manipulation – Equivalence Classes A B C D G 1 G 1 A B C D G 2 G 2 A B C D Pattern Pattern Pattern represents the equivalence class of DAGs if there are no latent variables. PAG represents the equivalence class of DAGs if there might be latent variables. A B C D PAG PAG o o o o

13 Edge types in different graphs    o  o o  o o  o o  o combinations of edges subject to varying constraints combinations of edges subject to varying constraints

14 The Statistical Theory for some graphical models is only partially worked out MAGs and PAGs MAGs and PAGs  know how to parameterize in linear cases  may not be a unique maximum likelihood estimate  PAG – not known how to efficiently determine if arbitrary combination of edges is PAG

15 Specific searches Assuming no latent variables or cycles Assuming no latent variables or cycles  Hill climbing – BIC, posterior probability (normal, discrete)  Constraint based – PC (normal, discrete)  Combined Assuming no cycles Assuming no cycles  Hill climbing – BIC (normal)  Constraint based – FCI (normal, discrete)

16 Other features Estimate parameters - DAGs (discrete, normal) Estimate parameters - DAGs (discrete, normal) Representation of background knowledge Representation of background knowledge Find equivalence class of given DAG (no latents, possibly cyclic) Find equivalence class of given DAG (no latents, possibly cyclic) Graphical interface Graphical interface Should have module to calculate effects of manipulations Should have module to calculate effects of manipulations  Known structure, no latents  Known structure, latent variables  Partially known structure

17 As a probabilistic model graphical models require usual operations As a probabilistic model, it requires the usual set of procedures As a probabilistic model, it requires the usual set of procedures  Search  Estimation  Testing  Scoring  Conditioning

18 Summary The causal interpretation of graphical models offers an opportunity to provide functionality not found in most other kinds of models (e.g. predicting affects of manipulations) The causal interpretation of graphical models offers an opportunity to provide functionality not found in most other kinds of models (e.g. predicting affects of manipulations)

19 Summary Added functionality, different domains and different background knowledge require a variety of different kinds of graphical models Added functionality, different domains and different background knowledge require a variety of different kinds of graphical models  desirability of flexibility in graphical representation  desirability of allowing each type to inherit as much as possible from more general representations

20 Summary Because of the need to locate good alternative models Because of the need to locate good alternative models  Search plays a very important role (score-based, constraint-based, and combinations)  Calculating equivalence classes is essential  Collection and representation of background knowledge to guide search is very important