CAUSES AND COUNTERFACTUALS OR THE SUBTLE WISDOM OF BRAINLESS ROBOTS.

Slides:



Advertisements
Similar presentations
Department of Computer Science
Advertisements

What is Science?.
RELATED CLASS CS 262 Z – SEMINAR IN CAUSAL MODELING CURRENT TOPICS IN COGNITIVE SYSTEMS INSTRUCTOR: JUDEA PEARL Spring Quarter Monday and Wednesday, 2-4pm.
Holland on Rubin’s Model Part II. Formalizing These Intuitions. In the 1920 ’ s and 30 ’ s Jerzy Neyman, a Polish statistician, developed a mathematical.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Causality Inferences. Objectives: 1. To understand the concept of risk factors and outcome in a scientific way. 2. To understand and comprehend each and.
Inferences The Reasoning Power of Expert Systems.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
CAUSES AND COUNTERFACTUALS Judea Pearl University of California Los Angeles (
1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (
Culture and psychological knowledge: A Recap
Causality, Mechanisms and Modularity: Structural Models in Econometrics Damien Fennell Centre for the Philosophy of Natural and Social Science London School.
COMMENTS ON By Judea Pearl (UCLA). notation 1990’s Artificial Intelligence Hoover.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
Modality Counterfactuals. Possible Worlds A statement is something that is true or false. A set of statements is consistent iff all of the members could.
Judea Pearl University of California Los Angeles CAUSAL REASONING FOR DECISION AIDING SYSTEMS.
Kent Where causal dualism comes from Monika Koeppl Causality, Cognition and the Constitution of Scientific Phenomena Department of Philosophy University.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Causation The Constant Conjunction Theory. Two Questions First Question: What are the causal relata? Events Things Agents Examples The eruption of Mt.
CS Bayesian Learning1 Bayesian Learning. CS Bayesian Learning2 States, causes, hypotheses. Observations, effect, data. We need to reconcile.
Introduction, Acquiring Knowledge, and the Scientific Method
Introduction To Science
Discrete Math 6A Max Welling. Recap 1. Proposition: statement that is true or false. 2. Logical operators: NOT, AND, OR, XOR, ,  3. Compound proposition:
1 CAUSAL INFERENCE: MATHEMATICAL FOUNDATIONS AND PRACTICAL APPLICATIONS Judea Pearl University of California Los Angeles (
Chapter 4 Principles of Quantitative Research. Answering Questions  Quantitative Research attempts to answer questions by ascribing importance (significance)
Philosophy and the Scientific Method Dr Keith Jones.
CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
Signals and Systems March 25, Summary thus far: software engineering Focused on abstraction and modularity in software engineering. Topics: procedures,
Lecture 2 – Boolean Algebra Lecturer: Amy Ching Date: 21 st Oct 2002.
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
1-8 An Introduction to Equations
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
Variables Tutorial 3c variable A variable is any symbol that can be replaced with a number to solve a math problem. An open sentence has at least one.
THE MATHEMATICS OF CAUSE AND EFFECT: With Reflections on Machine Learning Judea Pearl Departments of Computer Science and Statistics UCLA.
CMPF144 FUNDAMENTALS OF COMPUTING THEORY Module 5: Classical Logic.
An Examination of Science. What is Science Is a systematic approach for analyzing and organizing knowledge. Used by all scientists regardless of the field.
Logic. What is logic? Logic (from the Ancient Greek: λογική, logike) is the use and study of valid reasoning. The study of logic features most prominently.
Propositional Calculus CS 270: Mathematical Foundations of Computer Science Jeremy Johnson.
1 Bayesian Decision Theory Foundations for a unified theory.
EDUC 502: Introduction to Research August 29, 2005 Dr. Groth Note: This will be ed to your SU account after class tonight.
Chapter 6 Chapter 6 Research Design and Causation.
CAUSES AND COUNTERFACTIALS IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
Research Design and Methods. Causal Inference  What is causal inference “…learning about CAUSAL effects from the data observed.” (KKV, 8)  Different.
Science As a Process!!!!!. Science A process through which nature is studied, discovered, and understood. A process through which nature is studied, discovered,
THE MATHEMATICS OF CAUSE AND COUNTERFACTUALS Judea Pearl University of California Los Angeles (
Lecture 4 Boolean Algebra. Logical Statements °A proposition that may or may not be true: Today is Monday Today is Sunday It is raining °Compound Statements.
Aim: What factors must we consider to make an experimental design?
HYPOTHESIS TESTING Null Hypothesis and Research Hypothesis ?
Repetition I  Causes of biases in contingency and causal judgments:  Pattern recognition capabilities.  Belief in the law of small numbers.  Hot hand.
Introduction to EMS Research Osama A Samarkandi, PhD, RN BSc, GMD, BSN, MSN, NIAC EMS 423; EMS Research and Evidence Based Practice.
Causality and Identification in Structural Econometrics Causality and Probability in the Sciences University of Kent, Canterbury September Damien.
What is Science? If some was ask to you: What is science? How would you respond? Body of Knowledge Method/Process A Way of Thinking or Constructing Reality.
Variable selection in Regression modelling Simon Thornley.
Chapter 2. Signals and Linear Systems
EVALUATING A MIDDLE SCHOOL MATH M.ED. PROFESSIONAL DEVELOPMENT PROGRAM. Knowledge, Pedagogy, Practice or Student Achievement:
Counterfactuals.
The Scientific Method 7M Science.
Propositional Calculus: Boolean Algebra and Simplification
Judea Pearl University of California Los Angeles
Chen Avin Ilya Shpitser Judea Pearl Computer Science Department UCLA
A MACHINE LEARNING EXERCISE
University of California
Computer Science and Statistics
CAUSAL INFERENCE IN STATISTICS
The Scientific Method.
Conditionals: If __________ [then] __________ statements
From Propensity Scores And Mediation To External Validity
Variables and Equations
THE MATHEMATICS OF PROGRAM EVALUATION
CAUSAL REASONING FOR DECISION AIDING SYSTEMS
Presentation transcript:

CAUSES AND COUNTERFACTUALS OR THE SUBTLE WISDOM OF BRAINLESS ROBOTS

ANTIQUITY TO ROBOTICS “I would rather discover one causal relation than be King of Persia” Democritus (430-380 BC) Development of Western science is based on two great achievements: the invention of the formal logical system (in Euclidean geometry) by the Greek philosophers, and the discovery of the possibility to find out causal relationships by systematic experiment (during the Renaissance). A. Einstein, April 23, 1953 The second quote is from Albert Einstein who, a year before his death, attributes the progress of Western Science to two fortunate events: The invention of formal logic by the Greek geometers, and the Galilean idea that causes could be discovered by experiments. As I have demonstrated earlier, experimental science have not fully benefited from the power of formal methods --- formal mathematics was used primarily for analyzing passive observations, under fixed boundary conditions, while the design of new experiments, and the transitions between boundary conditions, have been managed entirely by the unaided human intellect. The development of autonomous agents and intelligent robots requires a new type of analysis, in which the DOING component of science enjoys the benefit of formal mathematics, side by side with its observational component, a tiny glimpse of such analysis I have labored to uncover in this lecture. I am convinced that the meeting of these two components will eventually bring about another scientific revolution, perhaps equal in impact and profoundness to the one that took place during the renaissance. AI will be the major player in this revolution, and I hope each of you take part in seeing it off the ground.

David Hume (1711–1776) The modern study of causation begins with the Scottish philosopher David Hume. Hume has introduced to philosophy three revolutionary ideas that, today, are taken for granted by almost everybody, not only philosophers.

Analytical vs. empirical claims Causal claims are empirical HUME’S LEGACY Analytical vs. empirical claims Causal claims are empirical All empirical claims originate from experience. He made a sharp distinction between analytical and empirical claims --- the former are product of thoughts, the latter matter of fact. 2. He classified causal claims as empirical, rather than analytical. 3. He identified the source of all empirical claims with human experience, namely sensory input. Putting (2) and (3) together have left philosophers baffled, for over two centuries, over two major riddles:

THE TWO RIDDLES OF CAUSATION What empirical evidence legitimizes a cause-effect connection? What inferences can be drawn from causal information? and how? What gives us in AI the audacity to hope that today, after 2 centuries of philosophical debate, we can say something useful on this topic, is the fact that, for us, the question of causation is not purely academic.

We must build machines that make sense of what goes on in their environment, so they can recover when things do not turn out exactly as expected.

The Art of Causal Mentoring “Easy, man! that hurts!” The Art of Causal Mentoring And we must build machines that understand causal talk, when we have the time to teach them what we know about the world. Because the way WE COMMUNICATE about the world is through this strange language called causation. This pressure to build machines that both learn about and reason with cause and effect, something that David Hume did not experience, now casts new light on the riddles of causation, colored with engineering flavor.

OLD RIDDLES IN NEW DRESS How should a robot acquire causal information from the environment? How should a robot process causal information received from its creator-programmer? I will not touch on the first riddle, because David Heckerman covered this topic on Tuesday evening, both eloquently and comprehensively. I want to discuss primarily the second problem: How we go from facts coupled with causal premises to conclusions that we could not obtain from either component alone. On the surface, the second problem sounds trivial, take in the causal rules, apply them to the facts, and derive the conclusions by standard logical deduction. But it is not as trivial as it sounds. The exercise of drawing the proper conclusions from causal inputs has met with traumatic experiences in AI.

CAUSATION AS A PROGRAMMER'S NIGHTMARE Input: “If the grass is wet, then it rained” “if we break this bottle, the grass will get wet” Output: “If we break this bottle, then it rained” One of my favorite example is the following: (Wet grass example on slide).

CAUSATION AS A PROGRAMMER'S NIGHTMARE (Cont.) ( Lin, 1995) Input: A suitcase will open iff both locks are open. The right lock is open Query: What if we open the left lock? Output: The right lock might get closed. Another troublesome example, which I first saw in Lin's paper of IJCAI-95 goes like that: (Suitcase Example on slide) In these two examples, the strange output is derived from solid logical principles, chaining in the first, constraint-satisfaction in the second, yet, we feel that there is a missing ingredient there which the computer did not quite grasp, and that it has to do with causality. Evidently there is some valuable information conveyed by causal vocabulary which is essential for correct understanding of the input. What is it that information? And what is that magic logic that should permit a computer to select the right information, and what is the semantics behind such logic ? It is this sort of questions that I would like to address in my talk this evening, because I know that many people in this community are dealing with such questions, and have made promising proposals for answering them. Most notably are people working in qualitative physics, troubleshooting, planning under uncertainty, modeling behavior of physical systems, constructing theories of action and change, and perhaps even those working in natural language understanding, because our language is loaded with causal expressions. Since 1990, I have examined many (though not all) of these proposals, together with others that have been suggested by philosophers and economists, and I have extracted from them a small set of basic principles which I would like to share with you tonight. I am now convinced, that the entire story of causality unfolds from just three basic principles:

BRAINLESS FIRST DISCOVERY: PHYSICS DESERVES A NEW ALGEBRA Scientific Equations (e.g., Hooke’s Law) are non-algebraic e.g., Length (Y) equals a constant (2) times the weight (X) Y := 2X Correct notation: Y = 2X X = 1 Y = 2 The solution X = 1 Process information Had X been 3, Y would be 6. If we raise X to 3, Y would be 6. Must “wipe out” X = 1.

MATHEMATICAL EXTRAPOLATION: THE WORLD AS A COLLECTION OF SPRINGS Definition: A structural causal model is a 4-tuple V,U, F, P(u), where V = {V1,...,Vn} are endogeneas variables U = {U1,...,Um} are background variables F = {f1,..., fn} are functions determining V, vi = fi(v, u) P(u) is a distribution over U P(u) and F induce a distribution P(v) over observable variables e.g.,

ORACLE FOR COUNTERFACTUALS FAMILIAR CAUSAL MODEL ORACLE FOR COUNTERFACTUALS X Y Z Here is a causal model we all remember from high-school -- a circuit diagram. There are 4 interesting points to notice in this example: (1) It qualifies as a causal model -- because it contains the information to confirm or refute all action, counterfactual and explanatory sentences concerned with the operation of the circuit. For example, anyone can figure out what the output would be like if we set Y to zero, or if we change this OR gate to a NOR gate or if we perform any of the billions combinations of such actions. (2) Logical functions (Boolean input-output relation) is insufficient for answering such queries (3)These actions were not specified in advance, they do not have special names and they do not show up in the diagram. In fact, the great majority of the action queries that this circuit can answer have never been considered by the designer of this circuit. (4) So how does the circuit encode this extra information? Through two encoding tricks: 4.1 The symbolic units correspond to stable physical mechanisms (i.e., the logical gates) 4.2 Each variable has precisely one mechanism that determines its value. INPUT OUTPUT

BRAINLESS SECOND DISCOVERY: COUNTERFACTUALS ARE EMBARRASINGLY SIMPLE Definition: The sentence: “Y would be y (in situation u), had X been x,” denoted Yx(u) = y, means: The solution for Y in a mutilated model Mx, (i.e., the equations for X replaced by X = x) with input U=u, is equal to y. The Fundamental Equation of Counterfactuals:

BRAINLESS SECOND DISCOVERY: COUNTERFACTUALS ARE EMBARRASINGLY SIMPLE Definition: The sentence: “Y would be y (in situation u), had X been x,” denoted Yx(u) = y, means: The solution for Y in a mutilated model Mx, (i.e., the equations for X replaced by X = x) with input U=u, is equal to y. Joint probabilities of counterfactuals: In particular:

THE STRUCTURAL MODEL PARADIGM Joint Distribution Data Generating Model Q(M) (Aspects of M) Data M Inference M – Invariant strategy (mechanism, recipe, law, protocol) by which Nature assigns values to variables in the analysis. “Think Nature, not experiment!”

THE PUZZLE OF COUNTERFACTUAL CONSENSUS Indicative: “If Oswald didn’t kill Kennedy, someone else did,” Subjunctive: “If Oswald hadn’t killed Kennedy, someone else would have.” (Adams 1975)

THE PUZZLING UBIQUITY OF COUNTERFACTUALS Hume’s Definition of “cause”: We may define a cause to be an object followed by another, and where all the objects, similar to the first, are followed by objects similar to the second, Or, in other words, where, if the first object had not been, the second never had existed (Hume 1748/1958, sec. VII). Lewis’s Definition of “cause”: “A has caused B” if “B would not have occurred if it were not for A (Lewis 1986). Why not define counterfactuals in terms of causes? (Pearl 2000)

STRUCTURAL AND SIMILARITY-BASED COUNTERFACTUALS w A B Lewis’s account (1973): The counterfactual “B if it were A” is true in a world w just in case B is true in all the closest A-worlds to w. Structural account (1995): “Y would be y if X were x” is true in situation u just in case

S1: “IF OSWALD DIDN’T KILL KENNEDY, SOMEONE ELSE DID” P (SE) true P (SE) = P (SE) OS true false P (SE) = 1 Prior knowledge OS M SE K true Oswald killed Kennedy K M SE OS M OS Realizing Oswald did not kill Kennedy K M SE OS true false M OS = true

S2: “IF OSWALD HADN’T KILLED KENNEDY, SOMEONE ELSE WOULD HAVE?” P (SE) K M OS SE true M OS = true P (SE) = P (SE) M SE M OS K SE OS M true After learning Oswald killed Kennedy M OS SE OS K Oswald refraining from killing Prior knowledge

S2: “IF OSWALD HADN’T KILLED KENNEDY, SOMEONE ELSE WOULD HAVE?” P (SE) K SE OS true M = true P (SE) = P (SE) false P (SE) = P (SE) true M SE M OS M SE M OS OS SE OS SE K K After learning Oswald killed Kennedy Oswald refraining from killing Prior knowledge

BRAINLESS THIRD DISCOVERY: HIGH SCHOOL COUNTERFACTUALS CAN BE USEFUL Solidify and unify (all?) other approaches to causation (e.g., PO, SEM, DA, Prob., SC) Demystify enigmatic notions and weed out myths and misconceptions (e.g., ignorability, exogeneity, exchangeability, confounding, mediation, attribution, regret) Algorithmitize causal inference tasks (e.g., covariate-selection, identification, c-equivalence, effect-restoration, experimental integration, sufficiency) Resolve lingering puzzles

CONCLUSIONS If Oswald had not used counterfactuals, brainless would have. Much of modern thinking is owed to brainless robots. I compute, therefore I think.