1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

A Tutorial on Learning with Bayesian Networks
Bayesian Networks CSE 473. © Daniel S. Weld 2 Last Time Basic notions Atomic events Probabilities Joint distribution Inference by enumeration Independence.
Bridgette Parsons Megan Tarter Eva Millan, Tomasz Loboda, Jose Luis Perez-de-la-Cruz Bayesian Networks for Student Model Engineering.
1 Some Comments on Sebastiani et al Nature Genetics 37(4)2005.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
PROBABILITY. Uncertainty  Let action A t = leave for airport t minutes before flight from Logan Airport  Will A t get me there on time ? Problems :
1 Knowledge Engineering for Bayesian Networks. 2 Probability theory for representing uncertainty l Assigns a numerical degree of belief between 0 and.
Introduction of Probabilistic Reasoning and Bayesian Networks
Artificial Intelligence Chapter 19 Reasoning with Uncertain Information Biointelligence Lab School of Computer Sci. & Eng. Seoul National University.
Knowledge Engineering for Bayesian Networks
CPSC 322, Lecture 26Slide 1 Reasoning Under Uncertainty: Belief Networks Computer Science cpsc322, Lecture 27 (Textbook Chpt 6.3) March, 16, 2009.
Data Mining Cardiovascular Bayesian Networks Charles Twardy †, Ann Nicholson †, Kevin Korb †, John McNeil ‡ (Danny Liew ‡, Sophie Rogers ‡, Lucas Hope.
1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.
1 Bayesian Networks and Causal Modelling Ann Nicholson School of Computer Science and Software Engineering Monash University.
1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.
Data Mining Cardiovascular Bayesian Networks Charles Twardy †, Ann Nicholson †, Kevin Korb †, John McNeil ‡ (Danny Liew ‡, Sophie Rogers ‡, Lucas Hope.
1 Using Bayesian networks for Water Quality Prediction in Sydney Harbour Ann Nicholson Shannon Watson, Honours 2003 Charles Twardy, Research Fellow School.
M.I. Jaime Alfonso Reyes ´Cortés.  The basic task for any probabilistic inference system is to compute the posterior probability distribution for a set.
Parameterising Bayesian Networks: A Case Study in Ecological Risk Assessment Carmel A. Pollino Water Studies Centre Monash University Owen Woodberry, Ann.
Knowledge Engineering a Bayesian Network for an Ecological Risk Assessment (KEBN-ERA) Owen Woodberry Supervisors: Ann Nicholson Kevin Korb Carmel Pollino.
Part 2 of 3: Bayesian Network and Dynamic Bayesian Network.
UNIVERSITY OF SOUTH CAROLINA Department of Computer Science and Engineering CSCE 580 Artificial Intelligence Ch.6 [P]: Reasoning Under Uncertainty Sections.
1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.
Representing Uncertainty CSE 473. © Daniel S. Weld 2 Many Techniques Developed Fuzzy Logic Certainty Factors Non-monotonic logic Probability Only one.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.
Ai in game programming it university of copenhagen Welcome to... the Crash Course Probability Theory Marco Loog.
5/25/2005EE562 EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 16, 6/1/2005 University of Washington, Department of Electrical Engineering Spring 2005.
Knowledge Engineering a Bayesian Network for an Ecological Risk Assessment (KEBN-ERA) Owen Woodberry Supervisors: Ann Nicholson Kevin Korb Carmel Pollino.
CPSC 422, Lecture 14Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 14 Feb, 4, 2015 Slide credit: some slides adapted from Stuart.
1 Bayesian Networks Chapter ; 14.4 CS 63 Adapted from slides by Tim Finin and Marie desJardins. Some material borrowed from Lise Getoor.
CPSC 322, Lecture 24Slide 1 Reasoning under Uncertainty: Intro to Probability Computer Science cpsc322, Lecture 24 (Textbook Chpt 6.1, 6.1.1) March, 15,
Intelligent Tutoring Systems Traditional CAI Fully specified presentation text Canned questions and associated answers Lack the ability to adapt to students.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
The Bayesian Web Adding Reasoning with Uncertainty to the Semantic Web
Reasoning with Bayesian Networks. Overview Bayesian Belief Networks (BBNs) can reason with networks of propositions and associated probabilities Useful.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 26 of 41 Friday, 22 October.
Aprendizagem Computacional Gladys Castillo, UA Bayesian Networks Classifiers Gladys Castillo University of Aveiro.
Bayesian Networks for Data Mining David Heckerman Microsoft Research (Data Mining and Knowledge Discovery 1, (1997))
Bayesian Statistics and Belief Networks. Overview Book: Ch 13,14 Refresher on Probability Bayesian classifiers Belief Networks / Bayesian Networks.
Introduction to Bayesian Networks
An Introduction to Artificial Intelligence Chapter 13 & : Uncertainty & Bayesian Networks Ramin Halavati
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 28 of 41 Friday, 22 October.
Learning With Bayesian Networks Markus Kalisch ETH Zürich.
Cognitive Computer Vision Kingsley Sage and Hilary Buxton Prepared under ECVision Specific Action 8-3
Review: Bayesian inference  A general scenario:  Query variables: X  Evidence (observed) variables and their values: E = e  Unobserved variables: Y.
CH751 퍼지시스템 특강 Uncertainties in Intelligent Systems 2004 년도 제 1 학기.
Software Engineering1  Verification: The software should conform to its specification  Validation: The software should do what the user really requires.
Clinical Decision Support 1 Historical Perspectives.
1 CMSC 671 Fall 2001 Class #20 – Thursday, November 8.
1 Probability FOL fails for a domain due to: –Laziness: too much to list the complete set of rules, too hard to use the enormous rules that result –Theoretical.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 24 of 41 Monday, 18 October.
Introduction on Graphic Models
CSE 473 Uncertainty. © UW CSE AI Faculty 2 Many Techniques Developed Fuzzy Logic Certainty Factors Non-monotonic logic Probability Only one has stood.
CPSC 322, Lecture 26Slide 1 Reasoning Under Uncertainty: Belief Networks Computer Science cpsc322, Lecture 27 (Textbook Chpt 6.3) Nov, 13, 2013.
Dependency Networks for Inference, Collaborative filtering, and Data Visualization Heckerman et al. Microsoft Research J. of Machine Learning Research.
Chapter 12. Probability Reasoning Fall 2013 Comp3710 Artificial Intelligence Computing Science Thompson Rivers University.
Artificial Intelligence Chapter 19 Reasoning with Uncertain Information Biointelligence Lab School of Computer Sci. & Eng. Seoul National University.
Knowledge Engineering for Bayesian Networks
RESEARCH APPROACH.
Artificial Intelligence
Artificial Intelligence Chapter 19
CAP 5636 – Advanced Artificial Intelligence
A Short Tutorial on Causal Network Modeling and Discovery
CAP 5636 – Advanced Artificial Intelligence
CS 188: Artificial Intelligence
CS 188: Artificial Intelligence Fall 2007
CS 188: Artificial Intelligence
Biointelligence Lab School of Computer Sci. & Eng.
Class #16 – Tuesday, October 26
Presentation transcript:

1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University

2 Overview l Representing uncertainty l Introduction to Bayesian Networks »Syntax, semantics, examples l The knowledge engineering process l Case Study: Intelligent Tutoring l Summary of other BN research l Open research questions

3 Sources of Uncertainty l Ignorance l Inexact observations l Non-determinism l AI representations »Probability theory »Dempster-Shafer »Fuzzy logic

4 Probability theory for representing uncertainty l Assigns a numerical degree of belief between 0 and 1 to facts »e.g. “it will rain today” is T/F. »P(“it will rain today”) = 0.2 prior probability (unconditional) l Posterior probability (conditional) »P(“it wil rain today” | “rain is forecast”) = 0.8 l Bayes’ Rule: P(H|E) = P(E|H) x P(H) P(E)

5 Bayesian networks l Directed acyclic graphs l Nodes: random variables, »R: “it is raining”, discrete values T/F »T: temperature, cts or discrete variable »C: colour, discrete values {red,blue,green} l Arcs indicate dependencies (can have causal interpretation)

6 Bayesian networks l Conditional Probability Distribution (CPD) –Associated with each variable –probability of each state given parent states “Jane has the flu” “Jane has a high temp” “Thermometer temp reading” X Flu Y Te Q Th Models causal relationship Models possible sensor error P(Flu=T) = 0.05 P(Te=High|Flu=T) = 0.4 P(Te=High|Flu=F) = 0.01 P(Th=High|Te=H) = 0.95 P(Th=High|Te=L) = 0.1

7 BN inference l Evidence: observation of specific state l Task: compute the posterior probabilities for query node(s) given evidence. Th Y Flu Te Diagnostic inference Th Flu Y Te Causal inference Intercausal inference Te FluTB Flu Mixed inference Th Flu Te

8 BN software l Commerical packages: Netica, Hugin, Analytica (all with demo versions) l Free software: Smile, Genie, JavaBayes, … html l Example running Netica software

9 Decision networks l Extension to basic BN for decision making »Decision nodes »Utility nodes l EU(Action) =  p(o|Action,E) U(o) o »choose action with highest expect utility l Example

10 Elicitation from experts l Variables »important variables? values/states? l Structure »causal relationships? »dependencies/independencies? l Parameters (probabilities) »quantify relationships and interactions? l Preferences (utilities)

11 Expert Elicitation Process l These stages are done iteratively l Stops when further expert input is no longer cost effective l Process is difficult and time consuming. l Current BN tools »inference engine »GUI l Next generation of BN tools? BN EXPERT BN TOOLS Domain EXPERT

12 Knowledge discovery l There is much interest in automated methods for learning BNS from data »parameters, structure (causal discovery) l Computationally complex problem, so current methods have practical limitations »e.g. limit number of states, require variable ordering constraints, do not specify all arc directions l Evaluation methods

13 The knowledge engineering process 1. Building the BN »variables, structure, parameters, preferences »combination of expert elicitation and knowledge discovery 2. Validation/Evaluation »case-based, sensitivity analysis, accuracy testing 3. Field Testing »alpha/beta testing, acceptance testing 4. Industrial Use »collection of statistics 5. Refinement »Updating procedures, regression testing

14 Case Study: Intelligent tutoring l Tutoring domain: primary and secondary school students’ misconceptions about decimals l Based on Decimal Comparison Test (DCT) »student asked to choose the larger of pairs of decimals »different types of pairs reveal different misconceptions l ITS System involves computer games involving decimals l This research also looks at a combination of expert elicitation and automated methods ( UAI2001 )

15 Expert classification of Decimal Comparison Test (DCT) results “apparent expert” “longer is larger” “shorter is larger” H = high (all correct or only one wrong) L = low (all wrong or only one correct)

16 The ITS architecture Adaptive Bayesian Network Decimal comparison test (optional) Inputs Computer Games Generic BN model of student Information about student e.g. age (optional) Hidden number Flying photographer Decimaliens …. Number between Student Item Answer Item Answer Classroom diagnostic test results (optional) Classroom Teaching Activities Report on student Answer Item type New game  Diagnose misconception  Predict outcomes  Identify most useful information Sequencing tactics  Select next item type  Decide to present help  Decide change to new game  Identify when expertise gained Teacher System Controller Module Answers Help Feedback Help

17 Expert Elicitation l Variables »two classification nodes: fine and coarse (mut. ex.) »item types: (i) H/M/L (ii) 0-N l Structure »arcs from classification to item type »item types independent given classification l Parameters »careless mistake (3 different values) »expert ignorance: - in table (uniform distribution)

18 Expert Elicited BN

19 Evaluation process l Case-based evaluation »experts checked individual cases »sometimes, if prior was low, ‘true’ classification did not have highest posterior (but usually had biggest change in ratio) l Adaptiveness evaluation »priors changes after each set of evidence l Comparison evaluation »Differences in classification between BN and expert rule »Differences in predictions between different BNs

20 Comparison evaluation l Development of measure: same classification, desirable and undesirable re-classification l Use item type predictions l Investigation of effect of item type granularity and probability of careless mistake

21 Comparison: expert BN vs rule UndesirableDesirableSame

22 Results Undes. Desir. Same varying prob. of careless mistake varying granularity of item type: 0-N and H/M/L

23 Investigation by Automated methods l Classification (using SNOB program, based on MML) l Parameters l Structure (using CaMML)

24 Results

25 Another Case Study: Seabreeze prediction l 2000 Honours project, joint with Bureau of Meteorology ( with Russell Kennett and Kevin Korb, PAKDD’2001 paper, TR) l BN network built based on existing simple expert rule l Several years data available for Sydney seabreezes l CaMML ( Wallace and Korb, 1999 ) and Tetrad-II ( Spirtes et al. 1993) programs used to learn BNs from data l Comparative analysis showed automated methods gave improved predictions.

26 Other BN-related projects l DBNS for discrete monitoring (PhD, 1992) l Approximate BN inference algorithms based on a mutual information measure for relevance ( with Nathalie Jitnah, ICONIP97, ECSQARU97, PRICAI98,AI99) l Plan recognition: DBNs for predicting users actions and goals in an adventure game ( with David Albrecht, Ingrid Zukerman, UM97,UMUAI1999,PRICAI2000 ) l Bayesian Poker ( with Kevin Korb, UAI’99, honours students )

27 Other BN-related projects (cont.) l DBNs for ambulation monitoring and fall diagnosis ( with biomedical engineering, PRICAI’96 ) l Autonomous aircraft monitoring and replanning ( with Ph.D. student Tim Wilkin, PRICAI2000 ) l Ecological risk assessment ( 2003 honours project with Water Studies Centre) l Writing a textbook! ( with Kevin Korb ) Bayesian Artificial Intelligence

28 Open Research Questions l Methodology for combining expert elicitation and automated methods »expert knowledge used to guide search »automated methods provide alternatives to be presented to experts l Evaluation measures and methods »may be domain depended l Improved tools to support elicitation »e.g. visualisation of d-separation l Industry adoption of BN technology