Presentation is loading. Please wait.

Presentation is loading. Please wait.

Software Engineering Laboratory1 Introduction of Bayesian Network 4 / 20 / 2005 CSE634 Data Mining Prof. Anita Wasilewska 105269827 Hiroo Kusaba.

Similar presentations


Presentation on theme: "Software Engineering Laboratory1 Introduction of Bayesian Network 4 / 20 / 2005 CSE634 Data Mining Prof. Anita Wasilewska 105269827 Hiroo Kusaba."— Presentation transcript:

1 Software Engineering Laboratory1 Introduction of Bayesian Network 4 / 20 / 2005 CSE634 Data Mining Prof. Anita Wasilewska 105269827 Hiroo Kusaba

2 Software Engineering Laboratory2 References [1] D. Heckerman: “ A Tutorial on Learning with Bayesian Networks ”, In “ Learning in Graphical Models ”, ed. M.I. Jordan, The MIT Press, 1998. [2] http://www.cs.huji.ac.il/~nir/Nips01-Tutorial/http://www.cs.huji.ac.il/~nir/Nips01-Tutorial/ [3]Jiawei Han: ” Data Mining Concepts and Techniques ”,ISBN 1-53860-489-8 [4] Whittaker, J.: Graphical Models in Applied Multivariate Statistics, John Wiley and Sons (1990)

3 Software Engineering Laboratory3 Contents Brief introduction Review  A little review of probability  Bayes theorem Bayesian Classification Steps of using Bayesian Network

4 Software Engineering Laboratory4 Random variables X, Y, Xi, Θ Capitals Condition (or value) of a variable x, y, xi, θ small Set of a variable X, Y, Xi, Θ in Capital bold Set of a condition (or value) x, y, xi, θ small bold P(x/a) : Probability that an event x occurs (or happens) under the condition of a

5 Software Engineering Laboratory5 What is Bayesian Network ? Network which express the dependencies among the random variables Each node has posterior probability which depends on the previous random variable The whole network also express the joint probability distribution from all of the random variables Pa is parent(s) of a node i

6 Software Engineering Laboratory6 How is it used ? Bayesian Learning  Estimating dependencies between the random variables from the actual data Bayesian Inference  When some of the random variables are defined it calculate the other probabilities  Patiants condition as a random variable, from the condition it predicts the desease

7 Software Engineering Laboratory7 What is so good about it? Conditional independencies and graphical expression capture structure of many real-world distributions. [1] Learned model can be used for many tasks Supports all the features of probabilistic learning  Model selection criteria  Dealing with missing data and hidden variables

8 Software Engineering Laboratory8 Example of Bayesian Network  Structure of a network  Conditional Probability  X,Y,Z are random variables which takes either 0 or 1  p(X), p(Y|X), p(Z|Y) XYZ XYP(Y|X) 000.1 010.9 100.2 110.8 YZP(Z|Y) 000.3 010.7 100.4 110.6 XP(X) 00.5 1

9 Software Engineering Laboratory9 Example of Bayesian Network 2 What is the Joint probability of P(X, Y, Z)?  P(X, Y, Z) = P(X)*P(Y|X)*P(Z|Y) XYZP(X,Y,Z) 0000.015 0010.035 0100.180 0110.270 XYZP(X,Y,Z) 1000.030 1010.070 1100.160 1110.240

10 Software Engineering Laboratory10 A little Review of probability 1 Probability : How likely is it that an event will happen? Sample Space S  Element of S: elementary event  An event A is a subset of S P(A) ≧ 0 P(S) = 1

11 Software Engineering Laboratory11 A little review of probability 2 Discrete probability distribution  P(A) = Σ s∈ A P (s) Conditional probability distribution  P(A|B) = P(A, B) / P(B) If the events are independent  P(A, B) = P(A)*P(B) Bayes Theorem B A

12 Software Engineering Laboratory12 Bayes Theorem

13 Software Engineering Laboratory13 Example of Bayes Theorem You are about to be tested for a rare desease. How worried should you be if the test result is positive ? Accuracy of the Test is P(T) = 85% Chance of Infection P(I) = 0.01% What is P(I / not T) http://www.gametheory.net/Mike/applets/Bayes/ Bayes.html

14 Software Engineering Laboratory14 Bayesian Classification Suppose that there are m classes, Given an unknown data sample, x the Bayesian classifier assigns an unknown sample x to the class c if and only if

15 Software Engineering Laboratory15 We have to maximize In order to reduce computation class conditional independence is made

16 Software Engineering Laboratory16 Example of Bayesian Classification in the text book[3] Customer under 30 and income is “ medium ” and student and credit rating is “ fair ”, which category does the customer belongs? Buy or not.

17 Software Engineering Laboratory17 Bayesian Network Network which express the dependencies among the random variables The whole network also express the joint probability distribution from all of the random variables Pa is parent(s) of a node i XYZ Pa i are a subset

18 Software Engineering Laboratory18 Steps to apply Bayesian Network Step1 Create a Bayesian Belief Network  Include all the variables that are important in your system  Use causal knowledge to guide the connections made in the graph  Use your prior knowledge to specify the conditional distributions Step2 Calculate the p(x i |pa i ) for your goal

19 Software Engineering Laboratory19 Example from [1] Example to make a BN from the prior knowledge BN to find a credit card fraud  Define random variables  Fraud(F):Probability that owner is a fraud  Gas(G):Bought a gas in 24 hours  Jewelry(J):Bought a jewelry in 24 hours  Age(A):Age of owner of the card  Sex(S):Gender of the owner of the card

20 Software Engineering Laboratory20 Give orders to random variables Define dependencies, but you have to be careful. F GJ SA F G JS A

21 Software Engineering Laboratory21 Next topic Training with Bayesian Network  Bayes Inference  If the training data is complete  If the training data is missing  Network Evaluation

22 Software Engineering Laboratory22 Thank you for listening.


Download ppt "Software Engineering Laboratory1 Introduction of Bayesian Network 4 / 20 / 2005 CSE634 Data Mining Prof. Anita Wasilewska 105269827 Hiroo Kusaba."

Similar presentations


Ads by Google