Presentation is loading. Please wait.

Presentation is loading. Please wait.

Machine Learning with Weka Cornelia Caragea Thanks to Eibe Frank for some of the slides.

Similar presentations


Presentation on theme: "Machine Learning with Weka Cornelia Caragea Thanks to Eibe Frank for some of the slides."— Presentation transcript:

1 Machine Learning with Weka Cornelia Caragea Thanks to Eibe Frank for some of the slides

2 Outline Weka: A Machine Learning Toolkit Preparing Data Building Classifiers Interpreting Results

3 WEKA: the software Machine learning/data mining software written in Java (distributed under the GNU Public License) Used for research, education, and applications Main features: Comprehensive set of data pre-processing tools, learning algorithms and evaluation methods Graphical user interfaces (incl. data visualization) Environment for comparing learning algorithms

4 WEKA: versions There are several versions of WEKA: WEKA: “book version” compatible with description in data mining book WEKA: “GUI version” adds graphical user interfaces (book version is command-line only) WEKA: “development version” with lots of improvements

5 WEKA: resources API Documentation, Tutorials, Source code. API DocumentationTutorials WEKA mailing list Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations Weka-related Projects: Weka-Parallel - parallel processing for Weka Weka-Parallel RWeka - linking R and Weka RWeka YALE - Yet Another Learning Environment YALE Many others…

6 Weka: web site http://www.cs.waikato.ac.nz/ml/weka/

7 WEKA: launching java -jar weka.jar

8 Outline Weka: A Machine Learning Toolkit Preparing Data Building Classifiers Interpreting Results

9 @relation heart-disease-simplified @attribute age numeric @attribute sex { female, male} @attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina} @attribute cholesterol numeric @attribute exercise_induced_angina { no, yes} @attribute class { present, not_present} @data 63,male,typ_angina,233,no,not_present 67,male,asympt,286,yes,present 67,male,asympt,229,yes,present 38,female,non_anginal,?,no,not_present... WEKA only deals with “flat” files

10 @relation heart-disease-simplified @attribute age numeric @attribute sex { female, male} @attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina} @attribute cholesterol numeric @attribute exercise_induced_angina { no, yes} @attribute class { present, not_present} @data 63,male,typ_angina,233,no,not_present 67,male,asympt,286,yes,present 67,male,asympt,229,yes,present 38,female,non_anginal,?,no,not_present... WEKA only deals with “flat” files

11 Explorer: pre-processing the data Data can be imported from a file in various formats: ARFF, CSV, C4.5, binary Data can also be read from a URL or from an SQL database (using JDBC) Pre-processing tools in WEKA are called “filters” WEKA contains filters for: Discretization, normalization, resampling, attribute selection, transforming and combining attributes, …

12 Import Datasets into WEKA

13

14

15

16

17

18

19

20

21

22

23 Outline Weka: A Machine Learning Toolkit Preparing Data Building Classifiers Interpreting Results

24 Explorer: building “classifiers” Classifiers in WEKA are models for predicting nominal or numeric quantities Implemented learning schemes include: Decision trees and lists, instance-based classifiers, support vector machines, multi-layer perceptrons, logistic regression, Bayes’ nets, … “Meta”-classifiers include: Bagging, boosting, stacking, error-correcting output codes, locally weighted learning, …

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41 Outline Machine Learning Software Preparing Data Building Classifiers Interpreting Results

42 Understanding Output

43 Decision Tree Output (1)

44 Decision Tree Output (2)

45 To Do List Try Decision Tree, Naïve Bayes, and Logistic Regression classifiers on a different Weka dataset Use various parameters


Download ppt "Machine Learning with Weka Cornelia Caragea Thanks to Eibe Frank for some of the slides."

Similar presentations


Ads by Google