Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.

Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology

Data Mining Resources on the Web 1. A comprehensive site for many resources of KDD http://www.kdnuggets.com/ 2. tutorial type articles on currently hot topics http://www.sigkdd.org/ 3. The KDD Cup(1997~2010) http://www.sigkdd.org/kddcup/index.php 4, UCI Dataset http://archive.ics.uci.edu/ml/ 5. Conferences, Journals, and Organizations SIGKDD,ICDM,SIGMOD,SDM,PAKDD IEEE Transactions on Knowledge and Data Engineering Data Mining Group

Tools Clementine Clementine is a platform of data mining developed by ISL (Integral Solutions Limited) company. SPSS company integrated and developed Clementine after purchasing the ISL company in 1999. Now Clementine has become another highlight of SPSS company. Merger and acquisition of IBM and SPSS happened in 2010 It is a data mining and text analytics workbench used to build predictive models. It has a visual interface which allows users to leverage statistical and data mining algorithms without programming. data miningtext analytics predictive models

Tools Clementine

Workflow1

Dataset1 1.Led7 1.attribute#1, attribute#2, ….. attribute#7, label 2.3200 instance 3.All attribute values are either 0 or 1 4.Whether the corresponding light is on or not for the decimal digit

Load the file

Operations

Partitions

View the model

Model analysis

View model

Dataset2 Listing of attributes: label: >50K, <=50K. Age, workclass, fnlwgt, education, education-num, marital-status, occupation, relationship, race, sex, capital-gain, capital-loss, hours-per-week, native-country

Setting

Partitions

C5.0 Analysis

CHAID Analysis

Data cleaning

Partition Flow

C5.0 and CHAID

Programming – Use C4.5 or Bayes classifier – Dataset

Programming Compare your result with the tool.

Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology

Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.

Similar presentations

Presentation on theme: "Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.

Similar presentations

Presentation on theme: "Classiﬁcation Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology."— Presentation transcript:

Similar presentations

About project

Feedback