1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.

Slides:



Advertisements
Similar presentations
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Advertisements

A SOFTWARE TOOL DEVELOPED FOR THE CLASSIFICATION OF REMOTE SENSING SPECTRAL REFLECTANCE DATA Abdullah Faruque School of Computing & Software Engineering.
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
Assessing Evolutionary Algorithms to Data Mining Problems Jesús Alcalá 1II Jornadas de software libreGranada, October 22nd, 2010.
Rutgers Components Phase 2 Principal investigators –Paul Kantor, PI; Design, modelling and analysis –Kwong Bor Ng, Co-PI - Fusion; Experimental design.
Department of Computer Science, University of Waikato, New Zealand Eibe Frank WEKA: A Machine Learning Toolkit The Explorer Classification and Regression.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
DATA WAREHOUSING.
Clementine Server Clementine Server A data mining software for business solution.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Business Driven Technology Unit 2
Building Knowledge-Driven DSS and Mining Data
Presenter: Teng-Chih Yang Professor: Ming-Puu Chen Date: 10/ 28/ 2009 Data mining in course management systems: Moodle case study and tutorial Romero,
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
Introduction to machine learning
Paper on Best implemented scientific concept for E-Governance Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola By Nitin V. Choudhari, DIO,NIC,Akola.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Chapter 5 Using SAS ® ETL Studio. Section 5.1 SAS ETL Studio Overview.
Data Mining Techniques
 The Weka The Weka is an well known bird of New Zealand..  W(aikato) E(nvironment) for K(nowlegde) A(nalysis)  Developed by the University of Waikato.
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
Repository Method to suit different investment strategies Alma Lilia Garcia & Edward Tsang.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Understanding Data Analytics and Data Mining Introduction.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
I. Pribela, M. Ivanović Neum, Content Automated assessment Testovid system Test generator Module generators Conclusion.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
An intro to programming. The purpose of writing a program is to solve a problem or take advantage of an opportunity Consists of multiple steps:  Understanding.
Data Profiling
Testovid - an environment for testing almost any aspect of student assignments I. Pribela, S. Tošić, M. Ivanović, Z. Budimac Risan, September 2007.
Presented by Tienwei Tsai July, 2005
Appendix: The WEKA Data Mining Software
Introduction to MDA (Model Driven Architecture) CYT.
Programming language A programming language is an artificial language designed to communicate instructions to a machine,languageinstructionsmachine particularly.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Weka: a useful tool in data mining and machine learning Team 5 Noha Elsherbiny, Huijun Xiong, and Bhanu Peddi.
Weka: Experimenter and Knowledge Flow interfaces Neil Mac Parthaláin
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
For ITCS 6265/8265 Fall 2009 TA: Fei Xu UNC Charlotte.
Artificial Neural Network Building Using WEKA Software
FEMA-MES SYSTEM. VERSION 2.0 ON LIVE LINUX PLATFORM Wacław PRZYBYŁO Jarosław KALINOWSKI.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Near Real-Time Verification At The Forecast Systems Laboratory: An Operational Perspective Michael P. Kay (CIRES/FSL/NOAA) Jennifer L. Mahoney (FSL/NOAA)
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Objectives: Terminology Components The Design Cycle Resources: DHS Slides – Chapter 1 Glossary Java Applet URL:.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.
Application of Data Mining Techniques on Survey Data using R and Weka
Books Visualizing Data by Ben Fry Data Structures and Problem Solving Using C++, 2 nd edition by Mark Allen Weiss MATLAB for Engineers, 3 rd edition by.
Level 1-2 Trigger Data Base development Current status and overview Myron Campbell, Alexei Varganov, Stephen Miller University of Michigan August 17, 2000.
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
1 SBM411 資料探勘 陳春賢. 2 Lecture I Class Introduction.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
IBM - CVUT Student Research Projects Plugin and script generator for WebSphere Jakub Řezníček Tomáš Turek
WEKA: A Practical Machine Learning Tool WEKA : A Practical Machine Learning Tool.
Introduction to Machine Learning, its potential usage in network area,
SNS COLLEGE OF TECHNOLOGY
LAND COVER CLASSIFICATION WITH THE IMPACT TOOL
INTRODUCING Adams/CHASSIS
Waikato Environment for Knowledge Analysis
Research Areas Christoph F. Eick
Machine Learning with Weka
Assoc. Prof. Dr. Syed Abdul-Rahman Al-Haddad
Lecture 10 – Introduction to Weka
MAPO: Mining and Recommending API Usage Patterns
Data Mining CSCI 307, Spring 2019 Lecture 7
Presentation transcript:

1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent Systems AYRNA GRSI

2 1. INTRODUCTION 2. KEEL 3. EXPERIMENTAL EXAMPLE 4. CONCLUSIONS AND FURTHER WORK KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems

3 Introduction Evolutionary Algorithms (EAs) requires a certain programming expertise along with considerable time and effort to write a computer program for implementing algorithms that often are sophisticated.

4 Introduction In the last few years, many software tools have been developed to reduce this task. We develop a non-commercial Java software tool named KEEL (Knowledge Extraction based on Evolutionary Learning).

5 Introduction This tool can offer several advantages: It includes a big library with EAs algorithms based on different paradigms (Pittsburgh, Michigan, IRL and GCCL) and simplifies their integration with different pre-processing techniques. It extends the range of possible users applying EAs. This can be used on any machine with Java.

6 1. INTRODUCTION 2. KEEL 3. EXPERIMENTAL EXAMPLE 4. CONCLUSIONS AND FURTHER WORK KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems

7 KEEL is a software tool to assess EAs for DM problems including regression, classification, clustering, pattern mining and so on. KEEL allows us to perform a complete analysis of any learning model in comparison to existing ones, including a statistical test module for comparison. Moreover, KEEL has been designed with a double goal: research and educational. KEEL : Functionality

8 EAs are presented in predicting models, pre-processing and postprocessing. It includes data pre-processing algorithms proposed in specialized literature: data transformation, discretization, instance selection and feature selection. It contains a statistical library for analyzing results Some algorithms have been developed by using Java Class Library for Evolutionary Computation (JCLEC). It provides a user-friendly graphical interface in which experimentations containing multiple data sets and algorithms connected among themselves can be easily performed. KEEL also allows creating experiments in on-line mode, aiming an educational support in order to learn the operation of the algorithm included. KEEL : Main features

9 It is integrated by three main blocks: Data Management. Design of Experiments (off-line module). Educational Experiments (on-line module). KEEL : Blocks

10 This part is made up of a set of tools that can be used to build new data to export and import data in other formats data edition and visualization to apply transformations and partitioning to data. etc. KEEL : Data Management

11 It is a Graphical User Interface that allows the design of experiments for solving different machine learning problems. Once the experiment is designed, it generates the directory structure and files required for running them in any local machine with Java. KEEL : Design of experiments Graphic Design Directory Structure and xml-based scripts Execute in a remote machine

12 The experiments are graphically modeled. They represent a multiple connection among data, algorithms and analysis/visualization modules. Aspects such as type of learning, validation, number of runs and algorithm’s parameters can be easily configured. Once the experiment is created, KEEL generates a script-based program which can be run in any machine with JAVA Virtual Machine installed in it. KEEL : Design of experiments

13 Similar structure to the design of experiments This allows for the design of experiments that can be run step- by-step in order to display the learning process of a certain model by using the software tool for educational purposes. Results and analysis are shown in on-line mode. KEEL : Educational Module

14 1. INTRODUCTION 2. KEEL 3. EXPERIMENTAL EXAMPLE 4. CONCLUSIONS AND FURTHER WORK KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems

15 Type of learning: Classification Methods considered: SLAVE algorithm (Clas-Fuzzy-Slave) and Chi et al. algorithm with rule weights (Clas-Fuzzy-Chi-RW). Type of validation: 10-folder cross-validation model. SLAVE has been run 5 times per data partition (a total of 50 runs). Statistical Analysis: Wilcoxon test (Stat-Clas-Wilcoxon) Experimental example

16 Experimental example 12 problems for classification:

17 Experimental example Average Results: (Vis-Clas-Tabular) Statistical Results: (Stat-Clas-Wilcoxon)

18 1. INTRODUCTION 2. KEEL 3. EXPERIMENTAL EXAMPLE 4. CONCLUSIONS AND FURTHER WORK KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems

19  KEEL relieves researchers of much technical work and allows them to focus on the analysis of their new models in comparison with the existing ones  The tool enables researchers with a basic knowledge of evolutionary computation to apply EAs to their work. Concluding Remarks

20  A new set of EAs and a test tool that will allow us to apply parametric and non-parametric tests on any set of data  Data visualization tools for the on-line and offline modules.  A data set repository that includes the data set partitions and algorithm results on these data sets, the KEEL-dataset Future work

21 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent Systems AYRNA GRSI