Chaotic Mining: Knowledge Discovery Using the Fractal Dimension Daniel Barbara George Mason University Information and Software Engineering Department.

Slides:



Advertisements
Similar presentations
Mining Frequent Patterns Using FP-Growth Method Ivan Tanasić Department of Computer Engineering and Computer Science, School of Electrical.
Advertisements

A distributed method for mining association rules
Data Mining Techniques Association Rule
Rule Discovery from Time Series Presented by: Murali K. Kadimi.
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
Graduate : Sheng-Hsuan Wang
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
Multi-dimensional Sequential Pattern Mining
Mining Sequential Patterns Dimitrios Gunopulos, UCR.
Chapter 16 Parallel Data Mining 16.1From DB to DW to DM 16.2Data Mining: A Brief Overview 16.3Parallel Association Rules 16.4Parallel Sequential Patterns.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
CS 590M Fall 2001: Security Issues in Data Mining Lecture 5: Association Rules, Sequential Associations.
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
September, 13th gR2002, Vienna PAOLO GIUDICI Faculty of Economics, University of Pavia Research carried out within the laboratory: Statistical.
Core Text Mining Operations 2007 년 02 월 06 일 부산대학교 인공지능연구실 한기덕 Text : The Text Mining Handbook pp.19~41.
Summarization of Frequent Pattern Mining. What is FPM? Why being frequent is so important? Application of FPM Decision make/Business Software Debugging.
Mining Negative Rules in Large Databases using GRD Dhananjay R Thiruvady Supervisor: Professor Geoffrey Webb.
Data Mining By Archana Ketkar.
Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set Presented By Kallepalli Vijay Instructor: Dr. Ruppa Thulasiram.
Research Project Mining Negative Rules in Large Databases using GRD.
Pattern Recognition Lecture 20: Data Mining 3 Dr. Richard Spillman Pacific Lutheran University.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
Chapter 11 Managing Knowledge. Dimensions of Knowledge.
Time series analysis and Sequence Segmentation
CS 349: Market Basket Data Mining All about beer and diapers.
Information Retrieval from Data Bases for Decisions Dr. Gábor SZŰCS, Ph.D. Assistant professor BUTE, Department Information and Knowledge Management.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
CS525: Big Data Analytics Machine Learning on Hadoop Fall 2013 Elke A. Rundensteiner 1.
Ch5 Mining Frequent Patterns, Associations, and Correlations
1 Apriori Algorithm Review for Finals. SE 157B, Spring Semester 2007 Professor Lee By Gaurang Negandhi.
Chapter 1 Introduction to Data Mining
Association Rule Mining in Peer-to-Peer Systems Ran Wolff Assaf Shcuster Department of Computer Science Technion I.I.T. Haifa 32000,Isreal.
Outline Knowledge discovery in databases. Data warehousing. Data mining. Different types of data mining. The Apriori algorithm for generating association.
4/8/2002Copyright Daniel Barbara Clustering by impact Daniel Barbará George Mason University ISE Dept. (joint work with.
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
DATA MINING Using Association Rules by Andrew Williamson.
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
Overview Definition of Apriori Algorithm
Mining Sequential Patterns © Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 Slides are adapted from Introduction to Data Mining by Tan, Steinbach,
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Optimization of Association Rules Extraction Through Exploitation of Context Dependent Constraints Arianna Gallo, Roberto Esposito, Rosa Meo, Marco Botta.
1. ABSTRACT Information access through Internet provides intruders various ways of attacking a computer system. Establishment of a safe and strong network.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Rule discovery from time series Authors: Guatam Das, King-Ip Lin, Heikki Mannila, Gopal Renganathan, Pedhraic Smyth Presented By: Tom Gradel.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining Functionalities
Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
Association rule mining
Byung Joon Park, Sung Hee Kim
An Enhanced Support Vector Machine Model for Intrusion Detection
Waikato Environment for Knowledge Analysis
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Targeted Association Mining in Time-Varying Domains
I don’t need a title slide for a lecture
Association Rule Mining
Data Science introduction.
Security and Trustworthiness in Cloud Computing
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Time Series Forecasting Accelerator
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Presentation transcript:

Chaotic Mining: Knowledge Discovery Using the Fractal Dimension Daniel Barbara George Mason University Information and Software Engineering Department By Dhruva Gopal

Fractals  What are fractals  Property of a fractal  Self Similarity

Uses of fractals  Geologic activity  Planetary orbits  Weather  Fluid flow  databases

Fractal Dimensions  Number of possible dimensions?  Fractal dimension computation  D q = 1/(q-1)*(log  i p i q )/(log r)  Hausdorff dimension  Information dimension  Correlation dimension

Examples  Event Anomalies in time series  Self similarity in association rules  Analyzing patterns in datacubes  Incremental clustering

Event Anomalies  Time series  Stock price changes  TCP connection occurrence  Example  Half open TCP connections  Network Spoofing

Methodology  Half open connections are self similar  Collect data points every  seconds  Moving window of k *  (k is an integer)  Fractal dimension will show a drastic decrease in case of spoofing  Other applications of fractals with time series  Password port in FTP service

Self Similarity in Association Rules  Parameters associated with a rule  Support  Confidence  Distribution of these transactions???  Seasonal  Promotional  Regular

Fractals in Association rules  Compute Fractal dimension of a k- itemset while computing its support  Information about the fractal dimension should be kept for use when computing k+1th itemset

Analyzing Patterns in datacubes  Patterns  Null cells (no aggregate)  Compute fractal dimension of null cells  Drastic changes imply anomalous trends

Incremental Clustering  Clustering algorithms are needed to deal with large datasets  Extended K means algorithm  Use a variation of extended K means algorithm using fractal dimensions for deciding point membership

Conclusions  Fractals are powerful parameters used to uncover anomalous patterns in the databases  Paper discusses techniques that can be used, but none are implemented.

References  Fast Discovery of Association rules,R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, A.I. Verkamo  John Sarraille and P. DiFalco, FD3,    