Our New Progress on Frequent/Sequential Pattern Mining We develop new frequent/sequential pattern mining methods Performance study on both synthetic and.

Slides:



Advertisements
Similar presentations
Frequent Itemset Mining Methods. The Apriori algorithm Finding frequent itemsets using candidate generation Seminal algorithm proposed by R. Agrawal and.
Advertisements

Mining Frequent Patterns Using FP-Growth Method Ivan Tanasić Department of Computer Engineering and Computer Science, School of Electrical.
1 Multi-way Algorithm for Cube Computation CPS Notes 8.
1 Top 10 Algorithms in Data Mining Xindong Wu ( 吴信东 ) Department of Computer Science University of Vermont, USA; Hong Kong Polytechnic University; 合肥工业大学计算机与信息学院.
LOGO Association Rule Lecturer: Dr. Bo Yuan
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
Chase Repp.  knowledge discovery  searching, analyzing, and sifting through large data sets to find new patterns, trends, and relationships contained.
MapReduce-based Closed Frequent Itemset Mining with Efficient Redundancy Filtering Su-Qi Wang ∗, Yu-Bin Yang ∗, Guang-Peng Chen ∗, Yang Gao ∗ and Yao Zhang†
Rakesh Agrawal Ramakrishnan Srikant
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Cube Explorer: Online Exploration of Data Cubes Jiawei Han, Jianyong Wang, Guozhu Dong, Jian Pei, Ke Wang.
Multi-dimensional Sequential Pattern Mining
Sequential Pattern Mining
Data Mining Association Rules Yao Meng Hongli Li Database II Fall 2002.
1 Association Rule Mining Instructor Qiang Yang Slides from Jiawei Han and Jian Pei And from Introduction to Data Mining By Tan, Steinbach, Kumar.
Sequence Databases & Sequential Patterns
Mining Sequential Patterns Dimitrios Gunopulos, UCR.
Data Mining: Concepts and Techniques 1 Mining Sequence Patterns in Transactional Databases CS240B --UCLA Notes by Carlo Zaniolo Based on those by J. Han.
The UNIVERSITY of Kansas EECS 800 Research Seminar Mining Biological Data Instructor: Luke Huan Fall, 2006.
Summarization of Frequent Pattern Mining. What is FPM? Why being frequent is so important? Application of FPM Decision make/Business Software Debugging.
Association Rule Mining (Some material adapted from: Mining Sequential Patterns by Karuna Pande Joshi)‏
Pattern-growth Methods for Sequential Pattern Mining: Principles and Extensions Jiawei Han (UIUC) Jian Pei (Simon Fraser Univ.)
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 6 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
Performance and Scalability: Apriori Implementation.
DBMiner 2.0 Adnan Rahi Prabhat Vivekanandan. Brief History of DBMiner Technology Inc. Research on data mining since International reputation and.
August 20, 2015Data Mining: Concepts and Techniques1 Chapter 5: Mining Frequent Patterns, Association and Correlations Basic concepts and a road map Efficient.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Association Rule Mining COMP Seminar GNET 713 BCB Module Spring 2007.
What Is Sequential Pattern Mining?
Storytelling and Clustering for Cellular Signaling Pathways M. Shahriar Hossain, Monika Akbar, Nicholas F. Polys Department of Computer Science, Virginia.
1 Apriori Algorithm Review for Finals. SE 157B, Spring Semester 2007 Professor Lee By Gaurang Negandhi.
October 2, 2015 Data Mining: Concepts and Techniques 1 Data Mining: Concepts and Techniques — Chapter 8 — 8.3 Mining sequence patterns in transactional.
October 6, 2015Data Mining: Concepts and Techniques1 Data Mining: Concepts and Techniques — Slides for Textbook — — Chapter 6 — ©Jiawei Han and Micheline.
AR mining Implementation and comparison of three AR mining algorithms Xuehai Wang, Xiaobo Chen, Shen chen CSCI6405 class project.
Takeaki Uno Tatsuya Asai Yuzo Uchida Hiroki Arimura
1 Multi-dimensional Sequential Pattern Mining Helen Pinto, Jiawei Han, Jian Pei, Ke Wang, Qiming Chen, Umeshwar Dayal ~From: 10th ACM Intednational Conference.
Pattern-Growth Methods for Sequential Pattern Mining Iris Zhang
Lecture 11 Sequential Pattern Mining MW 4:00PM-5:15PM Dr. Jianjun Hu CSCE822 Data Mining and Warehousing University.
Mining Frequent Patterns without Candidate Generation : A Frequent-Pattern Tree Approach 指導教授:廖述賢博士 報 告 人:朱 佩 慧 班 級:管科所博一.
Jian Pei Jiawei Han Behzad Mortazavi-Asl Helen Pinto ICDE’01
HOW TO HAVE A GOOD PAPER Tran Minh Quang. What and Why Do We Write? Letter Proposal Report for an assignments Research paper Thesis ….
Panagiotis Papapetrou, George Kollios, Stan Sclaroff, Dimitrios Gunopulos Department of Computer Science Boston University University of California, Riverside.
CloSpan: Mining Closed Sequential Patterns in Large Datasets Xifeng Yan, Jiawei Han and Ramin Afshar Proceedings of 2003 SIAM International Conference.
Fast and Memory Efficient Mining of Frequent Closed Itemsets Claudio Lucchese Salvatore Orlando Raffaele Perego DB group seminar Presenter: Leonidas.
PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth Jiawei Han, Jian Pei, Helen Pinto, Behzad Mortazavi-Asl, Qiming Chen,
1 Top Down FP-Growth for Association Rule Mining By Ke Wang.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 6 —
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Jian Pei and Runying Mao (Simon Fraser University)
TITLE What should be in Objective, Method and Significant
Association rule mining
©Jiawei Han and Micheline Kamber
CS6220: Data Mining Techniques
Data Mining: Concepts and Techniques
Jiawei Han Department of Computer Science
©Jiawei Han and Micheline Kamber
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 6 —
به نام خداوند جان و خرد الگوکاوي در پايگاه‌هاي تراکنش بسيار بزرگ با استفاده از رويکرد تقسيم وحل Frequent Pattern Mining on Very Large Transaction Databases.
A Linear Method for Deviation Detection in Large databases
Chapter 5: Mining Frequent Patterns, Association and Correlations
Frequent-Pattern Tree
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 6 —
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 6 —
Association Rule Mining
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 6 —
Closed Itemset Mining CSCI-7173: Computational Complexity & Algorithms, Final Project - Spring 16 Supervised By Dr. Tom Altman Presented By Shahab Helmi.
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 6 —
Presentation transcript:

Our New Progress on Frequent/Sequential Pattern Mining We develop new frequent/sequential pattern mining methods Performance study on both synthetic and real data sets shows that our methods outperform conventional ones in wide margins

Mining Complete Set of Frequent Patterns on T10I4D100k

Mining Complete Set of Frequent Patterns on T25I20D100k

Mining Complete Set of Frequent Patterns on Connect-4

Mining Sequential Patterns on C10T4S16I4

Mining Sequential Patterns on C10T8S8I8

Scalability of Mining Sequential Patterns on C10-100T8S8I8

Scalability of Mining Sequential Patterns on C10-100T4S16I4

Why Prefix Is Faster Than GSP? Dataset C10T4S16I4Dataset C10T8S8I8

Mining Frequent Closed Itemsets on T25I20D100k

Mining Frequent Closed Itemsets on Connect-4

Mining Frequent Closed Itemsets on Pumsb

References R. Agarwal, C. Aggarwal, and V. V. V. Prasad. A tree projection algorithm for generation of frequent itemsets. In Journal of Parallel and Distributed Computing (Special Issue on High Performance Data Mining), (to appear), R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc Int. Conf. Very Large Data Bases, pages , Santiago, Chile, September J. Han, J. Pei, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. Hsu. FreeSpan: Frequent pattern-projected sequential pattern mining. In Proc. KDD'2000, Boston, August J. Han, J. Pei, and Y. Yin. Mining Frequent Patterns without Candidate Generation, Proc. SIGMOD’2000, Dallas, TX, May J. Pei, J. Han, H. Pinto, Q. Chen, U. Dayal, and M. Hsu. PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth, submitted for publication R. Srikant and R. Agrawal. Mining sequential patterns: Generalizations and performance improvements. In Proc. 5th Int. Conf. Extending Database Technology (EDBT), pages , Avignon, France, March N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Discovering frequent closed itemsets for association rules. In Proc. ICDT’99, Israel, January M.J. Zaki and C. Hsiao. ChARM: An efficient algorithm for closed association rule mining. In Proc. KDD'2000, Boston, August 2000.

DBMiner Version 2.5 (Beta) DBMiner Technology Inc. B.C. Canada

What we had for DBMiner 2.0… Association module on data cubes Classification module on data cubes Clustering module on data cubes OLAP browser 3D Cube browser

What we will do in DBMiner 2.5… Keep the existing association module and classification module in version 2.0 Change the existing clustering module Add new visual classification module both on SQL server and OLAP Add new sequential pattern modules on SQL server using FP algorithm

What we have done… We have incorporated the existing association module and added OLAP browser Module We have added the visual classification module We have changed the existing clustering module We have added the sequential pattern module We are still in the development stage

Association module on data cubes

New sequential pattern module on SQL Server

New visual classification module on data cubes

New clustering module on data cubes