We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published bySean Schroeder
Modified over 2 years ago
© 2002 page 1 Data Mining Tools For ZLE Copying and Use Restrictions: Material under this presentation is the Intellectual Property of HP Corporation and Genus Software. Any use of the this material, in part or whole, except in context of Genus Data Mining Integrator and Data Mart Builder, without written permission from HP and Genus is prohibited.
© 2002 page 2 agenda data mining in ZLE solutions ZLE data mining toolkit toolkit demonstration agenda
© 2002 page 3 title text Meta Group process of identifying and/or extracting previously unknown, non-trivial, unanticipated, important information from large sets of data Gartner Group process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositories, using pattern recognition technologies, statistical and mathematical techniques
© 2002 page 4 title text role –determine most effective responses to business events ZLE facilitates mining by providing –a rich, integrated, current data source –an integrated operational environment into which models can be deployed data mining helps to realize the full business value of a ZLE system
© 2002 page 5 derive attributes identify and define business opportunity create case set deploy model profile data transform data assess performance train models typically about 75% of process ZLE data mining process understand the opportunity –identify and define business opportunity prepare data –profile and understand data –derive attributes –transform data –create case set build models –train models –assess model performance use models –deploy model –monitor model performance monitor model performance
© 2002 page 6 agenda data mining in ZLE solutions ZLE data mining toolkit toolkit demonstration agenda
© 2002 page 7 the ZLE data mining toolkit goal: –provide tools that facilitate ZLE data mining –reduce process cycle times dramatically three tools being developed by Genus Software: –data preparation –data transfer –model deployment partners: Genus, MicroStrategy, SAS product names: –Genus Mining Integrator for NonStop SQL (all three tools) –Genus Mart Builder for NonStop SQL (first two tools only)
© 2002 page 8 part of Genus toolkit ZLE data mining analytical cycle Data Store (NonStop SQL) Data Preparation (profiling/transforming data) Model Deployment (written to DB tables) Data Transfer (fast parallel streams) Mining Mart (Tru64/Windows) Scoring Engine Rules Engine Agg. Engine Interaction Manager Real-Time Scoring (using the Recommender) part of ZDK 3 Modeling (SAS Enterprise Miner) available from SAS
© 2002 page 9 agenda data mining in ZLE solutions ZLE data mining toolkit toolkit demonstration agenda
© 2002 page 10 toolkit demonstration credit card fraud detection example opportunity: use ZLE data store data to predict, in real-time, which credit card purchases are likely to be fraudulent use tools to: –build a case set table with one row describing each purchase –transfer table to SAS server for modeling –deploy predictive model to ZLE data store –execute model in real-time to make fraud predictions steps described, including many tool screen shots
© 2002 page 11 based on the MicroStrategy (MSI) Business Intelligence toolset, leverages GUI, logical data model support, SQL generation, etc. uses NonStop SQL/MX DBMS, leverages sampling, TRANSPOSE, statistical functions, … custom tool developed by Genus using MSI SDK for NonStop SQL operations and functionality not supported by MSI tools toolkit data preparation solution
© 2002 page 12 two main ZLE data preparation tasks 1.profile tables –column names and types –partitioning information, attributes, key structure, … –column values 2.transform source tables –derive new attributes –aggregate to appropriate level –clean data –pivot –combine to form case set
© 2002 page 13 the MicroStrategy desktop
© 2002 page 14 MSI profile report: fraud vs. billing state
© 2002 page 15 NonStop SQL/MX sampling source table sampling –insert into CustSamp select * from Cust sample random 1 percent clusters of 10 blocks union select * from Cust where CardNo in (select CardNo from FrdFlg) enables interactive and exploratory data prep cleanly integrated into SQL performed efficiently in DP2 easily accessible through Genus tool
© 2002 page 16 creating a materialized sample table using the Genus Data Mart Builder
© 2002 page 17 identifying source and sample method
© 2002 page 18 specifying materialized sample table
© 2002 page 19 transforming source data Billions of Purchase s Millions of Accounts Purchase StoreAccount Purchase History Item Summary Fraud Aggregate and Pivot
© 2002 page 20 result: a case set for modeling Hundreds of Attributes One Row Per Purchase Mix of Fraud and No-Fraud Purchases
© 2002 page 21 MSI Datamart report summarizing items
© 2002 page 22 data transfer tool Data StoreMining Mart NonStop SQL/MX ASCII filesSAS data set data transfer tool task: transfer case set from data store to mining mart coordinator –design HTML HTTP JDBC Web browser client Web server Web App. receive SAS import transferreceive SAS import transferreceive SAS import transfer receive SAS import transfer
© 2002 page 23 data transfer specification screen
© 2002 page 24 transfer monitoring
© 2002 page 25 modeling in SAS enterprise miner
© 2002 page 26 body copy model export score converter node generates Java model code reporter node exports code and HTML report to project directory
© 2002 page 27 NonStop SQL/MX Data Store SAS Open Metadata server File/SAS server SAS Enterpris e Miner Mining Mart model deployment tool task –copy model information to a ZLE Data Store Model export/registration –design HTML HTTP JDBC access Web browser client File/registry access Web Server Web App.
© 2002 page 28 starting the model deployment tool
© 2002 page 29 connecting to a Data Store
© 2002 page 30 a list of models in the Data Store
© 2002 page 31 viewing a deployed model
© 2002 page 32 selecting a SAS report directory
© 2002 page 33 viewing available reports
© 2002 page 34 viewing an Enterprise Miner report
© 2002 page 35 deploying a model
© 2002 page 36 deployment confirmation
© 2002 page 37 real-time scoring using the Recommender Scoring Engine Aggregation Engine Rules Engine Model Aggregates Model Scores Deployed Models Business Rules Aggregate Definitions Offers / Advice Customer Data Interaction Manager
© 2002 page 38 how to get the data mining tools Product Names –Genus Mining Integrator for NonStop SQL (Data Preparation, Data Transfer, and Model Deployment tools) –Genus Mart Builder for NonStop SQL (first two tools only) Can be ordered through HP, support provided by Genus Availability: calendar Q For more information, contact (Product (Program
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
CSS Data Warehousing for BS(CS) Lecture 1-2: DW & Need for DW Khurram Shahzad Department of Computer Science.
1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
© 2005 by Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 7 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
© 2005 by Prentice Hall 1 Chapter 11: Data Warehousing Modern Database Management 7 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
1 Data Warehousing Denis Manley Enterprise Systems FT228/3.
© Copyright GlobalLogic Connect. Collaborate. Innovate. Emerging Trends and Technologies in BI Sunil K Singh January, 2011 GlobalLogic.
1 Automating Business Rule Adaptation for Web-Based Information Systems Mei Tian 1, Ajit Singh 2, Kumaraswamy Ponnambalam 1, Raymond Lee 2 1 Systems Design.
Preface IIntroduction Objectives I-2 Course Overview I-3 1Introducing the Java and Oracle Platforms Objectives 1-2 What Is Java? 1-3 Key Benefits of Java.
2 Copyright © 2005, Oracle. All rights reserved. J2EE Overview.
Page 1 R Copyright © 1997 by Rational Software Corporation Analysis and Design with UML.
14 Copyright © 2005, Oracle. All rights reserved. System Management Tasks and Techniques.
Intelligence Through Learning from Data Monash University Semester 1, March 2006.
1 GREY BOX TESTING Web Apps & Networking Session 10 Boris Grinberg
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
1 © 2008 OpenLink Software, All rights reserved. Virtuoso Product Family Orri Erling - Program Manager, Virtuoso.
Lazo ILIJOSKI ASEBA AML Anti-Money Laundering Solution.
Adding Value to Your e-business with IBM Tivoli Performance & Availability Solutions Manage Your Technology Master Your Business Customer Name Speaker.
1 Database Systems: Design, Implementation, and Management CHAPTER 10 Distributed Database Management System.
Chapter 2 Overview of the Data Mining Process 1. Introduction Data Mining – Predictive analysis Tasks of Classification & Prediction Core of Business.
Using Advanced Information Technology to Increase Performance McGraw-Hill/Irwin Contemporary Management, 5/e Copyright © 2008 The McGraw-Hill Companies,
1 Blackboard Sales Presentation. 2 Agenda Introduction –The Company and the Community Why Blackboard –Product Strategy What Blackboard Provides –Product.
IT203 Unit 2: Gather Information and Define Requirements Gathering Information Copyright © 2012 Pearson Education, Inc. Publishing as Prentice HallChapter2.1.
© 2008 IBM Corporation 1 InfoSphere Integration and Product Roadmap Keith Kohl Program Director, Product Management.
1 OLTP Compared With OLAP OLTPOn Line Transaction Processing – OLTP –Maintains a database that is an accurate model of some real- world enterprise. Supports.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 2: Operating-System Structures.
© 2016 SlidePlayer.com Inc. All rights reserved.