Oracle Data Mining Update and Xerox Application Charlie Berger Sr. Director of Product Management, Life Sciences and Data Mining

Slides:



Advertisements
Similar presentations
Introduction to OWB(Oracle Warehouse Builder)
Advertisements

Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Pentaho Open Source BI Goldwin. Pentaho Overview Pentaho is the commercial open source software for Business Pentaho is the commercial open source software.
Data Manager Business Intelligence Solutions. Data Mart and Data Warehouse Data Warehouse Architecture Dimensional Data Structure Extract, transform and.
PolyAnalyst Data and Text Mining tool Your Knowledge Partner TM www
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Advance Analytics Capabilities
When to use Data Mining. Introduction An important question that should be answered before you commence any data mining project is whether data mining.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
WebSphere -DB2 Integration Web Browser Web Server (Apache) WebSphere –JSP/Servlet/EJB DB2 JDBC, SQL HTTP.
Microsoft SQL Server x 46% 900+ For Hosting Service Providers
Altosoft Copyright ® 2012 altosoft.com8/3/2012 Sandy Follin, Sr. Account Executive Steve Schrader, Sr. Sales Engineer.
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
® IBM Software Group © IBM Corporation IBM Information Server Metadata Management.
NOT AN OFFICIAL UNCTAD RECORD. Financial Insights With Oracle Business Intelligence Charles Gichobi Technology Manager – E. & Southern Africa
Data Warehouse Tools and Technologies - ETL
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Oracle10g for Data Warehousing Jiangang Luo
Application Express 4.1 New Features Hilary Farrell, Principal Member of Technical Staff, Oracle.
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
1.
M icrosoft Data Warehousing - SQL Server State of the Technology Presentation by Sujata Angara Nakul Johri Sang Ho Park.
An Evaluation of A Commercial Data Mining Suite Oracle Data Mining Presented by Emily Davis Supervisor: John Ebden.
1 Copyright © 2004, Oracle. All rights reserved. Introduction to Oracle Forms Developer and Oracle Forms Services.
Multiple Examples of tumor tissue (public data from Whitehead/MIT) SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25%
DAT336 SQL Server “Yukon” – The Future of Business Intelligence Jason Carlson Product Unit Manager SQL Server Microsoft Corporation Brian Welcker Microsoft.
Life Sciences Integrated Demo Joyce Peng Senior Product Manager, Life Sciences Oracle Corporation
Data Profiling
PO320: Reporting with the EPM Solution Keshav Puttaswamy Program Manager Lead Project Business Unit Microsoft Corporation.
Turning Audio Search and Speech Analytics into Business Intelligence.
Information Builders’ Teradata User Group Adam Cohen Eric Greisdorf.
Data Mining with Oracle using Classification and Clustering Algorithms Presented by Nhamo Mdzingwa Supervisor: John Ebden.
Oracle Application Express. Program Agenda Oracle Application Express Overview Use Cases Key Features Packaged Applications Packaging Pricing Call to.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Data Mining with Oracle using Classification and Clustering Algorithms Proposed and Presented by Nhamo Mdzingwa Supervisor: John Ebden.
Managing Knowledge in Business Intelligence Systems Dr. Jan Mrazek.
Life Cycle Management Using Oracle 9i Warehouse Builder Anissa Stevens Avanco International, Inc Mark Van De Wiel Oracle.
Carey Probst Technical Director Technology Business Unit - OLAP Oracle Corporation.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Consul- ting Services Outsour- cing Services Techno- logy Services Local Profes- sional Services Competence Centers Business Intelligence WebTech SAP.
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
An Evaluation of Commercial Data Mining Proposed and Presented by Emily Davis Supervisor: John Ebden.
Reporting & Analytics Stephen Chan Senior Solution Consultant.
SQL Server 2008 Analysis Services. END USER TOOLS & PERFORMANCE MANAGEMENT APPS Excel PerformancePoint Server BI PLATFORM SQL Server Reporting Services.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
CS 157B: Database Management Systems II April 10 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Oracle OLAP Option Bud Endress Director of Product Management, OLAP.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
1 Copyright © 2007, Oracle. All rights reserved. Installing and Setting Up the Warehouse Builder Environment.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Knowledge Discovery in a DBMS Data Mining Computing models and finding patterns in large databases current major challenge in database systems & large.
Oracle Advanced Analytics
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Introduction to Computing Lecture # 13
IBM Tivoli Web Site Analyzer Training Document
Delivering Business Insight with SQL Server 2005
Operationalize your data lake Accelerate business insight
Data Warehousing and Data Mining
TechEd /28/ :48 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.
Supporting End-User Access
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Intro to Machine Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Analytics, BI & Data Integration
6/17/ :03 AM © 2004 Microsoft Corporation. All rights reserved.
Customer 360.
David Gilmore & Richard Blevins Senior Consultants April 17th, 2012
Dynamics 365 Customer Insights
Presentation transcript:

Oracle Data Mining Update and Xerox Application Charlie Berger Sr. Director of Product Management, Life Sciences and Data Mining Oracle Corporation Raj Minhas Research Scientist Xerox Corporation Session id: 40262

Copyright © 2003 Oracle Corporation Agenda  Oracle Data Mining Update  ODM/DM4J Demonstration  Xerox Application

Copyright © 2003 Oracle Corporation Oracle Business Intelligence Vision DatabaseEngine Data Integration Engine OLAP Engine Mining Engine  Multiple databases  Multiple servers  Multiple engines  Proprietary interfaces  Complex environment  Slow conversion of data to information Is to change this …

Copyright © 2003 Oracle Corporation Oracle Business Intelligence Vision  Single database  Single server  Standard interfaces  Simplified environment  Fast conversion of data to information Data Warehousing ETL OLAP Data Mining Oracle 10g DB Into this …

Copyright © 2003 Oracle Corporation What is Oracle Data Mining?  Oracle Data Mining (ODM) sifts through massive amounts of data to find hidden patterns and information — valuable information that can help you better understand your customers and anticipate their behavior  ODM insights can be revealing, significant, and valuable e.g. – Predict which customers are likely to churn – Discover what factors are involved with a certain disease – Identify fraudulent behavior

Copyright © 2003 Oracle Corporation  Data mining embedded in Oracle10g Database – Simplifies process, eliminates data movement, speeds analysis, deployment and delivers security and scalability  Build models and applications simultaneously – Build and evaluate models and automatically generate Java code  Enhance applications with predictions and insights – For example, build churn prediction applications and enable call centers with greater customer insight Oracle Data Mining Overview & Differentiating Features Data Mining

Copyright © 2003 Oracle Corporation Oracle Data Mining Business Intelligence Applications  CEOs can ask… – How can I target the “right customers” to maximize profits?  Managers can answer… – Which customers are likely to be interested in which offers and why?  Call Reps can… – Suggest the right “offer” for the customer Information Producers Information Consumers  Data Miners can… – Discover patterns and insights hidden in the data Oracle Data Mining

Copyright © 2003 Oracle Corporation Information Consumers Key factors that influence customers likely to purchase a product Customers sorted in likelihood to purchase a product

Copyright © 2003 Oracle Corporation Oracle11i CRM Application CRM / Data Mining Integration  Marketing analysts can design targeted campaigns without becoming data mining experts – Build models, score lists – Discover patters & make predictions Data mining increases effectiveness of targeted campaigns

Copyright © 2003 Oracle Corporation 10g Oracle Data Mining Wide range of data mining algorithms  Feature Selection – Attribute Importance  Supervised learning (classification & prediction) – Naïve Bayes – Adaptive Bayes Networks – Support Vector Machines  Unsupervised learning (clustering and associations) – Association Rules – Orthogonal Clustering – Enhanced k-means Cluster  Feature Extraction – Non Negative Matrix Factorization Data Mining

Copyright © 2003 Oracle Corporation 10g Additional Features  Text Mining – Ability to combine structured data and unstructured data  ODM API – Java – PL/SQL  Scoring engine  Similarity Searches – BLAST (Life sciences: genes and proteins) Data Mining

Copyright © 2003 Oracle Corporation Oracle Data Mining/DM4J Demonstration

Copyright © 2003 Oracle Corporation DM4J2 New Features  Access Data – Import flat file to db wizard  Visualize Data – Data snapshot – Standard summary statistics – Attribute level histograms  Transform Data – Create View / Table – Random and Stratified sampling – Aggregation – Computed column – Normalization – Discretization – Table Splits – Filtering – Recode  Modeling – Building models – Testing models – Applying (scoring) models – Visualize results  Deploy Models/Results – Generate transformation code (PL/SQL) – View and generate transformation lineage – Generate model code (Java) – Integrate with Oracle tools – JDeveloper – Oracle Warehouse Builder – Discoverer

Copyright © 2003 Oracle Corporation Oracle Data Mining Enabling Data Mining Applications DM4J GUI add-ins provides wizards for building and evaluating models

Copyright © 2003 Oracle Corporation Oracle Data Mining Enabling Data Mining Applications Data analysts can build and review data mining models

Copyright © 2003 Oracle Corporation Oracle Data Mining Enabling Data Mining Applications  Comprehensive GUI for preparing data, building models, evaluating results and deploying models DM4J provides features to transform and prepare the data

Copyright © 2003 Oracle Corporation Oracle Data Mining Enabling Data Mining Applications  Automated, scheduled, and event-driven business intelligence applications can can be easily integrated into enterprise applications DM4J automatically generates the Java code

Copyright © 2003 Oracle Corporation Multiple Examples of tumor tissue (public data from Whitehead/MIT) Oracle 10g SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25% accuracy Green=Correct Red=Errors We feed multiple cancer types data into the Oracle DB: 16,063 genes, 144 cancer patients and 10 samples per class. We mine the data using Support Vector Machines and create the confusion matrix

Copyright © 2003 Oracle Corporation Oracle 10g SVM Classification of Multiple Tumor Types 78.25% accuracy Green=Correct Red=Errors Oracle Data Mining’s SVM models are able to accurately predict the multi-class tumor problem with 78.25% accuracy.

Copyright © 2003 Oracle Corporation TDWI “Andrew Braunberg, a senior analyst with research firm TDWI suggests that DM4J should simplify the job of data analysts. Before Oracle released DM4J, Braunberg notes, analysts who used ODM had to write out all of the Java code that was required to build their predictive models. “This was a time-consuming process that slowed model development and deployment.” With DM4J, Braunberg notes, Java code is automatically written as data analysts build their predictive models. Moreover, developers or data analysts can re-use this code in other Java-based applications. As a result, he anticipates, DM4J will “enhance analysts’ ability to create predictive models using Oracle Data Mining.”” TDWI Brief: Oracle Data Mining gets GUI; IBM and Cognos' BI partnership April 9, By Stephen Swoyer

Copyright © 2003 Oracle Corporation Benefits of Oracle’s Approach Oracle Data Mining FeatureBenefit Data Mining algorithms embedded in database  Eliminates data movement and security exposure  Fastest: Data  Information Wide range of data mining algorithms  Supports most data mining problems Runs on multiple platforms  Applications may be developed and deployed Built on Oracle Technology  Grid, RAC, integrated BI,…  SQL & PL/SQL available  Leverage existing skills

A Q & Q U E S T I O N S A N S W E R S