Download presentation
Presentation is loading. Please wait.
Published byDenis Fitzgerald Modified over 9 years ago
2
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305
3
Please Raise Your Hand If You’ve Ever… Attended a Statistics Lecture ?? Got a Statistics Degree ?? Used SQL Server Data Mining ??
4
Agenda Data Mining – What is it? Data Mining – How do we do it? Demonstrations Visualisation Reporting ETL Application Q&A
5
Data Mining – What Is It? According to Encarta Noun “Search for Hidden Information” “The locating of previously unknown patterns and relationships within data” Server-Driven Discovery Uses a combination of statistics, probability analysis and database technologies
6
DM Enables Predictive Analysis Predictive Analysis PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned reporting Ad-hoc reporting OLAP Data mining Role of Software
7
Business Scenarios Forecasting salesChurn AnalysisDetecting fraud or invalid dataTargeting promotionsCross-sellingDetermine Business Drivers
8
Our End-to-End BI Offering END USER TOOLS AND PERFORMANCE MANAGEMENT APPLICATIONS BI PLATFORM (RDBMS, ETL, OLAP, Reporting) DELIVERY Mainframe/ Departmental Systems The Big Picture SQL Server Reporting Services SQL Server Analysis Services SQL Server DBMS SQL Server Integration Services
9
Our End-to-End BI Offering END USER TOOLS AND PERFORMANCE MANAGEMENT APPLICATIONS BI PLATFORM (RDBMS, ETL, OLAP, Reporting) DELIVERY The Big Picture SQL Server Analysis Services
10
SQL Server™ 2008 Data Mining Key Drivers Keep Development Simple Retain Full Suite of Algortihms Manage Large Volumes Allow for Integration
11
SQL Server™ 2008 Algorithms Microsoft Naïve Bayes Quick and approachable algorithm Used for classification Microsoft Decision Trees Popular data mining technique Used for classification, regression and association Microsoft Linear Regression Finds the best possible straight line through a series of points Used for prediction analysis
12
SQL Server™ 2008 Algorithms Continued Microsoft Neural Network More sophisticated than Decision Trees and Naïve Bayes, this algorithm can explore extremely complex scenarios Used for classification and regression tasks Microsoft Logistic Regression A particular case of the Neural Network algorithm Microsoft Clustering Finds natural groupings inside data Supports segmentation and anomaly detection tasks
13
SQL Server™ 2008 Algorithms Continued Microsoft Sequence Clustering Groups a sequence of discrete events into natural groups based on similarity Microsoft Time Series Used to predict future values from a time series Has been improved in SQL Server 2008 to produce more accurate long-term forecasts Microsoft Association Rules Commonly supports market basket analysis to learn what products are purchased together
14
Data Mining Algorithm Usage What is your task? Predict Variable Naïve Bayes Decision Trees Neural Network Logistic Regression Predict Value Decision Trees Linear Regression Neural Network Logistic Regression Marketing Cluster Clustering Forecast Value Time Series Associate Association Rules Decision Trees
15
Data Mining Process Define the Problem Data Preperation Model Validation Accuracy Reliability Usefulness Model Visualisation
16
Describing the Data Mining Process Design time Process time Query time Mining Model
17
Describing the Data Mining Process Design time Process time Query time Mining Model Training Data Data Mining Engine
18
Data Mining Visualization
19
Model Creation + Processing
20
Describing the Data Mining Process Design time Process time Query time Mining Model Training Data Data Mining Engine
21
Describing the Data Mining Process Design time Process time Query time Data Mining Engine Data to Predict Predicted Data Mining Model
22
Predicting the Future
23
Data Mining for the Developer
25
Related Content Breakout Sessions Using MDX for Enhanced Scorecards and Dashboards (BIN 307) Required Slide Speakers, please list the Breakout Sessions, TLC Interactive Theaters and Labs that are related to your session. Any queries, please check with your Track Owner. Required Slide Speakers, please list the Breakout Sessions, TLC Interactive Theaters and Labs that are related to your session. Any queries, please check with your Track Owner.
26
Track Resources www.sqlserverdatamining.com www.microsoft.com/sql twitter.com/gavinrr Required Slide Track Owners to provide guidance. Please address any queries to your track owners. Required Slide Track Owners to provide guidance. Please address any queries to your track owners.
27
www.microsoft.com/teched Sessions On-Demand & Community http://microsoft.com/technet Resources for IT Professionals http://microsoft.com/msdn Resources for Developers www.microsoft.com/learning Microsoft Certification & Training Resources Resources Required Slide Speakers, TechEd 2009 is not producing a DVD. Please announce that attendees can access session recordings at TechEd Online. Required Slide Speakers, TechEd 2009 is not producing a DVD. Please announce that attendees can access session recordings at TechEd Online.
28
Required Slide Complete a session evaluation and enter to win! 10 pairs of MP3 sunglasses to be won
29
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION. Required Slide
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.