Presentation is loading. Please wait.

Presentation is loading. Please wait.

Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.

Similar presentations


Presentation on theme: "Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305."— Presentation transcript:

1

2 Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305

3 Please Raise Your Hand If You’ve Ever… Attended a Statistics Lecture ?? Got a Statistics Degree ?? Used SQL Server Data Mining ??

4 Agenda Data Mining – What is it? Data Mining – How do we do it? Demonstrations Visualisation Reporting ETL Application Q&A

5 Data Mining – What Is It? According to Encarta Noun “Search for Hidden Information” “The locating of previously unknown patterns and relationships within data” Server-Driven Discovery Uses a combination of statistics, probability analysis and database technologies

6 DM Enables Predictive Analysis Predictive Analysis PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned reporting Ad-hoc reporting OLAP Data mining Role of Software

7 Business Scenarios Forecasting salesChurn AnalysisDetecting fraud or invalid dataTargeting promotionsCross-sellingDetermine Business Drivers

8 Our End-to-End BI Offering END USER TOOLS AND PERFORMANCE MANAGEMENT APPLICATIONS BI PLATFORM (RDBMS, ETL, OLAP, Reporting) DELIVERY Mainframe/ Departmental Systems The Big Picture SQL Server Reporting Services SQL Server Analysis Services SQL Server DBMS SQL Server Integration Services

9 Our End-to-End BI Offering END USER TOOLS AND PERFORMANCE MANAGEMENT APPLICATIONS BI PLATFORM (RDBMS, ETL, OLAP, Reporting) DELIVERY The Big Picture SQL Server Analysis Services

10 SQL Server™ 2008 Data Mining Key Drivers Keep Development Simple Retain Full Suite of Algortihms Manage Large Volumes Allow for Integration

11 SQL Server™ 2008 Algorithms Microsoft Naïve Bayes Quick and approachable algorithm Used for classification Microsoft Decision Trees Popular data mining technique Used for classification, regression and association Microsoft Linear Regression Finds the best possible straight line through a series of points Used for prediction analysis

12 SQL Server™ 2008 Algorithms Continued Microsoft Neural Network More sophisticated than Decision Trees and Naïve Bayes, this algorithm can explore extremely complex scenarios Used for classification and regression tasks Microsoft Logistic Regression A particular case of the Neural Network algorithm Microsoft Clustering Finds natural groupings inside data Supports segmentation and anomaly detection tasks

13 SQL Server™ 2008 Algorithms Continued Microsoft Sequence Clustering Groups a sequence of discrete events into natural groups based on similarity Microsoft Time Series Used to predict future values from a time series Has been improved in SQL Server 2008 to produce more accurate long-term forecasts Microsoft Association Rules Commonly supports market basket analysis to learn what products are purchased together

14 Data Mining Algorithm Usage What is your task? Predict Variable Naïve Bayes Decision Trees Neural Network Logistic Regression Predict Value Decision Trees Linear Regression Neural Network Logistic Regression Marketing Cluster Clustering Forecast Value Time Series Associate Association Rules Decision Trees

15 Data Mining Process Define the Problem Data Preperation Model Validation Accuracy Reliability Usefulness Model Visualisation

16 Describing the Data Mining Process Design time Process time Query time Mining Model

17 Describing the Data Mining Process Design time Process time Query time Mining Model Training Data Data Mining Engine

18 Data Mining Visualization

19 Model Creation + Processing

20 Describing the Data Mining Process Design time Process time Query time Mining Model Training Data Data Mining Engine

21 Describing the Data Mining Process Design time Process time Query time Data Mining Engine Data to Predict Predicted Data Mining Model

22 Predicting the Future

23 Data Mining for the Developer

24

25 Related Content Breakout Sessions Using MDX for Enhanced Scorecards and Dashboards (BIN 307) Required Slide Speakers, please list the Breakout Sessions, TLC Interactive Theaters and Labs that are related to your session. Any queries, please check with your Track Owner. Required Slide Speakers, please list the Breakout Sessions, TLC Interactive Theaters and Labs that are related to your session. Any queries, please check with your Track Owner.

26 Track Resources www.sqlserverdatamining.com www.microsoft.com/sql twitter.com/gavinrr Required Slide Track Owners to provide guidance. Please address any queries to your track owners. Required Slide Track Owners to provide guidance. Please address any queries to your track owners.

27 www.microsoft.com/teched Sessions On-Demand & Community http://microsoft.com/technet Resources for IT Professionals http://microsoft.com/msdn Resources for Developers www.microsoft.com/learning Microsoft Certification & Training Resources Resources Required Slide Speakers, TechEd 2009 is not producing a DVD. Please announce that attendees can access session recordings at TechEd Online. Required Slide Speakers, TechEd 2009 is not producing a DVD. Please announce that attendees can access session recordings at TechEd Online.

28 Required Slide Complete a session evaluation and enter to win! 10 pairs of MP3 sunglasses to be won

29 © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION. Required Slide


Download ppt "Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305."

Similar presentations


Ads by Google