Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.

Similar presentations


Presentation on theme: "Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft."— Presentation transcript:

1 Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft Australia

2 Agenda What is Data Mining? Why use Data Mining? Data Mining Tasks Data Mining Process SQL Server 2005 Data Mining Demonstration SQL Server 2005 Data Mining Discussion

3 What is Data Mining? ?

4 What is not Data Mining? Ad-Hoc Query Event Notifications Multidimensional Analysis/Slice Dice Statistics OLAP Canned or ad-hoc reports

5 What is Data Mining? “Data mining is the semi- automatic extraction of patterns, changes, associations, anomalies, and other statistically significant structures from large data sets.” R. Grossman Also known as Machine Learning Predictive Analytics

6 Why Data Mining? Disk Processor Time

7 Types of Analysis Query-Reporting-Analysis “What happened?” Simple Reports Key Performance Indicators OLAP Cubes – Slice/Dice Real-Time - “What is happening?” Events/Triggers Data Mining “What will happen?” “How/why did this happen?”

8 Data Mining Tasks Explores Your Data Finds Patterns Performs Predictions

9 Data Mining Tasks Mining Model DM Engine Data To Predict DM Engine Predicted Data Training Data Mining Model

10 Customer Examples ComputerFleet (Australia): Predict when hired equipment will be returned Sanford Securities (Australia): Data mining automation Clait Health Services: Identify patients likely to suffer deteriorating health for pro-active treatment AIM Healthcare: Identify billing errors, duplicate payments etc. to minimize costs

11 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis

12 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis What type of membership card should I offer? Which customers will respond to my mailing? Is this transaction fraudulent? Will I lose this customer? Will this product be defective? Why is my system failing? Which patients health will degrade?

13 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis How much revenue will I get from this customer? How long will this asset be in service? What is the mean time to failure? What is the particle density of this fluid?

14 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis Describe my customers How can I differentiate my customers? How can I organize my data in a manner that makes sense? Is this record an outlier?

15 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis What items are bought together? Which services are used together? What products should I recommend to my customers?

16 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis – –What are projected revenues for all products? – –What are inventory levels next month?

17 Data Mining Tasks Classification Estimation Segmentation Association Forecasting Text Analysis Analysis of unstructured data – –Finds key terms and phrases in text – –Conversion to structured data – –Feed into other algorithms Classification Segmentation Association How do I handle call center data? How can I classify mail? What can I do with web feedback?

18 “Putting Data Mining to Work” “Doing Data Mining” Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment Data Data Mining Process CRISP-DM www.crisp-dm.org

19

20 Value of Data Mining SQL Server 2005 OLAP Reports (Adhoc) Reports (Static) Data Mining Business Knowledge Easy Difficult Usability Relative Business Value

21 “Putting Data Mining to Work” “Doing Data Mining” Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment Data Data Mining Process CRISP-DM www.crisp-dm.org

22 Data Mining User Interface SQL Server BI Development Studio Creation and exploration environment Data Mining projects inside Visual Studio solutions with related projects Source Control Integration SQL Server Management Studio Single place for management of all SQL Server technologies Manage, Browse, and Query Data Mining Models

23 Data Mining

24 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis

25 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis Decision Trees Neural Nets Naïve Bayes Logistic Regression

26 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis Decision Trees Neural Nets Logistic Regression Linear Regression

27 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis Clustering Sequence Clustering

28 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis Association Rules Decision Trees

29 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis Time Series

30 Data Mining Algorithms Classification Estimation Segmentation Association Forecasting Text Analysis Integration Services – –Term Extraction Transform – –Term Lookup Transform

31 Data Mining Programmability DMX Query Interface OLEDB, ADO, ADO.Net, ADOMD.Net, XMLA Dim cmd as ADOMD.Command Dim reader as ADOMD.DataReader Cmd.Connection = conn Set reader = Cmd.ExecuteReader(“Select Predict(Gender)…”) Data Mining Object Model Analysis Management Objects (AMO) ADOMD.Net, Server ADOMD.Net Direct access to Mining content CLR User Defined Procedures execute on the server Expandability Plug-In Algorithms Plug-In Viewers

32 Session Summary Data Mining is the automatic extraction of information from data for descriptive or predictive purposes Data Mining addresses a wide variety of problems SQL Server 2005 contains a full- featured set of data mining tools and API’s for the creation and deployment of data mining solutions.

33 Next Steps 1) SQL Server website: http://www.microsoft.com/sql 2) Virtual labs 3) Data Mining Tutorial 4) Find more info at: http://www.sqldatamining.com 5) Ask Questions: news:microsoft.public.sqlserver.datamining


Download ppt "Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft."

Similar presentations


Ads by Google