Presentation is loading. Please wait.

Presentation is loading. Please wait.

IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.

Similar presentations


Presentation on theme: "IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis."— Presentation transcript:

1 IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB –Precalculated Aggregates –Hierarchical Awareness Creating the OLAP DB –The Data Source View –The Cube Fact & Dimension Tables The Time Dimension Measures Using the Cube

2 IMS 6217: Data Warehousing / Business Intelligence Part 3 2 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Analysis Services—Why we Care Three levels of data sources for analysis –All three are separate copies of the data –Each has advantages/disadvantages & purposes Source data –OLTP relational databases –Other source data including external Data Warehouse –Source data integrated –Still a relational database Analysis Services (OLAP) Database

3 IMS 6217: Data Warehousing / Business Intelligence Part 3 3 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Analysis Services—Why we Care (cont.) Data Warehouse Strengths –Integrated data –'Scrubbed' data –Shortened relationship paths→Simpler queries –Optimized for queries rather than throughput Data Warehouse Limitations –Still a relational database –Performance lags when querying and summing millions of records

4 IMS 6217: Data Warehousing / Business Intelligence Part 3 4 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu The OLAP / Analysis Services Approach In their simplest forms OLAP databases have a logical structure similar to the star or snowflake schema we saw in the DW –Fact tables –Dimension tables Data storage structure is wildly different from relational DBMS Fact/Dimension tables are stored in 'cubes' –Multi-dimensional (not just three) relationships between fact and dimension tables –Preprocessed aggregates stored in DB

5 IMS 6217: Data Warehousing / Business Intelligence Part 3 5 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu The OLAP / Analysis Services Approach (cont.) Recall that fact tables contain –Keys that indicate dimensionality of the data –Measures that contain values of interest We will design Cubes based around a single fact table –Other approaches acceptable including multi-fact table cubes

6 IMS 6217: Data Warehousing / Business Intelligence Part 3 6 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu The OLAP / Analysis Services Approach (cont.) The OLAP engine is aware of –The relationship between the values in the dimensional key columns and the measures in the fact table Every sale is for one –Time Key –Customer Key –etc –Hierarchies in dimensional tables Country→State →City Year →Month →Date

7 IMS 6217: Data Warehousing / Business Intelligence Part 3 7 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Precalculated Aggregates The OLAP Engine precalculates aggregates along dimensions in the fact table If querying total sales value by customer and product this value may already be stored for each combination of customer and product –Aggregates are calculated and stored during processing of the cube (later)

8 IMS 6217: Data Warehousing / Business Intelligence Part 3 8 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Intelligent Hierarchies OLAP intelligently uses precalculated aggregates to total on hierarchies If aggregates are already calculated for sales by product by customer… … sales by product by country use the precalculated aggregate rather than querying the detail data There are special tools for establishing hierarchical relationships among time dimension components Relationships in snowflake schema will be automatically detected Others can be established at design time

9 IMS 6217: Data Warehousing / Business Intelligence Part 3 9 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Creating the OLAP DB Create the OLAP DB from the DW –They can also be created directly from source data Use Business Intelligence Development Studio to design, create, and load the OLAP DB –The Visual Studio project contains the definitions needed to design, create, and load –The project also creates the Analysis Services DB SQL Server & Analysis Services must both be running

10 IMS 6217: Data Warehousing / Business Intelligence Part 3 10 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Creating the OLAP DB (cont.) Steps –Create Analysis Services Project with connection(s) –Create Data Source View to define data to be loaded –Generate OLAP DB –Load OLAP DB OLAP DB available for use –Direct browsing –Serving via Analysis Services server –Reporting Services –Excel

11 IMS 6217: Data Warehousing / Business Intelligence Part 3 11 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Creating the OLAP DB New Business Intelligence Project Type is Analysis Services Manage project file locations

12 IMS 6217: Data Warehousing / Business Intelligence Part 3 12 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Create Data Source(s) for the Project Create a data source just as we did for the data warehouse load project Point the data source to the data warehouse DB –Create new connection if necessary –Select "Default" Impersonation Information if DW DB does not require login

13 IMS 6217: Data Warehousing / Business Intelligence Part 3 13 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Create the Data Source View The Data Source View (DSV) is a map from the source data (data warehouse in our case) to the OLAP DB –May include data transformations –Allows fact data and dimensions to be identified –Allows hierarchies to be established Special tools for time hierarchies Create new DSV in Solution Explorer –Set Data Source

14 IMS 6217: Data Warehousing / Business Intelligence Part 3 14 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Create Data Source View (cont.) Select fact table to be loaded Select dimension tables –Use Add Related Tables button –Manually select –Include hierarchical tables as necessary from snowflake schema Name DSV when all tables are selected

15 IMS 6217: Data Warehousing / Business Intelligence Part 3 15 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Create Data Source View (cont.) DSV template is created from the selected tables Template may be modified –Add calculated columns Ready to add new cube when DSV is complete

16 IMS 6217: Data Warehousing / Business Intelligence Part 3 16 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Create Cube Create new cube from Solution Explorer Select Build cube from data source Select the DSV that was created to be the basis for the new cube

17 IMS 6217: Data Warehousing / Business Intelligence Part 3 17 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Build Cube—Confirm Fact & Dimension Tables Confirm suggested fact and dimension tables Wizard frequently misidentifies dimension tables as fact tables –Just check Be sure to identify the Time Dimension Table

18 IMS 6217: Data Warehousing / Business Intelligence Part 3 18 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Build Cube—Map Time Dimension Columns Time has built in hierarchies Map the Time Dim columns to the predefined time hierarchical concepts –Not all will be mapped

19 IMS 6217: Data Warehousing / Business Intelligence Part 3 19 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Build Cube—Identify Measures Uncheck spurious columns that will not be used as measures in the fact table Next step detects hierarchies –No operator choices

20 IMS 6217: Data Warehousing / Business Intelligence Part 3 20 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Build Cube—Review Hierarchies The hierarchies screen allows you to review, delete, & modify hierarchies

21 IMS 6217: Data Warehousing / Business Intelligence Part 3 21 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Building the Cube—Name the Cube Give the cube a meaningful name Default is the same as the DSV which should probably not be used Click Finish to build the cube design

22 IMS 6217: Data Warehousing / Business Intelligence Part 3 22 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Building the Cube—Reviewing Cube Design

23 IMS 6217: Data Warehousing / Business Intelligence Part 3 23 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Building the Cube—Reviewing Cube Design The cube structure tab will show the cube design in a way that looks much like the DSV Any changes will be reflected –Calculated columns –Renamed columns

24 IMS 6217: Data Warehousing / Business Intelligence Part 3 24 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Processing the Cube The cube must be processed before it can be used –Select Process… from the Cube menu –On first run you will be prompted to build and deploy the project first –Select Run from the Process Cube dialog –This may take some time—this is where the data is being loaded into the OLAP DB and initial aggregations created

25 IMS 6217: Data Warehousing / Business Intelligence Part 3 25 Dr. Lawrence West, Management Dept., University of Central Florida lwest@bus.ucf.edu Demonstrations Browser Reporting Services Excel Access to OLAP DB Do Your Own –Build a Cube around the Adventure Works DW Internet Sales Fact Table


Download ppt "IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis."

Similar presentations


Ads by Google