Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data Warehousing. Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical.

Similar presentations


Presentation on theme: "Data Warehousing. Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical."— Presentation transcript:

1 Data Warehousing

2 Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical and strategic decision making –analysis of historical records

3 Can one database support both? RDBMS TPS DSS

4 Can one database support both? RDBMS TPS DSS Yes… but at a cost in performance. low concurrency large reads significant aggregation high concurrency small transactions limited aggregation

5 The Solution… Production Database (OLTP) TPSDSS Data Warehouse Extract, Transport & Transformation Load

6 OLTP vs DW Characteristics OLTP DatabaseData Warehouse High Read/Write ConcurrencyPrimarily Read Only Highly Normalized Highly Denormalized Limited Transaction HistoryMassive Transaction History Very Detailed DataDetailed and Summarized Data Limited External DataSignificant External Data

7 Data Marts (3-tier approach) Production Database (OLTP) DSS Data Warehouse ETL Data Mart A Data Mart B Data Mart C DSS Transformation & Limitation External Data Sources

8 Data Marts (bottom-up approach) Production Database (OLTP) DSS Data Mart A Data Mart B Data Mart C DSS External Data Sources External Data Sources External Data Sources ETL

9 Multi-dimensional (Sales) Data 70556035 40905030 801106025 Soda Diet Soda Lime Soda Orange Soda California Utah Arizona March 1 March 2 March 3

10 Cube Operations Cube (group by option) Slice (implement in Oracle with where clause) Dice (implement in Oracle with where clause) Drill Down (implemented in report writers) Roll-up (group by option) Pivot (not implemented by Oracle (but by Access))

11 Cube Data Example Create table sales ( Item varchar2(20), State varchar2(20), Amount number(6), Day date); Insert into Sales values('Soda','California',80,'01-Mar-2004'); Insert into Sales values('Diet Soda','California',110,'01-Mar-2004'); …

12 Examine these queries Select * from sales; Select Item, State, sum(amount) from sales group by Item, State; Select Item, State, sum(amount) from sales group by Rollup(Item, State); Select State, Item, sum(amount) from sales group by Rollup(State, Item); Select State, Item, sum(amount) from sales group by Cube(State, Item);

13 Materialized Views Materialized views are schema objects that can be used to summarize, precompute, replicate, and distribute data. They are suitable in various computing environments such as data warehousing, decision support, and distributed or mobile computing: In data warehouses, materialized views are used to precompute and store aggregated data such as sums and averages. Materialized views in these environments are typically referred to as summaries because they store summarized data. Cost-based optimization can use materialized views to improve query performance by automatically recognizing when a materialized view can and should be used to satisfy a request. The optimizer transparently rewrites the request to use the materialized view. Queries are then directed to the materialized view and not to the underlying detail tables or views. In distributed environments, materialized views are used to replicate data at distributed sites and synchronize updates done at several sites with conflict resolution methods. The materialized views as replicas provide local access to data that otherwise has to be accessed from remote sites. In mobile computing environments, materialized views are used to download a subset of data from central servers to mobile clients, with periodic refreshes from the central servers and propagation of updates by clients back to the central servers.

14 Create Materialized View (partial syntax)

15 Materialized View refresh_clause

16 MV Example Create Materialized View MVcustomer REFRESH start with sysdate Next sysdate+(1/24) AS Select customerID,lastname,firstname, phone from customers;

17 RDBMS Star Schema Sales SalesNO SalesUnits SalesDollars SalesCost Store StoreID Manager Street City Zip Item ItemID Name UnitPrice Brand Category Customer CustID Name Phone Street City Day DayID DayOfMonth Month Year DayOfWeek ItemID CustID StoreID DayID


Download ppt "Data Warehousing. Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical."

Similar presentations


Ads by Google