Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data Warehousing (Kimball, Ch.2-4) Dr. Vairam Arunachalam School of Accountancy, MU.

Similar presentations


Presentation on theme: "Data Warehousing (Kimball, Ch.2-4) Dr. Vairam Arunachalam School of Accountancy, MU."— Presentation transcript:

1 Data Warehousing (Kimball, Ch.2-4) Dr. Vairam Arunachalam School of Accountancy, MU

2 Sep. 9, 1999Dr. Vairam Arunachalam2 Grocery Store case terminology n SKUs – Stock-keeping units n UPCs – Universal Product codes n POS system – Point of Sale system n Promotions – TPRs, ads in newspapers, newspaper inserts, displays (shelf displays and end-aisle displays), coupons n Promotion dimension – lift, baseline sales, time shifting, cannibalization, growing the market

3 Sep. 9, 1999Dr. Vairam Arunachalam3 The Grocery Store n Steps in the Design Process: –Choose a business process to model (e.g., daily item movement) –Choose the grain of the business process (e.g., SKU by store by promotion by day) –Choose the dimensions applicable to the fact table (e.g., time, product, store, promotion) –Choose the measured facts (e.g., dollar sales, unit sales, dollar cost, customer count)

4 Sep. 9, 1999Dr. Vairam Arunachalam4 Salient Principles –The data warehouse almost always demands data expressed at the lowest possible grain of each dimension… –A careful grain statement determines the dimensionality of the fact table. –The number of sales transaction line items in a business can be estimated by dividing the gross revenue of the business by the average price of a sales item.

5 Sep. 9, 1999Dr. Vairam Arunachalam5 Salient Principles (contd.) –The fact table in a dimensional schema is naturally highly normalized –Efforts to normalize any of the tables in a dimensional database solely in order to save disk space are a waste of time –The dimension tables must not be normalized but should remain as flat tables. (Because?…)

6 Sep. 9, 1999Dr. Vairam Arunachalam6 Salient Principles (contd.) –Most data warehouses need an explicit time dimension table even though the primary time key may be an SQL date-valued object. (Because?…) –Drilling down in a data warehouse is adding row headers from the dimension tables. Drilling up is subtracting row headers. –The product dimension is one of the primary dimensions in nearly every data warehouse.

7 Sep. 9, 1999Dr. Vairam Arunachalam7 Normalization review n 1NF: no repeating groups; primary key defined n 2NF: non-key domains functionally dependent on entire primary key n 3NF: no dependencies between non-key domains

8 Sep. 9, 1999Dr. Vairam Arunachalam8 Other Issues n Database sizing n Domain transfer – design variations n Additive vs. semi- (or non-additive) dimensions

9 Sep. 9, 1999Dr. Vairam Arunachalam9 The Warehouse n Inventory Models: –The Inventory Snapshot model –Delivery Status model –Transaction model

10 Sep. 9, 1999Dr. Vairam Arunachalam10 Inventory Snapshot Model –Fig. 3.2 –Gross Margin Return on Inventory (GMROI) = [(Qty Ship)*(Value at LSP – Value at Cost)] / [(Daily Avg Qty)*(Value at LSP)]

11 Sep. 9, 1999Dr. Vairam Arunachalam11 Delivery Status Model –Steps: n Received n Inspected n Placed into inventory n Authorized to sell n Picked from inventory n Boxed n Shipped

12 Sep. 9, 1999Dr. Vairam Arunachalam12 Delivery Status Model (contd.) –Exception Conditions: n Failed inspection n Returned to vendor n Damaged in handling n Lost n Returned from customer n Returned to inventory n Written off n Refunded –Fig 3.3

13 Sep. 9, 1999Dr. Vairam Arunachalam13 Transaction Model –Includes: n Receive shipment line item n Place SKU into inspection hold n Release SKU from inspection hold n Place SKU into inspection failed with reason n Mark SKU for return to vendor with reason n Place SKU in bin n Authorize SKU for sale n Pick SKU from bin

14 Sep. 9, 1999Dr. Vairam Arunachalam14 Transaction Model (contd.) –Includes: n Package SKU for shipment n Ship SKU to customer n Bill customer n Receive SKU from customer with reason n Return SKU to inventory from customer return n Remove SKU from inventory with reason –Fig. 3.4

15 Sep. 9, 1999Dr. Vairam Arunachalam15 Transaction Model (contd.) –Sample queries: n How many times have we placed a product into an inventory bin on the same day we have picked the product from the same bin at a different time? n What is the clustering in time of customer returns of a particular SKU? n How many separate shipments did we receive from vendor X and when did we get them? n On which SKUs have we had more than one round of QA inspection failures that caused the return of the product to the vendor?

16 Sep. 9, 1999Dr. Vairam Arunachalam16 Transaction Model (contd.) –Transplant context (e.g.,FedEx) –Compare models

17 Sep. 9, 1999Dr. Vairam Arunachalam17 Salient principles –All measures that record a static level (such as…) are inherently nonadditive across time. However, in these cases the measure may be usefully aggregated across time by averaging over the number of time periods. –Document control numbers (such as…) usually are presented as degenerate dimensions (i.e., dimension keys with no corresponding dimension table) in fact tables where the grain of the table is the document itself or a line item in the document.

18 Sep. 9, 1999Dr. Vairam Arunachalam18 Salient principles (contd.) n Exceptions to absolute additivity in the fact table can be made where the additive measures are more conveniently delivered in a view. Examples include computed time spans from a large number of date fields, as well as extended monetary amounts derived from units costs and prices. In such a case, it is important to have all users access the view instead of the underlying table. (Tie to 3NF)

19 Sep. 9, 1999Dr. Vairam Arunachalam19 Shipments n The ideal shipments fact table (Fig. 4.1) n Typical customer ship-to dimension (Fig. 4.2) n Typical deal dimension (Fig. 4.3) n Typical ship mode dimension (Fig. 4.4)


Download ppt "Data Warehousing (Kimball, Ch.2-4) Dr. Vairam Arunachalam School of Accountancy, MU."

Similar presentations


Ads by Google