Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.

Similar presentations


Presentation on theme: "1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed."— Presentation transcript:

1 1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed data are stored, not structured for decision making. Historical and long-term decision making data Saved for a pre-determined period of time, usually related to long- term decision making, structured for decision making. Contains data that will support decisions of strategic importance. Referred to as a “data warehouse”. Archival data Saved for a pre-determined period of time, used to track transactions for audit, not structured for decision making.

2 2

3 3

4 4 CutGlass data storage requirements Operational needs. What are examples of questions management needs to be able to answer to handle daily operations effectively? Decision support needs. What are examples of questions management needs to be able to answer to manage the organization effectively on a short and long-term basis? Governmental or auditing regulation needs. What types of questions might be relevant for this type of organization?

5 5 Operational data Includes: Master data (also called reference data): Customer, employee, material, item, supplier/vendor, etc. Transaction data: Order, orderline, purchase order, invoice, shipment, etc. Must store both master and transaction data. Must store changes to both master and transaction data.

6 6 Problems with operational data Not integrated. “Dirty”: Incomplete. Not accurate. Inconsistent. The meaning of the data is not fully defined and/or understood by all stakeholders.

7 7 Archival data Examples of archived data: Emergency dispatch calls. Credit card transactions. Accounts payable transactions. Tax-related data. Does not usually have to be accessed quickly. Can store data on tape or other cheaper, less accessible media. Must have procedures for extracting, transforming and loading (ETL) data as necessary. Archive database design is usually a copy of the transaction database design.

8 8 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics of a data warehouse? What are the components of a data warehousing system? How is a data warehouse created? How is a data warehouse accessed?

9 Compare and Contrast TPS and DSS

10 We use data to answer management questions TPS Questions What proposals have no decision as of today? Which work orders are overdue as of today? How much material was purchased for work order 10010? What are the labor costs currently for work order 10010? Data Warehouse Questions What are the characteristics of the proposals that weren’t accepted by clients? Which work orders were over cost estimates for the last 3 years? Are our proposal estimates improving (closer to actual) over the last 3 years? How about the last year? 10

11 Operational vs. Data Warehouse databases

12 So, can one database support both transaction processing and decision support applications? Yes No

13 A Business Intelligence “System” A business intelligence system encompasses all processes, hardware and software necessary to extract data, transform it, integrate it, store it, and provide information. The information is then made effective and accessible to users to support decision making. Sounds like just another information system... 13 So what makes it different?

14 14 Big Data!

15 15

16 16 Components of a business intelligence/data warehousing system Data store. Extraction/transformation/loading processes. End user query tools. End user visualization tools.

17 What is a data warehouse? A data warehouse is a database designed to support a decision support system. A data warehouse is: Integrated: It is a centralized, consolidated database integrating data from an entire organization. Subject-oriented: Data warehouse data are organized around key subjects. The data are usually arranged by topic, such as customers, products, suppliers, etc. Time-variant: Data in the warehouse contain a time dimension so that they may be used as a historical aggregation. Non-volatile: Once data enter, they seldom leave. Data are appended rather than overwritten. Data are updated in batches.

18 18 Issues in creating a data warehouse How to get accurate and complete data? How to consolidate data? Differing data meanings. Differing storage mechanisms. Differing data formats.

19

20 Data mart extraction data warehouse 20

21 Two-tier data warehouse architecture

22 Three-tier data warehouse architecture

23 23 Issues in designing a data warehouse Must have a predefined subject focus. Has the potential to be very large – must define the “grain” or granularity level of storage. Will always have a dimension of time. May contain derived data. May be a summary of data, rather than each detailed transaction. Does not always adhere to standard normalization rules.

24 Potential Data Mart for TEC 24

25 25 Accessing a data warehouse Visualization tools. Graphical. Spreadsheet format - usually Excel look-and-feel. Beyond the spreadsheet using discovery tools. Example: http://www.gapminder.org/ http://www.gapminder.org/ Dashboard. Examples: http://www.dundas.com/dashboard/online-examples/ http://www.dundas.com/dashboard/online-examples/ Query tools. OLAP: Online analytical processing. Data mining: Artificial intelligence based query methods.

26 26 Online analytical processing Provides multi-dimensional data analysis techniques. Works primarily with data aggregation. Provides advanced statistical analysis. Supports access to very large databases. Provides enhanced query optimization algorithms. Lots of acronyms: OLAP, ROLAP, MOLAP, HOLAP. Can be add-ons to existing products, example is Excel. Can have their own user interfaces.

27 OLAP vs. Data Mining questions

28 28 Data mining Data mining tools: analyze the data; uncover patterns hidden in the data; form computer models based on the findings; and use the models to predict business behavior. Proactive tools. Based on artificial intelligence software such as decision trees, neural networks, fuzzy logic systems, inductive nets and classification networking.


Download ppt "1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed."

Similar presentations


Ads by Google