Principles Operational v Analytical Systems Data Warehousing & Data Mining Sheffield Hallam University 1.

Slides:



Advertisements
Similar presentations
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Advertisements

The 4 T’s of Test Automation:
Introduction to Product Family Engineering. 11 Oct 2002 Ver 2.0 ©Copyright 2002 Vortex System Concepts 2 Product Family Engineering Overview Project Engineering.
ASYCUDA Overview … a summary of the objectives of ASYCUDA implementation projects and features of the software for the Customs computer system.
Relational Database and Data Modeling
Chapter 18 Methodology – Monitoring and Tuning the Operational System Transparencies © Pearson Education Limited 1995, 2005.
Yong Choi School of Business CSU, Bakersfield
Chapter 10: Designing Databases
Chapter 6 Data Design.
Creating Tables. 2 home back first prev next last What Will I Learn? List and provide an example of each of the number, character, and date data types.
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
Introduction to Databases
CHAPTER 8 INFORMATION IN ACTION
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 15-1 David M. Kroenke Database Processing Chapter 15 Business Intelligence.
Chapter 13 The Data Warehouse
C6 Databases.
Prepare Book Review The Game! By: Michael Crain, MSIM Chapter 6 Foundations of Business Intelligence Create Knowledge Remember Be a Winner The Game!
Lecture-7/ T. Nouf Almujally
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Accounting System Design
Managing Data Resources
Database Management: Getting Data Together Chapter 14.
Getting Started (Excerpts) Chapter One DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
Introduction to Database Management
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Database Systems – Data Warehousing
Chapter 5 Lecture 2. Principles of Information Systems2 Objectives Understand Data definition language (DDL) and data dictionary Learn about popular DBMSs.
 Introduction Introduction  Purpose of Database SystemsPurpose of Database Systems  Levels of Abstraction Levels of Abstraction  Instances and Schemas.
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
The McGraw-Hill Companies, Inc Information Technology & Management Thompson Cats-Baril Chapter 3 Content Management.
STORING ORGANIZATIONAL INFORMATION— DATABASES CIS 429—Chapter 7.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Chapter 7: Database Systems Succeeding with Technology: Second Edition.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Dimu' Rumpak © 2009 by Prentice Hall 1 Getting Started Didimus Rumpak, M.Si. Database Concepts Chapter 1 1.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
CISB594 – Business Intelligence
CHAPTER 3 DATABASES AND DATA WAREHOUSES. 2 OPENING CASE STUDY Chrysler Spins a Competitive Advantage with Supply Chain Management Software Chapter 2 –
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
Data resource management
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 DATABASES AND DATA WAREHOUSES CHAPTER 6 DATABASES AND DATA WAREHOUSES.
Foundations of Business Intelligence: Databases and Information Management.
Fundamentals of Information Systems, Sixth Edition Chapter 3 Database Systems, Data Centers, and Business Intelligence.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
I am Xinyuan Niu I am here because I love to give presentations. Data Warehousing.
Managing Data Resources File Organization and databases for business information systems.
Getting started with Accurately Storing Data
Intro to MIS – MGS351 Databases and Data Warehouses
Databases and Information Management
Fundamentals & Ethics of Information Systems IS 201
Chapter 13 The Data Warehouse
Fundamentals of Information Systems
Databases and Data Warehouses Chapter 3
Accounting System Design
MANAGING DATA RESOURCES
An Introduction to Data Warehousing
Accounting System Design
Presentation transcript:

Principles Operational v Analytical Systems Data Warehousing & Data Mining Sheffield Hallam University 1

A customer walks into a bank, and speaks to a cashier. The cashier uses his/her computer to answer the queries / fulfil the actions. List five typical queries / actions 1. __________________________________ 2. __________________________________ 3. __________________________________ 4. __________________________________ 5. __________________________________ 2

What similarities can you spot about the nature of the data involved in your five queries / actions? Hint: volume, up-to-dateness (“currency”), level of detail l List four similarities 1. __________________________________ 2. __________________________________ 3. __________________________________ 4. __________________________________ 3

What general characteristics do you think can be stated about the IT application that the cashier uses to handle the customer queries / actions? Hint: Complete the following sentence: The application is oriented towards l List four characteristics 1. __________________________________ 2. __________________________________ 3. __________________________________ 4. __________________________________ 4

Operational Systems 1.High volume of transactions 2.Small processing per transaction 3.Frequent updating of data 4.Data is always current 5.Transaction driven 6.Predictable query types 7.Static structure 8.Content varies 9.High accuracy 10.High availability 11.Mature support Table 1: Attributes of an Operational System 5

Explanation of points 7 & 8 on the previous slide How often does the underlying design of the bank’s database have to change (tables, relationships, integrity rules etc)? Just thinking about the current data (not the archive data), how much bigger /smaller will the bank’s volume of data be in one year’s time? Just thinking about the current data, how much of that data will have different values in one year’s time? Do you understand why we said “Static structure; content varies” ? 6

Operational systems are generally based on Relational Database systems. § Very highly optimised towards fast writing / retrieval of small items of data (eg Oracle, DB2, SQL-Server, MySQL = very long established, huge $$ research investments) § Highly optimised towards using Relationships to fetch related data (eg Customer Name, and current balance) These reasons make Relational Databases extremely quick Operational Systems 7

Operational systems are generally based on Relational Database systems (cont...) §The database itself can enforce Referential Integrity (eg cannot delete customer name & address if they still have an account open) Relational design: Only store each data item in one place (eg if customer changes address, only one copy to change) These reasons make Operational applications much easier to write. Operational Systems 8

The literature tends to use the term On-Line Transactional Processing (OLTP) for what we have described as “operational” systems. Transactional = On-line : This term is a bit historic... originally most systems processed batches of data non-interactively (cheques are one of the few batch-oriented systems left now). Now systems all tend to be on-line / interactive Since the industry calls it OLTP, we will too. 9

Management have different needs to that of the operational side of the business. Managers are much more concerned with trends and totals and are generally not so concerned with the finer details. What they want are reporting systems that: l Give quick access to summaries of data l Have data structures that are business oriented l Allow users to explore the data l Give them control over report writing Management Reporting 10

The manager of the bank wants to analyse the effectiveness of last month’s business List five typical queries 1. __________________________________ 2. __________________________________ 3. __________________________________ 4. __________________________________ 5. __________________________________ Nb: TRY to stick with just data drawn from the Cashier system. But you will find this hard. Later we will see that it is a feature of Analytical system that they integrate data from many sources 11

What similarities can you spot about the nature of the data involved in your five queries? Hint: volume, up-to-dateness (“currency”), level of detail l List four similarities 1. __________________________________ 2. __________________________________ 3. __________________________________ 4. __________________________________ 12

What general characteristics do you think can be stated about analytical applications? Hint: Complete the following sentence: The application is oriented towards l List four characteristics 1. __________________________________ 2. __________________________________ 3. __________________________________ 4. __________________________________ 13

Analytical Systems 1.Small volume of transactions 2.Often huge processing per transaction 3.Data output level is summary 4.Data routinely added to, but infrequently changed 5.Analysis driven 6.Flexible results structure 7.'Fairly accurate' better than no result 8.Medium availability 9.Requires different database tools Table 2: Attributes of an Analytical System 14

 Data is stored in structures easy for business users to understand (not constrained by Relational rules)  Data held in duplicate if this makes access easier/quicker (eg can hold summary/totals of data too)  Out-of-date data held (with timestamp) as well as new (allows examination of historical trends) These sort of systems are referred to as OLAP systems or On-Line Analytical Processing Systems. On-line Analytical Processing (OLAP) The Management Data is better stored in a Data Warehouse. 15

16 The diagram below shows how Data is converted into Business Intelligence InformationKnowledge Structure Analyse Apply DataIntelligence The Process Flow

The diagram below shows the relationships between the various components: Database Data Warehouse OLTP OLAP SQL Server, Oracle, Ingres, Informix, DB/2 etc SAS Warehouse Administrator, SQL Server etc Application Software, SAP, SQL Server, Oracle, Spreadsheets etc SQL Server Analysis Services, Crystal Analysis, Oracle Discovery etc Cleansing/Staging

Database Data Warehouse OLTP OLAP SQL Server, Oracle, Ingres, Informix, DB/2 etc SAS Warehouse Administrator, SQL Server etc Application Software, SAP, SQL Server, Oracle, Spreadsheets etc SQL Server Analysis Services, Crystal Analysis, Oracle Discovery etc Cleansing/Staging Structure Analyse Apply InformationKnowledgeDataIntelligence

A Data Warehouse is... "... where data is specifically structured for query and analysis performance and ease-of-use" Kimball, 2002 Definition 19

Data Warehouses offer the flexibility needed to cope with the Management demands. A major issue is that often there are many differing OLTP systems and other data storage media. The Data Warehouse offers the opportunity to gather these together into one system with a unified structure. Because data is stored in a simplified aggregated format it allow reports to be written by staff who have a lesser computing background. Why a Data Warehouse? 20

OLAP versus OLTP If a report is designed to obtain information from an OLTP system it will generally be: Slow to produce the answer Complicated to write Slow down the operational system Need complicated formulas for grouping data 21

OperationalAnalytical DataIndividual ItemsSummarised Data RelationsSimple ChainsComplex/Unknown TimePresentPast AccessRecord-at-a-timeMany records ApproachSupport a transactionExplore a domain ImplementationRelationalFollow the module! Summary 22