Trends in Data Warehousing. CHAPTER OBJECTIVES  Review the continued growth in data warehousing  Learn how data warehousing is becoming mainstream 

Slides:



Advertisements
Similar presentations
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Advertisements

By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Technical BI Project Lifecycle
Management Information Systems, Sixth Edition
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
Managing Data Resources
Chapter 3 Database Management
The Hierarchy of Data Bit (a binary digit): a circuit that is either on or off Byte: 8 bits Character: each byte represents a character; the basic building.
1 Week 4 Decision Support System (DSS)/ Intelligent DSS.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Components and Architecture CS 543 – Data Warehousing.
Organizing Data & Information
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Chapter 14 The Second Component: The Database.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 3-1 Chapter 3 Decision Support Systems:
Introduction to Building a BI Solution 권오주 OLAPForum
The Premier Software Usage Analysis and Reporting Toolset Maximizing Value for Software Users.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Chapter 1: The Database Environment
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Lecture-8/ T. Nouf Almujally
Chapter 2: Business Intelligence Capabilities
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
1.Knowledge management 2.Online analytical processing 3. 4.Supply chain management 5.Data mining Which of the following is not a major application.
Database Systems – Data Warehousing
Chapter 5 Lecture 2. Principles of Information Systems2 Objectives Understand Data definition language (DDL) and data dictionary Learn about popular DBMSs.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Fundamentals of Information Systems, Fifth Edition
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
MANAGEMENT SUPPORT SYSTEMS II 7. Business Intelligence.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Preparation NAME::ASMAA ALASY Supervision A::RASHA ATALLAH.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
Architecture of Decision Support System
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Building Dashboards SharePoint and Business Intelligence.
DATA RESOURCE MANAGEMENT
Decision Support Systems: An Overview by Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc. web-site :
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
Learning Objectives Understand the concepts of Information systems.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Managing Data Resources File Organization and databases for business information systems.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Chapter 3 Decision Support Systems: An Overview
Popular Database Management Systems
Chapter 13 Business Intelligence and Data Warehouses
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
MANAGING DATA RESOURCES
Data Warehousing and Data Mining
Analysis models and design models
Tutorial 7 – Integrating Access With the Web and With Other Programs
Types and Importance of Information systems
Chapter 3 Database Management
Chapter 3 Decision Support Systems: An Overview
Presentation transcript:

Trends in Data Warehousing

CHAPTER OBJECTIVES  Review the continued growth in data warehousing  Learn how data warehousing is becoming mainstream  Discuss several major trends, one by one  Grasp the need for standards and review the progress

Continued Growth in Data Warehousing  Data Warehousing is Becoming Mainstream  Data Warehouse Expansion  Vendor Solutions and Products Significant Trends 1. Multiple Data Types 2. Data Visualization 3. Parallel Processing 4. Query Tools 5. Browser Tools 6. Data Fusion 7. Agent Technology

Continued Growth in Data Warehousing In every industry across the board, from retail chain stores to financial institutions, from manufacturing enterprises to government departments, from airline companies to utility businesses, data warehousing is revolutionizing the way people perform business analysis and make strategic decisions. Significant Trends Let us separate out the significant trends and discuss each briefly. Be prepared to visit each trend, one by one—every one has a serious impact on data warehousing. As we walk through each trend, try to understand its significance and be sure that you perceive its relevance to your company’s data warehouse. Be prepared to answer the question: What must you do to take advantage of the trend in your data warehouse?

1- Multiple Data Types Traditionally, companies included structured data, mostly numeric, in their data warehouses. From this point of view, decision support systems were divided into two camps: Data warehousing dealt with structured data Knowledge management involved unstructured data. For example, most marketing data consists of structured data in the form of numeric values. Marketing data also contains unstructured data in the form of images. Let us say a decision maker is performing an analysis to find the top selling product types. The decision maker arrives at a specific product type in the course of the analysis. He or she would now like to see images of the products in that type to make further decisions. How can this be made possible? Companies are realizing there is a need to integrate both structured and unstructured data in their data warehouses. What are the types of data we call unstructured data? Figure 3-4 shows the different types of data that need to be integrated in the data warehouse to support decision making more effectively.

Figure 3-4 Data warehouse: multiple data types.

Adding Unstructured Data. Some vendors are addressing the inclusion of unstructured data, especially text and images, by treating such multimedia data as just another data type. These are defined as part of the relational data and stored as binary large objects (BLOBs) up to 2 GB in size. User-defined functions (UDFs) are used to define these as user-defined types (UDTs). Searching Unstructured Data. You have enhanced your data warehouse by adding unstructured data. Is there anything else you need to do? Of course, without the ability to search unstructured data, integration of such data is of little value. Vendors are now providing new search engines to find the information the user needs from unstructured data. Query by image content is an example of a search mechanism for images. The product allows you to pre-index images based on shapes, colors, and textures.

2- Data Visualization When a user queries your data warehouse and expects to see results only in the form of output lists or spreadsheets, your data warehouse is already outdated. You need to display results in the form of graphics and charts as well. Every user now expects to see the results shown as charts. Visualization of data in the result sets boosts the process of analysis for the user, especially when the user is looking for trends over time. Data visualization helps the user to interpret query results quickly and easily.

Major Visualization Trends. In the last few years, three major trends have shaped the direction of data visualization software. 1) More Chart Types:  Most data visualizations are in the form of some standard chart type. The numerical results are converted into a pie chart, a scatter plot, or another chart type. 2) Interactive Visualization:  Visualizations are no longer static. Dynamic chart types are themselves user interfaces. Your users can review a result chart, manipulate it, and then see newer views online. 3) Visualization of Complex and Large Result Sets:  You users can view a simple series of numeric result points as a rudimentary pie or bar chart. But newer visualization software can visualize thousands of result points and complex data structures.

Visualization Types.  Visualization software now supports a large array of chart types.  The current needs of users vary enormously.  The business users demand pie and bar charts.  The technical and scientific users need scatter plots and constellation graphs.  Analysts looking at spatial data need maps and other three- dimensional representations.  Executives and managers, who need to monitor performance metrics, like digital dashboards that allow them to visualize the metrics as speedometers, thermometers, or traffic lights.

Advanced Visualization Techniques. The most remarkable advance in visualization techniques is the transition from static charts to dynamic interactive presentations. Chart Manipulation. A user can rotate a chart or dynamically change the chart type to get a clearer view of the results. With complex visualization types such as scatter plots, a user can select data points with a mouse and then move the points around to clarify the view. Drill Down. The visualization first presents the results at the summary level. The user can then drill down the visualization to display further visualizations at subsequent levels of detail. Advanced Interaction. These techniques provide a minimally invasive user interface. The user simply double clicks a part of the visualization and then drags and drops representations of data entities. Or, the user simply right clicks and chooses options from a menu. Visual query is the most advanced of user interaction features.

3- Parallel Processing You know that the data warehouse is a user-centric and query-intensive environment. Your users will constantly be executing complex queries to perform all types of analyses. Each query would need to read large volumes of data to produce result sets. Analysis, usually performed interactively, requires the execution of several queries, one after the other, by each user. If the data warehouse is not tuned properly for handling large, complex, simultaneous queries efficiently, the value of the data warehouse will be lost. A task is divided into smaller units and these smaller units are executed concurrently. Parallel Processing Hardware Options: In a parallel processing environment, you will find these characteristics: multiple CPUs, memory modules, one or more server nodes, and high-speed communication links between interconnected nodes.

Parallel Processing Software Implementation. You may choose the appropriate parallel processing hardware configuration for your data warehouse. Parallel processing software must be capable of performing the following steps:  Analyzing a large task to identify independent units that can be executed in parallel  Identifying which of the smaller units must be executed one after the other  Executing the independent units in parallel and the dependent units in the proper sequence  Collecting and consolidating the results returned by the smaller units

Parallel processing options Database vendors usually provide two options for parallel processing: parallel server option parallel query option parallel server option and parallel query option. The parallel server option allows each hardware node to have its own separate database instance, and enables all database instances to access a common set of underlying database files. The parallel query option supports key operations such as query processing, data loading, and index creation to be parallelized. In summary, advantages when you adopt parallel processing in your data warehouse:  Performance improvement for query processing, data loading, and index creation  Scalability, allowing the addition of CPUs and memory modules without any changes to the existing application  Fault tolerance so that the database would be available even when some of the parallel processors fail  Single logical view of the database even though the data may reside on the disks of multiple nodes

In a data warehouse, if there is one set of functional tools that are most significant, it is the set of query tools. The success of your data warehouse depends on your query tools. Because of this, data warehouse vendors have improved query tools during the past few years. the following functions for which vendors have greatly enhanced their query tools.  Flexible presentation—Easy to use and able to present results online and on reports in many different formats.  Aggregate awareness—Able to recognize the existence of summary or aggregate tables and automatically route queries to the summary tables  Crossing subject areas—Able to cross over from one subject data mart to another automatically.  Multiple heterogeneous sources—Capable of accessing heterogeneous data sources on different platforms.  Integration—Integrate query tools for online queries, batch reports, and data extraction for analysis.  Overcoming SQL limitations—Provide SQL extensions to handle requests that cannot usually be done through standard SQL. 4- Query Tools

5- Browser Tools Some recent trends in enhancements to browser tools:  Tools are extensible to allow definition of any type of data.  Inclusion of open APIs (application program interfaces).  Provision of several types of browsing functions.  Allowing users to browse the catalog (data dictionary or metadata).  Applying Web browsing and search techniques to browse through the information catalogs.

6 - Data Fusion Data fusion is a technology dealing with the merging of data from disparate sources. It has a wider scope and includes real-time merging of data from instruments and monitoring systems.

7- Agent Technology A software agent is a program that is capable of performing a predefined programmable task on behalf of the user. For example, on the Internet, software agents can be used to sort and filter out according to rules defined by the user. Within the data warehouse, software agents are beginning to be used to alert the users of predefined business conditions. They are also beginning to be used extensively in conjunction with data mining and predictive modeling techniques.

CHAPTER SUMMARY  Data warehousing is becoming mainstream with the spread of high-volume data warehouses and the rapid increase in the number of vendor products.  To be effective, modern data warehouses need to store multiple types of data: structured and unstructured, including documents, images, audio, and video.  Data visualization deals with displaying information in several types of visual forms: text, numerical arrays, spreadsheets, charts, graphs, and so on. Tremendous progress has been made in data visualization.  Data warehouse performance may be improved by using parallel processing with appropriate hardware and software options.  It is critical to adapt data warehousing to work with ERP packages, knowledge management, and customer relationship systems.  Data warehousing industry is seriously seeking agreed-upon standards for metadata and OLAP. The end is perhaps in sight.