Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.

Slides:



Advertisements
Similar presentations
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Advertisements

Introduction to Databases
Management Information Systems, Sixth Edition
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
File Systems and Databases
Chapter 17 Designing Databases
Organizing Data & Information
1 Data & Database Development. 2 Data File Bit Byte Field Record File Database Entity Attribute Key field Key file management concepts include:
Data Mining – Intro.
Chapter 10: Architectural Design
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 1 Chapter 1 - Introduction: Databases and Database Users - Outline Types of Databases and.
CSI315CSI315 Web Development Technologies Continued.
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
Data Warehouse & Data Mining
Systems analysis and design, 6th edition Dennis, wixom, and roth
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Introduction: Databases and Database Users
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehousing.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Data resource management
Chapter 13 Designing Databases Systems Analysis and Design Kendall & Kendall Sixth Edition.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Chapter 25: Distributed Databases. Definitions Distributed Database – a collection of of multiple logically interrelated databases distributed over a.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Foundations of Business Intelligence: Databases and Information Management.
IS 320 Notes for April 15, Learning Objectives Understand database concepts. Use normalization to efficiently store data in a database. Use.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Advanced Database Concepts
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
Lecture On Introduction (DBMS) By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Research Directions in Databases Technological Education Institution of Larisa in collaboration with Staffordshire University Larisa Dr. Theodoros.
Postgraduate Module Enterprise Database Systems Technological Educational Institution of Larisa in collaboration with Staffordshire University Larisa
Chapter 1 Overview of Databases and Transaction Processing.
Christoph F. Eick: Final Words COSC Topics Covered in COSC 3480  Data models (ER, Relational, XML)  Using data models; learning how to store real.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
Databases and DBMSs Todd S. Bacastow January 2005.
Data Mining – Intro.
Chapter 13 The Data Warehouse
Chapter 5 Data Management
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
Chapter 2 Database Environment Pearson Education © 2009.
Data, Databases, and DBMSs
MANAGING DATA RESOURCES
Data Warehouse and OLAP
Database management concepts
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
File Systems and Databases
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
Database management concepts
Introduction of Week 9 Return assignment 5-2
Chapter 17 Designing Databases
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
Data Warehouse and OLAP
Presentation transcript:

Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008

Advanced Topics in Databases Active Databases Temporal and Spatial Database Deductive Databases Distributed Databases XML Data Mining Data Warehousing OLAP Mobile Databases Multimedia Databases GIS Genome Data Management

Active Databases Active Behavior = ability to react to events dimensions: event condition action execution model management Application Areas composite objects integrity constraints, business rules derived data

Temporal Databases A temporal database is a database management system with built-in time aspects, e.g. a temporal data model and a temporal version of structured query language. More specifically the temporal aspects usually include valid- time and transaction-time. These attributes go together to form bitemporal data. Valid time denotes the time period during which a fact is true with respect to the real world. Transaction time is the time period during which a fact is stored in the database.

Spatial Databases offers spatial data types supports spatial indexing find all cities in Bavaria supports spatial joins for each river, find all cities within 50 KM managing space  large collections of simple geometric objects 2D: geography (GIS), VLSI design 3D: astronomy, brain maps, molecules

Deductive Databases logic programming + persistence prolog  datalog facts and rules inference engine

Distributed Databases data and processing (server) reside on multiple computers transparent replication/distribution increased reliability and availability improved performance easier expansion federated database system shared global schema multidatabase system interactively constructs shared schema

Distributed Transactions ensuring transaction properties is difficult, since multiple machines are involved 2 phase commit: phase 1: coordinator sends “prepare to commit” to all participants participants force-write all logs participants indicate “ready to commit” or “cannot commit” phase 2: if all participants are ready to commit, coordinator sends commit command if any participant cannot commit, coordinator sends abort command

Structured, semi-structured, unstructured data structured data: fits a predefined format relation schema Java class semi-structured data: structure is flexible, but can be described at any particular time structure is embedded in the data aka self-describing data unstructured data: no discernable structure raw text

XML XML has become popular for dealing with semi- structured data creates a hierarchical structure schema may be embedded with data or separate allows for dynamic databases where structure changes more quickly than can be handled with schema modifications persistence and queries can be specialized for XML structures XPATH, XQUERY

XML Example

Data Mining KDD: Knowledge Discovery in Databases 1. data selection 2. data cleansing 3. enrichment 4. data transformation or encoding 5. data mining 6. reporting

Data Mining Mining discovers association rules sequential patterns classification hierarchies patterns within time series clustering Goals predicition identification classification optimization

Data Warehousing Data Warehouses are very large multi-dimensional (i.e. temporal) not transactional the result of KDD step 4

OLAP Online Analytic Processing analysis of complex data from a data warehouse makes us of the multi-dimensional structure of the warehouse provides the tools to knowledge workers

Mobile Databases Issues: data distribution and replication transaction models query processing – how to handle incomplete or unavailable information? recovery and fault tolerance security

Multimedia Databases Applications: repositories presentation collaboration issues modeling: complex objects with “hidden” semantics indexing storage: generally very large objects, not suitable for storage as records in files queries and retrievals: what do we match, what should be returned

GIS Specialization of spatial database systems also has temporal aspects highly dependent on complex range queries

Genome Data Management data characteristics highly complex highly variable rapidly changing schema multiple representations of same data generally read-only most users are not database savvy context dependent representation of complex queries is important