CS6530 Graduate-level Database Systems Prof. Feifei Li.

Slides:



Advertisements
Similar presentations
1 Introduction to Database Systems CSE444 Instructor: Scott Vandenberg University of Washington Winter 2000.
Advertisements

Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chapter 1 Instructor: Mirsad Hadzikadic.
Chapter 1 Instructor: Murali Mani Database Management Systems.
1 541: Database Systems S. Muthu Muthukrishnan. 2 Some Data Collections I Have Played With….  Wireless call detail records.  U. S. Patents.  AskJeeves.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Introduction to Database Systems Chapter 1 Instructor: Wang-Chien Lee
Database: A collection of related data [Elmasri]. A database represents some aspect of real world called “miniworld” [Elmasri] or “enterprise” [Ramakrishnan].
Introduction to Database Systems Ch. 1, Ch. 2 Mr. John Ortiz Dept. of Computer Science University of Texas at San Antonio.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1 Instructor: Deborah Strahman
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Introduction to Database Systems Chapter 1 Instructor: Johannes Gehrke
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 1 Database Systems I Introduction.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1.
On Database Systems.
1 Introduction to Database Systems Ref. Ramakrishnan & Gehrke Chapter 1.
1 CENG 302 Introduction to Database Management Systems Nihan Kesim Çiçekli URL:
Rutgers University CS541: Database Systems Spring 2008 Computer Science Department Rutgers University.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1 Instructor: Ethan Jackson
CSCD34 - Data Management Systems,- A. Vaisman1 CSC D34 - Data Management Systems Instructor: Alejandro Vaisman University of Toronto.
CSC343H – Introduction to Databases
CSC 411/511: DBMS Design 1 1 Dr. Nan WangCSC411_L1_Overview 1 Overview of Database Systems (Chapter 1)
Database Management Systems 1 Introduction to Database Systems Instructor: Xintao Wu Ramakrishnan & Gehrke.
COP Introduction to Database Systems Prof. Feifei Li.
 DATABASE DATABASE  DATABASE ENVIRONMENT DATABASE ENVIRONMENT  WHY STUDY DATABASE WHY STUDY DATABASE  DBMS & ITS FUNCTIONS DBMS & ITS FUNCTIONS 
1 CS862 - Advanced Database Systems Sang H. Son
Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chpt 1 Instructor: Xintao Wu.
Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chpt 1 Instructor: Weichao Wang.
Database Management Systems
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1 Overview of Database Systems.
Database Organization and Design
DATABASE MANAGEMENT SYSTEMS TERM B. Tech II/IT II Semester UNIT-I PPT SLIDES Text Books: (1) DBMS by Raghu Ramakrishnan (2) DBMS by Sudarshan and.
INFS614, Dr. Brodsky, GMU1 Database Management Systems INFS 614 Instructor: Professor Alex Brodsky
ICS 321 Fall 2009 Introduction to Database Systems Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa.
Introduction to Database Systems1. 2 Basic Definitions Mini-world Some part of the real world about which data is stored in a database. Data Known facts.
1 What Is a DBMS?  A very large, integrated collection of data.  Models real-world enterprise.  Entities (e.g., students, courses)  Relationships (e.g.,
 TERMINOLOGY TERMINOLOGY DATA INFORMATION  NEED OF INFORMATION NEED OF INFORMATION  QUALITIES OF INFORMATION QUALITIES OF INFORMATION  FILE SYSTEM.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Introduction to Database Systems Chapter 1 Instructor: Johannes Gehrke
-ebru a.s ATTRIBUTE: Description of entities For employee entity number, name, deptno, age, adr, salary..etc are attributes. RECORD: Stores whole.
CpSc 8620: DBMS Design Introduction. 2 Attribution Materials and lecture notes in this course are adapted from various sources, including the authors.
1 Geog 357: Data models and DBMS. Geographic Decision Making.
1 CS462- Database Systems Sang H. Son
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Why do you learn database?? Chapter 0.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1.
BBM 371 – Data Management Lecture 3: Basic Concepts of DBMS Prepared by: Ebru Akçapınar Sezer, Gönenç Ercan.
SQL IMPLEMENTATION & ADMINISTRATION Introduction to Databases & SQL.
1 CENG 351 CENG 351 Introduction to Data Management and File Structures Department of Computer Engineering METU.
1 Introduction to Data Management Lecture #1 (Course “Trailer”) Instructor: Chen Li.
Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chpt 1 Instructor: Xin Zhang.
1 Database Management Systems Introduction Instructor: Oliver Schulte
COP Introduction to Database Structures
DATABASE MANAGEMENT SYSTEMS
Introduction to Database Systems Chapter 1
CS4222 Principles of Database System
DATABASE MANAGEMENT SYSTEMS
Database Management Systems Chapter 1
Introduction to Database Systems
Instructor: Elke Rundensteiner
Database Management Systems Chapter 1
Database Management Systems Introduction
Introduction to Database Systems
Database Management Systems Introduction
Overview of Database Systems Chpt 1
Instructor: Murali Mani
Database Management Systems Chapter 1
Database Management Systems
Database Management Systems CSE594
Sang H. Son CS6750: Database Systems The slides for this text are organized into chapters. This lecture covers Chapter 1. Chapter 1: Introduction.
Introduction to Database Systems
Database Management Systems Chapter 1
Data Independence Applications insulated from how data is structured and stored. Logical data independence: Protection from changes in logical structure.
Is the WWW a DBMS? = Fairly sophisticated search available
Introduction to Database Systems Chpt 1
Presentation transcript:

CS6530 Graduate-level Database Systems Prof. Feifei Li

What Is a Database System? Database: a very large, integrated collection of data. Models a real-world enterprise – Entities (e.g., teams, games) – Relationships (e.g., The Patriots is playing in The Superbowl) – More recently, also includes active components (e.g. “business logic”) A Database Management System (DBMS) is a software system designed to store, manage, and facilitate access to databases.

Why Study Databases?? Shift from computation to information –always true for corporate computing –Web made this point for personal computing –more and more true for scientific computing Need for DBMS has exploded in the last years –Corporate: retail swipe/clickstreams, “customer relationship mgmt”, “supply chain mgmt”, “data warehouses”, etc. –Scientific: digital libraries, Human Genome project, NASA Mission to Planet Earth, physical sensors, grid physics network DBMS encompasses much of CS in a practical discipline –OS, languages, theory, AI, multimedia, logic –Yet traditional focus on real-world apps ?

What’s the intellectual content? representing information –data modeling languages and systems for querying data –complex queries with real semantics* –over massive data sets concurrency control for data manipulation –controlling concurrent access –ensuring transactional semantics reliable data storage –maintain data semantics even if you pull the plug * semantics: the meaning or relationship of meanings of a sign or set of signs

OS Support for Data Management Data can be stored in RAM –this is what every programming language offers! –RAM is fast, and random access –Isn’t this heaven? Every OS includes a File System –manages files on a magnetic disk –allows open, read, seek, close on a file –allows protections to be set on a file –drawbacks relative to RAM?

Database Management Systems What more could we want than a file system? –Simple, efficient ad hoc 1 queries –concurrency control –recovery –benefits of good data modeling S.M.O.P. 2 ? Not really… –as we’ll see this semester –in fact, the OS often gets in the way! 1 ad hoc: formed or used for specific or immediate problems or needs 2 SMOP: Small Matter Of Programming

Describing Data: Data Models A data model is a collection of concepts for describing data. A schema is a description of a particular collection of data, using a given data model. The relational model of data is the most widely used model today. –Main concept: relation, basically a table with rows and columns. –Every relation has a schema, which describes the columns, or fields.

Levels of Abstraction Views describe how users see the data. Conceptual schema defines logical structure Physical schema describes the files and indexes used. Physical Schema Conceptual Schema View 1View 2View 3 DB Users

Example: University Database Conceptual schema: – Students(sid: string, name: string, login: string, age: integer, gpa:real) – Courses(cid: string, cname:string, credits:integer) – Enrolled(sid:string, cid:string, grade:string) Physical schema: –Relations stored as unordered files. –Index on first column of Students. External Schema (View): –Course_info(cid:string,enrollment:integer)

Concurrency Control Concurrent execution of user programs: key to good DBMS performance. –Disk accesses frequent, pretty slow –Keep the CPU working on several programs concurrently. Interleaving actions of different programs: trouble! –e.g., account-transfer & print statement at same time DBMS ensures such problems don’t arise. –Users/programmers can pretend they are using a single-user system. (called “Isolation”) –Thank goodness! Don’t have to program “very, very carefully”.

Structure of a DBMS A typical DBMS has a layered architecture. The figure does not show the concurrency control and recovery components. Each system has its own variations. The book shows a somewhat more detailed version. Query Optimization and Execution Relational Operators Files and Access Methods Buffer Management Disk Space Management DB These layers must consider concurrency control and recovery

FYI: A text search engine Less “system” than DBMS –Uses OS files for storage –Just one access method –One hardwired query regardless of search string Typically no concurrency or recovery management –Read-mostly –Batch-loaded, periodically –No updates to recover –OS a reasonable choice Smarts: text tricks –Search string modifier (e.g. “stemming” and synonyms) –Ranking Engine (sorting the output, e.g. by word or document popularity) The Access Method Buffer Management Disk Space Management DB OS The Query Search String Modifier Simple DBMS } Ranking Engine

Advantages of a DBMS Data independence Efficient data access Data integrity & security Data administration Concurrent access, crash recovery Reduced application development time So why not use them always? –Expensive/complicated to set up & maintain –This cost & complexity must be offset by need –General-purpose, not suited for special-purpose tasks (e.g. text search!) –Most suitable for structured data, what about unstructured data? –Cluster of commodity machines: parallel database systems are hard to scale.

Summary DBMS used to maintain, query large datasets. –can manipulate data and exploit semantics Other benefits include: –recovery from system crashes, –concurrent access, –quick application development, –data integrity and security. In this course we will explore: How to be a sophisticated user of DBMS technology