Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lectures 2&3: Introduction to SQL. Lecture 2: SQL Part I Lecture 2.

Similar presentations


Presentation on theme: "Lectures 2&3: Introduction to SQL. Lecture 2: SQL Part I Lecture 2."— Presentation transcript:

1 Lectures 2&3: Introduction to SQL

2 Lecture 2: SQL Part I Lecture 2

3 Today’s Lecture 1.SQL introduction & schema definitions ACTIVITY: Table creation 2.Basic single-table queries ACTIVITY: Single-table queries! 3.Multi-table queries ACTIVITY: Multi-table queries! 3 Lecture 2

4 1. SQL Introduction & Definitions 4 Lecture 2 > Section 1

5 What you will learn about in this section 1.What is SQL? 2.Basic schema definitions 3.Keys & constraints intro 4.ACTIVITY: CREATE TABLE statements 5 Lecture 2 > Section 1

6 SQL Motivation Dark times 5 years ago. Are databases dead? Now, as before: everyone sells SQL Pig, Hive, Impala “Not-Yet-SQL?” Lecture 2 > Section 1 > SQL

7 7 Basic SQL Lecture 2 > Section 1 > SQL

8 SQL Introduction SQL is a standard language for querying and manipulating data SQL is a very high-level programming language This works because it is optimized well! Many standards out there: ANSI SQL, SQL92 (a.k.a. SQL2), SQL99 (a.k.a. SQL3), …. Vendors support various subsets NB: Probably the world’s most successful parallel programming language (multicore?) SQL stands for Structured Query Language SQL stands for Structured Query Language Lecture 2 > Section 1 > SQL

9 9 SQL is a… Data Definition Language (DDL) Define relational schemata Create/alter/delete tables and their attributes Data Manipulation Language (DML) Insert/delete/modify tuples in tables Query one or more tables – discussed next! Lecture 2 > Section 1 > SQL

10 10 Tables in SQL PNamePriceManufacturer Gizmo$19.99GizmoWorks Powergizmo$29.99GizmoWorks SingleTouch$149.99Canon MultiTouch$203.99Hitachi Product A relation or table is a multiset of tuples having the attributes specified by the schema Let’s break this definition down Lecture 2 > Section 1 > Definitions

11 11 Tables in SQL PNamePriceManufacturer Gizmo$19.99GizmoWorks Powergizmo$29.99GizmoWorks SingleTouch$149.99Canon MultiTouch$203.99Hitachi Product A multiset is an unordered list (or: a set with multiple duplicate instances allowed) List: [1, 1, 2, 3] Set: {1, 2, 3} Multiset: {1, 1, 2, 3} i.e. no next(), etc. methods! Lecture 2 > Section 1 > Definitions

12 12 Tables in SQL PNamePriceManufacturer Gizmo$19.99GizmoWorks Powergizmo$29.99GizmoWorks SingleTouch$149.99Canon MultiTouch$203.99Hitachi Product An attribute (or column) is a typed data entry present in each tuple in the relation NB: Attributes must have an atomic type in standard SQL, i.e. not a list, set, etc. Lecture 2 > Section 1 > Definitions

13 13 Tables in SQL PNamePriceManufacturer Gizmo$19.99GizmoWorks Powergizmo$29.99GizmoWorks SingleTouch$149.99Canon MultiTouch$203.99Hitachi Product A tuple or row is a single entry in the table having the attributes specified by the schema Also referred to sometimes as a record Lecture 2 > Section 1 > Definitions

14 14 Tables in SQL PNamePriceManufacturer Gizmo$19.99GizmoWorks Powergizmo$29.99GizmoWorks SingleTouch$149.99Canon MultiTouch$203.99Hitachi Product Lecture 2 > Section 1 > Definitions The number of tuples is the cardinality of the relation The number of attributes is the arity of the relation

15 15 Data Types in SQL Atomic types: Characters: CHAR(20), VARCHAR(50) Numbers: INT, BIGINT, SMALLINT, FLOAT Others: MONEY, DATETIME, … Every attribute must have an atomic type Hence tables are flat Why? Lecture 2 > Section 1 > Definitions

16 16 Table Schemas The schema of a table is the table name, its attributes, and their types: A key is an attribute whose values are unique; we underline a key Product(Pname: string, Price: float, Category: string, Manufacturer: string) Lecture 2 > Section 1 > Definitions Product(Pname: string, Price: float, Category: string, Manufacturer: string)

17 Key constraints A key is an implicit constraint on which tuples can be in the relation i.e. if two tuples agree on the values of the key, then they must be the same tuple! 1. Which would you select as a key? 2. Is a key always guaranteed to exist? 3. Can we have more than one key? A key is a minimal subset of attributes that acts as a unique identifier for tuples in a relation Lecture 2 > Section 1 > Keys & constraints Students(sid:string, name:string, gpa: float)

18 NULL and NOT NULL To say “don’t know the value” we use NULL NULL has (sometimes painful) semantics, more detail later sidnamegpa 123Bob3.9 143JimNULL Say, Jim just enrolled in his first class. In SQL, we may constrain a column to be NOT NULL, e.g., “name” in this table Students(sid:string, name:string, gpa: float) Lecture 2 > Section 1 > Keys & constraints

19 2. Single-table queries 19 Lecture 2 > Section 2

20 What you will learn about in this section 1.The SFW query 2.Other useful operators: LIKE, DISTINCT, ORDER BY 3.ACTIVITY: Single-table queries 20 Lecture 2 > Section 2

21 21 Simple SQL Query: Selection PNamePriceCategoryManufacturer Gizmo$19.99GadgetsGizmoWorks Powergizmo$29.99GadgetsGizmoWorks SingleTouch$149.99PhotographyCanon MultiTouch$203.99HouseholdHitachi PNamePriceCategoryManufacturer Gizmo$19.99GadgetsGizmoWorks Powergizmo$29.99GadgetsGizmoWorks Lecture 2 > Section 2 > SFW SELECT * FROM Product WHERE Category = ‘Gadgets’ Selection is the operation of filtering a relation’s tuples on some condition

22 22 Simple SQL Query: Projection PNamePriceCategoryManufacturer Gizmo$19.99GadgetsGizmoWorks Powergizmo$29.99GadgetsGizmoWorks SingleTouch$149.99PhotographyCanon MultiTouch$203.99HouseholdHitachi PNamePriceManufacturer Gizmo$19.99GizmoWorks Powergizmo$29.99GizmoWorks Lecture 2 > Section 2 > SFW SELECT Pname, Price, Manufacturer FROM Product WHERE Category = ‘Gadgets’ Projection is the operation of producing an output table with tuples that have a subset of their prior attributes

23 23 A Few Details SQL commands are case insensitive: Same: SELECT, Select, select Same: Product, product Values are not: Different: ‘Seattle’, ‘seattle’ Use single quotes for constants: ‘abc’ - yes “abc” - no Lecture 2 > Section 2 > SFW

24 24 LIKE: Simple String Pattern Matching s LIKE p: pattern matching on strings p may contain two special symbols: % = any sequence of characters _ = any single character SELECT * FROM Products WHERE PName LIKE ‘%gizmo%’ Lecture 2 > Section 2 > Other operators

25 25 DISTINCT: Eliminating Duplicates SELECT DISTINCT Category FROM Product SELECT DISTINCT Category FROM Product Versus SELECT Category FROM Product SELECT Category FROM Product Category Gadgets Photography Household Category Gadgets Photography Household Lecture 2 > Section 2 > Other operators

26 26 ORDER BY: Sorting the Results SELECT PName, Price, Manufacturer FROM Product WHERE Category=‘gizmo’ AND Price > 50 ORDER BY Price, PName SELECT PName, Price, Manufacturer FROM Product WHERE Category=‘gizmo’ AND Price > 50 ORDER BY Price, PName Lecture 2 > Section 2 > Other operators Ties are broken by the second attribute on the ORDER BY list, etc. Ordering is ascending, unless you specify the DESC keyword.

27 3. Multi-table queries 27 Lecture 2 > Section 3

28 What you will learn about in this section 1.Foreign key constraints 2.Joins: basics 3.Joins: SQL semantics 4.ACTIVITY: Multi-table queries 28 Lecture 2 > Section 3

29 Foreign Key constraints student_id alone is not a key- what is? sidnamegpa 101Bob3.2 123Mary3.8 student_idcidgrade 123564A 123537A+ StudentsEnrolled We say that student_id is a foreign key that refers to Students Lecture 2 > Section 3 > Foreign Keys Students(sid: string, name: string, gpa: float) Enrolled(student_id: string, cid: string, grade: string) Students(sid: string, name: string, gpa: float) Enrolled(student_id: string, cid: string, grade: string) Suppose we have the following schema: And we want to impose the following constraint: ‘Only bona fide students may enroll in courses’ i.e. a student must appear in the Students table to enroll in a class

30 Declaring Foreign Keys Lecture 2 > Section 3 > Foreign Keys Students(sid: string, name: string, gpa: float) Enrolled(student_id: string, cid: string, grade: string) CREATE TABLE Enrolled( student_id CHAR(20), cid CHAR(20), grade CHAR(10), PRIMARY KEY (student_id, cid), FOREIGN KEY (student_id) REFERENCES Students ) Students(sid: string, name: string, gpa: float) Enrolled(student_id: string, cid: string, grade: string) CREATE TABLE Enrolled( student_id CHAR(20), cid CHAR(20), grade CHAR(10), PRIMARY KEY (student_id, cid), FOREIGN KEY (student_id) REFERENCES Students )

31 Foreign Keys and update operations DBA chooses (syntax in the book) Lecture 2 > Section 3 > Foreign Keys Students(sid: string, name: string, gpa: float) Enrolled(student_id: string, cid: string, grade: string) Students(sid: string, name: string, gpa: float) Enrolled(student_id: string, cid: string, grade: string) What if we insert a tuple into Enrolled, but no corresponding student? INSERT is rejected (foreign keys are constraints)! What if we delete a student? 1.Disallow the delete 2.Remove all of the courses for that student 3.SQL allows a third via NULL (not yet covered)

32 32 Keys and Foreign Keys PNamePriceCategoryManufacturer Gizmo$19.99GadgetsGizmoWorks Powergizmo$29.99GadgetsGizmoWorks SingleTouch$149.99PhotographyCanon MultiTouch$203.99HouseholdHitachi Product Company CNameStockPriceCountry GizmoWorks25USA Canon65Japan Hitachi15Japan What is a foreign key vs. a key here? Lecture 2 > Section 3 > Foreign Keys

33 33 Joins Ex: Find all products under $200 manufactured in Japan; return their names and prices. SELECT PName, Price FROM Product, Company WHERE Manufacturer = CName AND Country=‘Japan’ AND Price <= 200 SELECT PName, Price FROM Product, Company WHERE Manufacturer = CName AND Country=‘Japan’ AND Price <= 200 Lecture 2 > Section 3 > Joins: Basics Product(PName, Price, Category, Manufacturer) Company(CName, StockPrice, Country) Product(PName, Price, Category, Manufacturer) Company(CName, StockPrice, Country) Note: we will often omit attribute types in schema definitions for brevity, but assume attributes are always types

34 34 Joins Ex: Find all products under $200 manufactured in Japan; return their names and prices. SELECT PName, Price FROM Product, Company WHERE Manufacturer = CName AND Country=‘Japan’ AND Price <= 200 SELECT PName, Price FROM Product, Company WHERE Manufacturer = CName AND Country=‘Japan’ AND Price <= 200 Lecture 2 > Section 3 > Joins: Basics A join between tables returns all unique combinations of their tuples which meet some specified join condition Product(PName, Price, Category, Manufacturer) Company(CName, StockPrice, Country) Product(PName, Price, Category, Manufacturer) Company(CName, StockPrice, Country)

35 35 Joins Several equivalent ways to write a basic join in SQL: SELECT PName, Price FROM Product, Company WHERE Manufacturer = CName AND Country=‘Japan’ AND Price <= 200 SELECT PName, Price FROM Product, Company WHERE Manufacturer = CName AND Country=‘Japan’ AND Price <= 200 Lecture 2 > Section 3 > Joins: Basics SELECT PName, Price FROM Product JOIN Company ON Manufacturer = Cname AND Country=‘Japan’ WHERE Price <= 200 SELECT PName, Price FROM Product JOIN Company ON Manufacturer = Cname AND Country=‘Japan’ WHERE Price <= 200 A few more later on… Product(PName, Price, Category, Manufacturer) Company(CName, StockPrice, Country) Product(PName, Price, Category, Manufacturer) Company(CName, StockPrice, Country)


Download ppt "Lectures 2&3: Introduction to SQL. Lecture 2: SQL Part I Lecture 2."

Similar presentations


Ads by Google