Download presentation
Presentation is loading. Please wait.
Published byJuliet Daniel Modified over 9 years ago
1
Retrievals & Projections Objectives of the Lecture : To consider retrieval actions from a DB; To consider using relational algebra for defining relations; To consider the Project operator and its use in SQL retrievals.
2
Using a Relational DB 2. specifying the action to be taken with it/them. 1 2 3 4 A 9 10 11 12 C 5 6 7 8 5 6 7 8 B There are two parts to using a DB : 1 2 3 4 A 9 10 11 12 C 5 6 7 8 5 6 7 8 B 1. defining the relation(s) to be used;
3
Actions on a DB l Actions are expressed in SQL statements. l The keyword or key phrase that starts an SQL statement determines what the statement’s action is. Examples : Create (Table) Alter (Table) Insert (Into), Delete (From) Update, Commit and Rollback. l Data retrieval is another action. It is the raison d’etre of any DB. There would be little point in a DB whose data could not be retrieved. l The SQL keyword that starts a retrieval statement is Select.
4
Retrieval Actions l The SQL Select statement must comprise the following two phrases : Select ……. From ……. ; Optionally it may contain other phrases. l Retrievals are sometimes known as Queries, because the data retrieved can be thought of as the answer to a question. Example :- “Which employees are married ?” Data from the DB provides the answer. l On this course, the DBMS is set up so as to always retrieve data to our computer screen, so that we can see it. (However SQL Select does permit retrievals to other locations).
5
Defining Relations for Actions l Relational algebra or relational calculus languages are used to define a relation that is to be acted on. l A defined relation may be : l a part of a DB relation, l A merger of 2 or more DB relations, l part of a ‘merged’ relation. l Relational algebra / calculus language can provide the great power and flexibility, together with conceptual simplicity, needed to define ‘relations-to-be-acted-on’. l SQL is a mixture of relational algebra and calculus plus other ‘ad hocery’ needlessly complex.
6
Relational Algebra (1) l The SQL language is based on a mixture of relational algebra, relational calculus, and ad hoc peculiarities of its own. l Relational algebra is simpler than SQL. l Therefore for simplicity, this course follows the tradition of using relational algebra to define the relation to be retrieved. It is then expressed in SQL for execution. l Relational algebra consists of : l a number of relational operators; a monadic operator operates on one operand, i.e.one relation, to produce a single relation as a result; a dyadic operator operates on two operands to produce a single result relation. l A way of combining relational operators together to form a relational expression.
7
Relational Algebra (2) l “Closure Under the Algebra” In 3 - ( ( 5 6 ) + 2 ) ( 5 6 ) results in a number, 30; so the next calculation is (30 + 2), giving 32; the last calculation is (3 - 32), giving -29. Each operator generates another number, i.e. closure. Similarly each relational operator generates another relation. In both cases, arbitrarily complex formulae can be built up. Relational algebra is based on the same concepts as arithmetic algebra. l Intuitively useful operators : Examples :
8
Designing Retrievals l Designing a retrieval means defining the relation to be retrieved, since it is only necessary to prefix the SQL definition with Select for it to be retrieved. l As each relational algebra operator encapsulates an intuitively useful concept, with a universally accepted name for referencing it, the following method is used to learn how to write queries : l Learn a number of individual algebra operators and how to express them in SQL. l Learn how to combine operators to create powerful expressions, and how to write the expressions in SQL. l SQL uses a standard fixed format for writing expressions. This is straightforward for simple and reasonable queries, but can be constraining for complex queries.
9
ACBD 4 6 4 3 4 6 R 5 3 8 9 5 1 1 2 1 7 2 2 4 7 5 6 8 2 B 5 3 8 9 5 1 Example of Projection (1) B 5 3 8 9 5 1 R Project[ B, D ] D 4 7 5 6 8 2 D 4 7 5 6 8 2
10
R Project[ A, C ] ACBD 4 6 4 3 4 6 R 5 3 8 9 5 1 1 2 1 7 2 2 4 7 5 6 8 2 A 4 6 4 3 4 6 Example of Projection (2) A 4 6 4 3 4 6 R Project[ A, C ] C 1 2 1 7 2 2 C 1 2 1 7 2 2 416262413 4 7 2 62
11
Definition of ‘Project’ 1. Creates a new relation containing only the specified attributes. 2. Any would-be duplicate tuples are removed, to ensure the result is a set of tuples. 3. The result’s attribute names are those of the operand’s attribute names specified in the parameter to the Project operation. If the specified attribute(s) include a candidate key, then there can be no duplicate tuples to remove.
12
Characteristics of Projection l Don’t need to know about any constraints on the operand relation in order to be able to do a projection. l Boundary Cases : l If all the attributes are specified, then result operand l If no attributes are specified, then result nullary relation i.e. result has no attributes. l In principle, the unwanted attributes can be specified instead, i.e. those to be removed. Example : in relation R( A, B, C, D ) R Project[ ~ B, D ] R Project[ A, C ]
13
SQL : Projection Principles : l Put the operand relation’s name in the From phrase. l Put the attributes to be projected out in the Select phrase. l This SQL statement also retrieves the newly created relation from the DB. Examples : SQL equivalent of the 2 example projections on relation ‘R’ :- SelectB, DSelectA, C FromR ;FromR ;
14
SQL : Duplicate Rows l SQL does not remove duplicate rows automatically. Problem ! l SQL must be instructed to remove duplicates, by inserting the Distinct keyword between the Select keyword and the list of attribute names. Examples :- Select Distinct B,D FromR ; Select Distinct A, C FromR ; This does a real projection; the earlier version did not. As there were no duplicate tuples, Distinct is strictly unnecessary; but it is still a correct statement.
15
SQL : Removing Duplicate Rows l The reason why SQL is not designed to remove duplicate tuples automatically is because, originally, this was thought to need too much computing power. l This issue is now contentious : 1.Computers are now much more powerful. 2.Much more serious performance optimisation problems can be created by retaining duplicates than by removing them. Advice : l Always initially insert Distinct, unless a candidate key in the result ensures no problems of duplicates. l IF performance cannot then be made fast enough THEN decide whether performance or lack of duplicates is more important, and choose the preferred option.
16
SQL : Projection Characteristics l To project out all the attributes in a relation, use * in the Select phrase. Example :- Select * From R ; The result is the same as relation ‘R’. (Thus a retrieval of a whole table is in fact a retrieval of a projection of all the attributes). l Nullary relations. These are not permitted in SQL. If nothing is put between the Select and From keywords, a syntax error will be issued. l SQL does not allow the specification of unwanted attributes in the Select phrase.
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.