Presentation is loading. Please wait.

Presentation is loading. Please wait.

Fundamentals/ICY: Databases 2012/13 Week 4 John Barnden Professor of Artificial Intelligence School of Computer Science University of Birmingham, UK.

Similar presentations


Presentation on theme: "Fundamentals/ICY: Databases 2012/13 Week 4 John Barnden Professor of Artificial Intelligence School of Computer Science University of Birmingham, UK."— Presentation transcript:

1 Fundamentals/ICY: Databases 2012/13 Week 4 John Barnden Professor of Artificial Intelligence School of Computer Science University of Birmingham, UK

2 Reminder of Week 3

3 ENTITIES, RELATIONSHIPS & ATTRIBUTES (Introduction)

4 Entities uBasically, entities are just things of the “important types” that we judged above to merit tables. So we had entity types such as: l People l Employing Organizations l Phone Stations (as opposed to just phone numbers as such) uSo what the entity types are in a given working environment are partly a matter of judgment, as explained earlier. But we’ll see that in designing a DB we may need to introduce new, not immediately obvious, entity types. u“Entities” are, or should be, the things of a type: e.g., individual people. An entity is represented by a row in the appropriate table.

5 Entity Terminology uUnfortunately: “entity” is often used to mean entity type. “entity set” is often used for entity type. “entity occurrence” is often used to mean individual entity.

6 New for Week 4

7 Relationships uThese are the relationships between entity types, such as l A person being employed by an organization l A person having a phone station uHave to think about both directions of a relationship: e.g., both employed-by and employs. uCAUTION: Tables are also called “relations” [hence “relational” DB] (much more on this later). This is to do with the internals of tables/entities rather than with “relationships” between entities.

8 Relationship Connectivity uRelationships are importantly categorized as to uniqueness or multiplicity of entities at either end – “connectivity.” Has big effect on DB design. l 1:1 (“one to one”): e.g., the people/phone-stations relationship, if each person has at most one phone station and each phone station is assigned to at most one person. l M:N (“many to many”): e.g., the employs relationship, assuming a person may have more than one employing organization (or none) and an organization may have more than one employee (or none). (Don’t take “many” seriously – just means possibly more than one.) l 1:M (“one to many”): e.g., the employs relationship, if an organization may have more than one employee (or none) but a person has at most one employing-org.

9 Relationship Cardinality uRelationships can be further specified as to “how many entities allowed or required at either end” – cardinality. Also has significant effect on DB design. uIn a relationship from entity type A to entity type B, a minimum and a maximum can be specified for the number of B entities for each A entity. uA maximum greater than 1 can only be specified if the relationship from A to B is 1:M or M:N. (So the notions of connectivity and cardinality are not properly separated). l E.g., could be specified that a person can only be employed by up to five organizations. uMost normally, the important choice for the minimum is between none and one. E.g., the minimum for employed-by could be none, but the minimum for employs could be one. But the minimum number of wheels for a car could be specified to be three. uIf the minimum is none, then B is optional for A. Otherwise, it is mandatory for A.

10 Attributes uAttributes of entities of a given type are the names of the different pieces of information that need to be stored for entities of that type. So they’re just the column names for the table for the entity type. l E.g., entities of the type “people” could have the following attributes: person ID number, last name, first name, phone number, age. uNote: Attributes include artificial ones like the employer identity numbers (EMPL. NUM.) that we introduced in an example above. These may have no significance outside the DB itself. uRelationships are represented by associative linking by means of shared attributes. (For now, will always assume that the same attribute name is used in each of the tables involved.)

11 Attribute Determination uREMEMBER: Rows in a table are uniquely determined (picked out) by the values in some set of columns, i.e. the values of some collection of attributes. That is, given some values for those attributes, there is at most one entity that has those values for those attributes. uHence, that collection of attributes determines all the other attributes. uThat is, given some values for the determining attributes, there’s at most one value for each of the other attributes.

12 Attribute Determination, contd. uMore generally, a collection of one or more attributes determines another attribute A if only one value for A is possible given the values for the former attributes. E.g., the collection DAY-NUMBER, MONTH and YEAR specifying birth-date in a table about people could determine DAY-NAME, even though it doesn’t determine other attributes such as NATIONALITY: several people could have the same birth-date but be of different nationalities. uWe alternatively say that DAY-NAME is functionally dependent on DAY-NUMBER, MONTH and YEAR.

13 Attribute Determination, contd. uThe determining collection of attributes is called the determinant in the determination. uWrite determination as: DAY-NUMBER, MONTH, YEAR  DAY-NAME uDetermination does not mean that you have at hand an algorithm for working out a dependent attribute from the determinant, although you may do. E.g. Consider a “father” attribute.

14 Keys uA key for a table is a collection of one or more attributes that determines at least some other attribute(s) in that same table. l CAUTION: But sometimes people use “key” as a sloppy abbreviation for “primary key” (see below) or other notions. uA superkey for a table is a collection of one or more attributes that determines all the other attributes in the table, i.e. determines a whole row. l Trivially, the collection of all the attributes is a superkey. uA candidate key is a minimal superkey (i.e., you can’t remove attributes from it and still have a superkey.) It does NOT necessarily mean a numerically smallest superkey.

15 Superkeys & Candidate Keys: Example uSuppose a “day” entity type has attributes DAY-NAME, DAY-NUMBER, MONTH, YEAR, IS-HOLIDAY, … uThen DAY-NAME, DAY-NUMBER, MONTH, YEAR would be a superkey for the day type. uBut it’s not a candidate key because DAY-NUMBER, MONTH, YEAR is also a superkey. uThis smaller collection is a candidate key because no sub- collection of it uniquely identifies a day.

16 Primary Keys uA primary key for a table (entity type) is a candidate key that the DB designer has chosen as being the main way of uniquely identifying a row (entity). Extra restriction: Its attributes are not allowed to have null values. l It could be that there’s only one candidate key in practice anyway, such as a person’s ID number. uPrimary keys are the main way of identifying target entities in entity relationships, e.g., the way to identify someone’s employing organization. uFor efficiency reasons, the simpler that primary keys are, the better. l Identity numbers (of people, companies, products, courses, etc.), or combinations of them with one or two other attributes, are the typical primary keys in examples in the textbook and handouts.

17 Relationships and Foreign Keys uStandardly, a relationship is represented by means of foreign keys. T U U uA foreign key in a table T is a chosen collection of attributes intended to match the attributes that constitute (usually) the primary key in another table, U, and thereby to refer to entities in U. TU T Intuitively, the foreign key in T is U’s “ambassador” [my word] in T.

18 Primary & Foreign Keys PERS-IDNAMEPHONEEMPL. IDAGE 9568876Chopples 0121-414-3816 E2256137 2544799Blurp 01600-719975 E8570421 1698674Rumpel 07970-852657 E2256188 EMPL. IDEMPL. NAMEADDRESSNUM. EMPLSSECTOR E48693BTBT House, London, … 1,234,5678Private TCOM E85704Monmouth School for Girls Hereford Rd, Monmouth, … 245Private 2E E22561University of Birmingham Edgbaston Park Rd, …. 4023Public HE PHONETYPESTATUS 0121-414-3816 officeOK 01600-719975 homeFAULT 0121-440-5677 homeOK 07970-852657 mobileUNPAID Foreign keys are in italics Primary keys are underlined

19 Composite Primary and Foreign Keys PERS-IDNAMEAREA CODEPHONE BODYEMPL IDAGE 9568876Chopples 0121414-3816 E2256137 2544799Blurp 01600719975 E8570421 1698674Rumpel 07970852657 E2256188 AREA CODEPHONE BODYTYPESTATUS 0121414-3816 officeOK 01600719975 homeFAULT 0121440-5677 homeOK 07970852657 mobileUNPAID Phones People

20 1:1 Connectivity between Tables PERS-IDNAMEPHONEEMPL IDAGE 9568876Chopples 0121-414-3816 E2256137 2544799Blurp 01600-719975 E8570421 1698674Rumpel 07970-852657 E2256188 5099235BigglesE2256129 PHONETYPESTATUS 0121-414-3816 officeOK 01600-719975 homeFAULT 0121-440-5677 homeOK 07970-852657 mobileUNPAID Note: the representation is still asymmetric in that the People table mentions phones but not vice versa – symmetry would create extra redundancy. NB: Biggles has no phone listed, and 0121-440-5677 has no person recorded. Suggests a possible reason for not combining such tables. 1:1: that is, no more than one phone allowed per person, and vice versa. People Phones

21 1:M Connectivity between Tables PERS-IDNAMEPHONEEMPL IDAGE 9568876Chopples 0121-414-3816 E2256137 2544799Blurp 01600-719975 E8570421 1698674Rumpel 07970-852657 E2256188 1800748Dunston 0121-414-3886 E2256129 EMPL IDEMPL NAMEADDRESSNUM EMPLSSECTOR E48693BTBT House, London, … 1,234,5678Private TCOM E85704Monmouth School Hereford Rd, Monmouth, … 245Private 2E E22561University of Birmingham Edgbaston Park Rd, …. 3023Public HE People Organizations More than one employee allowed per organization, but no more than one employer per person. NOTE direction of use of the foreign key. Why so??


Download ppt "Fundamentals/ICY: Databases 2012/13 Week 4 John Barnden Professor of Artificial Intelligence School of Computer Science University of Birmingham, UK."

Similar presentations


Ads by Google