Presentation is loading. Please wait.

Presentation is loading. Please wait.

Dr. Chaitali Basu Mukherji

Similar presentations


Presentation on theme: "Dr. Chaitali Basu Mukherji"— Presentation transcript:

1 Dr. Chaitali Basu Mukherji
Data Modelling Dr. Chaitali Basu Mukherji

2 Data Modeling Concepts
A data model is a simply a diagram that describes the most important “things” in your business environment from a data-centric point of view.

3 Data Model for representation of a part of a real world
it is an abstraction of the reality : ignores unnecessary details represents operational data about real world events, entities, activities, etc. model may be at various levels depending of requirements : logical or physical external, conceptual, internal

4 Data Model…… a good model
is easy to understand has a few concepts permits top-down specifications model offers concepts, constructs and operations must capture meaning of data (data semantics) which help us in interpreting and manipulating data

5 Data Model…… semantics captured through data types, inter-relationships and data integrity constraints uniqueness existence dependence restrictions on some operations such as insertions, deletions

6 Example : Data Model in a PL
data structuring concepts data field/variable data groups and arrays Record/structure : unit of file i/o file : collection of records

7 Example : Data Model in a PL …
Operations file level : open, close record level : read next/random write next/random field level : computations no inter-file and inter-record relationships no constraints except primary key for indexed files

8 Why We Model We build models of complex systems because we cannot comprehend any such system in its entirety Need to develop a common understanding of the problem and the solution Cannot afford a trial-and-error approach to communicate the desired structure and behavior of our systems to visualize and control system’s architecture to understand the system we are building, often exposing opportunities for simplification and reuse

9 Why We Model to manage risk
The choice of which model we use has a profound influence on how a problem is attacked and how a solution is shaped No single model is sufficient; every complex system is best approached through a set of independent models Every model may be expressed at different levels of fidelity The best models are connected to reality

10 Conceptual Data Modeling
Data can be modeled at many levels, including the conceptual, logical and physical level. Conceptual Data Modeling is a very high level representation of organizational data. The purpose is to show the basic building blocks for the organization, i.e. the entities and rules about their meaning and interrelationships Logical data modeling adds more detail to conceptual modeling, but is still concerned only with how the organization/business uses data. Physical data modeling adds more detail, but is especially concerned with the actual physical implementation of the data.

11 Gathering Information
Two perspectives Top-down Data model is derived from an intimate understanding of the business Bottom-up Data model is derived by reviewing specifications and business documents

12 Four Types of Data Models
Current System Proposed System Project-Level E-R diagram for system being replaced Covers just data needed in the project’s application Enterprise-Level An E-R diagram for the whole database from which data for the application system being replaced is drawn An E-R diagram for the whole database from which the new application’s data are extracted

13 Sample conceptual data model diagram

14 Business Principles Business Principles IT Principles
Organizational consensus Data integrity Implementation efficiency User friendliness Operational efficiency IT Principles Scalability Compliance with IT standards

15 Primary steps Problem definition Do I need data warehouse?
What specific problems will it solve? What are my available resources (time, money, and personnel)? What criteria will I use to measure success? Should I outsource all, some, or none of the development and operation? Am I upgrading an existing system, converting from a legacy system, or developing from scratch?

16 Primary steps Requirement analysis Group and "bubble-up" requirements.
Generate a prioritized requirements table listing the requirement, where it came from, the success criteria, and priority. Keep this table high-level. A table with a dozen requirements will be much easier to manage than one with hundreds. Produce a detailed development schedule including hardware, software, personnel, documentation, and reviews. Include outsourcing requirements and long lead-time items. Get a sign-off of the requirements, resource allocation, and schedule from top management before you go any further.

17 Primary steps Requirement analysis
Clearly state the problem(s) you wish to solve. Identify all data sources and formats. Identify the users of the completed system. Formulate a specific budget - time, money, personnel. Ask identified users to specifically state what they expect the system to do. Ask management to specifically state their success criteria Separate their requirements from their "desirements." Only design to requirements. The enhancement phase is where you address the "desirements."

18 Primary steps Conceptual design Design and prototyping
Information or data modeling First level – ER Modeling Second level – Normalization Design and prototyping Rapid prototyping Structured development

19 Primary steps Development and Documentation Test and Review
Deployment and Training Operation Enhancement Help Desk

20 Objectives of ER Model Define terms related to entity relationship modeling, including entity, entity instance, attribute, relationship and cardinality, and primary key. Describe the entity modeling process. Discuss how to draw an entity relationship diagram. Describe how to recognize entities, attributes, relationships, and cardinalities.

21 Database Model A database can be modeled as:
a collection of entities, relationship among entities. Database systems are often modeled using an Entity Relationship (ER) diagram as the "blueprint" from which the actual data is stored — the output of the design phase.

22 Entity Relationship Diagram (ERD)
ER model allows us to sketch database designs ERD is a graphical tool for modeling data. ERD is widely used in database design ERD is a graphical representation of the logical structure of a database ERD is a model that identifies the concepts or entities that exist in a system and the relationships between those entities

23 Purposes of ERD An ERD serves several purposes
The database analyst/designer gains a better understanding of the information to be contained in the database through the process of constructing the ERD. The ERD serves as a documentation tool. Finally, the ERD is used to communicate the logical structure of the database to users. In particular, the ERD effectively communicates the logic of the database to users.

24 Components of an ERD 2. Relationship 3. Cardinality 4. Attribute
An ERD typically consists of four different graphical components: 1. Entity 2. Relationship 3. Cardinality 4. Attribute

25 Classification of Relationship
Optional Relationship An Employee may or may not be assigned to a Department A Patient may or may not be assigned to a Bed Mandatory Relationship Every Course must be taught by at least one Teacher Every mother have at least a Child

26 Cardinality Constraints
Express the number of entities to which another entity can be associated via a relationship set. Cardinality Constraints - the number of instances of one entity that can or must be associated with each instance of another entity. Minimum Cardinality If zero, then optional If one or more, then mandatory Maximum Cardinality The maximum number

27 Cardinality Constraints (Contd.)
For a binary relationship set the mapping cardinality must be one of the following types: One to one A Manager Head one Department and vice versa One to many ( or many to one) An Employee Works in one Department or One Department has many Employees Many to many A Teacher Teaches many Students and A student is taught by many Teachers

28 Cardinality Constraints (Contd.)

29 Cardinality Constraints Example
In our model, we wish to indicate that each school may enroll many students, or may not enroll any students at all. We also wish to indicate that each student attends exactly one school. The following diagram indicates this optionality and cardinality:

30 Cardinality Constraints Example (Contd.)

31 Developing an ERD The process has ten steps: 1. Identify Entities
2. Find Relationships 3. Draw Rough ERD 4. Fill in Cardinality 5. Define Primary Keys 6. Draw Key-Based ERD 7. Identify Attributes 8. Map Attributes 9. Draw fully attributed ERD 10. Check Results

32 A Simple Example A company has several departments. Each department has a supervisor and at least one employee. Employees must be assigned to at least one, but possibly more departments. At least one employee is assigned to a project, but an employee may be on vacation and not assigned to any projects. The important data fields are the names of the departments, projects, supervisors and employees, as well as the supervisor and employee number and a unique project number.

33 Identify entities One approach to this is to work through the information and highlight those words which you think correspond to entities. A company has several departments. Each department has a supervisor and at least one employee. Employees must be assigned to at least one, but possibly more departments. At least one employee is assigned to a project, but an employee may be on vacation and not assigned to any projects. The important data fields are the names of the departments, projects, supervisors and employees, as well as the supervisor and employee number and a unique project number. A true entity should have more than one instance

34 Find Relationships Aim is to identify the associations, the connections between pairs of entities. A simple approach to do this is using a relationship matrix (table) that has rows and columns for each of the identified entities.

35 Find Relationships (Contd.)
Go through each cell and decide whether or not there is an association. For example, the first cell on the second row is used to indicate if there is a relationship between the entity "Employee" and the entity "Department".

36 Identified Relationships
Names placed in the cells are meant to capture/describe the relationships. So you can use them like this A Department is assigned an employee A Department is run by a supervisor An employee belongs to a department An employee works on a project A supervisor runs a department A project uses an employee

37 Draw Rough ERD Draw a diagram and:
Place all the entities in rectangles Use diamonds and lines to represent the relationships between entities. General Examples

38 Drawing Rough ERD (Contd.)

39 Drawing Rough ERD (Contd.)

40 Drawing Rough ERD (Contd.)

41 Fill in Cardinality Supervisor Department Employee Project
Each department has one supervisor. Department Each supervisor has one department. Each employee can belong to one or more departments Employee Each department must have one or more employees Each project must have one or more employees Project Each employee can have 0 or more projects.

42 Fill in Cardinality (Contd.)
The cardinality of a relationship can only have the following values One and only one One or more Zero or more Zero or one

43 Cardinality Notation

44 Cardinality Examples

45 ERD with cardinality

46 Examples

47 ERD for Course Enrollment

48 ERD for Course Registration

49 Rough ERD Plus Primary Keys

50 Identify Attributes In this step we try to identify and name all the attributes essential to the system we are studying without trying to match them to particular entities. The best way to do this is to study the forms, files and reports currently kept by the users of the system and circle each data item on the paper copy.

51 Identify Attributes Cross out those which will not be transferred to the new system, extraneous items such as signatures, and constant information which is the same for all instances of the form (e.g. your company name and address). The remaining circled items should represent the attributes you need. You should always verify these with your system users. (Sometimes forms or reports are out of date.) The only attributes indicated are the names of the departments, projects, supervisors and employees, as well as the supervisor and employee NUMBER and a unique project number.

52 Map Attributes For each attribute we need to match it with exactly one entity. Often it seems like an attribute should go with more than one entity (e.g. Name). In this case you need to add a modifier to the attribute name to make it unique (e.g. Customer Name, Employee Name, etc.) or determine which entity an attribute "best' describes. If you have attributes left over without corresponding entities, you may have missed an entity and its corresponding relationships. Identify these missed entities and add them to the relationship matrix now.

53 Map Attributes (Contd.)

54 Draw Fully Attributed ERD

55 Check ERD Results Look at your diagram from the point of view of a system owner or user. Is everything clear? Check through the Cardinality pairs. Also, look over the list of attributes associated with each entity to see if anything has been omitted.


Download ppt "Dr. Chaitali Basu Mukherji"

Similar presentations


Ads by Google