Presentation is loading. Please wait.

Presentation is loading. Please wait.

MIS2502: Data Analytics Relational Data Modeling

Similar presentations


Presentation on theme: "MIS2502: Data Analytics Relational Data Modeling"— Presentation transcript:

1 MIS2502: Data Analytics Relational Data Modeling
Zhe (Joe) Deng

2 The information architecture of an organization
Now we’re here… Data entry Transactional Database Data extraction Analytical Data Store Data analysis Series of tables stored in a relational database Stored as structured or unstructured data in a variety of formats.

3 What is a model? Representation of something in the real world

4 Why bother modeling? Creates a blueprint before you start building the database Gets the story straight: easy for non-technical people to understand Minimize having to go back and make changes in the implementation stage

5 Modeling a database A representation of the information to be captured
Describes the data contained in the database Explains how the data interrelates

6 Transactional Database
Four Steps of Modeling ER Diagram (Entity Relationship Diagram) Problem Statement Transactional Database Database Schema Transactional Database

7 Start with a problem statement
Design a database to track orders for a store. A customer places an order for a product. People can place an order for multiple products. Record first name, last name, city, state, and zip code for customers. We also want to know the date an order was placed. Finally, we want to track the name and price of products and the quantity of each product for each order.

8 The Entity Relationship Diagram (ERD)
The primary way of modeling a relational database ER Diagrams --- are sketches like this one! Primary Key TUID Course number Cardinality Name Student contains Course Attribute Course Title Entity Relationship GPA

9 The Entity Relationship Diagram (ERD)
Three main diagrammatic elements Entity A uniquely identifiable thing (i.e., person, order) A characteristic of an entity or relationship (i.e., first name, order number) Attribute Relationship Describes how two entities relate to one another (i.e., makes)

10 Entity Represents an object/construct we want to manage
Another way to think: a name of a table (student, class, bill, purchase order,…) Properties: Normally a singular noun, e.g., Book, Student, etc

11 Attribute A characteristic of an entity or relationship
Represents actual data we would like to manage - For entity ‘Student’, ‘StudentID’, ‘GPA’, ‘Class Year’ are the examples of Attribute Properties: Normally a singular noun Within one entity, each attribute name should be different

12 Relationship Represents a relationship between entities
Each entity must be related to at least one other entity Properties: Normally a verb, e.g., Enrolls In as to say: “Student” --- “Enrolls In” --- “Class”

13 Begin with Identifying the Entities
This is what your database is about. What’s left are your entities! List the nouns in the problem statement. Step 1 When nouns are synonyms for other nouns, choose the best one. Step 2 Make a note of nouns that describe other nouns. These will be your entities’ attributes. Step 3 Rule out the nouns that don’t relate to the process to be captured. Step 4

14 So here are the nouns… Which nouns are entities
Design a database to track orders for a store. A customer places an order for a product. People can place an order for multiple products. Record first name, last name, city, state, and zip code for customers. We also want to know the date an order was placed. Finally, we want to track the name and price of products and the quantity of each product for each order. Which nouns are entities Which nouns are attributes? Which nouns are irrelevant?

15 The ERD Based on the Problem Statement

16 Here’s where it gets tricky…
In this case, “store” is the context store is not an entity because we are not tracking specific information about the store (i.e., store location) BUT…if there were many stores and we wanted to track sales by store, then store would be an entity! But that isn’t part of the problem statement….

17 The primary key Entities need to be uniquely identifiable
So you can tell them apart They may not be explicitly part of the problem statement, but you need them! Use a primary key One or more attributes that uniquely identifies an entity How about these as primary keys for Customer: First name and/or last name? Social security number? Customer ID Uniquely identifies a customer Uniquely identifies an order Order number

18 Time for our first ICA!


Download ppt "MIS2502: Data Analytics Relational Data Modeling"

Similar presentations


Ads by Google