Presentation is loading. Please wait.

Presentation is loading. Please wait.

21.1 Introduction to Information Integration CS257 Fan Yang.

Similar presentations


Presentation on theme: "21.1 Introduction to Information Integration CS257 Fan Yang."— Presentation transcript:

1 21.1 Introduction to Information Integration CS257 Fan Yang

2 Need for Information Integration All the data in the world could put in a single database (ideal database system) In the real world (impossible for a single database): databases are created independently hard to design a database to support future use

3 University Database Registrar: to record student and grade Bursar: to record tuition payments by students Human Resources Department: to record employees Other department….

4 Inconvenient Record grades for students who pay tuition Want to swim in SJSU aquatic center for free in summer vacation? (all the cases above cannot achieve the function by a single database) Solution: one database

5 How to integrate Start over build one database: contains all the legacy databases; rewrite all the applications result: painful Build a layer of abstraction (middleware) on top of all the legacy databases this layer is often defined by a collection of classes BUT…

6 Heterogeneity Problem What is Heterogeneity Problem Aardvark Automobile Co. 1000 dealers has 1000 databases to find a model at another dealer can we use this command: SELECT * FROM CARS WHERE MODEL=“A6”;

7 Type of Heterogeneity Communication Heterogeneity Query-Language Heterogeneity Schema Heterogeneity Data type difference Value Heterogeneity Semantic Heterogeneity

8 Conclusion One database system is perfect, but impossible Independent database is inconvenient Integrate database 1. start over 2. middleware heterogeneity problem

9 Thank you very much


Download ppt "21.1 Introduction to Information Integration CS257 Fan Yang."

Similar presentations


Ads by Google