Presentation is loading. Please wait.

Presentation is loading. Please wait.

Andy Jenkinson, EBI An Introduction to DAS. Summary of Topics What is Data Integration? Problems in Data Integration An architectural overview of DAS.

Similar presentations


Presentation on theme: "Andy Jenkinson, EBI An Introduction to DAS. Summary of Topics What is Data Integration? Problems in Data Integration An architectural overview of DAS."— Presentation transcript:

1 Andy Jenkinson, EBI An Introduction to DAS

2 Summary of Topics What is Data Integration? Problems in Data Integration An architectural overview of DAS Brief History of DAS

3 What is Data Integration

4 All These are Data Integration Reading some papers so you can write a report Exploring some database websites so you can learn about a topic Downloading some data from different databases so you can analyse it Downloading some data from different databases so you can combine it with your own

5 All These are Data Integration Reading some papers so you can write a report Exploring some database websites so you can learn about a topic Downloading some data from different databases so you can analyse it Downloading some data from different databases so you can combine it with your own

6 Data Integration “Automatic” data integration pulling in data from different locations processing it creating a resource derived from the data done via computers, not humans e.g. creating/updating a data warehouse Warehouse PDBEnsemblUniProt

7 Warehouse model

8

9 Databases are all different

10 Databases evolve

11 Data ages

12 Databases are big

13 Distributed Annotation System Distributed Client-Server architecture Federation RESTful web services

14 Warehouse model

15 DAS model

16 Architectural Overview

17 DAS Databases are all different DAS is a uniform facet of a database – always the same Databases change their structure when the database changes, DAS stays the same Databases are updated DAS data comes directly from the provider so is always fresh Databases are big DAS uses real-time targeted queries

18 History Developed circa 1999 for sharing genome annotations Expanded 2004 onwards more data types better metadata addition of Registry DAS/2 project split from DAS, not backwards compatible inspired some DAS developments

19 To Summarise… The Distributed Annotation System is… A network of biological data sources An example of federation A collection of REST web services The DAS Protocol is… An integration platform A client-server protocol An agreed standard

20 Image Credits Flickr/muir.ceardach Flickr/Horia Varlan Flickr/Alessandro Pinna Fotopedia/Jean-Marie Hullot listicles.com/?p=3485 Google Earth/Cnes/Spot Image Olivier H. Beauchesne


Download ppt "Andy Jenkinson, EBI An Introduction to DAS. Summary of Topics What is Data Integration? Problems in Data Integration An architectural overview of DAS."

Similar presentations


Ads by Google