Presentation is loading. Please wait.

Presentation is loading. Please wait.

DiGIR1 Distributed Databases and Applications John Wieczorek Museum of Vertebrate Zoology, UC Berkeley.

Similar presentations


Presentation on theme: "DiGIR1 Distributed Databases and Applications John Wieczorek Museum of Vertebrate Zoology, UC Berkeley."— Presentation transcript:

1 DiGIR1 Distributed Databases and Applications John Wieczorek Museum of Vertebrate Zoology, UC Berkeley

2 DiGIR 2 Distributed Databases – Discipline-specific The Species Analyst (TSA)TSA The Integrated Taxonomic Information System (ITIS)ITIS FishNet The Mammal Networked Information System (MaNIS)MaNIS HerpNET The Ornithological Information System (ORNIS) …

3 DiGIR 3 Distributed Databases – International European Natural History Science Information Network (ENHSIN)ENHSIN Biological Collection Access for Europe (BioCASE)BioCASE Australia Virtual Herbarium (AVH)AVH Red Mundial de Información Sobre Biodiversidad, Comisión Nacional para el Conocimiento y Uso de la Biodiversidad (REMIB, CONABIO)REMIB …

4 DiGIR 4 Distributed Databases – Regional Mountain and Plains Spatio-Temporal Database- Informatics (MaPSTeDI)MaPSTeDI Ocean Biogeographic Information System (OBIS)OBIS Pacific Basin Information Node, National Biological Information Infrastructure (PBIN, NBII)PBIN Species Link, Centro de Referência em Informação Ambiental (Species Link, CRIA)Species Link A Virtual Herbarium of the Chicago Region (vPlants)vPlants Spatial Analysis of Local Vegetation Inventories Across Scales (SALVIAS)SALVIAS …

5 DiGIR 5 Distributed Databases – Intra-institutional Berkeley Natural History Museums (BNHM)BNHM Association of Biological Collections, UC Davis …

6 DiGIR 6 Distributed Databases – “Nodes” LifeMapper National Biodiversity Information Infrastructure (NBII)NBII Global Biodiversity Information Facility (GBIF)GBIF

7 DiGIR 7 GBIF Work Programmes NODES ECAT – Electronic Catalogue of Names of Known Organisms DIGIT – Digitisation of Natural History Collections OCB – Outreach and Capacity Building DADI – Data Access and Database Interoperability

8 DiGIR 8 Taxonomic Database Working Group Standards development and maintenance Access to Biological Collections Data (ABCD)ABCD Darwin Core Version 2 (DwC2)DwC2 Structure of Descriptive Data (SDD) DiGIR Others…

9 DiGIR9 DiGIR Distributed Generic Information Retrieval John Wieczorek, Stan Blum, Dave Vieglais, P.J. Schwartz

10 DiGIR 10 Information Retrieval Distributed - a protocol for retrieving structured data from multiple, heterogeneous databases across the Internet. Generic - a protocol independent of the data retrieved and of the software to retrieve it.

11 DiGIR 11 Project Rationale Avoid multiple incongruous development efforts Pool resources and create a support community of experts Solve scalability problems

12 DiGIR 12 Design Goals Use open protocols and standards, such as HTTP and XML Decouple the protocol, software and semantics Make new data provider installations as easy as possible Develop open source software with GNU General Public Licensing (It’s free).

13 DiGIR13 DiGIR Component Summary

14 DiGIR 14 DiGIR Architecture Provider

15 DiGIR 15 Provider Receives requests Retrieves data from database Sends results to requestor Supplies metadata to describe content, contacts, and capabilities Logs requests

16 DiGIR 16 DiGIR Architecture Portal Engine

17 DiGIR 17 Portal Engine The entry point for an application Can query a registry to discover potential providers Can determine, based on provider metadata, whether a provider should be queried Can send requests to multiple providers

18 DiGIR 18 Portal Engine, continued Assembles responses from providers Returns packaged results to the requesting application Communicates via protocol compliant messaging only Logs activity

19 DiGIR 19 Registry Provides a “yellow pages” to advertise the existence and capabilities of a provider Provides a means to discover potential providers of interest May be public or private Need not be a part of the architecture

20 DiGIR 20 DiGIR Architecture Provider Registry (register)

21 DiGIR 21 DiGIR Architecture Portal Engine Registry (discover)

22 DiGIR 22 DiGIR Protocol Defines request and response message formats for communication between provider, portal engine, and applications Metadata requests Search requests Inventory requests Remains unfettered by the structure of the data it transfers

23 DiGIR 23 DiGIR Architecture Application

24 DiGIR 24 DiGIR Architecture Application Protocol (request) Portal Engine

25 DiGIR 25 DiGIR Architecture Application Protocol (request) Portal Engine Protocol (request) Provider

26 DiGIR 26 DiGIR Architecture Application Protocol (request) Portal Engine Protocol (response) Provider

27 DiGIR 27 DiGIR Architecture Application Protocol (response) Portal Engine

28 DiGIR 28 Applications Must be able to assemble and send a request document to a portal Must be able to receive and interpret a response document from the portal Must do something incredibly useful and interesting with the data This is where the real fun is!

29 DiGIR 29 Hot topics – Interesting problems Persistent unique identifiers Web services (converters, translators, calculators, transformation services) Data validation (taxonomic and geographic thesauri, ecological niche modeling, expedition analysis, outlier detection,…) Spatial query interfaces Data quality feedback mechanisms Automated georeferencing, event gazetteers Concept libraries, schema extensions, schema libraries, federated ontologies

30 DiGIR30 MaNIS – The Mammal Networked Information System It’s more than just a pangolin…

31 DiGIR 31 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

32 DiGIR 32 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

33 DiGIR 33 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

34 DiGIR 34 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

35 DiGIR 35 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

36 DiGIR 36 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

37 DiGIR 37 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

38 DiGIR 38 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

39 DiGIR 39 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

40 DiGIR 40 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

41 DiGIR 41 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

42 DiGIR 42 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

43 DiGIR 43 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database Online MS Access Database

44 DiGIR 44 MaNIS Network Configuration LACM MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database Online MS Access Database Online MS Access Database

45 DiGIR 45 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database UAM Oracle Database Online MS Access Database Online MS Access Database

46 DiGIR 46 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database Online MS Access Database

47 DiGIR 47 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

48 DiGIR 48 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

49 DiGIR 49 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MVZ-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database

50 DiGIR 50 MaNIS Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database MaNIS DiGIR Portal LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database MaNIS DiGIR Portal MaNIS DiGIR Portal MVZ-MaNIS Presentation Layer UMNH-MaNIS Presentation Layer UWBM-MaNIS Presentation Layer DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database … … … … …… ……

51 DiGIR 51 “CalNet” Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database CalNet DiGIR Portal

52 DiGIR 52 Event Gazetteer Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database BioGeomancer Web Service DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database CalNet DiGIR Portal

53 DiGIR 53 NBII Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database NBII DiGIR Portal

54 DiGIR 54 GBIF Network Configuration LACM MS Access Database Online MS Access Database MVZ Sybase Database LSUMZ 4D-Mac Database Online MS Access Database UAM Oracle Database Online MS Access Database GBIF Presentation Layers DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider DiGIR Provider CAS SQL Server Database NBII DiGIR Portal

55 DiGIR 55 Intra-Network Configuration (BNMH) PHMA Working Database Online Database UCBG Working Database DiGIR Provider BNHM DiGIR Portal UCJEPS Working Database Online Database UCMP Working Databases (4) Online Database Essig Working Database Online Database Online Database BNHM Presentation Layer

56 DiGIR 56 Other Network Configurations Working Database Online Database DiGIR Provider DiGIR Provider DiGIR Portal Working Database Online Database DiGIR Provider Working Database Online Database DiGIR Portal Working Database Online Database DiGIR Provider DiGIR Portal Working Database

57 DiGIR 57 Other Network Configurations Working Database Online Database DiGIR Provider DiGIR Provider DiGIR Portal Working Database Online Database DiGIR Provider Working Database Online Database DiGIR Portal Working Database Online Database DiGIR Provider DiGIR Portal Working Database

58 DiGIR 58 Other Network Configurations Working Database Online Database DiGIR Provider DiGIR Provider DiGIR Portal Working Database Online Database DiGIR Provider Working Database Online Database DiGIR Portal Working Database Online Database DiGIR Provider DiGIR Portal Working Database

59 DiGIR 59 Other Network Configurations Working Database Online Database DiGIR Provider DiGIR Provider DiGIR Portal Working Database Online Database DiGIR Provider Working Database Online Database DiGIR Portal Working Database Online Database DiGIR Provider DiGIR Portal Working Database

60 DiGIR 60 Other Network Configurations Working Database Online Database DiGIR Provider DiGIR Provider DiGIR Portal Working Database Online Database DiGIR Provider Working Database Online Database DiGIR Portal Working Database Online Database DiGIR Provider DiGIR Portal Working Database

61 DiGIR 61 Other Network Configurations Working Database Online Database DiGIR Provider DiGIR Portal Working Database Working Database

62 DiGIR 62 Project Information DiGIR is a collaborative open source development project on SourceForge (https://sourceforge.net/projects/digir).https://sourceforge.net/projects/digir Software and documentation are available on the DiGIR web site (http://digir.net).http://digir.net MaNIS is an international network collaboration among mammal specimen collections (http://elib.cs.berkeley.edu/manis).http://elib.cs.berkeley.edu/manis

63 DiGIR 63 Hot topics – Interesting problems Persistent unique identifiers Web services (converters, translators, calculators, transformation services) Data validation (taxonomic and geographic thesauri, ecological niche modeling, expedition analysis, outlier detection,…) Spatial query interfaces Data quality feedback mechanisms Automated georeferencing, event gazetteers Concept libraries, schema extensions, schema libraries, federated ontologies


Download ppt "DiGIR1 Distributed Databases and Applications John Wieczorek Museum of Vertebrate Zoology, UC Berkeley."

Similar presentations


Ads by Google