Presentation is loading. Please wait.

Presentation is loading. Please wait.

The SADE mini-project of the EGI DARIAH Competence Centre

Similar presentations


Presentation on theme: "The SADE mini-project of the EGI DARIAH Competence Centre"— Presentation transcript:

1 The SADE mini-project of the EGI DARIAH Competence Centre
Giuseppe LA ROCCA INFN EGI Community Forum 2015 Bari, Italy

2 Introduction to the SADE mini-project
Outline Introduction to the SADE mini-project INFN contributions in the EGI DARIAH CC Development of gLibrary 2.0 The Node.Js API Framework LoopBack Customization of the Simple/Parallel Semantic Search Engine (SSE) Future activities Summary & Conclusions EGI Community Forum November 2015, Bari, Italy

3 Storing and Accessing DARIAH (SADE) contents in EGI
The overall goal of this mini-project is to create a digital repository of DARIAH contents using Datasets for this CC are provided by the Austrian Academic of Science (AAS) Headwords (about 50,000 A-Z) [1] Records (about 40,000 plants; about 70,000 in general) [2] Multimedia with Link to Audio-file (examples; to be improved) [3] Multimedia with Collection (about 3,000; planned to be published within the mini-project) [4] Multimedia connected to Headword (about 3,000; planned to be digitized) [5] Project specific biographies [6] Locations [7] EGI Community Forum November 2015, Bari, Italy

4 in a nutshell gLibrary is a platform developed by INFN that provides a simple yet powerful system to organize, search, store and retrieve “digital assets” in distributed repositories built on Grid/Cloud/local storage infrastructures hides the underlying technical details to the users “digital assets”: digital object + corresponding metadata EGI Community Forum November 2015, Bari, Italy

5 gLibrary 1.0 Architecture
eToken service Front ends glibrary.ct.infn.it REST API AuthN / AuthZ Science Gateway User Tracking DB Call gLibrary REST API through API Server Gateway Metadata Service Local storage Grid storage Cloud Storage Authorization I EGI Community Forum November 2015, Bari, Italy

6 gLibrary 2.0 Architecture
eToken service glibrary.ct.infn.it REST API User Tracking DB Local storage Grid storage Cloud Storage Local DB for repo SQL and/or NoSQL DBs EGI Community Forum November 2015, Bari, Italy

7 gLibrary 2.0 features Removed AMGA + PostgreSQL dependency
In gLibrary 2.0 each repo can be configured to store metadata in a local DB or remote DBs Not tied anymore to a given DB for metadata. We can use: PostgreSQL MySQL MongoDB Oracle SQL Server Community connectors: CouchDB, SQLite, ArangoDB, etc. gLibrary 2.0 (code and doc) on and EGI Community Forum November 2015, Bari, Italy

8 The Parallel Semantic Search Engine (SSE)
The Semantic Search Engine (SSE) is a framework conceived to demonstrate the potential of Open Access Data infrastructures coupled with semantic web technologies to address the issues of data discovery and correlation The framework aims to develop an open access infrastructure for linking scientists and scientific data/information resources The SSE framework has been developed within the CHAIN-REDS project The Semantic Search Engine (code and doc) on and 8 CHAIN-REDS School on Cloud Computing, Catania EGI Community Forum November 2015, Bari, Italy

9 The Architecture of the SSE
Linked-data search engine Semantic-web enrichment Harvester (running on grid/cloud) Harvester (running on grid/cloud) OAI-PMH End-points OAI-PMH Data Repos. OADRs 9 CHAIN-REDS School on Cloud Computing, Catania EGI Community Forum November 2015, Bari, Italy

10 Semantic enrichment EGI Community Forum November 2015, Bari, Italy

11 The Parallel Semantic Search Engine for EGI DARIAH CC
More than 30 million resources, almost 600 million triples EGI Community Forum November 2015, Bari, Italy

12 Future activities Semantically enrich and correlate present Linked Data CHAIN-REDS KB, OpenAgris, Europeana, Isidore, PubMed, Cultura italia and Engage with: GENONAMES, GERMANET, ISLEX, DBPEDIA, etc. Extend the ontology based on Protègè to include DARIAH-specific knowledge Add to the current functionalities Google Scholar and LodLive others required by DARIAH: GIS, GeoBrowser EGI Community Forum November 2015, Bari, Italy

13 Summary & Conclusions The gLibrary framework has been enhanced to better support the requirements coming from the A&H VRC The Parallel Semantic Search Engine is under development to include the new requested user scenarios and domain-specific contents and Linked Data EGI Community Forum November 2015, Bari, Italy

14


Download ppt "The SADE mini-project of the EGI DARIAH Competence Centre"

Similar presentations


Ads by Google