VI-SEEM Data Discovery Service

Slides:



Advertisements
Similar presentations
RABNENA. 2 RABNENA Interfaces Frontend Interface The frontend components provide the information retrieval/browsing functionalities, and are available.
Advertisements

EXtensible Catalog David Lindahl University of Rochester.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
EXtensible Catalog XC Drupal Toolkit. XC Software Overview User Interface for searching and browsing Library Website (on Drupal) VoyagerUR Research XC.
Resource Discovery Module DigiTool Version 3.0. Resource Discovery 2 Deposit Approval Search & Index Dispatcher & Viewers Single & Bulk Web Services DigiTool.
Doug Nebert Senior Advisor for Geospatial Technology CSS, FGDC Secretariat.
Capacity Planning in SharePoint Capacity Planning Process of evaluating a technology … Deciding … Hardware … Variety of Ways Different Services.
SOA & BPM Business Architecture, SOA & BPM Learn about SOA and Business Process Management (BPM) Learn how to build process diagrams.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
INTRODUCTION DIGITAL REPOSITORY Developing a Digital Repository for International Symposium on Information Management in a Changing World Müge AKBULUT.
Putting it all together for Digital Assets Jon Morley Beck Locey.
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Using the SAS® Information Delivery Portal
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
PROJECT HYDRA SNEAK PEAK – ADVANCE SHOWING Brought to you by the Digital Repository Task Force Steve Marine (chair), Ted Baldwin, Dan Gottlieb, Kevin Grace,
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
An introduction to Microsoft Office SharePoint Server 2007 Rue Juliette Wytsmanstraat 14 | 1050 Brussels | Belgium T | F
CubicWeb – The Semantic Web is a construction game! Student: Uglješa Milić University of Belgrade School of Electrical.
University of Illinois at Urbana-Champaign A Unified Platform for Archival Description and Access Christopher J. Prom, Christopher A. Rishel, Scott W.
Bridging Communities and Data with ArcGIS Open Data Courtney Claessens, Product Engineer Daniel Fenton, Product Engineer.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
The Resource Discovery Network and OAI Andy Powell UKOLN, University of Bath UKOLN is funded by Resource: The Council.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Introduction to Archon for CARLI Members Jen Masciadrelli, Library Systems Coordinator, CARLI Office Sarah Horowitz, Special Collections Librarian, Augustana.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
DSpace - Digital Library Software
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Click to edit Master subtitle style 9/30/2016 Next Generation Catalog with Integration of VuFind and Pazpar2 Presented by Mohan Raj Pradhan Associate Professor.
October 2014 HYBRIS ARCHITECTURE & TECHNOLOGY 01 OVERVIEW.
Daniel POP West University of Timisoara
The Holmes Platform and Applications
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Accessing the VI-SEEM infrastructure
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Business Directory REST API
Using E-Business Suite Attachments
Information modeling and infrastructures for metadata
CUAHSI HIS Sharing hydrologic data
Simple Storage Service
Steering Group Member, Link Digital
Metadata Editor Introduction
Introduction, Features & Technology
VI-SEEM Data Repository
CS 501: Software Engineering Fall 1999
Cloud based Open Source Backup/Restore Tool
VI-SEEM Data Repository
Introduction to DSpace
Islandora Learning Objectives
Overview of the VI-SEEM services
Module 01 ETICS Overview ETICS Online Tutorials
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Prepared by Peter Boško, Luxembourg June 2012
TN19-TCI: Integration and API management using TIBCO Cloud™ Integration
Presentation transcript:

VI-SEEM Data Discovery Service Vladimir Dimitrov Institute of Information and Communication Technologies (IICT-BAS)

Agenda VI-SEEM Data Discovery Service (DDS) Software implementation Hardware implementation Features DDS front end Example screens Data Synchronization Tool Conclusion Total of 18 slides. VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

VI-SEEM Data Discovery Service The VI-SEEM Data Discovery Service provides flexible search functions for (meta)data(sets) which are used in the project. Main access point: https://search.vi-seem.eu Backup access point: https://dds.avitohol.acad.bg Use case To make such datasets searchable by means of associating meta data. The datasets are hosted at VI-SEEM Data repository and other places. VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Software implementation The Data Discovery service is based on a customized implementation of CKAN system (https://ckan.org) CKAN is built with Python on the backend and Javascript on the frontend, and uses The Pylons web framework and SQLAlchemy (database toolkit for Python, https://www.sqlalchemy.org) as its ORM (Object-Relational Mapping). Its database engine is PostgreSQL and its search is powered by Apache search platform SOLR (http://lucene.apache.org/solr). CKAN has a modular architecture that allows extensions to be developed to provide additional features such as harvesting or data upload. CKAN uses its internal model to store metadata about the different records, and presents it on a web interface that allows users to browse and search this metadata. It also offers a powerful API that allows third-party applications and services to be built around it. VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Hardware implementation Currently VI-SEEM DDS is installed on: One physical server with two CPU Intel Xeon 2.6 GHz, 4 physical cores each 16 GB DDR2 RAM HDD SATA 250 GB 1 GBps UTP Ethernet external connectivity Hosted and maintained by IICT-BAS. The service does not seem to be experiencing a staggering hunger for HPC resources for now. VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Features User Interface Web based, for search queries, end-user and system administrators Search and retrieval of items by browsing or searching the metadata Workflow Enables differing searches for communities (groups) and organizations Models "e-people" who have "roles" in the workflow of a particular Community (group) in the context of a given dataset collection Integrated with VI-SEEM Data Repository Data Synchronization Tool Access Control Most of the features are available to all users. However editing particular metadata records requires free registration and requesting a role for particular community or organization. Integrated with the VI-SEEM Login Service (work in progress) Metadata Schema Follows the guidelines for VI-SEEM data repository Possibility to add new metadata schemas, different than the above VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Data Discovery Service front end The service is comprised of Organizations and Groups Organizations are the institution members of VI-SEEM project. However, it is easily to add departments, labs, schools etc. Groups correspond to the main VI-SEEM communities: Climate Sciences, Digital Cultural Heritage and Life Sciences. Also there are a Software projects group and common purposes Generic group. Finally there are groups and organizations (named Zen*) with testing purposes only. Could be deleted at any time. VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example screen: Organizations VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example screen: Groups VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example screen: simple browsing datasets VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example search VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example dataset view VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example: User registration form VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Example: Add/create a new dataset VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

VI-SEEM datasets synchronization tool Synchronizes VI-SEEM repo with VI-SEEM search service. Written in Python3. Uses only modules from The Python Standard Library. Runs in either create or update mode. create is the default behavior and filters already synchronized items, uploading only new items from repo to search.vi-seem.eu(create) ; update mode – if specified the metadata of all items will be checked and updated. Every 24 hours, 2 cron jobs for each mode automatically synchronize data. Records every unsuccessful upload in a log file. Developed and supported by IICT-BAS. VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Synchronization with VI-SEEM Data repository brief workflow VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Metadata matching The tool is conversing the metadata tags of each item in the following way: Data Repository (DSPACE) Data Discovery Service (CKAN) title Name lastModified modified dc.identifier.uri url dc.type type dc.contributor.author(multiple)  author dc.creator(multiple) dc.description notes  dc.description.abstract description dc.description.sponsorship sponsorship  dc.publisher organization dc.rights  rights dc.rights.uri rights_uri  Collection collection dc.subject(multiple  subject VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia

Thank you for your attention. Questions? Conclusion The Data Discovery Service is a CKAN based search tool for indexing datasets and metadata which are used during the VI-SEEM project. It uses custom developed Datasets Synchronization tool for indexing on a regular basis of VI-SEEM Data Repository and possibly different external sources. Thank you for your attention. Questions? VI-SEEM Regional Climate Training event, 11-13 October 2017, Belgrade, Serbia