Data Catalogue Service Work Package 4. Main Objective: Deployment, Operation and Evaluation of a cataloguing service for scientific data. Why: Potential.

Slides:



Advertisements
Similar presentations
SWaNI Project Update Report April Project Outcomes Under review, might not all be possible in conjunction with Skillnet or SITS Interoperability.
Advertisements

Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
UKOLN is supported by: Put functionality Augmenting interoperability across scholarly repositories 20/21 April 2006 Rachel Heery, UKOLN, University of.
Supersites infrastructure: data access services and tools Chuck Meertens, Jeff McWhirter, Fran Boler, Stuart Wier, Susanna Gross, Scott Baker, UNAVCO Supersites.
IUFRO International Union of Forest Research Organizations Eero Mikkola Description of WP2 – NEFIS Metadata and Controlled Vocabularies Standards - work.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Slide: 1 Welcome to the workshop ESRFUP-WP7 User Single Entry Point.
Introduction on WP7/WP9 Dominique PORTE 29/05/2008 Menu What is WP7? What is WP9? Goal of the brainstorming Introduction on WP7/WP9.
CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
Federated data catalogues supporting cross-facility, cross- discipline interaction at the scale of atoms and molecules Neutron diffraction X-ray diffraction.
The SMARTFREIGHT project Hans Westerheim SINTEF ICT.
Thee-Framework for Education & Research The e-Framework for Education & Research an Overview TEN Competence, Jan 2007 Bill Olivier,
SpaceGRID and EGSO Satu Keski-Jaskari Maria Vappula Parallal Computing – Seminar
Software Architecture April-10Confidential Proprietary Master Data Management mainly inspired from Enterprise Master Data Management – An SOA approach.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
PaN-data WP4 - Users Gordon Brown STFC-e-Science Alun Ashton DLS Bill Pulford DLS.
PaN-data Meeting 4-5 October 2010 HZB, Berlin. Project Summary.
PaNdata Europe Midpoint workshop 8-10 February 2011 Soleil, Paris PaN-data Europe – building a sustainable data infrastructure for Neutron and Photon laboratories.
Mantychore Oct 2010 WP 7 Andrew Mackarel. Agenda 1. Scope of the WP 2. Mm distribution 3. The WP plan 4. Objectives 5. Deliverables 6. Deadlines 7. Partners.
Benchmarking in WP 2.1. Sep 28th, 2004 © R. García-Castro, A. Gómez-Pérez Raúl García-Castro Asunción Gómez-Pérez September 28th, 2004 Benchmarking.
Metadata for Large Science: The ICAT Data Model Brian Matthews, Leader, Scientific Applications Group, E-Science Centre, STFC Rutherford Appleton Laboratory.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Testing and Improving Interoperability The Z39.50 Interoperability Testbed William E. Moen School of Library and Information Sciences Texas Center for.
Benchmarking Methodology. Sep 27th, 2004 © R. García-Castro, A. Gómez-Pérez Raúl García-Castro Asunción Gómez-Pérez September 27th, 2004 Benchmarking.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
BlogForever Project Presentation Vangelis Banos, Project Manager, ALTEC Software Stratos Arampatzis, Dissemination Manager, Tero Dr. Alexandra Cristea,
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
SmartNets Results Overview SmartNets SmartNets Methods.
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
Workpackage 2: Implementation Infrastructure. WP2: Objectives Main Objective of WP2: Integrated Optique Platform Main Objective of WP2: Integrated Optique.
WP5 – Virtual Laboratories. WP5 Deliverables  D5.1: Specific requirements for the virtual laboratories M6  D5.2: Deployment of Specification of the.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Annual Review Meeting - Introduction.
Metadata Mòrag Burgon-Lyon University of Glasgow.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Unit 18 Advanced Database Design
Metadata for structural science Workshop on research metadata in context Nijmegen, 7–8 September 2010 Simon Lambert STFC e-Science UK.
ICAT Schema Current Schema organization What’s there but not yet implemented What could we want in the future 1 ICAT developer workshop, August 2009.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
WP3 Information and Monitoring Rob Byrom / WP3
PaNdata ODI Open Data Infrastructure INFRA : Data infrastructures for e-Science PaNdata-ODI will develop, deploy and operate an Open Data Infrastructure.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Core Task Status, AR Doug Nebert September 22, 2008.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
ICAT Status Alistair Mills Project Manager Scientific Computing Department.
CRISP WP 17 1 / 2 Proposed Metadata Catalogue Architecture Document.
E-infrastructure requirements from the ESFRI Physics, Astronomy and Analytical Facilities cluster Provisional material based on outcome of workshop held.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Share.TEC realizing the vision.
INTAROS WP5 Data integration and management
Capacity Building Enhance the coordination of efforts to strengthen individual, institutional and infrastructure capacities, particularly in developing.
Pandata Service Verification
EGI-Engage Engaging the EGI Community towards an Open Science Commons
PaNdata Photon and Neutron Data Infrastructure Juan Bicarregui
ESS roadmap on Linked Open Data State of play
ITDG meeting of of October 2011
Mirjam van Daalen, (Stephan Egli, Derek Feichtinger) :: Paul Scherrer Institut Status Report PSI PaNDaaS2 meeting Grenoble 12 – 13 December 2016.
JISC and SOA A view Robert Sherratt.
IPET-DD-1 meeting Feb 2019 Thorsten Busselberg -DWD
IPET-DD-1 meeting Feb 2019 Thorsten Busselberg -DWD
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Expand portfolio of EGI services
Presentation transcript:

Data Catalogue Service Work Package 4

Main Objective: Deployment, Operation and Evaluation of a cataloguing service for scientific data. Why: Potential benefits beyond the convenience of powerful data searching/retrieving. Nov. 4, 2011WP4

Outcomes develop the generic software infrastructure to support the interoperation of facility data catalogues, deploy this software to establish a federated catalogue of data across the partners, provide data services based upon this generic framework which will enable users to deposit, search, visualise, and analyse data across the partners’ data repositories, evaluation of the service (also from the perspective of facility users) manage jointly the evolution of this software and the services based upon it, promote the take up of this technology and the services based upon it beyond the project. Nov. 4, 2011WP4

Relations and dependencies 1.user AAA services (WP3) 2.Virtual Laboratories (WP5) 3.Requires an established shared user AAA service – underpin the integrated data catalogue both of these are required to enable seamless access to the content through the virtual laboratories. Nov. 4, 2011WP4

Methodology Builds on: PaNdata Support Action user AAA services – in order to provide: service to the virtual labs No intention for a new metadata catalogue ICAT STFC’s ICAT is an advanced implementation Deployed in various facilities including Elettra/NFFA (+VCR) Comparison with other systems will be necessary MCA, MCAT, Artemis and Fireman. (outdated candidates?) – Check: AMGA (Fireman replacement in GLite) Nov. 4, 2011WP4

The current system will need further development. Issues that have to be addressed: how to link logical files (indexed by metadata) to physical files how to query metadata how to authorize user access to metadata (WP3 feedback?) what API to propose to programs to access metadata and data – (ICAT API at the catalogue level - pHDF5/ NeXus, Common Data Model? For the actual data in, line with PaNdata) Nov. 4, 2011WP4

Additional Should we “migrate” old files / archived datasets too? (converters?) Initial requirement Set of keywords for the metadata catalogue Expansion based on existing implementations + PaNdata SA Integration WP outcome + Dublin Core? Nov. 4, 2011WP4

Populating the catalogue virtual laboratories (WP5) – demonstration & test Existing data archives of other partners – May require converters + metadata generation Distributed access accessing data distributed over multiple sites via their metadata performance and scalability will be evaluated (as elaborated in WP5) Nov. 4, 2011WP4

Task 4.1 Survey existing systems – ICAT and other Examine them against the metadata, authorisation, performance, and ontological requirements of vLab (WP5) and uCAT AAA (WP3) Task 4.2. Deployment of the chosen metadata catalogue solution (=ICAT) Task 4.3. Remote API access to the individual catalogues Single search capability across the collaborating facilities. Task 4.4. Benchmarking - evaluation of the performance. Nov. 4, 2011WP4

Indicators of success 1.Searchable data catalogue established in participating facilities (more than 50% uptake) 2.Cross facility searching in place for data from different facilities. Nov. 4, 2011WP4

Deliverables D4.1. Requirements analysis for common data catalogue (M9: June 2012) D4.2. Populated metadata catalogue with data from the virtual laboratories (M15: Dec. 2012) D4.3 : Deployment of cross-facility metadata searching (M21: June 2013) D4.4. Benchmark of performance of the metadata catalogue (M27: Dec. 2013) Nov. 4, 2011WP4