Measurement Data Archive – Project Highlights GEC12 Nov 2011 Giridhar Manepalli Corporation for National Research Initiatives

Slides:



Advertisements
Similar presentations
1 SensorWebs and Security Experiences Dan Mandl Presented at WGISS Meeting in Toulouse, France May 11, 2009.
Advertisements

Serving the interests of e-resource users in Research, Higher and Further Education Libraries ELIN: Electronic Library Information Navigator Ian Mayfield.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
CS 101 Sect 7 – Databases (DB) Why databases Difference between a DB and a Web search What is a DB An hands-on case: the JCU Library 1
OASIS Reference Model for Service Oriented Architecture 1.0
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
Introducing Symposia : “ The digital repository that thinks like a librarian”
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Continuity in a School Music Program Jordan D. Mantey.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
OpenURL: Linking LC’s E-Resources Ardie Bausenbach Automated Planning and Liaison Office Library of Congress November 24, 2003.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Systematic Approaches to Literature Reviewing. The Literature Review ? “Literature reviews …… introduce a topic, summarise the main issues and provide.
Periodical Databases Full-text article – entire textual contents of article in online format Abstract – brief summary of article Citation – basic information.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Measurement Data Archive GEC11 July 2011 Giridhar Manepalli Corporation for National Research Initiatives
Digital Object Architecture
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 6 Slide 1 Software Requirements.
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Amos Kujenga ADLSN Training Coordinator Addis Ababa, Ethiopia 5 – 7 November 2014 Introduction To Digital Libraries and Repositories.
M EASUREMENT D ATA O BJECT D ESCRIPTOR S PECIFICATION - P RESENT S TATUS Giridhar Manepalli Corporation for National Research Initiatives.
ZLOT Prototype Assessment John Carlo Bertot Associate Professor School of Information Studies Florida State University.
MD9.6 Release: Highlights Increased the character limit for all URL resources to 600 characters. Data_Center/Service_Provider Data_Set_Citation/Service_Citation.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: Gathering, Transferring and Sharing MD Goals Architecture Overview –Process.
Sponsored by the National Science Foundation GENI Registry Services, a.k.a. Digital Object Registry Spiral 2 Year-end Project Review CNRI PI: Larry Lannom.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: MD Objects and Descriptors Goals Architecture Overview –Process –Functional.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
"How much?": Aggregating usage data from Repositories in the UK Jo Lambert, Ross Macintyre, Paul Needham, Jo Alcock OR2015.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
ICOLC Las Vegas March 28, 2003 TDNet E-Management Services for Consortia From E-Journals to E-Resources Michael Markwith President, TDNet Inc.
Measurement Data Archive – Integration Effort GEC11 July 2011 Giridhar Manepalli Corporation for National Research Initiatives.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
Creating documentation and metadata: Recording provenance and context Jeff Arnfield National Climatic Data Center Version a1.0 Review Date.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Dr Jamal Roudaki Faculty of Commerce Lincoln University New Zealand.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Measurement Data Workspace and Archive: Current State and Next Steps GEC15 Oct 2012 Giridhar Manepalli Corporation for National Research Initiatives
LAMP: Bringing perfSONAR to ProtoGENI Martin Swany.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Sponsored by the National Science Foundation 1 Nov 4, 2010 Inst & Meas WG Meeting at GEC9 Thur, Nov 4, 9am – 10:30am Introductions (9am) Topic 2: Meas.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
Sponsored by the National Science Foundation Measurement System Spiral 2 Year-end Project Review University of Wisconsin, Colgate University, Boston University.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Ellis Paul Technical Solution Specialist – System Center Microsoft UK Operations Manager Overview.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: I&M Service Types, Arrangements, Assembling Goals Architecture Overview.
Measurement Data Archive GEC10 March 2011 Larry Lannom Corporation for National Research Initiatives
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Data Type Registries (DTR) WG RDA P3 Breakout 28 March 2014 Larry Lannom Corporation for National Research Initiatives
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: Sharing MD Objects with Researchers, MDA Service Goals Architecture Overview.
Researching for your Literature Review
An Overview of Data-PASS Shared Catalog
Data Type Registries Breakout
Repository Software - Standards
CNI Spring 2010 Membership Meeting
Introduction to Semantic Metadata & Semantic Web
The new RDA: resource description in libraries and beyond
Presentation to SISAI Luxembourg, 12 June 2012
ONE-STOP FOR JOURNAL DISCOVERY
Presentation transcript:

Measurement Data Archive – Project Highlights GEC12 Nov 2011 Giridhar Manepalli Corporation for National Research Initiatives

Why Archive? The obvious: for use by others or by yourself in the future The Fourth Paradigm Data-intensive science Emergent phenomena Funding bodies increasingly asking for data plans Citations from journal articles to data sets on the rise Consistent archiving standards enhance the use of data over time and within a domain

Measurement Data Archive Experimenter Y Experimenter X Workspace Key: 1. Experiment Initiated 1 1 Slice = Data Model TBD Public Journals Internet Measurement Data Collected 3 3. Measurement Data Archived 4 4. Archived Data Referenced 5 5. Archived Data Retrieved

Current Usage Early adopters in GENI: OnTimeMeasure - Ohio State University INSTOOLS - University of Kentucky Possible usage in other projects: DARPA Transformative Apps program for managing mobile apps related data Internal to CNRI for sharing documents and presentations across groups

Next Steps – I&M Standpoint Revisit the protocols for pushing data into workspace Associate metadata with data effectively Where does the metadata live? How is it associated with data? At what level of granularity is it specified? Support GENI and I&M schemes of authentication, authorization, metadata enforcement, etc. Allow multiple workspace deployments Identify the process to push data from workspace into the archive Should metadata be enforced before data is pushed into the archive? How is the data serialized in the archive? How is data visibility managed in the archive?

Next Steps – GENI-wide Extend services offered by the archive beyond data storage Developed a visualization service prototype to demonstrate automatic visualization of data for DataCite Designed a theoretical model for enforcing terms & conditions, licenses, etc. prior to disseminating data Goal: Expand archive into an eco-system to entice communities into using it Use archive for experiments, not just for I&M

SUITE OF SERVICES Science Times Article Title Data ID Archive Services Suite of extensible services end users can leverage by following the ID. Ohio University VDC Experiment Experimenter Other Experiments Other Experimenters Stores & Retrieves Data Visualization Archive I Agree Terms:… License Enforcement I Agree Terms:… I Agree Terms:… Data Set Dissemination … … …. Data Processing 1.User follows Data ID into the Archive User is redirected to requested Archive Service.

Measurement Data Archive – Project Highlights GEC12 Nov 2011 Giridhar Manepalli Corporation for National Research Initiatives

Related Slides

Prototype Limitations Only one workspace service is deployed Multiple workspaces, within and outside GENI networks, can be hosted that push data to the archive Authentication and authorization model is simple and redundant Should conform and use one scheme across GENI (or at least across I&M) No metadata standard applied I&M metadata requirements must be applied once identified

What is Metadata and Why Do I Need It? Lots of miscommunication because Metadata is not a type of data Metadata is a type of relationship between two pieces of data Needed for Understanding and Finding Understanding (sometimes called Descriptive MD) How do I parse this? How do I interpret this? Finding (sometimes called Subject MD) Finding one item in a population of 10 is easy Finding one item in a population of 1M is impossible w/o some some way to distinguish them Generally requires a human in the loop at some level Sometimes the object is self-describing (journal article) Automatic indexing/classification works for some domains

Why is Metadata Hard? To be effective it must be consistent, and consistently applied, within a given domain What is the scope of the domain? What aspects of the object need to be described? What is the vocabulary, is it open or closed? Even within a defined domain, there are many points of view Especially true for any sort of subject description May have to allow for multiple metadata records for a single described object Spending time on creating good metadata is Good For You The best sources for good metadata are the creators/owners of the described object, but they may lack interest and training Some types of metadata are difficult to automate, e.g., good title Keep it simple – trade consistency and coverage for depth

Misc Points Precision and Recall useful concepts in searching Precision: % of search results are on target Recall: % of the correct result set did my search retrieve Desirable tradeoff is situational Consider University Libraries as reliable archive holders Variety of approaches to managing a useful vocabulary of terms Controlled vocabulary: set of terms – use these instead of slight variations Taxonomy: parent-child relationships Ontologies: introduce other types of relationships