Measurement Data Archive GEC11 July 2011 Giridhar Manepalli Corporation for National Research Initiatives

Slides:



Advertisements
Similar presentations
1 SensorWebs and Security Experiences Dan Mandl Presented at WGISS Meeting in Toulouse, France May 11, 2009.
Advertisements

Serving the interests of e-resource users in Research, Higher and Further Education Libraries ELIN: Electronic Library Information Navigator Ian Mayfield.
Why, what were the idea ? 1.Create a data infrastructure, 2.Data + the knowledge products that are produced on the basis of data a) Efficiant access to.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
CS 101 Sect 7 – Databases (DB) Why databases Difference between a DB and a Web search What is a DB An hands-on case: the JCU Library 1
OASIS Reference Model for Service Oriented Architecture 1.0
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
Introducing Symposia : “ The digital repository that thinks like a librarian”
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Measurement Data Archive – Project Highlights GEC12 Nov 2011 Giridhar Manepalli Corporation for National Research Initiatives
Continuity in a School Music Program Jordan D. Mantey.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
OpenURL: Linking LC’s E-Resources Ardie Bausenbach Automated Planning and Liaison Office Library of Congress November 24, 2003.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Digital Object Architecture
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Amos Kujenga ADLSN Training Coordinator Addis Ababa, Ethiopia 5 – 7 November 2014 Introduction To Digital Libraries and Repositories.
M EASUREMENT D ATA O BJECT D ESCRIPTOR S PECIFICATION - P RESENT S TATUS Giridhar Manepalli Corporation for National Research Initiatives.
ZLOT Prototype Assessment John Carlo Bertot Associate Professor School of Information Studies Florida State University.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: Gathering, Transferring and Sharing MD Goals Architecture Overview –Process.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Sponsored by the National Science Foundation GENI Registry Services, a.k.a. Digital Object Registry Spiral 2 Year-end Project Review CNRI PI: Larry Lannom.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: MD Objects and Descriptors Goals Architecture Overview –Process –Functional.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Measurement Data Archive – Integration Effort GEC11 July 2011 Giridhar Manepalli Corporation for National Research Initiatives.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
Creating documentation and metadata: Recording provenance and context Jeff Arnfield National Climatic Data Center Version a1.0 Review Date.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Dr Jamal Roudaki Faculty of Commerce Lincoln University New Zealand.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Measurement Data Workspace and Archive: Current State and Next Steps GEC15 Oct 2012 Giridhar Manepalli Corporation for National Research Initiatives
LAMP: Bringing perfSONAR to ProtoGENI Martin Swany.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Sponsored by the National Science Foundation 1 Nov 4, 2010 Inst & Meas WG Meeting at GEC9 Thur, Nov 4, 9am – 10:30am Introductions (9am) Topic 2: Meas.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
Sponsored by the National Science Foundation Measurement System Spiral 2 Year-end Project Review University of Wisconsin, Colgate University, Boston University.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Ellis Paul Technical Solution Specialist – System Center Microsoft UK Operations Manager Overview.
A Solution Perspective An Open Source Collaborative and Foundational Solution Targeted at Non-OECD Member Countries February 9, 2016.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: I&M Service Types, Arrangements, Assembling Goals Architecture Overview.
Measurement Data Archive GEC10 March 2011 Larry Lannom Corporation for National Research Initiatives
INFORMATION STROAGE AND RETRIEVAL SYSTEM By Ms. Preeti Patel Lecturer School of Library And Information Science DAVV, Indore
International Planetary Data Alliance Registry Development and Coordination Project Report 7 th IPDA Steering Committee Meeting July 13, 2012.
Data Type Registries (DTR) WG RDA P3 Breakout 28 March 2014 Larry Lannom Corporation for National Research Initiatives
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: Sharing MD Objects with Researchers, MDA Service Goals Architecture Overview.
NIST Office of Data and Informatics (ODI) of the Material Measurement Laboratory Robert Hanisch, director Ray Plante, interoperability expert ODI has responsibility.
Researching for your Literature Review
An Overview of Data-PASS Shared Catalog
Data Type Registries Breakout
Repository Software - Standards
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
CNI Spring 2010 Membership Meeting
An ecosystem of contributions
LOD reference architecture
The new RDA: resource description in libraries and beyond
ONE-STOP FOR JOURNAL DISCOVERY
Presentation transcript:

Measurement Data Archive GEC11 July 2011 Giridhar Manepalli Corporation for National Research Initiatives

Why Archive? The obvious: for use by others or by yourself in the future The Fourth Paradigm Data-intensive science Emergent phenomena Funding bodies increasingly asking for data plans Citations from journal articles to data sets on the rise Consistent archiving standards enhance the use of data over time and within a domain

Measurement Data Archive Experimenter Y Experimenter X Workspace Key: 1. Experiment Initiated 1 1 Slice = Data Model TBD Public Journals Internet Measurement Data Collected 3 3. Measurement Data Archived 4 4. Archived Data Referenced 5 5. Archived Data Retrieved

Prototype Limitations Only one workspace service is deployed Multiple workspaces, within and outside GENI networks, can be hosted that push data to the archive Authentication and authorization model is simple and redundant Should conform and use one scheme across GENI (or at least across I&M) No metadata standard applied I&M metadata requirements must be applied once identified

Current Usage Early adopters in GENI: OnTimeMeasure - Ohio State University INSTOOLS - University of Kentucky Possible usage in other projects: DARPA Transformative Apps program for managing mobile apps related data Internal to CNRI for sharing documents and presentations across groups

Next Steps – I&M Standpoint Revisit the protocols for pushing data into workspace Associate metadata with data effectively Where does the metadata live? How is it associated with data? At what level of granularity is it specified? Support GENI and I&M schemes of authentication, authorization, metadata enforcement, etc. Allow multiple workspace deployments Identify the process to push data from workspace into the archive Should metadata be enforced before data is pushed into the archive? How is the data serialized in the archive? How is data visibility managed in the archive?

Next Steps – GENI-wide Extend services offered by the archive beyond data storage Developed a visualization service prototype to demonstrate automatic visualization of data for DataCite Designed a theoretical model for enforcing terms & conditions, licenses, etc. prior to disseminating data Goal: Expand archive into a eco-system to entice communities into using it Use archive for experiments, not just for I&M

SUITE OF SERVICES Science Times Article Title Data ID Archive Services Suite of extensible services end users can leverage by following the ID. Ohio University VDC Experiment Experimenter Other Experiments Other Experimenters Stores & Retrieves Data Visualization Archive I Agree Terms:… License Enforcement I Agree Terms:… I Agree Terms:… Data Set Dissemination … … …. Data Processing 1.User follows Data ID into the Archive User is redirected to requested Archive Service.

Measurement Data Archive GEC11 July 2011 Giridhar Manepalli Corporation for National Research Initiatives

Related Slides

What is Metadata and Why Do I Need It? Lots of miscommunication because Metadata is not a type of data Metadata is a type of relationship between two pieces of data Needed for Understanding and Finding Understanding (sometimes called Descriptive MD) How do I parse this? How do I interpret this? Finding (sometimes called Subject MD) Finding one item in a population of 10 is easy Finding one item in a population of 1M is impossible w/o some some way to distinguish them Generally requires a human in the loop at some level Sometimes the object is self-describing (journal article) Automatic indexing/classification works for some domains

Why is Metadata Hard? To be effective it must be consistent, and consistently applied, within a given domain What is the scope of the domain? What aspects of the object need to be described? What is the vocabulary, is it open or closed? Even within a defined domain, there are many points of view Especially true for any sort of subject description May have to allow for multiple metadata objects for a single described object Spending time on creating good metadata is Good For You The best sources for good metadata are the creators/owners of the described object, but they may lack interest and training Some types of metadata are difficult to automate, e.g., good title Keep it simple – trade consistency and coverage for depth

Misc Points Precision and Recall useful concepts in searching Precision: % of search results are on target Recall: % of the correct result set did my search retrieve Desirable tradeoff is situational Consider University Libraries as reliable archive holders Variety of approaches to managing a useful vocabulary of terms Controlled vocabulary: set of terms – use these instead of slight variations Taxonomy: parent-child relationships Ontologies: introduce other types of relationships