Presentation on theme: "Toward a Data Repository for Evolutionary Biology: Toward a Data Repository for Evolutionary Biology: Jane Greenberg, Associate Professor, Director SILS/Metadata."— Presentation transcript:
Toward a Data Repository for Evolutionary Biology: Toward a Data Repository for Evolutionary Biology: Jane Greenberg, Associate Professor, Director SILS/Metadata Research Center, UNC-CH Jackson Dube, Visiting Scholar, SILS/MRC Ruth Monnig, Doctoral Research Assistant, SILS/MRC
Overview 1. Metadata defined 2. Role of metadata in a repository 3. Range of metadata standards – Principles and objectives – Domains – Architectural Layout 4. Issues 5. Discussion
Metadata Data about the content, quality, condition, and other characteristics of data (FGDC Glossary, 1992) Additional information necessary for data to be useful (Musik, 1997) Structured, descriptive information about a resource (DCMI Glossary; Weibel, 1995) *Resource = data = object = entity = document = data object
Metadata types and properties Metadata typeProperty, etc. *Resource/data discovery Title, subject ProvenanceCreator, source Terms and condition metadata (intellectual use) Access rights, manipulation rights. Structural metadata (technical use) Software and hardware needs *Resource = data = object = entity = document = data object
Range of published data objects Table, graph Dataset Research methods//procedures Agent/s: scientists/s, organizations Project Publication Related data objects –All these levels again and more…
Range of metadata standards Schemes (just a few…) LSID TEI Header; MARC bibliographic format, Dublin Core EAD FGDC/CSGSM; NBII EML DDI ODRL (Creative Commons Profile) A Core PREMISCharacteristics Objectives and principles Domains –Environment –Object type/format Architectural Layout –Extent –Level of Complexity Flat, hierarchical –Granularity
Metadata continuum LSID FGDC EML TEI Header, MARC; Dublin Core EAD DDI Draft – jed, after we meet, maybe you can make pretty!
Range of metadata standards Data structure standards Data communication standards Data value standards –Content representation, ontologies, authority files Data syntax standards Data models, architectures/packaging
The Knowledge Network for Biocomplexity (KNB) *http://knb.ecoinformatics.org//data.htmlhttp://knb.ecoinformatics.org//data.html
The Knowledge Network for Biocomplexity (KNB) *http://knb.ecoinformatics.org//data.htmlhttp://knb.ecoinformatics.org//data.html ontologies Data structures
Issues Cost –More metadata, more cost to produce –Less metadata, cost to users Metadata creation –Who, when, how? (Insuring quality, timely creation) –What applications are needed? Interoperability –What levels of interoperability do we need? With what systems? Preservation –How should the metadata/data be preserved and sustained Open access –How do we ensure open access to the metadata+?
Questions for discussion What level do you think metadata needs to be applied to facilitate data object discovery/use? What will help or invite users? What other issues come to mind?
Why metadata? Facilitate discovery of data objects Permit use – intellectual and technical Asset/object management and preservation Security Help advance the field of evolutionary biology
Range of published data objects Table, graph Dataset (supplementary data, entire data set) Research methods, procedures Coverage: Temporal and spatial aspects Agent/s: scientists/s, organizations Project Publication (journal volume, issue, pagination) Related data objects All these levels again and more… / Ruth/Jed, please consider altering/or another slide/s..