Download presentation
Presentation is loading. Please wait.
Published byAlbert Simon Modified over 7 years ago
1
OceanTeacher Global Academy Pilot Course Digital Asset Management 30 September/4 October, 2013 Kenya Marine and Fisheries Research Institute (KMFRI) OceanTeacher Regional Training Centre Mombasa, Kenya Mombasa, Kenya Data and Data Citation Linda Pikula NOAA Linda.pikula@noaa.gov
2
Is Data a digital asset? Class?
3
Statement on digital assets: “Unlike traditional analog objects such as books or photographs where the user has unmediated access to the content, a digital object always needs a software environment to render it”
4
Does that statement apply to data? Does data always need a software environment to render it?
5
“The data deluge is a reality in many fields. Scientific instruments are generating data at greater speed, densities and detail than before possible.” “Digital technologies are reshaping the practice of science” “Increases in computational capacity and capability drive more powerful modeling, simulation and analysis” There is a place for Data in the scholarly life cycle and a role for Librarians in this cycle. How will we define our role?
6
Digital Assets In the case of born-digital content (e.g., institutional archives, Web sites, electronic audio and video content, born-digital photography and art, research data sets, observational data), the enormous and growing quantity of content presents significant scaling issues to digital preservation efforts.
7
Data Management Data Access Data Preservation Data Rights Management
8
2.1 Main concepts related with data sharing, data publication, data citation and data metrics In this report the following concepts are used: “Data sharing” has been defined as the “voluntary provision of information from one individual or institution to another for purposes of legitimate research” (Fienberg et al., 1985) or simply “the release of research data for use by others” (Borgman, 2012). This general concept is grounded in the assumption that data are a valuable long-term resource and that sharing them and making them publicly-available is essential if their potential value is to be realized (Swan & Brown, 2008). Data sharing requires the systematic collection, curation and dissemination of data. “Data citations” have been defined as formal citations included in the reference list of published articles to data resources that led to a given research result (Mayernik, 2012). In this sense, the concept of data citation is tied to the idea that datasets should be published just as other kinds of scholarly products, being considered also as first class research outputs, both from social and funding policy perspectives (Lawrence, Jones, & Matthews, 2011). “Data publication”: The idea of publication of datasets mirrors the scientific publication model, although some criticisms have been also raised (Mayernik, 2012) as this model does not fully fit all the idiosyncrasies related with the sharing and publication of datasets. “Data metrics”: Data metrics are mainly related with data publication and data citation (but not exclusively, for example we could also potentially include ‘altmetrics’ on datasets here). Both data publication and data citation can be considered as signals of use of data. Use of data can generate new data, which may feed back into the collection phase (see Figure 1). Thus, for data metrics to build up, data sharing is a necessary prerequisite. Whether it will work the other way round (metrics leading to sharing) remains to be seen. In the rest of this report data sharing (i.e. collection, curation, dissemination) and data metrics (metrics on production and use) will be dealt with separately
10
Scholarly Information Cycle
12
The Scientific Communication Life- Cycle *Björk, B-C (2007): “A model of scientific communication as a global distributed information system”, Information Research, 12(2) paper 307
13
Scholarly Communication Cycle -Open Access - L. Lyon
14
Why Link Data? Class?
15
Scholarly Info Cycle for Data, Value Chain MethodLibrary Role Legitimization of Data Trust in DataPeer Review Registration of DataMetadata, preservation, curation Certification of Data Dissemination of DataAccess, preservation
17
Four categories of data Observational Computational Experimental Records
18
Incentives to share data Scholars concernsAgreements among research partners COLLABORATION RECOGNITION Reciprocity Coercion Open Science Publishers concernsEconomic Preservation, access, documentation Librarians concernsDocumentation, provenance, access, preservation Performance evaluations, bibliometrics “peer citing”
19
Incentives NOT to share data Rewarded for publication not data management Difficult and time consuming to document data for another's use subsequent to own use Competition for grant funding and recognition amongst scientists Keep control of intellectual propertyPublishers, scientists
20
Examples of efforts to include data along with scholarly journal articles SCOR/IODE/WHOI/MBL project AMS- American Meteorological Society publications Data Cite and Crosstalk registries NOAA catalog Elsevier - Pangea
21
Examples continued American Geophysical Union (AGU) and European Geophysical Union (EGU)publications Online Repositories_ WHOAS Woods Hole
22
Key issues for Data and e-science Issues for Librarians Discovery and Identification : What data exist? Where are the data and how can they be accessed? Access : Who has access? How will the privacy of both users and research subjects be protected? What kinds of rights management structures need to be established, if any? Interoperability : In what formats will data be stored and presented? What kinds of metadata will be applied? How will variables be described? What data models apply? Retention Criteria : Is the data likely to be reused? Will another researcher be able to reasonably replicate or build upon the original results using this data? What is the cost of metadata creation, and how does that compare to the expected value of the data to other researchers?
23
Definition of “Linked Data” In computing, linked data (often capitalized as Linked Data) describes a method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as Http, RDF and URLS, but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers. This enables data from different sources to be connected and queried. [1] [1] Tim Berners Lee, director of the World Wide Web Consortium, coined the term in a design note discussing issues around the Semantic Web project. [2] [2]
25
Persistent Identifiers
26
How to Link Data
27
Librarian’s Role in Data Management Roles: 1.Data management including collection, organization, description, curation, archiving, and dissemination-creating a plan 2. Creation of new data- and scholarship-based electronic resources for university and/or public use 3. Development of new models, standards, and architectures for various aspects of data management, description, etc. 4. Building accessible linkages between all the components and stages of research, from data to researchers to publications 5. Bridging institutional hierarchies and departmental divisions in service of interdisciplinary initiatives
29
Name of the course [date]x – x month, 201x [host organisation] IODE Project Office [place: city, country] Oostende, Belgium Data Citation Name of the trainer Trainer’s affiliation Email address Space for Trainer’s organisation logo, in case he/she wants/needs
30
Citation and Peer Review of Data Citation Metrics? Class Discussion Thomson Reuters Web of Science and Data Citation Databases
32
Digital Object Identifier (DOI) A digital object identifier (DOI) is a character string used to uniquely identify an object. Metadata about the object is stored in association with the DOI name. Libraries have been using for years, now de facto standard for data. http://dx.doi.org/10.1575/1912/5105 10 -DOI registry 1575 - DOI registry agent – CrossRef 1912 - publisher – MBLWHOI 5105 – “item” number
33
E-Repositories and Data How many of you have an E-Repository? OceanDocs? Other? Software used? For Your Information: DSpace Repository Accepts both text documents and datasets Accepts data related to articles as well as data not associated with a paper
34
Current Status of Linked Data How many of you currently access/link to data through your repositories? …through your online library catalog? …through your Data Divisions web pages?
35
Use the following slide on Lat/Long To discuss Bibliographies that have data links to geospatial data The slide shows metadata needed
36
Lat / Long
37
List of Most Common Metadata Fields
38
How to ‘mint’ a DOI Mint? Registries for DOI’s: CrossRef DATACite ESIP Parts of the DOI Explained
39
http://dx.doi.org/10.1575/1912/5105 10 -DOI registry 1575 - DOI registry agent – CrossRef 1912 - publisher – MBLWHOI 5105 – “item” number
40
NOAA Examples Cruise Videos Here? Historic International Climate Data (NOAA,IODE) NOAA Pilot Project DOI’s: Landing pages: http://www.ngdc.noaa.gov/docucomp/page?xml=NOAA/NESDIS/NGDC/Collec tion/iso/xml/Hazard_Images_Database.xml&view=iso2html http://www.ngdc.noaa.gov/docucomp/page?xml=NOAA/NESDIS/NGDC/Collec tion/iso/xml/Hazard_Images_Database.xml&view=iso2html
41
NOAA DOI Pilot Project Wiki-How to Assign a DOI https://geo-ide.noaa.gov/wiki/index.php?title=Data_Citation
46
TYPES OF DATA Oceanography Fisheries Atmospheric Sciences
49
Carbon Cycles Ocean Temperature, Color,Depth,Salinity Time Series Mixed layer surface currents Meridional heat transport Global Heat Storage Global Surface Currents Essential Climate Variables data(salinity chlorophyll, altimetry, surface wind and current) Wave Data Coastal Climatologies Data for use in Marine Spatial Planning and Decision support applications for climate, ecosystems and coastal planning Oceanographic Data
50
Fisheries Catch, abundance, sex, size Commercial Fisheries landings/exploitation recreational fisheries Stock assessments/abundance, species, habitat assessments, surveys at sea, recruitment Environmental- habitat, water quality, climate cycles International Organizations that collect or maintain Fisheries Statistics http://www.st.nmfs.noaa.gov/st1/International_National_Org anizations.html Fisheries Data
51
Air pressure and winds Near surface winds (ocean surface) Hurricane and storm data Other Atmospheric -Air/Sea Interaction
52
Other Examples Historic Oceanographic Cruises Marine Photo Libraries (Images) Geospatial Data- what is? Bibliographies online with Geospatial Data
53
Exercise 1 Create a DOI and Metadata for the following Publication: 2005 Carbon dioxide, hydrographic, and chemical data obtained during the R/V Maurice Ewing cruise in the Atlantic Ocean : (WOCE section A17, 4 January-21 March 1994) Online version in PDF format http://cdiac.ornl.gov/ftp/ndp084/ndp084.pdfhttp://cdiac.ornl.gov/ftp/ndp084/ndp084.pdf
54
Exercise 2 Create a Metadata record and DOI for the following digital asset 2013 Seafloor video footage and still-frame grabs from U.S. Geological Survey cruises in Hawaiian nearshore waters Online document in PDF and MOV (PURL) http://purl.fdlp.gov/GPO/gpo37015 Find a NOAA Video cataloged and have students do the above exercise
55
Exercise 3 On the web, find a publication in a marine science journal which shows links to data Copy the information on a slide, for class discussion
57
Data Citation http://wiki.esipfed.org/index.php/Interagency_Data_Stewardship/Citations/provider_guidelines Google search: esip data citation for more examples http://wiki.esipfed.org/index.php/Interagency_Data_Stewardship/Citations/provider_guidelines The core required elements of a citation are Author(s)--the people or organizations responsible for the intellectual work to develop the data set. The data creators. Release Date--when the particular version of the data set was first made available for use (and potential citation) by others. Title--the formal title of the data set Version--the precise version of the data used. Careful version tracking is critical to accurate citation. Archive and/or Distributor--the organization distributing or caring for the data, ideally over the long term. Locator/Identifier--this could be a URL but ideally it should be a persistent service, such as a DOI, Handle or ARK, that resolves to the current location of the data in question. Access Date and Time--because data can be dynamic and changeable in ways that are not always reflected in release dates and versions, it is important to indicate when on-line data were accessed. Additional fields can be added as necessary to credit other people and institutions, etc. Additionally, it is important to provide a scheme for users to indicate the precise subset of data that were used. This could be the temporal and spatial range of the data, the types of files used, a specific query id, or other ways of describing how the data were subsetted. An example citation: Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated 2003. CLPX-Ground: ISA snow depth transects and related measurements ver. 2.0. Edited by M. Parsons and M. J. Brodzik. National Snow and Ice Data Center. Data set accessed 2008-05-14 at http://dx.doi.org/10.5060/D4MW2F23zhttp://dx.doi.org/10.5060/D4MW2F23z
58
DataCite Metadata Version 3.0 now available July 2013 http://schema.datacite.org New features include: -Better support for recording data location -Discipline specific meta data fields to supplement the generic schema -Better documentation as a whole for DataCite
59
Cruises and Expeditions-Data Management from Library Perspective Example: NOAA Video Data Management
60
Video Data Management System (VDMS) Archives, Preserves and Provides Online Access to NOAA Digital Video and Image Data Anna Fiolek, Metadata Librarian National Oceanographic Data Center, NOAA Central Library Silver Spring, Maryland Project’s e-mail: vdms@noaa.govvdms@noaa.gov All images from: NOAA Photo Library at: http://www.photolib.noaa.gov/ http://www.photolib.noaa.gov/ NOAA
61
OER IPT/VDMS VDMS Objectives Provide timely online information about NOAA’s Office of Exploration and Research video data to the general public. Educate our Nation about NOAA oceanographic expeditions and underwater explorations through digital video and related information. Archive and preserve unique video and related data for future generations. Collaborate with NOAA librarians, data managers, and scientists from different NOAA offices and programs. Use or extend existing library tools, guidelines, and metadata standards to support new media formats: digital video, digital image, and digital text documents. Enhance data access and metadata sharing between NCL NOAALINC, NODC Ocean Archive System (OAS), and NCDDC MerMaid catalogs, Digital Atlas, CoRIS, and ASFA.
62
Example of Data about a research Cruise Available through the online catalog NOAALinc http://oceanexplorer.noaa.gov/ 5-12 Signature Explorations per year. 1-12 Summary Explorations per year. VDMS archives and provide either online or off-line access to over 3000 OER digital video tapes or DVD discs, and over 300 video highlights and video clips. *Sustainable Seas Expeditions (SSE) 1999-2003, 18 missions (13 of them to NMS areas)
63
Access through a Web Landing Page
64
Access through Library Catalog NOAA Library Catalog: 2009 Bermuda Caves 2009 (Collection) http://oceanexplorer.noaa.gov/explorations/ 09bermuda/
66
CoRIS metadata searches include Library metadata via Z39.50 protocol NOAA Photo Library
67
Two Methods of Digital Research Data Management shown Access through online Catalog and specialized topic catalog Access through Internet web page (Landing Page) to data archive
68
Cruise Data Received and Distributed by OE/NCDDC Original/Copy Digital Video Tapes for Long- Term Archiving and Preservation NOAA Central Library Catalog Online NODC Ocean Archive System Video Annotations Image Annotations Cruise Reports Quick Look ReportsSituational Reports Peer-Review Publications Educational Lesson Plans K-12 NGDC Archive System Original Raw CTD DATA for Archiving, Preservation, and Online Access Digital Video Highlights Digital Image Highlights Original Raw Multi-Beam Data for Preservation and Online Access Uncompressed Digital Video Data for Online Archiving and Preservation Digital Video HighlightsVideo Annotations Cruise Reports Quick Look ReportsSituational Reports Peer-Review Publications Digital Images Image Annotations Digital Image Highlights Web Sites and Related Home PagesEducational Lesson Plans Video Supporting Documents For Online Access Video Supporting Documents For Archiving, Preservation and Central Online Access OER DATA FLOW shows collaboration between NOAA Data Centers Digital Atlas MERMAid Catalog
70
Another Example of Data Management- Access
71
NOAA AOML This is the home page of the NOAA AOML Laboratory – showing types of data available at the laboratory and focus of research http://www.aoml.noaa.gov/
73
Software tools for data discovery, access, visualization and analysis Data Discovery and Access: Web Based User Interfaces Programmatic access interfaces Links to the literature EXAMPLES: GeoMapApp Virtual Ocean EarthChem Portals to complementary data in other repositories: ASP, EarthChem, USAP-DCC, GoogleEarth Publication E-Repositories
74
Other Common Digital Assets in Library? Photo collections online Weather records online Cruise reports- digitized? Other possible digital assets? Historical documents? Videos of cruises?
75
Not Data-But Example of Marine Digital Photo Library Management http://www.photolib.noaa.gov/
77
Sample Data Sources NOAA Coastal Water Temps http://www.nodc.noaa.gov/dsdt/cwtg/catl.html http://www.nodc.noaa.gov/dsdt/cwtg/catl.html World Sea Temperatures http://www.seatemperature.org/http://www.seatemperature.org/ BODC CTD and Underway Data https://www.bodc.ac.uk/data/online_delivery/amt/ https://www.bodc.ac.uk/data/online_delivery/amt/ Global Ocean Data http://www.coriolis.eu.org/Observing-the- ocean/Global-and-regional-views/Global-Oceanhttp://www.coriolis.eu.org/Observing-the- ocean/Global-and-regional-views/Global-Ocean Real Time Arctic Data http://www.arctic.noaa.gov/data.htmlhttp://www.arctic.noaa.gov/data.html BCO-DMO http://bcodmo.org/datahttp://bcodmo.org/data
78
Other Online Resources MANTRA: online course on Research Data Management http://datalib.edina.ac.uk/mantra/ SUNScholar/Digital Preservation http://wiki.lib.sun.ac.za/index.php/SUNScholar/Digital_Preservation/Electronic_Archives_Preservation_Policy/Preservable_Digital_Objects Data Curation Profiles Symposium mms://video1.itap.purdue.edu/DCPSymposium Data Curation Profiles Toolkit http://datacurationprofiles.org/purpose Data Curation Symposium http://docs.lib.purdue.edu/dcpsymposium/
79
Mantra
80
Sunscholar
81
Standards ISQ (NISO)Information Standards Quarterly Spring/Summer 2012 v.24 issue 2/3 : Linked Data for Libraries, Archives, and Museums
82
Credits Lisa Raymond –E-Repositories Presentation L.Pikula – OT Digital Asset Management, 2009 Ball, A. and Duke, M. (2012) “How to Cite Datasets and Link to Publications.” DCC How-to Guides. Edinburgh: Digital Curation Centre. Available online: Http://www.dcc.ack.uk/resources/how-guides Http://www.dcc.ack.uk/resources/how-guides Fiolek, Anna. Video Data Management System VDMS, NOAA. Cycle of Scholarly Information, Washington and Lee University Libraries NOAA Central Library Website
83
Credits Lyons, Scholarly information cycle
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.