Presentation is loading. Please wait.

Presentation is loading. Please wait.

OceanTeacher Global Academy Pilot Course Digital Asset Management 30 September/4 October, 2013 Kenya Marine and Fisheries Research Institute (KMFRI) OceanTeacher.

Similar presentations


Presentation on theme: "OceanTeacher Global Academy Pilot Course Digital Asset Management 30 September/4 October, 2013 Kenya Marine and Fisheries Research Institute (KMFRI) OceanTeacher."— Presentation transcript:

1 OceanTeacher Global Academy Pilot Course Digital Asset Management 30 September/4 October, 2013 Kenya Marine and Fisheries Research Institute (KMFRI) OceanTeacher Regional Training Centre Mombasa, Kenya Mombasa, Kenya Data and Data Citation Linda Pikula NOAA Linda.pikula@noaa.gov

2 Is Data a digital asset? Class?

3 Statement on digital assets: “Unlike traditional analog objects such as books or photographs where the user has unmediated access to the content, a digital object always needs a software environment to render it”

4 Does that statement apply to data? Does data always need a software environment to render it?

5 “The data deluge is a reality in many fields. Scientific instruments are generating data at greater speed, densities and detail than before possible.” “Digital technologies are reshaping the practice of science” “Increases in computational capacity and capability drive more powerful modeling, simulation and analysis” There is a place for Data in the scholarly life cycle and a role for Librarians in this cycle. How will we define our role?

6 Digital Assets In the case of born-digital content (e.g., institutional archives, Web sites, electronic audio and video content, born-digital photography and art, research data sets, observational data), the enormous and growing quantity of content presents significant scaling issues to digital preservation efforts.

7 Data Management Data Access Data Preservation Data Rights Management

8 2.1 Main concepts related with data sharing, data publication, data citation and data metrics In this report the following concepts are used: “Data sharing” has been defined as the “voluntary provision of information from one individual or institution to another for purposes of legitimate research” (Fienberg et al., 1985) or simply “the release of research data for use by others” (Borgman, 2012). This general concept is grounded in the assumption that data are a valuable long-term resource and that sharing them and making them publicly-available is essential if their potential value is to be realized (Swan & Brown, 2008). Data sharing requires the systematic collection, curation and dissemination of data. “Data citations” have been defined as formal citations included in the reference list of published articles to data resources that led to a given research result (Mayernik, 2012). In this sense, the concept of data citation is tied to the idea that datasets should be published just as other kinds of scholarly products, being considered also as first class research outputs, both from social and funding policy perspectives (Lawrence, Jones, & Matthews, 2011). “Data publication”: The idea of publication of datasets mirrors the scientific publication model, although some criticisms have been also raised (Mayernik, 2012) as this model does not fully fit all the idiosyncrasies related with the sharing and publication of datasets. “Data metrics”: Data metrics are mainly related with data publication and data citation (but not exclusively, for example we could also potentially include ‘altmetrics’ on datasets here). Both data publication and data citation can be considered as signals of use of data. Use of data can generate new data, which may feed back into the collection phase (see Figure 1). Thus, for data metrics to build up, data sharing is a necessary prerequisite. Whether it will work the other way round (metrics leading to sharing) remains to be seen. In the rest of this report data sharing (i.e. collection, curation, dissemination) and data metrics (metrics on production and use) will be dealt with separately

9

10 Scholarly Information Cycle

11

12 The Scientific Communication Life- Cycle *Björk, B-C (2007): “A model of scientific communication as a global distributed information system”, Information Research, 12(2) paper 307

13 Scholarly Communication Cycle -Open Access - L. Lyon

14 Why Link Data? Class?

15 Scholarly Info Cycle for Data, Value Chain MethodLibrary Role Legitimization of Data Trust in DataPeer Review Registration of DataMetadata, preservation, curation Certification of Data Dissemination of DataAccess, preservation

16

17 Four categories of data Observational Computational Experimental Records

18 Incentives to share data Scholars concernsAgreements among research partners COLLABORATION RECOGNITION Reciprocity Coercion Open Science Publishers concernsEconomic Preservation, access, documentation Librarians concernsDocumentation, provenance, access, preservation Performance evaluations, bibliometrics “peer citing”

19 Incentives NOT to share data Rewarded for publication not data management Difficult and time consuming to document data for another's use subsequent to own use Competition for grant funding and recognition amongst scientists Keep control of intellectual propertyPublishers, scientists

20 Examples of efforts to include data along with scholarly journal articles SCOR/IODE/WHOI/MBL project AMS- American Meteorological Society publications Data Cite and Crosstalk registries NOAA catalog Elsevier - Pangea

21 Examples continued American Geophysical Union (AGU) and European Geophysical Union (EGU)publications Online Repositories_ WHOAS Woods Hole

22 Key issues for Data and e-science Issues for Librarians Discovery and Identification : What data exist? Where are the data and how can they be accessed? Access : Who has access? How will the privacy of both users and research subjects be protected? What kinds of rights management structures need to be established, if any? Interoperability : In what formats will data be stored and presented? What kinds of metadata will be applied? How will variables be described? What data models apply? Retention Criteria : Is the data likely to be reused? Will another researcher be able to reasonably replicate or build upon the original results using this data? What is the cost of metadata creation, and how does that compare to the expected value of the data to other researchers?

23 Definition of “Linked Data” In computing, linked data (often capitalized as Linked Data) describes a method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as Http, RDF and URLS, but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers. This enables data from different sources to be connected and queried. [1] [1] Tim Berners Lee, director of the World Wide Web Consortium, coined the term in a design note discussing issues around the Semantic Web project. [2] [2]

24

25 Persistent Identifiers

26 How to Link Data

27 Librarian’s Role in Data Management Roles: 1.Data management including collection, organization, description, curation, archiving, and dissemination-creating a plan 2. Creation of new data- and scholarship-based electronic resources for university and/or public use 3. Development of new models, standards, and architectures for various aspects of data management, description, etc. 4. Building accessible linkages between all the components and stages of research, from data to researchers to publications 5. Bridging institutional hierarchies and departmental divisions in service of interdisciplinary initiatives

28

29 Name of the course [date]x – x month, 201x [host organisation] IODE Project Office [place: city, country] Oostende, Belgium Data Citation Name of the trainer Trainer’s affiliation Email address Space for Trainer’s organisation logo, in case he/she wants/needs

30 Citation and Peer Review of Data Citation Metrics? Class Discussion Thomson Reuters Web of Science and Data Citation Databases

31

32 Digital Object Identifier (DOI) A digital object identifier (DOI) is a character string used to uniquely identify an object. Metadata about the object is stored in association with the DOI name. Libraries have been using for years, now de facto standard for data. http://dx.doi.org/10.1575/1912/5105 10 -DOI registry 1575 - DOI registry agent – CrossRef 1912 - publisher – MBLWHOI 5105 – “item” number

33 E-Repositories and Data How many of you have an E-Repository? OceanDocs? Other? Software used? For Your Information: DSpace Repository Accepts both text documents and datasets Accepts data related to articles as well as data not associated with a paper

34 Current Status of Linked Data How many of you currently access/link to data through your repositories? …through your online library catalog? …through your Data Divisions web pages?

35 Use the following slide on Lat/Long To discuss Bibliographies that have data links to geospatial data The slide shows metadata needed

36 Lat / Long

37 List of Most Common Metadata Fields

38 How to ‘mint’ a DOI Mint? Registries for DOI’s: CrossRef DATACite ESIP Parts of the DOI Explained

39 http://dx.doi.org/10.1575/1912/5105 10 -DOI registry 1575 - DOI registry agent – CrossRef 1912 - publisher – MBLWHOI 5105 – “item” number

40 NOAA Examples Cruise Videos Here? Historic International Climate Data (NOAA,IODE) NOAA Pilot Project DOI’s: Landing pages: http://www.ngdc.noaa.gov/docucomp/page?xml=NOAA/NESDIS/NGDC/Collec tion/iso/xml/Hazard_Images_Database.xml&view=iso2html http://www.ngdc.noaa.gov/docucomp/page?xml=NOAA/NESDIS/NGDC/Collec tion/iso/xml/Hazard_Images_Database.xml&view=iso2html

41 NOAA DOI Pilot Project Wiki-How to Assign a DOI https://geo-ide.noaa.gov/wiki/index.php?title=Data_Citation

42

43

44

45

46 TYPES OF DATA Oceanography Fisheries Atmospheric Sciences

47

48

49 Carbon Cycles Ocean Temperature, Color,Depth,Salinity Time Series Mixed layer surface currents Meridional heat transport Global Heat Storage Global Surface Currents Essential Climate Variables data(salinity chlorophyll, altimetry, surface wind and current) Wave Data Coastal Climatologies Data for use in Marine Spatial Planning and Decision support applications for climate, ecosystems and coastal planning Oceanographic Data

50 Fisheries Catch, abundance, sex, size Commercial Fisheries landings/exploitation recreational fisheries Stock assessments/abundance, species, habitat assessments, surveys at sea, recruitment Environmental- habitat, water quality, climate cycles International Organizations that collect or maintain Fisheries Statistics http://www.st.nmfs.noaa.gov/st1/International_National_Org anizations.html Fisheries Data

51 Air pressure and winds Near surface winds (ocean surface) Hurricane and storm data Other Atmospheric -Air/Sea Interaction

52 Other Examples Historic Oceanographic Cruises Marine Photo Libraries (Images) Geospatial Data- what is? Bibliographies online with Geospatial Data

53 Exercise 1 Create a DOI and Metadata for the following Publication: 2005 Carbon dioxide, hydrographic, and chemical data obtained during the R/V Maurice Ewing cruise in the Atlantic Ocean : (WOCE section A17, 4 January-21 March 1994) Online version in PDF format http://cdiac.ornl.gov/ftp/ndp084/ndp084.pdfhttp://cdiac.ornl.gov/ftp/ndp084/ndp084.pdf

54 Exercise 2 Create a Metadata record and DOI for the following digital asset 2013 Seafloor video footage and still-frame grabs from U.S. Geological Survey cruises in Hawaiian nearshore waters Online document in PDF and MOV (PURL) http://purl.fdlp.gov/GPO/gpo37015 Find a NOAA Video cataloged and have students do the above exercise

55 Exercise 3 On the web, find a publication in a marine science journal which shows links to data Copy the information on a slide, for class discussion

56

57 Data Citation http://wiki.esipfed.org/index.php/Interagency_Data_Stewardship/Citations/provider_guidelines Google search: esip data citation for more examples http://wiki.esipfed.org/index.php/Interagency_Data_Stewardship/Citations/provider_guidelines The core required elements of a citation are Author(s)--the people or organizations responsible for the intellectual work to develop the data set. The data creators. Release Date--when the particular version of the data set was first made available for use (and potential citation) by others. Title--the formal title of the data set Version--the precise version of the data used. Careful version tracking is critical to accurate citation. Archive and/or Distributor--the organization distributing or caring for the data, ideally over the long term. Locator/Identifier--this could be a URL but ideally it should be a persistent service, such as a DOI, Handle or ARK, that resolves to the current location of the data in question. Access Date and Time--because data can be dynamic and changeable in ways that are not always reflected in release dates and versions, it is important to indicate when on-line data were accessed. Additional fields can be added as necessary to credit other people and institutions, etc. Additionally, it is important to provide a scheme for users to indicate the precise subset of data that were used. This could be the temporal and spatial range of the data, the types of files used, a specific query id, or other ways of describing how the data were subsetted. An example citation: Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated 2003. CLPX-Ground: ISA snow depth transects and related measurements ver. 2.0. Edited by M. Parsons and M. J. Brodzik. National Snow and Ice Data Center. Data set accessed 2008-05-14 at http://dx.doi.org/10.5060/D4MW2F23zhttp://dx.doi.org/10.5060/D4MW2F23z

58 DataCite Metadata Version 3.0 now available July 2013 http://schema.datacite.org New features include: -Better support for recording data location -Discipline specific meta data fields to supplement the generic schema -Better documentation as a whole for DataCite

59 Cruises and Expeditions-Data Management from Library Perspective Example: NOAA Video Data Management

60 Video Data Management System (VDMS) Archives, Preserves and Provides Online Access to NOAA Digital Video and Image Data Anna Fiolek, Metadata Librarian National Oceanographic Data Center, NOAA Central Library Silver Spring, Maryland Project’s e-mail: vdms@noaa.govvdms@noaa.gov All images from: NOAA Photo Library at: http://www.photolib.noaa.gov/ http://www.photolib.noaa.gov/ NOAA

61 OER IPT/VDMS VDMS Objectives Provide timely online information about NOAA’s Office of Exploration and Research video data to the general public. Educate our Nation about NOAA oceanographic expeditions and underwater explorations through digital video and related information. Archive and preserve unique video and related data for future generations. Collaborate with NOAA librarians, data managers, and scientists from different NOAA offices and programs. Use or extend existing library tools, guidelines, and metadata standards to support new media formats: digital video, digital image, and digital text documents. Enhance data access and metadata sharing between NCL NOAALINC, NODC Ocean Archive System (OAS), and NCDDC MerMaid catalogs, Digital Atlas, CoRIS, and ASFA.

62 Example of Data about a research Cruise Available through the online catalog NOAALinc http://oceanexplorer.noaa.gov/ 5-12 Signature Explorations per year. 1-12 Summary Explorations per year. VDMS archives and provide either online or off-line access to over 3000 OER digital video tapes or DVD discs, and over 300 video highlights and video clips. *Sustainable Seas Expeditions (SSE) 1999-2003, 18 missions (13 of them to NMS areas)

63 Access through a Web Landing Page

64 Access through Library Catalog NOAA Library Catalog: 2009 Bermuda Caves 2009 (Collection) http://oceanexplorer.noaa.gov/explorations/ 09bermuda/

65

66 CoRIS metadata searches include Library metadata via Z39.50 protocol NOAA Photo Library

67 Two Methods of Digital Research Data Management shown Access through online Catalog and specialized topic catalog Access through Internet web page (Landing Page) to data archive

68 Cruise Data Received and Distributed by OE/NCDDC Original/Copy Digital Video Tapes for Long- Term Archiving and Preservation NOAA Central Library Catalog Online NODC Ocean Archive System Video Annotations Image Annotations Cruise Reports Quick Look ReportsSituational Reports Peer-Review Publications Educational Lesson Plans K-12 NGDC Archive System Original Raw CTD DATA for Archiving, Preservation, and Online Access Digital Video Highlights Digital Image Highlights Original Raw Multi-Beam Data for Preservation and Online Access Uncompressed Digital Video Data for Online Archiving and Preservation Digital Video HighlightsVideo Annotations Cruise Reports Quick Look ReportsSituational Reports Peer-Review Publications Digital Images Image Annotations Digital Image Highlights Web Sites and Related Home PagesEducational Lesson Plans Video Supporting Documents For Online Access Video Supporting Documents For Archiving, Preservation and Central Online Access OER DATA FLOW shows collaboration between NOAA Data Centers Digital Atlas MERMAid Catalog

69

70 Another Example of Data Management- Access

71 NOAA AOML This is the home page of the NOAA AOML Laboratory – showing types of data available at the laboratory and focus of research http://www.aoml.noaa.gov/

72

73 Software tools for data discovery, access, visualization and analysis Data Discovery and Access: Web Based User Interfaces Programmatic access interfaces Links to the literature EXAMPLES: GeoMapApp Virtual Ocean EarthChem Portals to complementary data in other repositories: ASP, EarthChem, USAP-DCC, GoogleEarth Publication E-Repositories

74 Other Common Digital Assets in Library? Photo collections online Weather records online Cruise reports- digitized? Other possible digital assets? Historical documents? Videos of cruises?

75 Not Data-But Example of Marine Digital Photo Library Management http://www.photolib.noaa.gov/

76

77 Sample Data Sources NOAA Coastal Water Temps http://www.nodc.noaa.gov/dsdt/cwtg/catl.html http://www.nodc.noaa.gov/dsdt/cwtg/catl.html World Sea Temperatures http://www.seatemperature.org/http://www.seatemperature.org/ BODC CTD and Underway Data https://www.bodc.ac.uk/data/online_delivery/amt/ https://www.bodc.ac.uk/data/online_delivery/amt/ Global Ocean Data http://www.coriolis.eu.org/Observing-the- ocean/Global-and-regional-views/Global-Oceanhttp://www.coriolis.eu.org/Observing-the- ocean/Global-and-regional-views/Global-Ocean Real Time Arctic Data http://www.arctic.noaa.gov/data.htmlhttp://www.arctic.noaa.gov/data.html BCO-DMO http://bcodmo.org/datahttp://bcodmo.org/data

78 Other Online Resources MANTRA: online course on Research Data Management http://datalib.edina.ac.uk/mantra/ SUNScholar/Digital Preservation http://wiki.lib.sun.ac.za/index.php/SUNScholar/Digital_Preservation/Electronic_Archives_Preservation_Policy/Preservable_Digital_Objects Data Curation Profiles Symposium mms://video1.itap.purdue.edu/DCPSymposium Data Curation Profiles Toolkit http://datacurationprofiles.org/purpose Data Curation Symposium http://docs.lib.purdue.edu/dcpsymposium/

79 Mantra

80 Sunscholar

81 Standards ISQ (NISO)Information Standards Quarterly Spring/Summer 2012 v.24 issue 2/3 : Linked Data for Libraries, Archives, and Museums

82 Credits Lisa Raymond –E-Repositories Presentation L.Pikula – OT Digital Asset Management, 2009 Ball, A. and Duke, M. (2012) “How to Cite Datasets and Link to Publications.” DCC How-to Guides. Edinburgh: Digital Curation Centre. Available online: Http://www.dcc.ack.uk/resources/how-guides Http://www.dcc.ack.uk/resources/how-guides Fiolek, Anna. Video Data Management System VDMS, NOAA. Cycle of Scholarly Information, Washington and Lee University Libraries NOAA Central Library Website

83 Credits Lyons, Scholarly information cycle


Download ppt "OceanTeacher Global Academy Pilot Course Digital Asset Management 30 September/4 October, 2013 Kenya Marine and Fisheries Research Institute (KMFRI) OceanTeacher."

Similar presentations


Ads by Google