Presentation is loading. Please wait.

Presentation is loading. Please wait.

Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office.

Similar presentations


Presentation on theme: "Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office."— Presentation transcript:

1 Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office

2 Why LTER Data Co-op? A Diamond in the Rough Demonstrations How can I contribute data? How do I find data? How can I see who is using my data? How is Network synthesis enabled? How is provenance captured? Where do we go from here? Panel Discussion

3 LTER Network Office Its about community A cooperative … is an autonomous association of persons who voluntarily cooperate for their mutual social, economic, and cultural benefit. - Wikipedia Producers – LTER sites Middleware - PASTA Consumers – Science Community

4 LTER Network Office

5 Data producers can evaluate their data package prior to harvesting into PASTA Data packages are discovered via browsing and/or search tools Derived data may be generated when a data package insert or update event occurs Provenance metadata can be generated for derived data packages Data package use information is viewed by a contributor LTER Network Office

6 LTER Network Data Portal portal.lternet.edu

7 PASTA Web Service API

8 Subcomponent of the Data Package Manager component in PASTA Generates a quality report for each data package A quality report contains a set of quality checks Stored as XML but usually rendered in HTML for human readability 27 quality checks implemented in the NIS prototype (of 52 proposed by EML Metrics Working Group) Available to the greater ecoinformatics community via the Data Manager Library (ecoinformatics.org) LTER Network Office

9 An individual metric or a best practice May involve looking at: metadata (independent of data), or data (independent of metadata), or congruency between metadata and data Can result in one of four statuses valid info warn error LTER Network Office

10 Users can evaluate data packages before inserting them into PASTA An error status reported by any quality check blocks insertion of the data package into PASTA Every data package stored in PASTA has a quality report that can be accessed along with its metadata and data LTER Network Office

11 Data Package Quality Report

12 Evaluate Runs quality checks on the data package but doesnt insert it into PASTA May reveal more diagnostic information (as compared to harvest) because it doesnt necessarily halt after encountering the first error Harvest Runs quality checks on the data package; if no errors are discovered, inserts (or updates) the data package into PASTA May reveal less diagnostic information (as compared to evaluate) because it may halt as soon as an error is encountered Bottom line: Always evaluate before harvesting! LTER Network Office

13 EML is version 2.1.0 or beyond Document is schema-valid EML Document is EML parser-valid All entity-level data URLs are live The packageId pattern matches scope.identifier.revision There are no duplicate entity names An entity-level URL which is not set to information returns data Data table does not have more fields than metadata attributes Data table does not have fewer fields than metadata attributes Database table can be created from EML metadata Field delimiter in metadata is a single character Document is schema-valid after dereferencing enumeratedDomain codes are unique (not yet implemented) LTER Network Office

14 Data can be loaded into the database Length of entityName is not excessive A methods element is present Record delimiter is present in metadata Data examined and possible record delimiters returned Number of records in metadata matches number of rows loaded At least one keyword element is present Dataset title length is at 5 least words Dataset abstract element is a minimum of 20 words...others not yet implemented LTER Network Office

15 Display downloaded data Display first insert row coverage element is present temporalCoverage element is present geographicCoverage element is present taxonomicCoverage element is present...others not yet implemented LTER Network Office

16

17 Data producers can evaluate their data package prior to harvesting into PASTA Data packages are discovered via browsing and/or search tools Derived data may be generated when a data package insert or update event occurs Provenance metadata can be generated for derived data packages Data package use information is viewed by a contributor LTER Network Office

18 North Inlet Meteorological – Air Temperature Yearly aggregation of data Down-sample Hourly to Daily and Monthly LTER Network Office 1982 1983 1984 1983 1984 1992 … … 1. 2. 3. 11.

19 LTER Network Office PASTA NIN Workflow NIN Workflow Source Data

20 LTER Network Office PASTA NIN Workflow NIN Workflow Notify

21 LTER Network Office PASTA NIN Workflow NIN Workflow Request Data

22 LTER Network Office PASTA NIN Workflow NIN Workflow Source Data

23 LTER Network Office PASTA

24 LTER Network Office PASTA NIN Workflow NIN Workflow Derived Data

25 Subscribe to a Data Package event

26 LTER Network Office

27 Source Data Package Derived Data Package Workflow Description

28 Provenance Metadata

29

30 LTER Network Office

31

32

33 LTER Network Office

34

35 December 2012 Support DOI assignment to metadata and data objects Refine NIS Data Portal Complete metadata rendering Improve catalog browsing Hang out shingle Summer 2013 Standup DataONE member node

36


Download ppt "Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office."

Similar presentations


Ads by Google