Presentation is loading. Please wait.

Presentation is loading. Please wait.

Using Specimen Data in Scientific Workflow Environments to Connect to Metadata Archive and Discovery Services in Environmental Biology CJ Grady, J.H. Beach,

Similar presentations


Presentation on theme: "Using Specimen Data in Scientific Workflow Environments to Connect to Metadata Archive and Discovery Services in Environmental Biology CJ Grady, J.H. Beach,"— Presentation transcript:

1 Using Specimen Data in Scientific Workflow Environments to Connect to Metadata Archive and Discovery Services in Environmental Biology CJ Grady, J.H. Beach, A. Stewart, J. Cavner University of Kansas Biodiversity Institute

2 Geospatial Metadata Describes – What it is – What it looks like – Who assembled it – When it was collected – Etc

3 EML Ecological Metadata Language – XML Schema – Open Source – Community Driven – Describes ecological data Occurrence Data Climate Layers Species Ranges

4 Narratives Transformation of metadata into a story that is appropriate for the intended audience Same metadata can be used to create narrative for: – Scientists – Undergrads – K-12 students

5 Narrative Example

6 DataONE – Distributed system for: Queries Data replication – Initially supports EML

7 Study of Experiment Reproducibility Ellison, Aaron Repeatability and transparency in ecological research. Ecology 90.

8 There is a Solution!

9 Process Metadata Data about the process used Descriptive and prescriptive Documents process used to generate data / metadata – Quality control – SDM experiments – RAD experiments

10 Capabilities Reproducibility – Actions are documented Transparency – Experiments can be evaluated and validated Publishing – Metadata can be published along with results

11 What we have done EML for all of our Species Distribution Modeling services Simple process metadata – Documents how an experiment is ran through our cluster including what versions of software – Also describes what web services would be called to execute the experiment again

12 What we have done Clients – Python library – VisTrails – QGIS

13 What we are doing Publishing EML to a repository Client extensions Extending process metadata – HTTP message – XPath

14 Process Metadata Extensions HTTP Message – Documents any web resource call over HTTP XPath processing

15

16

17 What we will do next Use standard APIs to communicate with DataONE Continue to search for standard process metadata and include it whenever possible Contribute process metadata extensions back to the community Add additional conditional analysis elements to the schema (JSON, etc)

18 Reproducibility Simple process metadata EML process metadata extensions Lifemapper client EML reader

19 Transparency EML for all service objects Descriptive process metadata

20 Publishing Aid Client access to public data / metadata catalogs Publish buttons

21 Lessons Learned Had success starting with narrow, specific, process steps and generalizing them – Calls to our web services expanding to any HTTP call Easy to get carried away with all of the possibilities

22 Lifemapper funded by: U.S. National Science Foundation NSF EPSCoR NSF EPSCoR EHR/DRL BIO/DBI OCI/CI-TEAM


Download ppt "Using Specimen Data in Scientific Workflow Environments to Connect to Metadata Archive and Discovery Services in Environmental Biology CJ Grady, J.H. Beach,"

Similar presentations


Ads by Google