The CEINT Database Sandra Karcher Carnegie Mellon University / CEE To the Nanotechnology Working Group on September.

Slides:



Advertisements
Similar presentations
Statistical basics Marian Scott Dept of Statistics, University of Glasgow August 2008.
Advertisements

Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
Water Analysis of the University of Cincinnati, Clermont Campus Bridget Jennings, Brittany Miller, Sara Neel, Kristopher Thomas, Tricia Wright June 6,
Managing Sensor Infrastructure: A Sensor Extension for ODM2 Amber Spackman Jones, Jeffery S. Horsburgh, Juan Caraballo, Maurier Ramírez Utah Water Research.
Wetland Soils Phosphorus Criteria Development V.D. Nair*, M.W. Clark, K.R. Reddy, and S.G. Haile Soil and Water Science Department, University of Florida,
Modeling of Soil Nutrients – An Introduction to Logical Spreadsheeting Russell Yost Department of Tropical Plant and Soil Science, University of Hawai`i.
Graphic Vision of Environment Threat in New Orleans Area after Katrina Student: Ke Yang Mentors: Dr. Wendy Zhang, Dr. Ju Chou COMPUTER SCIENCE, COLLEGE.
16 months…. The Visibility Information Exchange Web System is a database system and set of online tools originally designed to support the Regional Haze.
An Overview of Today’s Class
PAGE # 1 Presented by Stacey Hancock Advised by Scott Urquhart Colorado State University Developing Learning Materials for Surface Water Monitoring.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
QA/QC FOR ENVIRONMENTAL MEASUREMENT Unit 4: Module 13, Lecture 2.
Distributed Data Analysis & Dissemination System (D-DADS) Prepared by Stefan Falke Rudolf Husar Bret Schichtel June 2000.
What is in Aarhus’ backyard Dr. Sándor Fülöp, Csaba Sándor Environmental Management and Law Association Comissioned by UNEP/GRID Arendal Dr. Sándor Fülöp,
Material Flow Analysis
Annex I: Methods & Tools prepared by some members of the ICH Q9 EWG for example only; not an official policy/guidance July 2006, slide 1 ICH Q9 QUALITY.
DeAnn Huinker & Kevin McLeod University of Wisconsin-Milwaukee Designing High Quality Professional Development Knowledge, Management, & Dissemination Conference.
An EPA Sponsored Literature Review Database to Support Stressor Identification Benjamin Jessup Cathy Cresswell Tetra Tech, Inc. Patricia Shaw-Allen, Ph.D.
Nanomaterial Registry: Minimal Information Standards for Well-Characterized Nanomaterials in Environmental and Biological Studies Michele L. Ostraat April.
Economic Cooperation Organization Training Course on “Drought and Desertification” Alanya Facilities, Antalya, TURKEY presented by Ertan TURGU from Turkish.
WARE RET 2014 –Rain Garden Construction-C. Leon King High School Nicole Richards-Works 1, Ryan Locicero 2, Dr. Maya Trotz 2 1. C. Leon King High School;
The NCIP Nanotechnology Working Group Nano WG Fall 2013 Kick-Off.
A unifying model of cation binding by humic substances Class: Advanced Environmental Chemistry (II) Presented by: Chun-Pao Su (Robert) Date: 2/9/1999.
1 Data Integration Community of Practice Meeting September 15, 2009 Science Data Integration.
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
University of Florida Florida State University
1 / 9 ASTM D19 Method Validation Procedures William Lipps Analytical & Measuring Instrument Division July, 2015.
Water Quality Data, Maps, and Graphs Over the Web · Chemical concentrations in water, sediment, and aquatic organism tissues.
Laboratory QA/QC An Overview.
EPA’s Bioassessment Performance and Comparability “Guidance”
Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting.
10/10/2012ISC239 Isabelle Bichindaritz1 Physical Database Design.
Numerical Methods Continuous Fourier Series Part: Continuous Fourier Series
Critical Loads and Target Loads: Tools for Assessing, Evaluating and Protecting Natural Resources Ellen Porter Deborah Potter, Ph.D. National Park Service.
Presented by: Ashgan Fararooy Referenced Papers and Related Work on:
- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
What is Snapshot Day? An estuary wide sampling event for students, teachers and educators. What makes it so special? Same Day Sampling - A single Fall.
BASINS 2.0 and The Trinity River Basin By Jóna Finndís Jónsdóttir.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
Children’s Health Exposure Analysis Resource (CHEAR) CHEAR Center for Data Science Susan Teitelbaum, PhD November 4, 2015.
HDF and HDF-EOS Workshop VII September 24, 2003 HDF5, HDF-EOS and Geospatial Data Archives Don Keefer Illinois State Geological Survey Mike Folk Univ.
Virtual Experiment © Oregon State University Models as a communication tool for HJA scientists Kellie Vache and Jeff McDonnell Dept of Forest Engineering.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Preliminary Scoping Effort. Presentation Objectives Identify need for additional sources of future funding Provide background on how elements were identified.
Linking Data with Action Part 1: Seven Steps of Using Information for Decision Making.
Supported in part by the National Science Foundation under Grant No. HRD Any opinions, findings, and conclusions or recommendations expressed.
Robert W. Pinder, Alice B. Gilliland, Robert C. Gilliam, K. Wyat Appel Atmospheric Modeling Division, NOAA Air Resources Laboratory, in partnership with.
1 1 EPA Nanotechnology Research Program – LCA Considerations Jeff Morris National Program Director for Nanotechnology 5 November 2009.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Numerical Methods Multidimensional Gradient Methods in Optimization- Example
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
Investigating Mass Gain and Mass Loss Power point Power point to accompany Carbon Teaching Experiment Written by: Jonathon Schramm A, Eric Keeling B, Dijanna.
Table 3. Merits and Demerits of Selected Water Quality Indices Shweta Tyagi et al. Water Quality Assessment in Terms of Water Quality Index. American Journal.
Center of Excellence for Oceans and Human Health at the Hollings Marine Laboratory Metadata Development in Support of the Oceans and Human Health Tidal.
Delivery of Science Components to NRCS Business Applications
Pb Solids Precipitated Under Laboratory Conditions
Computational Reasoning in High School Science and Math
Outdoor Water Sediment Study – Adding Effects of Sunlight to Aquatic System Exposure Assessment Cecilia Mucha Hirata (DuPont Crop Protection, Newark DE,
Data Management: The Data Repatriation Re-integration Step or …
A (prototype) Shiny app for QCing continuous stream sensor data
Title of Poster Site Visit 2017 Introduction Results
People Who Did the Study Universities they are affiliated with
What is Data? Information, in the form of facts or figures obtained from experiments or surveys, used as a basis for making calculations or drawing.
Using Soil Moisture and Matric Potential Observations to Identify Subsurface Convergent Flow Pathways Qing Zhu, Henry Lin, and Xiaobo Zhou Dept . Crop.
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Bird of Feather Session
Title of Poster Site Visit 2018 Introduction Results
The role of metadata in census data dissemination
This material is based upon work supported by the National Science Foundation under Grant #XXXXXX. Any opinions, findings, and conclusions or recommendations.
Presentation transcript:

The CEINT Database Sandra Karcher Carnegie Mellon University / CEE To the Nanotechnology Working Group on September 4, 2014 with Christine Hendren and Yuan Tian 1

Goal: Database and Tools 2 Database Data Analysis Data Visualization Risk Assessment

Our Design Fundamentals 1)Structure for storage 2)Protocols for populating 3)Key fields for querying 3 database software, such as MySQL or Access, requires the use of a specific structure can be challenging - requires planning and thorough documentation to facilitate consistency very challenging - requires understanding of the interconnections between all the individual pieces of information

Target Information  Nanomaterial characterization (intrinsic and extrinsic)  Nanomaterial meta-data (synthesis methods and protocols, characterization protocols)  System characterization (relevant information to describe the nanomaterial surroundings throughout the duration of the experimental process, including experimental, environmental, and biological system parameters)  System meta-data (synthesis or growth protocols, characterization protocols)  Nanomaterial dosing information  Experimental methods  Experimental results (raw and derived or calculated)  Associated laboratory and/or field quality control information  Modeled results (internal to a specific research group and/or spanning across research groups) 4

Design Concepts 5 an Experiment occurs at a Location in a System of Matrix (Matrices) Sample(s) are collected, something (nanomaterial, plant, fish, etc.) is added, and/or something is measured At specified time points: following a set of Methods resulting in a Parameter measurement and/or description

What can we do with it? 6

Example – Two Mesocosms  Reference: Long-Term Transformation and Fate of Manufactured Ag Nanoparticles in a Simulated Large Scale Freshwater Emergent Wetland (Lowry et al, Environ. Sci. Technol. 2012, 46, 7027−7036)  One mesocosm dosed with AgNPs in soil, one dosed in water  ~ 6 measurements above 0 of surface water interface – measured dissolved oxygen  18 samples (3 x 6 grid) at 4 depth intervals – determined mass of silver in sediment/soil 7 Surface Water Interface (set at 0)

Find the Average - 1  What is the average dissolved oxygen concentration, found in both mesocosms, just above the surface water interface? 1)We find the relevant records (just above the surface water interface, with the parameter of dissolved oxygen). 8 SsdmGrpTypeSsdmSBDepSsdmSEDepDepUnitSsdmMxSsdmMxSpcparLabelparValueparUnit Mes millimeterwatersurface water interfacedissolved oxygen157.8micromolar Mes millimeterwatersurface water interfacedissolved oxygen185.2micromolar Mes millimeterwatersurface water interfacedissolved oxygen202.1micromolar Mes millimeterwatersurface water interfacedissolved oxygen200.5micromolar Mes millimeterwatersurface water interfacedissolved oxygen216.1micromolar Mes millimeterwatersurface water interfacedissolved oxygen221.8micromolar Mes millimeterwatersurface water interfacedissolved oxygen208.4micromolar Mes millimeterwatersurface water interfacedissolved oxygen202.8micromolar Mes millimeterwatersurface water interfacedissolved oxygen203.2micromolar Mes millimeterwatersurface water interfacedissolved oxygen191.8micromolar

Find the Average - 2 2) Then we compute an average. 9 SsdmMxSsdmMxSpcparLabelAvgOfparValueparUnit watersurface water interfacedissolved oxygen199.0micromolar Simple Query We can also build in unit conversions (using 1 micromole of oxygen = mg/L) ParVal_in_mg_per_L 6.4

Associate Lab QC  What does the Lab QC tell us about potential bias in our samples? 1) Find the Lab QC records 2) Find samples associated with the Lab QC 3) Compute an adjusted sample result based on the reported recovery in the Lab QC 10

Associate Lab QC SampleField NameMesocosm 1Mesocosm 2Mesocosm 1Mesocosm 2 sampleResManGrpLowry Lab at Duke bothExpGrpGVL2012Long-Term sampleExpPurGrpNano Material Partitioning sampleSrcDataGVL2012Long-Term sampleExpTypemesocosm sampleSsdmGrpTypeMes01Mes02Mes01Mes02 sampleSsdmLocTypeExp01Exp02Exp01Exp02 sampleSsdmXcoord sampleSsdmYcoord sampleSsdmSBDep sampleSsdmSEDep sampleSsdmColMthcore sampleSsdmMxsediment sampleAnayMthSo_ICPMS-002 sampleparValue sampleparUnitmilligram per kilogram labQCSsdmMxsoil labQCExpTypelabQC AnayMthSo_ICPMS-002 labQCSsdmGrpTypeSRM labQCSsdmLocTypeLqc01 Lqc01R labQCparValue labQCparUnitpercent recovered labQCparLabelsilver LabQC sample Replicate LabQC sample Find the Lab QC samples

Associate Lab QC SampleField NameMesocosm 1Mesocosm 2Mesocosm 1Mesocosm 2 sampleResManGrpLowry Lab at Duke sampleExpGrpGVL2012Long-Term sampleExpPurGrpNano Material Partitioning sampleSrcDataGVL2012Long-Term sampleExpTypemesocosm sampleSsdmGrpTypeMes01Mes02Mes01Mes02 sampleSsdmLocTypeExp01Exp02Exp01Exp02 sampleSsdmXcoord sampleSsdmYcoord sampleSsdmSBDep sampleSsdmSEDep sampleSsdmColMthcore sampleSsdmMxsediment sampleAnayMthSo_ICPMS-002 sampleparValue sampleparUnitmilligram per kilogram labQCSsdmMxsoil labQCExpTypelabQC AnayMthSo_ICPMS-002 labQCSsdmGrpTypeSRM labQCSsdmLocTypeLqc01 Lqc01R labQCparValue labQCparUnitpercent recovered sampleparLabelsilver Find the samples associated with the Lab QC (use key field that indicates experimental group)

Associate Lab QC SampleField NameMesocosm 1Mesocosm 2Mesocosm 1Mesocosm 2 sampleResManGrpLowry Lab at Duke bothExpGrpGVL2012Long-Term sampleExpPurGrpNano Material Partitioning sampleSrcDataGVL2012Long-Term sampleExpTypemesocosm sampleSsdmGrpTypeMes01Mes02Mes01Mes02 sampleSsdmLocTypeExp01Exp02Exp01Exp02 sampleSsdmXcoord sampleSsdmYcoord sampleSsdmSBDep sampleSsdmSEDep sampleSsdmColMthcore sampleSsdmMxsediment sampleAnayMthSo_ICPMS-002 sampleparValue sampleparUnitmilligram per kilogram labQCSsdmMxsoil labQCExpTypelabQC AnayMthSo_ICPMS-002 labQCSsdmGrpTypeSRM labQCSsdmLocTypeLqc01 Lqc01R labQCparValue labQCparUnitpercent recovered bothparLabelsilver Both samples are each associated with two lab QC results (since the lab QC was performed in replicate). The average recovery of the two lab replicates is 80.3% The adjusted value of silver in mesocosm 1 = 3.7 (2.99/0.803) And the adjusted value of silver in mesocosm 2 = 7.2 (5.75/0.803) (could adjust the measured results, or perhaps use the recoveries to determine an upper an lower bound)

Example: Two Mesocosm Study Location and orientation of mesocosms estimated using Google Maps. Locations imported into GIS. Mass of Ag in sediment also imported into GIS. We zoom in and…. Visualize Results in GIS 14

Notice mass is in mg and the same gradient (shade of grey) is used at all sample depths (0-1, 1-2, 2-4, 4-22 cm). This mesocosm was dosed in the soil compartment This mesocosm was dosed in the water compartment We see 18 core sample locations in each mesocosm. 15 soil side water side

16 The diameter of the circles increases with depth. The darker the shade of grey, the more silver mass found in the sediment. soil side water side Focusing on the water dosed mesocosm

Still Considering  What other visualizations do we want to perform?  When do we need raw data? What level of aggregation would be acceptable?  Do we have all the appropriate grouping fields we need?  Do we need more “query friendly” fields in some parts of our database?  How can we best engage researchers during the curation process? 17

Acknowledgements CEINT Data Integration Team Duke: Christine Hendren, Yuan Tian, Lichen He, Mark Wiesner Carnegie Mellon: Jeanne VanBriesen, Greg Lowry, and Sandra Karcher This material is based upon work supported by the National Science Foundation (NSF) and the Environmental Protection Agency (EPA) under NSF Cooperative Agreements EF and DBI , Center for the Environmental Implications of NanoTechnology (CEINT). Any opinions, findings, conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF or the EPA. This work has not been subjected to EPA review and no official endorsement should be inferred. 18