Semantically Enhanced Model Experiment Evaluation Process (SeMEEP) within the Atmospheric Chemistry Community Chris Martin 1,2, Mo Haji 2, Peter Dew 2,

Slides:



Advertisements
Similar presentations
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
Advertisements

Who are the Experts?Simon KampaSlide 1 Who are the Experts? Simon Kampa IAM Group University of Southampton
Copyright 1999 all rights reserved Why Conduct a Task Analysis? n Uncovered Information may lead to abandoning a bad idea –The world is ready for video.
Geographical Investigation Friday 29 th April 2009 Deadline.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
SWIM WEB PORTAL by Dipti Aswath SWIM Meeting ORNL Oct 15-17, 2007.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science.
Final Year Project COMP39X COMP390/3/4/5 Final Year Project Design Darek Kowalski
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Architecture for Pattern- Base Management Systems Manolis TerrovitisPanos Vassiliadis National Technical Univ. of Athens, Dept. of Electrical and Computer.
Integrating data sources on the World-Wide Web Ramon Lawrence and Ken Barker U. of Manitoba, U. of Calgary
GenSpace: Exploring Social Networking Metaphors for Scientific Collaborative Work Gail Kaiser, Swapneel Sheth, Chris Murphy {kaiser, swapneel, cmurphy}
Methodology Conceptual Database Design
Software Engineer Report What should contains the report?!
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Research Data Management Philip Tarrant Global Institute of Sustainability.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Final Year Project COMP39X COMP390/3/4/5 Final Year Project Design Irina Biktasheva
Cyber-Infrastructure for Agro-Threats Steve Goddard Computer Science & Engineering University of Nebraska-Lincoln.
Unit 2: Engineering Design Process
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Chapter 8: Systems analysis and design
Towards an activity-oriented and context-aware collaborative working environments Presented by: Ince T Wangsa Supervised by:
The Integration of Peer-to-peer and the Grid to Support Scientific Collaboration Tran Vu Pham, Lydia MS Lau & Peter M Dew {tranp, llau &
The Information Environment for Neuroscientists David R Newman
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
EVALUATING PAPERS KMS quality- Impact on Competitive Advantage Proceedings of the 41 st Hawaii International Conference on System Sciences
Secure Systems Research Group - FAU Classifying security patterns E.B.Fernandez, H. Washizaki, N. Yoshioka, A. Kubo.
Domain Modeling In FREMA David Millard Yvonne Howard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
The Brain Project – Building Research Background Part of JISC Virtual Research Environments (Phase 3) Programme Based at Coventry University with Leeds.
DAME: A Distributed Diagnostics Environment for Maintenance Duncan Russell University of Leeds.
Professional Certificate in Electoral Processes Understanding and Demonstrating Assessment Criteria Facilitator: Tony Cash.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Streamflow - Programming Model for Data Streaming in Scientific Workflows Chathura Herath.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu & Gagan Agrawal Enabling.
COMM89 Knowledge-Based Systems Engineering Lecture 8 Life-cycles and Methodologies
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Chapter 6 CASE Tools Software Engineering Chapter 6-- CASE TOOLS
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
OVERVIEW OF RESEARCH METHODS Dr. Kimaro. INTRODUCTION Why Research; Increase body of knowledge Explain social phenomena Test theoretical hypothesis Explain.
Community Data Evaluation using a Semantically Enhanced Modelling Process Mohammed Haji 1, Peter Dew 1, Chris Martin 1,2.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
Centre for Strategic Manufacturing Assessing the impact of e-business on the business processes of manufacturing companies: A simulation approach Pável.
Module 4: Systems Development Chapter 13: Investigation and Analysis.
Making a great Project 2 OCR 1994/2360. Implementation This is about how you make your system. It should have enough detail for a competent user to be.
The Final Report.  Once scientists arrive at conclusions, they need to communicate their findings to others.  In most cases, scientists report the results.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science.
Prepare for the Possibility: Response and Recovery Steve Goddard Computer Science & Engineering University of Nebraska-Lincoln.
A Collaborative e-Science Architecture towards a Virtual Research Environment Tran Vu Pham 1, Dr. Lydia MS Lau 1, Prof. Peter M Dew 2 & Prof. Michael J.
ETRIKS Harmonization System October Fabien Richard CNRS, EISBM 1.
MScSED Thesis Workshop: Introduction June 12 th, 2015.
Making a great Project 2 OCR 1994/2360. Testing The point of testing is to make sure that it does what it set out to do Test to see that the user requirements.
Research Methods Dr. X.
Designing for people CPSC 481: HCI I.
CS 641 – Requirements Engineering
CS 641 – Requirements Engineering
Gail Kaiser, Swapneel Sheth, Chris Murphy
Chapter 2 – Software Processes
Chapter 2: Designing your project
Three Uses for a Technology Roadmap
Presentation transcript:

Semantically Enhanced Model Experiment Evaluation Process (SeMEEP) within the Atmospheric Chemistry Community Chris Martin 1,2, Mo Haji 2, Peter Dew 2, Peter Jimack 2, Mike Pilling 1 1 School of Chemistry, University of Leeds 2 School of Computing, University of Leeds

2 Outline of the Presentation Introduction Atmospheric community SeMEEP ELN Provenance capture Conclusion and next stage

3 Section 1 Overview Application domain – atmospheric community –Reliance on computational models to evaluate data Motivation –Study how to transition from today's ad-hoc processes practises –Sustainable process of Gathering, community evaluation and sharing data & models between scientists Minimising changes to proven working practises of the scientist Within world-wide co-laboratories

4 Related projects CombeChem –Experimental organic chemistry –From source to long term data –perseveration (knowledge) –Semantically-enabled ELN –Data-driven workflow Collaboratory for Multi-Scale Chemical Science –Multi-layer chemical model myGrid –Bio-informatics and related areas (semantic pattern matching –Reusable semantic workflow using SMD (semantic metadata) –Data Quality Karama2 –Weather forecasting – computation modelling –Data-driven workflow Quantum Thermo Kinetic Mechanism Reacting Flow Chemistry Chemistry Simulation

5 Section 2 Atmospheric Chemistry Seeks to understand the chemical processes (reactions) taking place in the lower atmosphere (e.g. smoke) It has significant implication for both: –Air Quality –Climate Change

6 The Master Chemical Mechanism (MCM) Data repository of elementary chemical reactions & rate constants The mechanism is described by a computational model that is evaluated against experimental data –Chamber experiments –Field experiments

7 Section 3 SeMEEP Today –Typically within the atmospheric chemistry community the provenance is recorded in an ad-hoc, unstructured fashion, using a combination of traditional lab-book, word processing documents and spreadsheet. Move to more sustainable evaluation process supports the gathering, evaluation and sharing of data and models Using semantic metadata

8 SeMEEP Vision SeMEEP semantically-enabled MEEP –Supports the organisation of information but critically, records its provenance (say to recover secondary data) Mike Pilling : “SeMEEP approach will radically enhance the effectiveness of a research community to deliver new science“

9 Requirements with metadata for elementary chemical reactions

10 Requirements for metadata capture for elementary reactions Only published data Rate constants from several labs No access to the raw data No access to secondary data SeMEEP will provide this.

11 Current Evaluation Processes for the MCM

12 Envisioned Evaluation Processes

13 Section 4 Electronic Lab-Books (ELNs) ELNs address the limitations of the current methods of provenance capture. Southampton ELN for organic chemistry experiments. Benefits to the modeller Modelling process can be automatically captured Searchable Remote access is possible Provenance is structured Possible to use resolvable references to resources

14 Will User attach quality metadata? Motivate users: –By demonstrating the value of provenance in their day-to-day work Writing publication Managing their data Reinterpretting the data. –Management –Publishers

15 Recoding provenance for modelling 3-level process –Experimental modelling plan –Modelling iterations –Modelling layer where provenance is capture has the modelling process proceeds using data-driven workflows where data is a first class object

16 The Modelling Process - A Three Layer Mapping

17 MCM Mechanism being investigated

18 ELN Process

19 ELN Screenshots Prompts displayed when changing the changing the chemical mechanism; Editing a reaction Adding a new reaction

20 ELN Screenshots

21 ELN Modelling SMD Architecture

22 Evaluation Methodology In-depth interviews with members of the atmospheric chemistry model group here at Leeds, covering: –Demonstration of the prototype –User testing of the prototype –Discussion of scenarios involving the use of the prototype (e.g. ) Analysis –Interviews recorded and transcribed –Analysed using techniques from grounded theory

23 Evaluation Barriers to adoption: –Effort required at modelling time for provenance capture “[in] your lab book you can write down what ever you want [but with an ELN] it is going to take time to go through the different protocol steps”. –When asked if they would use an ELN requiring a similar amount of user input to the prototype the response was positive: “Yeah, I think it would be a good thing. I don’t think it is too much extra … work.” –Rather than viewing the prompts for user annotation as interruption to their normal work the user recognised the value of being prompted “is a good way to do it because otherwise you won’t [record the provenance].”

24 Evaluation Users intuitively grasped the benefits of recording provenance with an ELN and that the benefits would be realised after the time of modelling by a number of stakeholders: –“if someone else wants to look at … [your provenance], that’s great because the person can see exactly what you have done, where you have been and where to go next. And for yourself, if you are writing up a PhD... [you can] … see exactly what you’ve done whereas currently you have to rifle through lab-books to see exactly what you have done.”

25 Section 5 Conclusions and future work Outlined SeMEEP and ELN –User evaluated proposed modelling ELN Addressed case studies –IUPAC –MCM Developing a case study with the Geomagnetic community User and System issues –Application of actively theory to capture requirements and user evaluation –Querying and inference –Address QoS issues (e.g. security, scalabilty, dynamic roles-based access control)

26 Questions