Recording application executions enriched with domain semantics of computations and data Master of Science Thesis Michał Pelczar Krakow, 30.9.2008.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
UTPB: A Benchmark for Scientific Workflow Provenance Storage and Querying Systems Artem Chebotko Joint work with E. De Hoyos, C. Gomez, A. Kashlev, X.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
WS-VLAM Introduction presentation WS-VLAM Semantic tools Systems, Networking, and Engineering group Institute of informatics University of Amsterdam.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Dagstuhl, February 16, 2009 Layers in Grids Uwe Schwiegelshohn 17. Februar 2009 Layers in Grids.
Semantic Web Research: Visual Modelling of OWL-S Services Computer Science Annual Workshop September 2004 Charlie Abela, James Scicluna Department of Computer.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Chapter 1 Overview of Databases and Transaction Processing.
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
June Amsterdam A Workflow Bus for e-Science Applications Dr Zhiming Zhao Faculty of Science, University of Amsterdam VL-e SP 2.5.
January, 23, 2006 Ilkay Altintas
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
Environment for Management of Experiments on the Grid Master of Science Thesis AGH University of Science and Technology, Krakow, Poland Faculty of Electrical.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
revised CmpE 583 Fall 2006Discussion: OWL- 1 CmpE 583- Web Semantics: Theory and Practice DISCUSSION: OWL Atilla ELÇİ Computer Engineering.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
04.10.’04 updated ’06CmpE 583 Fall 2008Terminology- 1 CmpE 583- Web Semantics: Theory and Practice TERMINOLOGY Atilla ELÇİ Computer Engineering Department.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
1 CSE 2102 CSE 2102 Ph.D. Proposal A Process Framework For Ontology Modeling, Design, And Development Realized By Extending OWL and ODM Candidate: Rishi.
Polish Infrastructure for Supporting Computational Science in the European Research Space QoS provisioning for data-oriented applications in PL-Grid D.
Knowledge based Learning Experience Management on the Semantic Web Feng (Barry) TAO, Hugh Davis Learning Society Lab University of Southampton.
Deploying Trust Policies on the Semantic Web Brian Matthews and Theo Dimitrakos.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
ANSTO E-Science workshop Romain Quilici University of Sydney CIMA CIMA Instrument Remote Control Instrument Remote Control Integration with GridSphere.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Cracow Grid Workshop, October 27 – 29, 2003 Institute of Computer Science AGH Design of Distributed Grid Workflow Composition System Marian Bubak, Tomasz.
Aude Dufresne and Mohamed Rouatbi University of Montreal LICEF – CIRTA – MATI CANADA Learning Object Repositories Network (CRSNG) Ontologies, Applications.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
WORKS08, Austin, Texas, November 17th, 2008 Monitoring Infrastructure for Grid Scientific Workflows Institute of Computer Science and ACC CYFRONET AGH.
EC-project number: Universal Grid Client: Grid Operation Invoker Tomasz Bartyński 1, Marian Bubak 1,2 Tomasz Gubała 1,3, Maciej Malawski 1,2 1 Academic.
EC-project number: ViroLab Virtual Laboratory Marian Bubak ICS / CYFRONET AGH Krakow virolab.cyfronet.pl.
AKOGRIMO Integration of Grid services with mobile technologies; validation in e-health, e-learning and disaster management areas CoreGRID European Grid.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
The Knowledge Grid Methodology  Concepts, Principles and Practice Hai Zhuge China Knowledge Grid Research Group Chinese Academy of Sciences.
Semantic Publishing Benchmark Task Force Fourth TUC Meeting, Amsterdam, 03 April 2014.
Information Architecture The Open Group UDEF Project
Lessons learned from Semantic Wiki Jie Bao and Li Ding June 19, 2008.
1 A Medical Information Management System Using the Semantic Web Technology Networked Computing and Advanced INFORMATION MANAGEMENT, NCM '08. Fourth.
Télé-université Synthesis From Research to Practice Montreal, November 7, 2007 EFPC/CSPS.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
K-WfGrid: Grid Workflows with Knowledge Ladislav Hluchy II SAS, Slovakia.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
Chapter 1 Overview of Databases and Transaction Processing.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Chapter 1 Characterization of Distributed Systems
DICE - Distributed Computing Environments Team
Middleware independent Information Service
Similarities between Grid-enabled Medical and Engineering Applications
Stanford Medical Informatics
knowledge organization for a food secure world
Knowledge Based Workflow Building Architecture
Data Provenance.
Business Process Management and Semantic Technologies
The ViroLab Virtual Laboratory for Viral Diseases
Complex Information Management Using a Framework Supported by ECA Rules in XML Presented By Essam Mansour.
Presentation transcript:

Recording application executions enriched with domain semantics of computations and data Master of Science Thesis Michał Pelczar Krakow,

Outline Background Objectives Provenance model Information building Feasibility study QUaTRO State of the art Research outline Publications

Background E-Science –Advanced computing technologies supporting scientists –Global collaboration in key areas of science Semantic Web provides data scalability –XML, RDF, RDFS, OWL –Ontology serves as taxonomy Grid computing provides computation scalability Virtual experiments influence scientific discoveries pace

Provenance metadata that pertains to the derivation history of a data product starting from its original sources the seven W’s: Who, What, Where, Why, When, Which, hoW Scientific results reproducibility Guarantee of data reliability and quality Regulatory mechanism of sensitive data protection Mean of e ffi ciency optimization

ViroLab Virtual laboratory for infectious diseases Prevention, diagnosis and treatment Medical science, computer science, healthcare

Objectives Design information model for provenance Design data model for monitoring system Adapt existing monitoring infrastructure to the provenance requirements Define ontology creation process –Ontology and data model independent –Manageable –Augmentable –Described semantically Design and implement component realizing the process Incorporate the component into system grid infrastructure Design and implement provenance querying component

Provenance model Experiment re-execution Data dependencies Results management Performance Resources availability Related with ontologies: –Data –Domain

Ontology extension Derivation concepts –XML –Delegates Aggregation rules Annotations –Classes –Properties

Information building OWL and XSD independent Manageable Events correlation Events aggregation Experiment transaction support Knowledge history tracking Association strategy

Proof of concept: Drug resistance case study Alignment Subtyping Drug ranking Different levels of semantics –Data –Computation

QUaTRO Abstract query language –Data representation and storage transparent –Understandable by non-IT specialist –Configurable by ontologies –Easy to integrate with GUI –Extendible

Query processing Provenance ontologies Mapping ontologies File systems Databases Operators

Summary Data model for operations and resources Ontologies for data, experiments and geno2drs scenario Monitoring infrastructure: remote logging, automatic generation of helpers Semantic Event Aggregator implemented and deployed as OneJAR application QUaTRO integrated into GridSphere portal

Future work QUaTRO extensions –Join operation –Provenance graph rendering –File system querying Model extensions –Performance recording –Data origin recording Explicit provenance recording –Domain ontologies generation –Partial results storage –Domain events publication

Publications B. Balis, M. Bubak, M. Pelczar, From Monitoring Data to Experiment Information – Monitoring of Grid Scientific Workflows. In G. Fox, K. Chiu, and R. Buyya, editors, Third IEEE International Conference on e-Science and Grid Computing, e-Science 2007, Bangalore, India, December 2007, pages IEEE Computer Society, B. Balis, M. Bubak, M. Pelczar, J. Wach, Provenance Tracking and Querying in ViroLab. In Cracow GridWorkshop 2007Workshop Proceedings, pp.71-76, ACC CYFRONET AGH B. Balis, M. Bubak, M. Pelczar, J. Wach, Provenance Querying for End-Users: A Drug Resistance Case Study. In: Bubak, M., Albada, G.D.v., Dongarra, J., Sloot, P.M.A. (Eds.), Proceedings ICCS 2008, Krakoland, June 23-25, 2008, LNCS 5103, pp , Springer 2008.

Detailed information ViroLab: VLvl: QUaTRO: Ontologies: