Health-e-Child: A Grid Platform for European Paediatrics On behalf of the Health-e-Child Consortium (& with thanks to David Manset Maat-G) CHEP07 3 rd.

Slides:



Advertisements
Similar presentations
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
Advertisements

Data Management Expert Panel - WP2. WP2 Overview.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
NeuGRID PROJECT: A PRACTICAL EXAMPLE OF RESEARCH THROUGH THE GARR.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
The CrossGrid project Juha Alatalo Timo Koivusalo.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The Health-e-Child Project & Platform Data Integration - Semantic and Syntactic Interoperability David Manset – MAAT-G March 5th, 2009 EGEE-UF/OGF25 Catania,
DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING Carlos de Alfonso Andrés García Vicente Hernández.
© 2008 by M. Stümpert, A. Garcia; made available under the EPL v1.0 | Access the power of Grids with Eclipse Mathias Stümpert (Karlsruhe Institute.
New Task Group CRIS Architecture & Development Maximilian Stempfhuber RWTH Aachen University Library
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Imaging Workspace An Overview and Roadmap Eliot L. Siegel, MD Imaging Workspace Lead SME January 23, 2008.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Computational grids and grids projects DSS,
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Towards a Universal Client for Grid Monitoring Systems Towards a Universal Client for Grid Monitoring Systems Design and Implementation of the Ovid Browser.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
Health-e-Child Project Requirements EGEE 2006 – HealthGrid David Manset MAAT GKnowledge.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Health-e-Child Konstantin Skaburskas Based on presentations made by HeC partners.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Project Overview Jörg Freund, Siemens David Manset, MaatGKnowledge HG09, 29-Jun-09, Berlin.
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
Metadata Mòrag Burgon-Lyon University of Glasgow.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
HeC workshop, EGEE07 5 October 2007, Budapest Health-e-Child: A Platform for European Paediatrics Tamás Hauer University of the West of England, Bristol.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ignacio Blanquer Vicente Hernández Damià.
HeC PMT Meeting Tallinn 10 July 2008 Brainstorming session: what interest can there be in thinking about a possible new HeC 2 project to follow-up the.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 Health-e-Child project Konstantin Skaburskas CERN IT/GD University of Tartu, Estonia 16 March 2007, CERN.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
The Health-e-Child Project EGEE 2006 – Industry Task Force David Manset MAAT GKnowledge.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Imaging Workspace An Overview and Roadmap Eliot L. Siegel, MD Imaging Workspace Lead SME January 23, 2008.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
CSE 5810 Biomedical Informatics and Cloud Computing Zhitong Fei Computer Science & Engineering Department The University of Connecticut CSE5810: Introduction.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Grid Computing: Running your Jobs around the World
Health-e-Child: An Integrated Platform for European Paediatrics
Similarities between Grid-enabled Medical and Engineering Applications
GSAF Grid Storage Access Framework
GSAF Grid Storage Access Framework
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Data, Databases, and DBMSs
AMGA Web Interface Vincenzo Milazzo
The Anatomy and The Physiology of the Grid
Presentation transcript:

Health-e-Child: A Grid Platform for European Paediatrics On behalf of the Health-e-Child Consortium (& with thanks to David Manset Maat-G) CHEP07 3 rd September 2007, Victoria, CA Richard McClatchey, UWE-Bristol UK

Contents Project Objectives & Challenges Health-e-Child Gateway The HeC Grid architecture Data Integration & Ontologies Futures Conclusions

3 Motivation for the Project Clinical demand for integration and exploitation of heterogeneous biomedical information vertical dimension – multiple data sources horizontal dimension – multiple sites Need for generic and scalable platforms (Grid?) integrate traditional and emerging sources – in vivo and in vitro provide decision support ubiquitous access to knowledge repositories in clinical routine connect stakeholders in clinical research Need for complex integrated disease models build holistic views of the human body early disease detection exploiting in vitro information personalized diagnosis, therapy and follow-up

4 Objectives of Health-e-Child Build enabling tools & services that improve the quality of care and reduce cost with Integrated disease models Database-guided decision support systems Cross modality information fusion and data mining for knowledge discovery Establish multi-site, vertical, and longitudinal integration of data, information and knowledge Develop a GRID based platform, supported by robust search, optimisation and matching Healthy Child Decision Support Systems Integrated Disease Modeling Knowledge Discovery Augment Guidance Guidance Enrich Real-time alert On-line learning Observation Process Sensors Imaging Genomics Lab Data Proteomics Demographics Physician Notes Life Style Time Organ Tissue Cell Molecule Population Individual Vertical Data Integration Integrated Medical Database

5 Focus on Paediatric Diseases Three Paediatric Diseases with at least partly unknown cause, classification and/or treatment outcomes Heart diseases (Right Ventricular Overload, Cardiomyopathy) Inflammatory diseases (Juvenile Idiopathic Arthritis) Brain tumours (Gliomas) Many Clinical Departments Cardiology Rheumatology (Neuro-)Oncology Radiology Lab (Genetics, Proteomics) Administration, IT Main Modalities / Data Sources Imaging (MR, US/echocardiography, CT, x-ray) Clinical (Patient information, Lab results etc) Genetics & Proteomics

6 GOSH NECKER UWE CERN IGG SIEMENS ASPER UOA INRIA LYNKEUS UCL EGF FGG MAAT A Geographically Distributed Environment Introduction Clinical Site R&D Site

7 Applications Integration Challenge Introduction IGG NECKER GOSH Highlights Networks - Different Networks: LANs, WANs, Internet Security - Security Constraints: Local & National Regulations Bandwidth - Bandwidth Limitations: LAN/WAN & Internet uplinks … Highlights Networks - Different Networks: LANs, WANs, Internet Security - Security Constraints: Local & National Regulations Bandwidth - Bandwidth Limitations: LAN/WAN & Internet uplinks …

8 Contents Project Objectives & Challenges Health-e-Child Gateway The HeC Grid architecture Data Integration & Ontologies Futures Conclusions

9 HeC System Overview Grid Infrastructure databases, resource and user management, data security HeC Gateway HeC specific models and Grid services like query processing, security Heart Disease Applications Inflammatory Diseases Applications Brain Tumour Applications Common Client Applications user interface for authentication, viewing, editing, similarity search

10 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Services Our building blocks are Services Our Gateway is a SOA Existing blocks are available in the community WSRF Containers: Globus Toolkit4, Tomcat, WSRF::Lite… Data Access Layers: OGSA-DAI, AMGA… File Transfer facilities: gridFTP… Applies to other distributed systems Our approach is to efficiently reuse and combine blocks to satisfy our requirements Building blocks might be enhanced, i.e. HeC Authentication Service Data Management & Integration The Health-e-Child Gateway

11 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Gateway The HeC Gateway An intermediary access layer to decouple client applications from the complexity of the grid Towards a platform independent implementation To add domain specific functionality not available in Grid middleware Status √ SOA architecture and design √ implementation of privacy and security modules

12 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Our Approach Workstation Gateway User Hospital X Highlights Simplicity - Simplicity, “all-in-one box” Modularity & Scalability - Modularity & Scalability, “off-the-shelf” components State-of-the-art - State-of-the-art Approaches Highlights Simplicity - Simplicity, “all-in-one box” Modularity & Scalability - Modularity & Scalability, “off-the-shelf” components State-of-the-art - State-of-the-art Approaches One Access Point Per Institution One Key To enter the system

13 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Platform & Grid Middleware The Health-e-Child Access Point One Access Point Hosting Domain 1 Stable & Secure Environment Hosting Domain 2 Stable & Secure Environment Security & Registration Job Managnt HeC Gateway Storage HeCScheduling Monitoring Info System Computation Unit Data Unit HeCDBMS 200GB50GB1TB Health-e-Child EGEE gLite HeC Gateway Virtualization…

14 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 The Platform The Health-e-Child Gateway (1) Inside the box… Client Applications One Access Point HeC Gateway Computing Resources Functionality Access Infrastructure Abstraction

15 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 The Platform The Health-e-Child Gateway (2 ) Secur ity GT4 Inside the box… One Access Point HeC Gateway

16 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Platform & Grid Middleware The Health-e-Child Gateway (3) Grid GT4 Inside the box… One Access Point HeC Gateway

17 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 The Platform The Health-e-Child Gateway (4) Client Connectivity GT4 Inside the box… One Access Point HeC Gateway

18 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 The Platform The Health-e-Child Gateway (5) Client Applications Inside the box… One Access Point HeC Gateway Data Integration

19 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Contents Project Objectives & Challenges Health-e-Child Gateway The HeC Grid architecture Data Integration & Ontologies Futures Conclusions

20 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Grid Grid technology (gLite 3.0) as the enabling infrastructure A distributed platform for sharing storage and computing resources HeC Specific Requirements Need support for medical (DICOM) images Need high responsiveness for use in clinical routine Need to guarantee patient data privacy:  access rights management  storage of anonymized patient data only Status √ Testbed installation since Mai 2006 √ HeC Certificate Authority √ HeC Virtual Organisation √ Security Prototype (clients & services) √ Logging Portal & Appender

21 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 EGEE gLite - Grid Middleware to be used in HeC is EGEE gLite (DoW) - gLite provides several modules, 1 module encompassing 1 or * services/software  Selected the minimal set of services for enacting a Test-bed: - Core Services (services accessed by all sites following a centralised architecture) - Top BDii: Berkeley Database Information Index. Grid Information System Database - VOMS: Virtual Organisation Membership Service - PX: Proxy - LFC: File Catalogue - WMSLB: Workload Management System - Site Services (services installed at each site) - Site BDii: Berkeley Database Information Index. Top BDii replica. - gLiteSE/DPM: Storage Element - gLiteCE/Torque: Computing Element - WN: Worker Node - UI: User Interfaces. Grid API & Clients - However… Limited Resources  Virtualisation of OSs & m/w Services clustering - Modular Grid Access Points:  Virtual machines can be spread over the infrastructure Grid Platform Grid Middleware - Adopted Solution

22 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Grid Platform Grid Month18 Virtualization Applied 2 nd Test Node Deployed Being deployed

23 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Solution Facts & Conclusion Grid Platform Installed & Tested - Test-bed Installed & CERN Middleware Architecture - Validated the Middleware Architecture (V1.0) to be installed at the different Sites centralized - However: Architecture still too centralized (VOMS, LFC…) - Being addressed in the second planning period Installed & Tested - Test-bed Installed & CERN Middleware Architecture - Validated the Middleware Architecture (V1.0) to be installed at the different Sites centralized - However: Architecture still too centralized (VOMS, LFC…) - Being addressed in the second planning period CERN

24 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Grid Platform Current HeC gLite Interface covers: Proxies Management, File(s) & Catalog Management, Workload Management, File Transfer Management Proxy File & CatWorkloadFile Transf Grid Info Solution Facts & Conclusion (2) Highest priority was File & Catalog functionality Highlights - Fast prototyping and integration - Fast prototyping and integration of grid functionality autumn Test-bed is ready since autumn 2006 Modular - Modular architecture (virtualization) resource limitations - Satisfies Hospitals’ IT resource limitations Highlights - Fast prototyping and integration - Fast prototyping and integration of grid functionality autumn Test-bed is ready since autumn 2006 Modular - Modular architecture (virtualization) resource limitations - Satisfies Hospitals’ IT resource limitations

25 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Common Client Applications Status √ authentication (certificate-based single sign-on) √ simple browsing and viewing USB key solution authentication basic functionality + applications that do not require specific resources available from any PC supporting USB in the hospital without SW installation autorun from Windows XP Supports basic functionality browsing, viewing, editing patient data safety, security, privacy, anonymisation similarity search

26 Health-e-Child Technical Review Meeting, Paris, 17 January 2007 Contents Project Objectives & Challenges Health-e-Child Gateway The HeC Grid architecture Data Integration & Ontologies Futures Conclusions

27 Vertical Levels Split the domain into vertical levels Data/Information from multiple levels Implicit semantics connecting these levels. Vertical Integration in applications Data models should provide seamless access to all the infor- mation stored in HeC Vertically Integrated Modelling Coherently integrate information from different levels (instances) Provide integrated view of the data to the users (concept-oriented views, etc) Queries against the integrated knowledge Topics Vertical Integration along the longitudinal axis in pediatrics Semantics of relevance for presenting clinical data Fragment extraction, alignment and integration from biomedical knowledge sources Vertical Integration PopulationIndividualOrganTissueCellMolecule

28 Data and knowledge modelling Requirements Data Acquisition Protocols (WP9) Users Requirements Specifications (WP2) Modelling with Domain Experts (WP6) Integrated Data Modelling Applications DSS Similarity Knowledge Representation - ontologies Query HeC-universal Application specific

29 Data and knowledge representation approaches in the medical domain relational schemas, e.g. traditional DB + well-understood and robust method for representing data + performant persistent data management implementations - no intuitive modeling of hierarchically structured objects, e.g. biological entities semi-structured data, e.g. XML + more natural modeling of biological entities (e.g. by using the features like nesting) - it is still difficult to model complex (non-tree) relationships ontologies + can represent declarative and procedural knowledge and arbitrarily complex relations - can be cumbersome for simple schemata - non-performant storage + provides reasoning over distributed knowledge resources + best suited for medical data

30 Knowledge Representation: Ontologies and Data Models Ontologies generic and task-independent (universal) represent knowledge that formally specifies agreed logical theories for an application domain provide the basis for reusability, reliability, shareability, portability, and interoperabiltity can be easily extended and combined expressive domain rule languages lead to a more correct and precise conceptualisation of a domain (integrated reasoning support) Data Models task-specific schemata, implementation oriented represent structure and integrity of the data elements conceptualisation and vocabulary of the data model are not to be shared extension of the data model is limited less expressiveness

31 Ontologies in Health-e-Child Reuse existing medical knowledge to structure our feature space Working with established knowledge Linkage to “live” knowledge base(s) Saves effort on validating the models with domain experts Non-domain specific conceptualization (upper level ontologies): reusing these, we establish shareable/shared knowledge Ontological modelling of paediatric medicine Integration of information – resolving semantic heterogeneity Different countries, hospitals, protocols Research potential OWL DL representation of paediatrics medical knowledge, identification of knowledge patterns Decision-support systems Similarity metrics and query enhancement. Alignment of ontologies that overlap over the Health-e-Child domain

32 Contents Project Objectives & Challenges Health-e-Child Gateway The HeC Grid architecture Data Integration & Ontologies Futures Conclusions

33 Clinical and Application Roadmap Phase I (- 06/06) Phase II (07/ /07) Phase III (07/ /08) Study Design and Approval Phase IV (2009) Clinical Validation Refinement of Models and Algorithms Dissemination Data acquisition, genetic tests, ground truth annotations User Requirements State of the Art Reports Knowledge Discovery Methods Segmentation/Registration Feature Extraction from Imaging Disease Model Development generic  subtype specific  patient and treatment specific Integrated decision support Classifiers Based on Genetics

34 Future Work – Decentralized Architecture Grid Platform Working out a possible alternatives to decentralize key components of the grid middleware  Autonomous Sites

35 Ontology layer in the data management architecture HeC Ontology Ontological Layer Ontology-Data Model Mappings Query Processing Engine GUI Semantics Rules External public DBs External Biomedical ontologies Hospitals’ DBs Applications (e.g.DSS) HeC-External Ontologies Mappings

36 Contents Project Objectives & Challenges Health-e-Child Gateway The HeC Grid architecture Data Integration & Ontologies Futures Conclusions

37 HEP vs. BioMed - Comparison Number & location of sites, Job vs. Interactive processing, Data distribution & governance, Nature of applications, skills sets of users, Data Types & lifetimes, Data Replication Middleware-centric vs. Operating system-centric Vs.

38 Obstacles for Biomed Users Grid Middleware is difficult to set up, configure, maintain and use. New skills needed. Support for non-scientific applications is limited. Hierarchical environments favoured, largely client-server, inflexible in nature - no support for P2P. Grid software has concentrated on job-oriented (batch-like) computing rather than interactive computing. Cannot share the cpu (CE) or storage capabilities (SE) with the Grid : all or nothing for the resourcing.  We need out-of-the-box Grid functionality for biomedical end-users.

39 Conclusions HeC Gateway designed and implementation progressing through Grid platform decided – EGEE gLite BUT Grid is not yet mature in non-HEP settings Biomedical communities are very different to HEP communities. So no one-size-fits-all Future must be towards user-participation in Grids – what about a Grid OS ?