Health-e-Child Konstantin Skaburskas Based on presentations made by HeC partners.

Slides:



Advertisements
Similar presentations
Distributed Data Processing
Advertisements

Database System Concepts and Architecture
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
CREAM-CE status and evolution plans Paolo Andreetto, Sara Bertocco, Alvise Dorigo, Eric Frizziero, Alessio Gianelle, Massimo Sgaravatto, Lisa Zangrando.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
NeuGRID PROJECT: A PRACTICAL EXAMPLE OF RESEARCH THROUGH THE GARR.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
MammoGrid: A Service Oriented Architecture based Medical Grid Application CERN (Switzerland, Project Coordination) Mirada Solutions (UK) – Medical Image.
Lecture Nine Database Planning, Design, and Administration
Configuration Management
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The Health-e-Child Project & Platform Data Integration - Semantic and Syntactic Interoperability David Manset – MAAT-G March 5th, 2009 EGEE-UF/OGF25 Catania,
WP6: Grid Authorization Service Review meeting in Berlin, March 8 th 2004 Marcin Adamski Michał Chmielewski Sergiusz Fonrobert Jarek Nabrzyski Tomasz Nowocień.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
ITEC224 Database Programming
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
X-Road – Estonian Interoperability Platform
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
Health-e-Child: A Grid Platform for European Paediatrics On behalf of the Health-e-Child Consortium (& with thanks to David Manset Maat-G) CHEP07 3 rd.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ignacio Blanquer Vicente Hernández Damià.
TESTBED FOR FUTURE INTERNET SERVICES TEFIS at the EU-Canada Future Internet Workshop, March Annika Sällström – Botnia Living Lab at Centre for.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
Health-e-Child Project Requirements EGEE 2006 – HealthGrid David Manset MAAT GKnowledge.
NMI End-to-End Diagnostic Advisory Group BoF Fall 2003 Internet2 Member Meeting.
The GRelC Project: architecture, history and a use case in the environmental domain G. Aloisio - S. Fiore The Climate-G testbed is an interdisciplinary.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
Project Overview Jörg Freund, Siemens David Manset, MaatGKnowledge HG09, 29-Jun-09, Berlin.
GAAIN Virtual Appliances: Virtual Machine Technology for Scientific Data Analysis Arihant Patawari USC Stevens Neuroimaging and Informatics Institute July.
© 2006 STEP Consortium ICT Infrastructure Strand.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Annual Review Meeting - Introduction.
HeC workshop, EGEE07 5 October 2007, Budapest Health-e-Child: A Platform for European Paediatrics Tamás Hauer University of the West of England, Bristol.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Approaching Fine-grain Access Control for Distributed Biomedical Databases within Virtual Environments Onur Kalyoncu, Yi Pan, Matthias Assel High Performance.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
TCS-ICS interactions Kuvvet Atakan 1 and the WP6 and WP7 Teams 1 University of Bergen / Department of Earth Science.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Desktop Integration Rhidian Bramley PACS & Teleradiology Group Meeting November 2005.
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ignacio Blanquer Vicente Hernández Damià.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
1 Health-e-Child project Konstantin Skaburskas CERN IT/GD University of Tartu, Estonia 16 March 2007, CERN.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
The Health-e-Child Project EGEE 2006 – Industry Task Force David Manset MAAT GKnowledge.
Site Authorization Service Local Resource Authorization Service (VOX Project) Vijay Sekhri Tanya Levshina Fermilab.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Chapter 9 Database Planning, Design, and Administration Transparencies © Pearson Education Limited 1995, 2005.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Dr. Ir. Yeffry Handoko Putra
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Regional Operations Centres Core infrastructure Centres
JRA3 Introduction Åke Edlund EGEE Security Head
Ian Bird GDB Meeting CERN 9 September 2003
Health-e-Child: An Integrated Platform for European Paediatrics
High Performance Computing Center – HLRS
Presentation transcript:

Health-e-Child Konstantin Skaburskas Based on presentations made by HeC partners

2Health-e-Child 07. November 2006 Establish Horizontal and Vertical integration of data, information and knowledge Develop a grid-based biomedical information platform, supported by sophisticated and robust search, optimisation, and matching techniques for heterogeneous information, Build enabling tools and services that improve the quality of care and reduce its cost by increasing efficiency Integrated disease models exploiting all available information levels Database-guided decision support systems Large-scale, cross-modality information fusion and data mining for knowledge discovery A Knowledge Repository? Project Objectives

3Health-e-Child 07. November 2006  Instrument:Integrated Project (IP) of the Framework Program FP6  Project Identifier:IST  Coordinator:Siemens AG, Dr. Jörg Freund  Partner:14 European (companies, hospitals, institutions)  Timetable: 01-Jan-06 to 31-Dec-09 (4 years)  Total cost: 16.7 Mio. €  EC funding: 12.2 Mio. €  Web page: Project General Info

4Health-e-Child 07. November 2006 GOSH NECKER UWE CERN IGG SIEMENS ASPER UOA INRIA Project Map LYNKEUS UCL EGF FGG MAAT

5Health-e-Child 07. November 2006 Clinical Context  Diseases Heart diseases (Right Ventricle Overload, Cardiomyopathy), Inflammatory diseases (Juvenile Idiopathic Arthritis), and Brain tumours (Gliomas)  Clinical Institutions I.R.C.C.S. Giannina Gaslini (IGG), Genoa, Italy University College London, Great Ormond Street Children’s Hospital (GOSH), London, UK Assistance Publique Hopitaux de Paris – NECKER, Paris, France  Clinical Departments Cardiology Rheumatology (Neuro-)Oncology Radiology Lab (Genetics, Proteomics, Lab) Administration

6Health-e-Child 07. November 2006 Data Integration Challenge (1) 3 Hospital Nodes Integration of data stored in Hospital’s IS + fresh new data to be acquired Acquisition of large samples of Imaging data 3 diseases X 300 cases X 2 modalities X 300 images – i.e. at most images ~ 270 GB A Distributed Platform for sharing, manipulating and inferring data Decision Support System Disease Modelling Knowledge Discovery / Data Mining Image Processing Automatic segmentation of right ventricle – to determine volume, ejection fractions etc for cardiac MR and ultrasound images Brain tumour segmentation/registration to determine volume, location etc Volume of synovial fluid in wrist MR scans Grid technology as the enabling infrastructure HeC Components

7Health-e-Child 07. November 2006 Data Integration Challenge (2) Heterogeneous Data/Imaging Sources DB Backends: from simple MS ACCESS to complex Patient Information Systems like TOMCAT, RIS … No or few linkage bw department’s IS Various imaging modalities: MRI, CT, US, X-Ray… Various imaging devices: Siemens Bi-Plan, GE Vivid7, Sequoia, HP128… Heterogeneous Connectivity PACS not yet present in all Hospitals/Departments Hospitals have different Hardware/Network/Security constraints A 3-Phase Data Integration Scheme 1 st : A temporary offline data acquisition application 2 nd : An online data acquisition application (interacting with the platform) 3 rd : A background data integration service (in the platform)

8Health-e-Child 07. November 2006 A1: Project Management WP1 – Project management A2: User Requirements Specifications WP2 – User requirements specifications A3: Ethical, Legal and Social Issues WP3 – Legal, ethical, and regulatory issues WP4 – Privacy and security A4: Platform Development, Vertical Data Integration and Knowledge Representation WP5 – Grid platform (Maat, CERN, UWE, Siemens) WP6 – Medical vertical knowledge representation WP7 – Data management layer and data integration mechanisms WP8 – Medical query processing A5: Data Collection, Annotation and Knowledge Gathering WP9 – Data collection WP10 – Ground truth (annotated data) and clinical knowledge gathering A6: Disease Modelling, Decision Support and Knowledge Discovery based on Integrated Biomedical Information WP11 – Integrated disease modelling WP12 – Decision support systems WP13 – Biomedical knowledge discovery A7: System Integration WP14 – Deployment of the data management system and Grid gateway A8: Dissemination Policy and Broader Impact WP15 – Training WP16 – Dissemination Activities and Work Packages

9Health-e-Child 07. November 2006 WP5 Objectives To provide a Grid-aware infrastructure on which the Health-e-Child applications can operate To develop appropriate APIs for applications to invoke Grid services To provide education to application developers on the use and operation of the Grid infrastructure To ensure that updates and releases of the Grid platform are available as the infrastructure evolves

10Health-e-Child 07. November 2006 Challenges Security: Strong authentication VOMS based authorization to all Grid resources Fine-grained data sharing/access. Granularity: File DB views, tables, table rows

11Health-e-Child 07. November 2006 Early Faced Issues Mainly Non-Functional since project has just started Selecting grid m/w services wrt project requirements Lots of services/functionalities available Different implementations with different levels of maturity Clustering grid m/w services To reduce the h/w requirements & maintenance (1 server / Hospital) To facilitate deployment (3 clinical sites + at least 5 institutional sites) Decentralisation of grid m/w services Sites need to be as much as possible autonomous “Griddification” of Applications Some of the HeC applications might be “gridified” Griddification has to be balanced against runtime and development complexity criteria

12Health-e-Child 07. November 2006 Current Investigations Selecting grid m/w services wrt project requirements => Services selection based on URS + Grid Questionnaire Clustering grid m/w services => “Xenification” of OSs + clustering services wrt functionality Decentralisation of Grid Services => Dependent on gLite developments, but already some possibilities with Master/Slave configurations “Griddification” of Applications => Introduced a classification of applications. Grid Questionnaire will certainly help in making decisions Grid Access => Abstracting grid access through dedicated service

13Health-e-Child 07. November 2006 Remaining Challenges Data Integration in Hospitals (post phase 2) What mechanisms to use? What will be the limitations (in particular with proprietary systems? Patient Data Distribution & Sharing What technology/implementation? Patient Image Files Sharing Enabling the sharing of large files over the internet GOSH = 500MB/patient NECKER = 3.5GB/patient …raises bandwidth problems Griddification of Applications Appears relevant for computation heavy algorithms or batch processing However many clinical algorithms have short runtime (e.g. image processing, since clinicians need almost instantaneous results)

14Health-e-Child 07. November 2006 Non-functional Requirements autonomous Hospital Sites should be autonomous Sites should not depend on any central services Hardware requirements remain too high for Hospitals one box Getting access to the grid through one box would be ideal 1 Server per Hospital e.g. 1 Server per Hospital Fine-grained security mechanism for accessing data (at the record level?) Functional Requirements Pseudonymisation Pseudonymisation as a native middleware service? Streaming Native Streaming facilities for sharing large DICOM files patient-centric [ Native patient-centric data model(s) (flexibility) Optionally data model could be selected from existing standards (e.g. HL7…) or even created from scratch (interoperability) Optionally a native commodity for exporting/exposing data through different data models would be nice (model-driven) (interoperability) Optionally a data model (schema) discovery mechanism could help Native connectors to external backends for batch data integration ] HealthGridsDistributed PACS 1. Are HealthGrids likely to become the enabling infrastructure for Distributed PACS? GridKnowledge Repositories 2. Is the Grid likely to become the enabling infrastructure for Knowledge Repositories? Middleware Requirements

15Health-e-Child 07. November 2006 HeC Platform HeC Server GOSH HeC Server Clinicians Laptops/Desktops Workstation IGG NECKER One server per Hospital Single entry point to HeC Platform One workstation per Department dedicated user interface For complex tasks a dedicated user interface is used Generic computers on Intranet web browsers Most functionalities accessible from generic web browsers Clinician’s HeC Identity Approach (1)

16Health-e-Child 07. November 2006 HeC Platform Centralized

17Health-e-Child 07. November 2006 HeC Platform DeCentralized

18Health-e-Child 07. November 2006 Approach (2) An intermediary access layer: the HeC Gateway To decouple client applications from the complexity of the grid and other computing resources Towards a platform independent implementation Domain Specific Functionality exposed in the HeC Gateway Grid mainly used as a Distributed & Federated PACS Different modalities of images to be anonymised and shared Clinical Reports Misc. Files

19Health-e-Child 07. November 2006 Gateway Architecture

20Health-e-Child 07. November 2006 Platform Use Cases Local & GlobalRequires high responsivenessView Case Global--Maintain Grid Global--Knowledge Mining Local & GlobalRequires high responsivenessFind Similarity Local & GlobalRequires high responsivenessQuery Local--Data Acquisition Local & Global--Data Annotation Global--Manage Sharing Global--Maintain VO Global--Maintain Tools Local & Global--Maintain Information Schema Local--Maintain Patient Database 3. Maintain Platform Local & GlobalRequires high responsivenessUse Disease Models Local & GlobalRequires high responsivenessUse Decision Support System 2. Retrieve & Exploit Information 1. Collect Information ScopeComment(high-level) Use Case

21Health-e-Child 07. November st Technical Accomplishments Establishment of a Common Development Environment Indispensible to synchronise partners and leverage synergy Creation of the Health-e-Child Virtual Organisation (VO) Establishment of a Certificate Authority (55 certs delivered so far) HeC VO (10 users) Structure in place, being tested 1st gLite Test-bed deployed in May 2006 on HeC dedicated servers ~20 computers involved Being refined according to project requirements HeC gateway (prototype version) Authentication Client Application & Grid Service (VOMS enabled) HeC Portal & Factory (exposing domain specific functionality)

22Health-e-Child 07. November 2006 What has been achieved, practically speaking? T4.2 Health-e-Child Certificate Authority Redundant & Secured OpenCA Repositories 2 repositories synchronized by a USB stick 1 repository exposing non-sensitive data 1 repository installed in a secure computer 56 certificates delivered so far Being used by all the services in the infrastructure from gLite to gateway to common development environment T4.3 Security Prototype Gateway service container and security policy in place Authentication Client & Service – Improved GSI & PKI model Logging Web-Portal – Featuring all facilities to browse gateway logs » Can process statistics on gateway usage/reliability » Users Operations can be traced efficiently & real-time » Based on G LogAppender for Java – Asynchronous logger, therefore almost no computation cost – Log appended in a G DBMS VO Configured and under Test Infrastructure Protected by Firewalls – respecting the security rules defined for gateways WP4 – Security & Privacy

23Health-e-Child 07. November 2006 WP5 – Grid Platform What has been achieved? T5.1 EGEE gLite Evaluation (M1-6 / 4MM) OK. Done via gLite Test-bed deployment at CERN & MAAT – Report should be appended to D5.1 T5.2 Education / Training(M1-9 / 4MM) OK. Done, undertaken several gLite tutorials until recently WP5 Team might be interested in having further technical tutorials T5.3 API Development(M6-12 / 10MM) UP. API Information Collection completed – Functionality wrapping is under process T5.4 gLite Test-bed(M9-15 / 14MM) OK. First Test-bed, so-called « Roma »… demonstrated end May Central Site 3 Local Sites (mimicking hospitals) T5.5 Prototype Testing(M15-18 / 3MM) NS. However: Trying out some test suites for gLite Started Feedback to EGEE through CERN Milestones and Deliverables D5.1 Report on delivered Grid prototype (due date: Month18) M5.1 Prototype Grid Platform (due date: Month18) NS : Not Started yet UP : OnGoing OK : Done

24Health-e-Child 07. November 2006 What has been achieved, practically speaking? T5.2 Education / Training gLite CERN gLite CERN gLite CERN T5.3 API Development HeC Infrastructure Requirements Questionnaire Delivered HeC Infrastructure Requirements Questionnaire Collected all filled-in questionnaires (by 23-Oct. 2006) Questionnaires now have to be processed API integration scheme has been selected according to last tutorial results API development has been split Already some embryo functionality can be tested through the HeC Shell (bundled in the Authentication Client Application for sysadmins) T5.1 & T5.4 gLite Test-bed Test-bed, so-called « Roma »… demonstrated end May 2006 Test-bed now features: 1 Central 3 Hospital-like sites 1 Development CERN 15 VMs~6 Servers For a total of 15 VMs~6 Servers WP5 – Grid Platform

25Health-e-Child 07. November 2006 Infrastructure Outlook Node dedicated to “Xenification” tests of gLite services. Temporary site, should disappear later. OKCERN3 / 1Test SiteLocalCERN Production Site Development Site to train DSS connected to Grid Mimicking a Hospital site. Could become a public Gateway for external clinicians/hospitals get access Infrastructure should expand to 20 servers gradually Mimicking a Hospital site. Temporary site, should disappear later. Health-e-Child CA Running core gLite services Note NSGOSH London 4 / 2Hospital SiteLocalGOSH NSNECKER Paris 4 / 2Hospital SiteLocalNECKER NSIGG Genova 4 / 2Hospital SiteLocalIGG OKMAAT Madrid 4 / 1Hospital / Development Site LocalMAAT UP Month? SIEMENS Munich 4 / 2Development SiteLocalSIEMENS Computational Site Hospital / Test Site VO Sub-Type UP Month12 UWE Bristol4 / 2LocalUWE OKCERN2 / 2LocalCERN OKCERN1 / 1CentralMAAT OKCERN6 / 1CentralMAAT StatusLocation# of VM / Servers TypeOwner WP5 – Grid Platform

26Health-e-Child 07, November 2006 WP14 – Connectivity Deployment NS : Not Started yet UP : OnGoing OK : Done Milestones and Deliverables D14.1 Report on Deployed Prototype (due date: Month18) What has been achieved? UP. T14.1 Deployment of the Connection (M5-10 / 8MM) To speed up the process, MAAT is planning to: –1. Order servers for Hospitals –2. Batch install them –3. Ship them to Hospitals and –4. Finalize deployment onsite Currently preparing the deployment logistic –MAAT delivered recently Hospitals servers specs proposal –Deployment Team constituted and ready to proceed NS. T14.2 Linkage to DSS (M11-18 / 4MM) NS. T14.3 Browsing of Data through Specified Ontologies (M11-18 / 2MM)

27Health-e-Child 07, November 2006 What has been achieved, practically speaking? Test-bed used as a learning environment to perfect up-coming deployment Deployment recently exercised through installation of a “Hospital-like” MAAT Server Pricing for Hospitals Got proposal from DELL Spain (good prices ~ 1K€ lower than public prices) Deployment will be documented. Procedures are being properly described Deployment Procedure Documentation Hardware Test Report (1 per server delivered to Hospitals) –For properly assessing the servers to be deployed in Hospitals Software Test Report (1 per Hospital Site) –For properly validating a site installation All Reports to be appended to D14.1 WP14 – Connectivity Deployment

28Health-e-Child 07, November 2006 What is planned? once orderedServers preparation phase to last 3 months (once ordered) Targeted 1st Hospital: IGG (GOSH 2 nd & Necker last) Ideal Objective would be to order h/w before the end of November 2006 (Month11) Deployment in Hospitals to start at Month12 “Ideal” Objective is to deploy one Hospital before European Review (Month14) Where do you foresee problems, risks and/or delays? WP14 Description not relevant for T14.2 & T14.3 (deserves amendment) Deployment already delayed Hardware deprecation – Servers should be ordered no later than December 2006 Servers integration in Hospitals What could be the counter measures to avoid the problems, risks and delays? Amend WP14 for the DoW2 Order the h/w at Month11/12 Tight interaction with Hospitals’ IT Departments pre-deployment visitmight require a pre-deployment visit at each Hospital WP14 – Connectivity Deployment