Grid Engineering Experience & Biological Applications Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics.

Slides:



Advertisements
Similar presentations
Delivery of Industrial Strength Middleware Federated Strengths Agility & Coordination Prof. Malcolm Atkinson Director 21 st January 2004.
Advertisements

S.L.LloydATSE e-Science Visit April 2004Slide 1 GridPP – A UK Computing Grid for Particle Physics GridPP 19 UK Universities, CCLRC (RAL & Daresbury) and.
S.L.LloydGrid Steering Committee 8 March 2002 Slide 1 Status of GridPP Overview Financial Summary Recruitment Status EU DataGrid UK Grid Status GridPP.
Tony Doyle - University of Glasgow GridPP EDG - UK Contributions Architecture Testbed-1 Network Monitoring Certificates & Security Storage Element R-GMA.
Dr Richard Sinnott Dr Dave Berry 5 th February 2004 National e-Science Centre Local Developments Technical Director National e-Science Centre ||| Deputy.
Particle physics – the computing challenge CERN Large Hadron Collider –2007 –the worlds most powerful particle accelerator –10 petabytes (10 million billion.
The National Grid Service Mike Mineter.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
NeSC: National e-Science Centre. NeSC Mission Help the UK develop international strength in Grid computing Industry, Commerce, Scientific Research, …
DyVOSE Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
BRIDGES Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Directors Meeting Oxford e-Research Centre Malcolm Atkinson Director e-Science Institute & e-Science Envoy 10 th October 2006.
R e D R e S S Resource Discovery for Researchers in e-Social Science ReDReSS A Joint Application from Lancaster and Daresbury (7 social scientists, 6 computer/computational.
E-Science Update Steve Gough, ITS 19 Feb e-Science large scale science increasingly carried out through distributed global collaborations enabled.
From AgentLink II to AgentLink III Co-ordinators: Peter McBurney, University of Liverpool, UK Terry Payne, University of Southampton, UK.
National e-Science Centre Arthur Trew Director, EPCC Deputy Director, NeSC.
Peter Clarke UK National e-Science Centre University of Edinburgh e-Infrastructure in the UK.
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
UK e-Science and the White Rose Grid Paul Townend Distributed Systems and Services Group Informatics Research Institute University of Leeds.
Welcome e-Science in the UK Building Collaborative eResearch Environments Prof. Malcolm Atkinson Director 23 rd February 2004.
GridPP Tuesday, 23 September 2003 Tim Phillips. 2 Bristol e-Science Vision National scene Bristol e-Science Centre Issues & Challenges.
ScotGrid: a Prototype Tier-2 Centre – Steve Thorn, Edinburgh University SCOTGRID: A PROTOTYPE TIER-2 CENTRE Steve Thorn Authors: A. Earl, P. Clark, S.
The OMII Perspective on Grid and Web Services At the University of Southampton.
Techniques for Data Linkage and Anonymisation – A Funders View Turing Gateway Meeting 23 rd October 2014 Dr Mark Pitman.
BRIDGES Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
The National e-Science Centre: Role and Activities Dave Berry, Research Manager Indo-UK Workshop on e-Science Delhi, February 2004.
12th November 2003LHCb Software Week1 UK Computing Glenn Patrick Rutherford Appleton Laboratory.
Can Grids Deliver the Vision for Future Hypothesis Driven Life Science Research? Professor Richard Sinnott Technical Director National e-Science Centre.
Sys-Bio Talk, 24 th Feb 2005 Towards Grid-Based System Biology Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director (Technical)
The Environmental Genomics Thematic Programme Data Centre Dawn Field, Director.
ScotGRID:The Scottish LHC Computing Centre Summary of the ScotGRID Project Summary of the ScotGRID Project Phase2 of the ScotGRID Project Phase2 of the.
Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University of Glasgow
Edinburgh Investment in e-Science Infrastructure Dr Arthur Trew.
GGF-16 Athens Production Grid Computing in the UK Neil Geddes CCLRC Director, e-Science.
Neil Geddes GridPP-10, June 2004 UK e-Science Grid Dr Neil Geddes CCLRC Head of e-Science Director of the UK Grid Operations Support Centre.
Future of e-Science Malcolm Atkinson Director 18 th March 2004.
UK e-Science AHM th September 2005 Comparison of Data Access and Integration Technologies in the Life Science Domain Dr Richard Sinnott Technical.
Usability Talk, 26 th January 2006 Development of Usable Grid Services for the Biomedical Community Prof Richard Sinnott Technical Director National e-Science.
Grid enabled e-Research in the Life Sciences Prof Richard Sinnott Technical Director National e-Science Centre Anthony Stell University of Glasgow, Scotland,
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
“Grids and eScience” Mark Hayes Technical Director - Cambridge eScience Centre GEFD Summer School 2003.
The UK eScience Grid (and other real Grids) Mark Hayes NIEeS Summer School 2003.
Data and the UK e-Science Programme Paul Watson Director North-East Regional e-Science Centre School of Computing Science University of.
The National Grid Service Mike Mineter
AHM September 2004 Grid Services Supporting the Usage of Secure Federated, Distributed Biomedical Data Dr Richard Sinnott Technical Director National e-Science.
IBM & HSBC visit Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 30 th March 2006.
Utility Computing: Security & Trust Issues Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
Tony Doyle - University of Glasgow Introduction. Tony Doyle - University of Glasgow 6 November 2006ScotGrid Expression of Interest Universities of Aberdeen,
An Introduction to UK e-Science Anne E Trefethen Deputy Director UK e-Science Core Programme.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
The National Grid Service Mike Mineter.
GRIDSTART Brussels 20/9/02 1www.gridstart.org GRIDSTART and European activities Dr Francis Wray EPCC The University of Edinburgh.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
VOTES Virtual Organisations for Trials and Epidemiological Studies Overview The development.
RC ICT Conference 17 May 2004 Research Councils ICT Conference The UK e-Science Programme David Wallace, Chair, e-Science Steering Committee.
Shibboleth Use at the National e-Science Centre Hub Glasgow at collaborating institutions in the Shibboleth federation depending.
Realising MRC’s Vision in Health and Bioinformatics MRC Open Council Meeting July 2014 Janet Valentine Head of Population Health and Informatics.
The UK National Grid Service Andrew Richards – CCLRC, RAL.
Bob Jones EGEE Technical Director
Building Castles with Shifting Sands?
Tools and Services Workshop
Joslynn Lee – Data Science Educator
UK Grid: Moving from Research to Production
Welcome to National e-Science Centre Official Opening
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
The National Grid Service
The National Grid Service Mike Mineter NeSC-TOE
Presentation transcript:

Grid Engineering Experience & Biological Applications Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University of Glasgow 28 th May 2004

Cambridge Newcastle Edinburgh Oxford Glasgow Manchester Cardiff Southampton London Belfast Daresbury Lab RAL Hinxton NeSC in the UK NeSC Prof Malcolm Atkinson (Director) Dr Richard Sinnott (Technical Director - Glasgow) NeSC and UK Grid Engineering Background Achievements Current/future Life sciences & Grids Challenges & Opportunities Life science projects involving NeSC Glasgow –Bridges (Security focused Grid infrastructure for CFG) –Scottish Bioinformatics Research Network (coming soon) –JDSS (data sharing for life sciences) –VOTES…? Core National Grid Service White Rose Grid HPC(x ) CSA R Previous work on UK e-Science Grid based on GT2 Demonstrated broad set of applications across it Monte Carlo simulations of ionic diffusion through radiation damaged crystal structures Integrated Earth system modelling BLAST on the Grid Grid Integration Test Script Suite … Transition to OGSI/OGSA under discussion Two UK OGSA Test Grid projects started in January UCL, Imperial College, Universities of Edinburgh and Newcastle Universities of Portsmouth, Reading, Manchester, Westminster and CCLRC There are still issues to be resolved OGSA definition and delivery Standards OGSI, WSRF, … …and Technologies GT3, GT4… Hosting environments & Platforms Combinations of services supported Material and grids to support adopters

Glasgow e-Science Hub E-Science Hub Externally Glasgow end of NeSC –Involved in UK wide activities »ETF: In May 2003 became first UK e-Science Centre to run integration tests across every site of the UK (Level 2) Grid. Therefore 100% access to UK Grid resources at this time –Public visibility of NeSC »responsible for NeSC web site Internally Focal point for e-Science research/activities at Glasgow Work closely with foundation departments –Department of Computing Science –Department of Physics & Astronomy Also working closely with other groups including –Bioinformatics Research Centre –Electronics and Electrical Engineering –Biostatistics –…

Glasgow e-Science Activities Consolidating resources Building around ScotGrid Providing shared Grid resource for wide variety of scientists inside/outside Glasgow –Particle physicists, computer scientists, bioinformaticians, … »Target shares established –Focal point for e-Science at Glasgow Hardware 59 IBM X Series 330 dual 1 GHz Pentium III with 2GB memory 2 IBM X Series 340 dual 1 GHz Pentium III with 2GB memory 3 IBM X Series 340 dual 1 GHz Pentium III with 2GB memory and Mbit/s ethernet 1TB disk LTO/Ultrium Tape Library Cisco ethernet switches New.. IBM X Series 370 PIII Xeon with 32 x 512 MB RAM 5TB FastT500 disk 70 x 73.4 GB IBM FC Hot-Swap HDD eDIKT 28 IBM blades dual 2.4 GHz Xeon with 1.5GB memory eDIKT 6 IBM X Series 335 dual 2.4 GHz Xeon with 1.5GB memory CDF 10 Dell PowerEdge GHz Xeon with 1.5GB memory CDF 7.5TB Raid disk Shared Resources: Disk ~15TB CPU ~ 330 1GHz CDF LHC BIO

Grids & Life Sciences Extensive Research Community >1000 per research university Extensive Applications Many people care about them Health, Food, Environment, … Interacts with many disciplines Physics, Chemistry, Maths/Statistics, Nano-engineering, … Huge and expanding number of databases relevant to bioinformatics community Heterogeneity, Interdependence, Complexity, Change, Dirty… Linking using in co-ordinated, secure manner full of open issues to be addressed Compute demands growing as more in-silico research undertaken

Database Growth PDB Content Growth DBs growing exponentially!!! Biobliographic (MedLine, …) Amino Acid Seq (SWISS-PROT, …) 3D Molecular Structure (PDB, …) Nucleotide Seq (GenBank, EMBL, …) Biochemical Pathways (KEGG, WIT…) Molecular Classifications (SCOP, CATH,…) Motif Libraries (PROSITE, Blocks, …)

More genomes …... Arabidopsis thaliana mouse rat Caenorhabitis elegans Drosophila melanogaster Mycobacterium leprae Vibrio cholerae Plasmodium falciparum Mycobacterium tuberculosis Neisseria meningitidis Z2491 Helicobacter pylori Xylella fastidiosa Borrelia burgorferi Rickettsia prowazekii Bacillus subtilis Archaeoglobus fulgidus Campylobacter jejuni Aquifex aeolicus Thermotoga maritima Chlamydia pneumoniae Pseudomonas aeruginosa Ureaplasma urealyticum Buchnerasp. APS Escherichia coli Saccharomyces cerevisiae Yersinia pestis Salmonella enterica Thermoplasma acidophilum

Complexity of Biological Data Nucleotide sequences Nucleotide structures Gene expressions Protein Structures Protein functions Protein-protein interaction (pathways) Cell Cell signalling Tissues Organs PhysiologyOrganisms Populations + links to plant/crops, environmental, health, … information sources Fascinating scientific questions Why do mice, worms, humans… live longer if they eat less? How does the brain work? Why do we stop growing? …

Bioinformatics Grid Needs Taken from C. Goble myGrid presentation Workflow / Virtual Organisation Needs OGSA_DAI/DAIT, IBM Information Integrator, … Single sign on authentication, Granularity of authorisation National Data Curation Centre (GU,EU,UKOLN, CCLRC) BioInf community, Database schemas, … UDDI repositories, BioInf portals, … Grid engineering (scheduling, resource reservation, workflow enactment, …) WSDL descriptions, Semantic grid, …

Bio e-Science Projects

Overview of BRIDGES Biomedical Research Informatics Delivered by Grid Enabled Services (BRIDGES) NeSC (Edinburgh and Glasgow) and IBM Supporting project for CFG project Generating data on hypertension Rat, Mouse, Human genome databases Variety of tools used BLAST, BLAT, Gene Prediction, visualisation, … Variety of data sources and formats Microarray data, genome DBs, project partner research data, medical records, … Aim is integrated infrastructure supporting Data federation Security

Bridges Project Synteny Grid Service blast +

To sequence To multiple alignment To tabular summaries DRILL-DOWN FUNCTIONS Future tools available via Portal

Where we are today! Information Integrator DB repository established and populated … with public data sets … linking to relevant resources (ensembl…) GT3 based Grid services developed (BLAST, …) General usage of ScotGrid (solution being re-engineered with help from eDIKT - will include Condor pool) Initial portal developed using IBM WebSphere Genome visualisation browsers SyntenyVista – for viewing synteny between local/remote data sets MagnaVista – for exploring genetic information across multiple (remote) resources Gaining experience with security technologies Setting up policies with Grid security authorisation software etc Initial roll-out to CFG planned for 4 th June

Lessons learnt Public data resources openness Often cannot query directly Often not easy/possible to find schemas Joint Data Standards Study investigating this Starts on 1 st June and involves –Digital Archiving Consultancy –Bioinformatics Research Centre (Glasgow) –NeSC (Edinburgh and Glasgow) Look at technical, political, social, ethical etc issues involved in accessing and using public life science resources –Will liase with NDCC –Interview relevant scientists, data curators/providers 8 month project with final report in January –Funded by MRC, BBSRC, Wellcome Trust, JISC, NERC, DTI GT3 not without pain! Hopefully GT4 will be better?

Scottish Bioinformatics Research Network Four year proposal starting imminently Funded by Scottish Enterprise, Scottish Higher Education Funding Council, Scottish Executive Environment and Rural Affairs Department Involves Glasgow, Dundee, Edinburgh, Scottish Bioinformatics Forum Aim to provide bioinformatics infrastructure for Scottish health, agriculture and industry Infrastructure support at Dundee, Edinburgh and Glasgow to support first-rate research in bioinformatics at each academic institute Infrastructure support at three institutes, to support inter-institutional sharing of compute and data resources through application of Grid computing Outreach and training activities mediated by the Scottish Bioinformatics Forum

VOTES Plans to develop Grid infrastructure to address key components of clinical trial/observational study Recruitment of potentially eligible participants Data collection during the study Study administration and coordination Involves Glasgow, Oxford, Leicester, Nottingham, Manchester Hopefully to be funded in August 2004 by MRC

Summary NeSC Glasgow establishing itself as leading centre in Grid Security Authentication, authorisation, usability Data access and integration Working closely with NeSC Edinburgh (OGSA-DAI, DAIT, ELDAS) Education Developing Grid Computing courses in advanced MSc at Glasgow –DyVOSE project »Two year project started 1 st May »Grids & security to the masses! Life sciences focal point for NeSC Glasgow Close liaison with –Bioinformatics Research Centre (Prof David Gilbert) –Biostatistics (Prof Ian Ford) … others?