Dr Richard Sinnott Dr Dave Berry 5 th February 2004 National e-Science Centre Local Developments Technical Director National e-Science Centre ||| Deputy.

Slides:



Advertisements
Similar presentations
SE Name SE Title Blackboard Training: Approaches and Opportunities.
Advertisements

Delivery of Industrial Strength Middleware Federated Strengths Agility & Coordination Prof. Malcolm Atkinson Director 21 st January 2004.
S.L.LloydATSE e-Science Visit April 2004Slide 1 GridPP – A UK Computing Grid for Particle Physics GridPP 19 UK Universities, CCLRC (RAL & Daresbury) and.
S.L.LloydGrid Steering Committee 8 March 2002 Slide 1 Status of GridPP Overview Financial Summary Recruitment Status EU DataGrid UK Grid Status GridPP.
Tony Doyle - University of Glasgow GridPP EDG - UK Contributions Architecture Testbed-1 Network Monitoring Certificates & Security Storage Element R-GMA.
Particle physics – the computing challenge CERN Large Hadron Collider –2007 –the worlds most powerful particle accelerator –10 petabytes (10 million billion.
The National Grid Service Mike Mineter.
Chinese Delegation visit Malcolm Atkinson Director 18 th November 2004.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
Enterprise Java and Data Services Designing for Broadly Available Grid Data Access Services.
Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
NeSC: National e-Science Centre. NeSC Mission Help the UK develop international strength in Grid computing Industry, Commerce, Scientific Research, …
DyVOSE Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
The National Grid Service and OGSA-DAI Mike Mineter
Eldas 1.0 Enterprise Level Data Access Services Design Issues, Implementation and Future Development Davy Virdee.
E-Science Data Information and Knowledge Transformation Edikt : e-Science Data, Information and Knowledge Transformation NeSC Review, 30 September 2003.
Grid Engineering Experience & Biological Applications Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics.
NeSC Data Projects and Initiatives Dr. Dave Berry Research Manager.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
BRIDGES Status Report Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Knowledge Exchange and Economic Benefit Dr Mark Parsons Commercial Director EPCC and NeSC.
E-Science Update Steve Gough, ITS 19 Feb e-Science large scale science increasingly carried out through distributed global collaborations enabled.
Introduction to NeSC: The Gateway to UK e-Science Dave Berry, Research Manager HEPix Meeting, May 2004.
An Overview of OGSA-DAI Kostas Tourlas
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
Requirements from astronomy in the Virtual Observatory era Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
© HATII, University of Glasgow Introduction to the UK ’ s Digital Curation Centre Prof Seamus Ross Visiting Fellow at Oxford Internet Institute ,
UK e-Science Dave Berry, Research Manager National e-Science Centre e-Science: Computational Grid Infrastructure and Scientific & Engineering Applications.
Welcome e-Science in the UK Building Collaborative eResearch Environments Prof. Malcolm Atkinson Director 23 rd February 2004.
GridPP Tuesday, 23 September 2003 Tim Phillips. 2 Bristol e-Science Vision National scene Bristol e-Science Centre Issues & Challenges.
ScotGrid: a Prototype Tier-2 Centre – Steve Thorn, Edinburgh University SCOTGRID: A PROTOTYPE TIER-2 CENTRE Steve Thorn Authors: A. Earl, P. Clark, S.
A long tradition. e-science, Data Centres, and the Virtual Observatory why is e-science important ? what is the structure of the VO ? what then must we.
Extensible Framework for Data Access & Integration Malcolm Atkinson Director 10 th November 2004.
Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
The National e-Science Centre: Role and Activities Dave Berry, Research Manager Indo-UK Workshop on e-Science Delhi, February 2004.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Sys-Bio Talk, 24 th Feb 2005 Towards Grid-Based System Biology Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director (Technical)
ScotGRID:The Scottish LHC Computing Centre Summary of the ScotGRID Project Summary of the ScotGRID Project Phase2 of the ScotGRID Project Phase2 of the.
Dr Richard Sinnott Technical Director National e-Science Centre ||| Deputy Director Technical Bioinformatics Research Centre University of Glasgow
Edinburgh Investment in e-Science Infrastructure Dr Arthur Trew.
E-Science Data Information and Knowledge Transformation Edikt : e-Science Data, Information and Knowledge Transformation E-Science Centres of Excellence.
Future of e-Science Malcolm Atkinson Director 18 th March 2004.
UK e-Science AHM th September 2005 Comparison of Data Access and Integration Technologies in the Life Science Domain Dr Richard Sinnott Technical.
Usability Talk, 26 th January 2006 Development of Usable Grid Services for the Biomedical Community Prof Richard Sinnott Technical Director National e-Science.
Bioinformatics Core Facility Guglielmo Roma January 2011.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
AHM September 2004 Grid Services Supporting the Usage of Secure Federated, Distributed Biomedical Data Dr Richard Sinnott Technical Director National e-Science.
Tony Doyle - University of Glasgow 8 July 2005Collaboration Board Meeting GridPP Report Tony Doyle.
IBM & HSBC visit Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 30 th March 2006.
Grid Glasgow Outline LHC Computing at a Glance Glasgow Starting Point LHC Computing Challenge CPU Intensive Applications Timeline ScotGRID.
OGSA-DAI & DAIT projects Update for TAG Prof. Malcolm Atkinson Director 30 th October 2003.
National e-Science Institute and National e-Science Centre The Way Ahead Prof. Malcolm Atkinson Director 30 th September 2003.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Chinese Delegation Visit High Performance Computer Mission UK e-Science & The National e-Science Centre Prof. Malcolm Atkinson Director
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
RC ICT Conference 17 May 2004 Research Councils ICT Conference The UK e-Science Programme David Wallace, Chair, e-Science Steering Committee.
Shibboleth Use at the National e-Science Centre Hub Glasgow at collaborating institutions in the Shibboleth federation depending.
The National e-Science Centre. eSI in Edinburgh NeSC Roles National: help coordinate and lead the UK e- Science Programme Community building activities,
Building Castles with Shifting Sands?
UK Grid: Moving from Research to Production
UK GridPP Tier-1/A Centre at CLRC
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Grid Portal Services IeSE (the Integrated e-Science Environment)
The National Grid Service
Collaboration Board Meeting
Presentation transcript:

Dr Richard Sinnott Dr Dave Berry 5 th February 2004 National e-Science Centre Local Developments Technical Director National e-Science Centre ||| Deputy Director (Technical) Bioinformatics Research Centre University of Glasgow Research Manager National e-Science Centre University of Edinburgh

Overview NeSC Role in UK e-Science NeSC Edinburgh developments e-Science Institute Infrastructure/set-up Projects Plans NeSC Glasgow developments Infrastructure/set-up Projects Plans Conclusions

NeSCs Role Help coordinate and lead the UK e-Science Programme Community building activities, regional support & outreach Grid building as a member of the Engineering Task Force Skill building through training events & support centre Help establish the UKs international role International meetings, standardisation work & presentations Undertake R&D projects To deliver reliable middleware To engage industry To stimulate the uptake of e-Science technology and methods Run the e-Science Institute Knowledge building through workshops and conferences Research visitors and events

NeSC at Edinburgh: Recent Developments Globus Alliance Digital Curation Centre Edinburgh, Glasgow, UKOLN, CCLRC New e-Science Lecturer (Particle Physics) Training Team PPARC and EGEE funding Manager + 4 trainers Europe-wide role DAI Two (Extension of OGSA-DAI) OGSA Test Grid

Digital Curation Centre Industry research collaborators standards bodies testbeds & tools communities of practice: users communit y support & outreach research development services management & co- ordination curation organisations Collaborative Associates Network of Data Organisations

e-Science Institute A meeting place The focus for presenting UK e-Science Visiting researchers Collaborate in our research and development Engage in and develop our event programme Build bridges with their community Visits last between one week and six months Research-oriented event programme e-Science research topics Training to e-Science research teams

eSI Workshops Space for real work Crossing communities Creativity: new strategies and solutions Written reports Scientific Data Mining, Integration and Visualisation Grid Information Systems Portals and Portlets Virtual Observatory as a Data Grid Imaging, Medical Analysis and Grid Environments Open Issues in Grid Scheduling Data Provenance & Annotation e-Science Workflow Services GeoSciences & Scottish Bioinformatics Forum Suggestions always welcome!

Projects OGSA-DAI/DAIT, MS.NETGrid, SunDCG, GridWeaver, BRIDGES, PGPGrid, FirstDIG, ODD-Genes EGEE, NextGrid OGSA Test Grid, IBM Early Evaluation edikt Publishing Scientific Data GridPP, AstroGrid, QCDGrid, RealityGrid Portal Biological Spatio-Temporal Databases CoAKTinG, Grid-enabled Modelling Tools and Databases for Neuroinformatics, e-Diamond Dynamic Configuration of Grid Fabrics, Dependable Grid Services, Deductive Synthesis Techniques, Inferring QoS Properties for Grid Applications, Mobile Resource Guarantees TIES, TIES-II

The Virtual Observatory International Virtual Observatory Alliance UK, Australia, EU, China, Canada, Italy, Germany, Japan, Korea, US, Russia, France, India How to integrate many multi-TB collections of heterogeneous data distributed globally? Sociological and technological challenges to be met

Data Services GGF Data Access and Integration Svcs (DAIS) OGSI-compliant interfaces to access relational and XML databases Needs to be generalized to encompass other data sources (see next slide…) Generalized DAIS becomes the foundation for: Replication: Data located in multiple locations Federation: Composition of multiple sources Provenance: How was data generated?

1a. Request to Registry for sources of data about x 1b. Registry responds with Factory handle 2a. Request to Factory for access to database 2c. Factory returns handle of GDS to client 3a. Client queries GDS with XPath, SQL, etc 3b. GDS interacts with database 3c. Results of query returned to client as XML SOAP/HTTP service creation API interactions RegistryFactory 2b. Factory creates GridDataService to manage access Grid Data Service Client XML / Relationa l database Data Access & Integration Services

edikt The team: 8 professional software engineers, support staff, project manager, commercialisation manager, architect, and SAB SHEFC funded research and development grant 3 years funding: May 2002 – years funding upon successful project and review Standards Edikt project Requirements analysis Technology matchmaking Gap fillingRigorous engineering CS Research Grid Services for e-Science Data Management Commercial SW components and skills E-Science Apps

Java Framework ELDAS – Data Access Service Implemented using Enterprise Java Beans Data Access Components interface to distinct DBMSs Accessible as a grid data service or a web data service ELDAS DB2 DBMySQL DBXindice DB Web User1 Oracle 9i DB EJB - DAS DAC ELDAS runs anywhere Web Servlet Grid Proxy Grid User1Grid User2 Suitable for grid & web

e-Science Application Binary Data File BinX – accessing legacy binary data The Problem: Many binary data files Applications must know the data format Binary data formats are machine-specific BinX Library The Solution: Write a stand-aside format description in XML Provide a library to Interpret the description Provide file access across different machines Build higher-level services BinX file describes binary file structure simulations

NeSC at Glasgow E-Science Hub Externally Glasgow end of NeSC –Involved in UK wide activities »ETF: In May 2003 became first UK e-Science Centre to run integration tests across every site of the UK (Level 2) Grid. Therefore 100% access to UK Grid resources at this time –Public visibility of NeSC »responsible for NeSC web site Internally Focal point for e-Science research/activities at Glasgow Work closely with foundation departments –Department of Computing Science –Department of Physics & Astronomy Also working closely with other groups including –Bioinformatics Research Centre –Electronics and Electrical Engineering –Biostatistics, …

Glasgow e-Science Investment Major investment by university 230m 2 of newly renovated floor space in Kelvin Building offices access grid facility training room –equipped with 20PCs/server for training courses Funding Technical Director

Resource Consolidation at Glasgow Building around ScotGrid Providing shared Grid resource for wide variety of scientists inside/outside Glasgow Particle physicists, computer scientists, electronic engineers, bioinformaticians, … Focal point, knowledge pool, primary resource for e-Science activity at Glasgow Target shares –60% PP, 20% Bioinf, 20% open share… Hardware 59 IBM X Series 330 dual 1 GHz Pentium III with 2GB memory 2 IBM X Series 340 dual 1 GHz Pentium III with 2GB memory 3 IBM X Series 340 dual 1 GHz Pentium III with 2GB memory and Mbit/s ethernet 1TB disk LTO/Ultrium Tape Library Cisco ethernet switches IBM X Series 370 PIII Xeon with 32 x 512 MB RAM 5TB FastT500 disk 70 x 73.4 GB IBM FC Hot-Swap HDD eDIKT 28 IBM blades dual 2.4 GHz Xeon with 1.5GB memory eDIKT 6 IBM X Series 335 dual 2.4 GHz Xeon with 1.5GB memory CDF 10 Dell PowerEdge GHz Xeon with 1.5GB memory CDF 7.5TB Raid disk Shared Resources: Disk ~15TB CPU ~ 330 1GHz CDF LHC BIO

Projects with NeSC Glasgow Involvement DCC National Digital Curation Centre AMUSE Autonomous Management of Ubiquitous Systems for e-Health P2Popt Performance measurement & mgt of 2-Layer Peer to Peer NWs… PGPGrid Peppers Ghost Productions Equator Environmental e-Science Interdisciplinary Research Project BPS Biochemical Pathway Simulator BRIDGES

Overview of BRIDGES Biomedical Research Informatics Delivered by Grid Enabled Services (BRIDGES) NeSC (Edinburgh and Glasgow) and IBM 2 year project started 1 st October 2003 Supporting project for CFG project Generating data on hypertension Rat, Mouse, Human genome databases Variety of tools used BLAST, FASTA, MPsrch, BLAT, Gene Prediction, visualisation, … Variety of data sources and formats Microarray data, genome DBs, project partner research data, medical records, … Aim is integrated infrastructure supporting Data federation Security

Shared data CFG Partner Distribution Glasgow Edinburgh Leicester Oxford London Netherlands Public curated data Private data Private data Private data Private data Private data Private data

Problems specific to Bio- Community PDB Content Growth DBs growing exponentially!!! Biobliographic (MedLine, …) Amino Acid Seq (SWISS-PROT, …) 3D Molecular Structure (PDB, …) Nucleotide Seq (GenBank, EMBL, …) Biochemical Pathways (KEGG, WIT…) Molecular Classifications (SCOP, CATH,…) Motif Libraries (PROSITE, Blocks, …) …

More genomes …... Arabidopsis thaliana mouse rat Caenorhabitis elegans Drosophila melanogaster Mycobacterium leprae Vibrio cholerae Plasmodium falciparum Mycobacterium tuberculosis Neisseria meningitidis Z2491 Helicobacter pylori Xylella fastidiosa Borrelia burgorferi Rickettsia prowazekii Bacillus subtilis Archaeoglobus fulgidus Campylobacter jejuni Aquifex aeolicus Thermotoga maritima Chlamydia pneumoniae Pseudomonas aeruginosa Ureaplasma urealyticum Buchnerasp. APS Escherichia coli Saccharomyces cerevisiae Yersinia pestis Salmonella enterica Thermoplasma acidophilum

Complexity of Biological Data Nucleotide sequences Nucleotide structures Gene expressions Protein Structures Protein functions Protein-protein interaction (pathways) Cell Cell signalling Tissues Organs Physiology Organisms

BRIDGES Data Integration/Federation Local repository being developed Populated with data that cannot be federated e.g. public data sets with no programmatic interface Shared data sets of CFG scientists Security through X.509 PKI (authentication) PERMIS (authorisation) Will make use of e-Science technologies (OGSA- DAI/DAIT, ELDAS, IBMs DiscoveryLink) Automatically keep fresh/updated data Web (Grid) services offered that allow to make use of these local data sets For example for visualising, searching, querying, … Example usage scenario …

System Usage Scenario BRIDGES Portal Client Site X Secure access for CFG VO Shared/ Private Data Sets Personalised Services BLAST Smith W SV DL OGSA-DAI Authorisation Per user, per site Remote data in Oracle, DB2,Sybase, Excel, flat files, XML... Browser based clients… Java App downloaded (via WebStart) Push relevant data onto ScotGrid for BLASTing Secure Data Repository Up to date results input to DB wrappers Generic services used by other projects

Conclusions NeSC continues to provide leadership in UK e-Science Difficult with multitude of scientific research areas, heterogeneity of systems and fluidity of technologies, GT2, GT3, WSRF, GT4…? Closer working with GridPP beneficial for everyone move towards Production Grid ScotGrid a good model for co-operation Planning for soft landing through diversification and more integration into university MRC bids, BBSRC bids, EPSRC bids, … UK e-Science operating as community for upcoming DTI funding opportunities Plans for developing Grid Computing teaching modules as part of advanced MSc

Website National e-Science Centrehttp:// Mission, Background, Foundation Locations, Staff, Resources, Projects Register interest, Mailing lists, NeSCForge Regional associations and Collaborations News, Notices Presentations & Lectures e-Science Institute Mission, Events (Future and Past) Register for Events, Visitor Programme UK e-Science Map and Index of Centres Technical Papers Index of >100 Projects Task Forces General Information Glossary, Bibliography, Whos who E-Science job vacancies

Questions…?