The my Grid Information Model Nick Sharman, Nedim Alpdemir, Justin Ferris, Mark Greenwood, Peter Li, Chris Wroe AHM2004, 1 September
The my Grid Information Model Outline my Grid in context Science & e-Science The information model Next steps Conclusions
Third-party tools Utopia Haystack (IBM) LSID Launchpad (IBM) my Grid information model The my Grid Information Model my Grid in Context External Services Applications Web portals Taverna e-Science workbench Legacy apps Web Services OGSA-DAI databases Websites Core Services Service & workflow discovery Feta semantic discovery View federated UDDI+ Workflow enactment Freefluo workflow engine Metadata Management RDF-based Metadata store Provenance capture tool my Grid ontology Notification service LSID support Data Management my Grid information repository AMBIT text extraction service Soaplab Gowlab OGSA-DAI DQP service Web Service (Grid Service) communication fabric
The my Grid Information Model Science and e-Science The scientific process: 1.Observe and describe phenomena and study existing knowledge 2.Formulate a hypothesis to explain the phenomena 3.From the hypothesis, predict other phenomena 4.Develop and perform repeatable experiments that test the predictions. E-science parallels: 1.Search online repositories, with workflows and queries 2.Create domain ontologies to express hypotheses and … 3.… predictions 4.Workflows and queries can be preserved, shared, re-enacted
The my Grid Information Model Aspects of the model Based on the CCLRC Scientific Metadata Model (Matthews & Sufi) Programmes, studies & experiments People & organizations Data types Provenance metadata Annotation & argumentation
The my Grid Information Model Programmes, studies & experiments
The my Grid Information Model Provenance metadata
AC Homo sapiens BAC clone CTA-315H11 from 7, complete sequence AC Homo sapiens BAC clone RP11-622P13 from 7, complete sequence AL Human DNA sequence from clone RP11-553N16 on chromosome 1, complete sequence AL Homo sapiens chromosome 21 segment HS21C AL Human chromosome 14 DNA sequence BAC R-775G15 of library RPCI-11 from chromosome 14 of Homo sapiens (Human), complete sequence BX Homo sapiens mRNA; cDNA DKFZp686G08119 (from clone DKFZp686G08119) AC Homo sapiens 12q22 BAC RPCI11-256L6 (Roswell Park Cancer Institute Human BAC Library) complete sequence AK Homo sapiens cDNA FLJ45040 fis, clone BRAWH AC Homo sapiens chromosome 17, clone RP11-104J23, complete sequence AL Human DNA sequence from clone RP4-715N11 on chromosome 20q Contains two putative novel genes, ESTs, STSs and GSSs, complete sequence AC Homo sapiens BAC clone RP11-731I19 from 2, complete sequence AC Homo sapiens chromosome 15, clone RP11-342M21, complete sequence AL Human DNA sequence from clone RP11-461K13 on chromosome 10, complete sequence AC Homo sapiens PAC clone RP3-368G6 from X, complete sequence AC Homo sapiens chromosome 4 clone B200N5 map 4q25, complete sequence AF Homo sapiens chromosome 21q22.3 PAC 171F15, complete sequence >gi| |gb|AC | Homo sapiens BAC clone CTA-315H11 from 7, complete sequence AAGCTTTTCTGGCACTGTTTCCTTCTT CCTGATAACCAGAGAAGGAAAAGATC TCCATTTTACAGATGAG GAAACAGGCTCAGAGAGGTCAAGGCT CTGGCTCAAGGTCACACAGCCTGGGA ACGGCAAAGCTGATATTC AAACCCAAGCATCTTGGCTCCAAAGC CCTGGTTTCTGTTCCCACTACTGTCAG TGACCTTGGCAAGCCCT GTCCTCCTCCGGGCTTCACTCTGCAC ACCTGTAACCTGGGGTTAAATGGGCT CACCTGGACTGTTGAGCG urn:lsid:taverna:datathing:15..BLAST_Report rdf:type urn:lsid:taverna:datathing:13..similar_sequences_to.. nucleotide_sequence rdf:type service invocation..created_by workflow invocation workflow definition experiment definition project person group service description organisation..described_by..run_during..invocation_of..part_of..works_for..part_of..author..run_for..masked_sequence_of..filtered_version_of The my Grid Information Model Annotation & argumentation
The my Grid Information Model Next Steps: modelling e-science events Third-party tools Utopia Haystack (IBM) LSID Launchpad (IBM) my Grid information model Applications Core Services External Services Service & workflow discovery Feta semantic discovery View federated UDDI+ Web portals Taverna e-Science workbench Workflow enactment Freefluo workflow engine Metadata Management RDF-based Metadata store Provenance capture tool my Grid ontology Soaplab Gowlab AMBIT text extraction service Legacy apps Web Services OGSA-DAI databases Websites OGSA-DAI DQP service e-Science coordination e-Science Mediator e-Science process patterns e-Science events LSID support Data Management my Grid information repository Web Service (Grid Service) communication fabric Notification service
The my Grid Information Model Conclusions Builds on existing work –CCLRC Scientific Metadata Model –LSID: gives access via third party tools –Semantic web Persistent types almost implemented Transient types in progress Needs validating –Early versions already doing useful work
The my Grid Information Model The my Grid team Matthew Addis, Nedim Alpdemir, Pinar Alper, Rich Cawley, Neil Davis, Vijay Dialani, Stefan Egglestone, Alvaro Fernandes, Justin Ferris, Rob Gaizauskas, Kevin Glover, Carole Goble (director), Chris Greenhalgh, Mark Greenwood, Yikun Guo, Ananth Krishna, Peter Li, Xiaojian Liu, Phil Lord, Darren Marvin, Karon Mee, Simon Miles, Luc Moreau, Arijit Mukherjee, Tom Oinn, Juri Papay, Norman Paton, Terry Payne, Steve Pettifer, Milena Radenkovic, Peter Rice, Angus Roberts, Alan Robinson, Martin Senger, Nick Sharman, Robert Stevens, Victor Tan, Paul Watson, Anil Wipat, Chris Wroe & Jun Zhao.