Reactome a pathways knowledgebase Imre Vastrik EMBL-European Bioinformatics Institute 6/10/2005
The Plan Why? How? What does it look like/what can you do with it?
From data to knowledge Decrease in computational access
Insulin binds the insulin receptor, causing it to dimerise. The dimerised form the autophosphorylates on 6 cytoplasmic tyrosines. This phosphorylated form recruits the IRS adaptor....
Decrease in computational access …and exhaustion
Why? How? What does it look like/what can you do with it?
History of Reactome Started as Genome Knowledgebase in spring Aim: capture the knowledge of biological experts in a form that could be searched and reasoned over electronically, and which could act as a connecting link between sequence records and primary biomedical literature. Initially tried to capture and standardise the language used to describe molecular processes. 2001/2002 realised that what we are trying to capture are reactions and pathways. Rebranded as Reactome June 2004.
plasma membrane [GO: ] Cytosol [GO: ] extracellular region [GO: ] Reactome data model Insulin Insulin receptor Insulin Insulin receptor PP Insulin IRS Insulin Insulin receptor PP ATP x12 ADP x12
Reactome data model UniProt:P01308 UniProt:P06213 PMID: PMID: PMID: PMID: PMID: ChEBI:2359ChEBI:2342 IRS-1IRS-2 DOK1 UniProt :Q9Y4H2UniProt :Q99704UniProt :P35568 plasma membrane [GO: ] Insulin Insulin receptor Insulin Insulin receptor Insulin Insulin receptor PP PP Insulin IRS ATP x12 ADP x12 Cytosol [GO: ] extracellular region [GO: ] transmembrane receptor protein tyrosine kinase activity [GO: ]
plasma membrane [GO: ] Reactome data model Insulin Insulin receptor Insulin Insulin receptor Insulin Insulin receptor PP PP Insulin IRS IRS-1IRS-2 DOK1 UniProt :Q9Y4H2UniProt :Q99704UniProt :P35568 UniProt:P01308 UniProt:P06213 ATP x12 ADP x12 ChEBI:2359ChEBI:2342 transmembrane receptor protein tyrosine kinase activity [GO: ] PMID: PMID: PMID: PMID: PMID: Cytosol [GO: ] extracellular region [GO: ] Insulin signalling
Ambiguity of connection maps… AB C D ++ + Do you need A & B or just A | B to get active C?
…is avoided by using states and reactions A C C’ C’’ B D D’ A C C’’ B D D’ A & BA | B
About mice and men… humanmouserathuman PMID:5555PMID:4444PMID:8976PMID:3924
… and how not to mix them human PMID:5555PMID:4444 mouse rat Direct evidence Indirect evidence PMID:8976 PMID:3924
Two FAQs What about tissue specific reactions? –We annotate to the union of all possible reactions: gene expression data gives the set of reactions feasible in a cell What about fine dynamic balances? –We only capture qualitative information. The quantitative/model aspects has to be handled by ODEs/Kds and SBML like techniques. We can link to these resources, but they are out of scope for the moment
Reviewer (external) Curator (staff) Expert (external)
Release cycle Repository Release DB Extract finished & reviewed topics Computationally project pathways to other organisms Add cross-references (Ensembl, Entrez Gene, MIM, KEGG,…)
Reactome in numbers (release 15, 26/9/2005) Human: Reactions1524 Pathways659 Proteins1095 “Small molecules”379 Complexes982 Literature references1408 Interactions19471
Why? How? What does it look like/what can you do with it?
HSA MMU ANA BSU ECO SSO MJA PFA DDI ATH ANI SPO SCE CEL DME TNI Homo sapiens Schizosaccharomyces pombe Mus musculus Tetraodon nigroviridis Drosophila melanogaster Caenorhabditis elegans Saccharomyces cerevisiae Aspergillus nidulans Arabidopsis thaliana Dictyostelium discoideum Plasmodium falciparum Methanococcus jannaschii Sulpholobus solfataricus Escherichia coli Bacillus subtilis Anabaena
Human Species 1 Species 2 Rules for orthology-based inference 75% of a complex must have orthologs Lineage specific paralogs are allowed All small molecules presumed to exist if reactions exist Otherwise every input, output, catalyst must be present
HSA MMU ANA BSU ECO SSO MJA PFA DDI ATH ANI SPO SCE CEL DME TNI Finding lineage-specific deletions ?++ ? ? + ?
Lineage-specific deletion rates
Absent in cerevisiae and pombe, but present in aspergillus Lipid metabolism Xenobiotic metabolism Metabolism of amino acids Nucleotide metabolism (transport)
Lineage Deletion rates Trp Catabolism Head or Tail DNA Repair Redundant Paths Insulin Signalling Pathway modules
Presence of “small molecules”
Tissue expression Data from Human Novartis Affy scan more correlated
Reactome at a glance Catalogue of all possible reactions (topology) in an organism - reactome Authored by experts Currently human orientated Computational predictions to other species Data & code freely available ( –MySQL database, SBML, BioPAX + specialised datasets –Perl and Java APIs –Website mirror –Data entry tool
Cold Spring Harbor LaboratoryEuropean Bioinformatics InstituteGene Ontology Consortium Lincoln Stein Peter D'Eustachio Lisa Matthews Gopal Gopinath Marc Gillespie Guanming Wu Elizabeth Nickerson Marcela Tello-Ruiz Geeta Joshi-Tope Ewan Birney Imre Vastrik Esther Schmidt Bijay Jassal Bernard de Bono David Croft Suzanna Lewis Groups & People NHGRI Grant # R01 HG EU STREP EMI-CD EBI Industry program www: