Presentation is loading. Please wait.

Presentation is loading. Please wait.

ISMB, June.26.2005. From protein sequences…to protein networks Database DNA and protein sequences Query Sequence GACTGCATTAC Family of homologous genes.

Similar presentations


Presentation on theme: "ISMB, June.26.2005. From protein sequences…to protein networks Database DNA and protein sequences Query Sequence GACTGCATTAC Family of homologous genes."— Presentation transcript:

1 ISMB, June.26.2005

2 From protein sequences…to protein networks Database DNA and protein sequences Query Sequence GACTGCATTAC Family of homologous genes Cellular response of interest Interaction pathways associated with cellular response Database / Scaffold of Molecular Interactions

3 www.cytoscape.org Cytoscape.org Cytoscape is a freely-available (open-source, java-based) bioinformatics software platform for visualizing biological networks (e.g. molecular interaction networks) and analyzing networks with gene expression profiles and other state data. Additional features are available as plugins. jActiveModules: identify significant “active” subnetworks Expression Correlation Network: cluster expression data Agilent Literature Search: build networks by extracting interactions from scientific literature. MCODE: finds clusters of highly interconnected regions in networks cPath: query, retrieve and visualize interactions from the MSKCC Cancer Pathway database BiNGO: determine which Gene Ontology (GO) categories are statistically over-represented in a set of genes Motif Finder: runs a Gibbs sampling motif detector on sequences for nodes in a Cytoscape network. CytoTalk: Interact with Cytoscape from Perl, Python, R, shell scripts or C or C++ programs. Core Features Customize network data display using visual styles Powerful graph layout tools Easily organize multiple networks Easily navigate large networks Filter the network Plugin API Input/Output Protein protein interactions from BIND, TRANSFAC databases Gene functional annotations from Gene Ontology (GO) and KEGG databases Biological models from Systems Biology Markup Language (SBML) cPath: Cancer Pathway database Proteomics Standards Initiative Molecular Interaction (PSI-MI) or Biopathway Exchange Language (BioPAX) formats Oracle Spatial Network data model

4 www.cytoscape.org Outline Introduction (5 min) Cytoscape as a network integration and query tool Basic features demo (15 min) Load network Navigate/Zoom/Select/Filter Nodes Create subnetworks Visual styles Layout Plugin demo (25 min) MCODE and BinGO Agilent Literature Search Plug-in cPATH Future work (5 min)

5 www.cytoscape.org cPath PlugIn cPath: Overview cPath: XML Web Service cPath Cytoscape PlugIn –Demo: Download sample protein-protein interaction network. –Demo: Drill down to protein details.

6 www.cytoscape.org

7 cPath: XML Web Services API Provides a URL-HTTP XML Web Services API to all cPath Data. Formats: –PSI-MI: Proteomics Standards Initiative Molecular Interaction Format –BioPAX: Biological Pathway Exchange Format Commands: –Query by keyword; query by interactor name; query by Pub Med ID, etc. Example Query: http://www.cbio.mskcc.org/cpath/webservice.do?version=1.0 &cmd=get_by_interactor_name_xref&q=P04273&format=psi _mi&startIndex=0&organism=&maxHits=10

8 www.cytoscape.org cPath Cytoscape PlugIn Enables Cytoscape users to easily query, download and visually render interactions stored in cPath. Utilizes the cPath XML Web Service Automatically bundled with Cytoscape 2.1 –Works out of the box Additional details available on the Cytoscape PlugIn home page: –http://cytoscape.org/plugins2.php

9 www.cytoscape.org cPath PlugIn Demo

10 www.cytoscape.org cPath PlugIn Demo

11 www.cytoscape.org Baker’s yeast (Saccharomyes cerevisiae) Nematode worm (Caenorhabditis elegans) http://www.pathblast.org FUTURE DIRECTIONS: Cross-comparison of networks (1) Alignment of networks across species (network conservation) (2) Correspondence between physical and genetic networks (3) Conserved regions in the presence vs. absence of stimulus Fruit fly (Drosophila melanogaster)

12 Network alignment with PathBLAST P is a path in the global alignment graph. The v and e represent vertices and edges in P. The value p(v) is the prob. of true homology for the proteins aligned at v. The value q(e) is the prob. that the protein interaction at e is real, i.e., not a false-positive.

13 Example yeast/worm/fly alignments Roded Sharan et al. PNAS 2005

14 Integration of genetic and physical interactions 160 between- pathway models 101 within- pathway models Num interactions: 1,102 genetic 933 physical Ryan Kelley et al. Nature Biotechnology 2005

15 A between-pathway model

16 www.cytoscape.org Upcoming Events Cytoscape Conference Nov 30 th and Dec 1 st, 2005 RECOMB Satellite Conference on Network Biology and Gene Regulation Dec 2 nd -4 th, 2005 Mailing lists –cytoscape-announce@googlegroups.com –cytoscape-discuss@googlegroups.com

17 www.cytoscape.org Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan Wang Benno Schwikowski Lee Hood Richard Bonneau Rowan Christmas Past contributors: Iliana Avila-Campillo Larissa Kamenkovich Andrew Markiel Paul Shannon Chris Sander Gary Bader Ethan Cerami Rob Sheridan Agilent Annette Adler Allan Kuchinsky Aditya Vailaya Mike Creech

18 www.cytoscape.org Funding Sources NIH (NIGMS) R01 GM070743-01 Program Manager: John Whitmarsh NCI caBIG Ken Buetow, Peter Kovitz Unilever, PLC Guy Werner PathBLAST network comparison NSF Quantitative Systems Biology Program Manager: Mitra Basu

19 www.cytoscape.org Extensible Architecture: 100% open source Java –Core + plugin API –Plugins are independently licensed –“Just need to write the algorithm” –Template code samples Plugin

20 www.cytoscape.org Layout 16 algorithms available through plugins Zooming, hide/show, alignment

21 yFiles Organic

22 yFiles Circular

23 www.cytoscape.org Visual Styles Map graph attributes to visual attributes Define visual styles for later use Graph has node and edge attributes E.g. expression data, interaction type, GO function Mapped to visual attributes E.g. node/edge size, shape, color, font… Take continuous gene expression data and visualize it as continuous node colors

24 Visual Styles Load “Your Favorite Network”

25 Visual Styles Load “Your Favorite Expression” Dataset

26 Visual Styles Map expression values to node colors using a continuous mapper

27 Visual Styles Expression data mapped to node colors

28 www.cytoscape.org Visual Styles Node attributes: node color, border color, border type, node shape, size, label, font Edge attributes: edge color, line types, arrows, label, font Multidimensional visual attribute mapping soon

29

30 www.cytoscape.org MCODE and Biomodules Plugins (MSKCC and ISB) Clusters in a protein-protein interaction network have been shown to represent protein complexes and parts of pathways Clusters in a protein similarity network represent protein families Network clustering is available through the MCODE Cytoscape plugin

31 Proteasome 26S Proteasome 20S Ribosome RNA Pol core RNA Splicing

32 Biomodules (ISB) Prinz S, Avila-Campillo I, Aldridge C, Srinivasan A, Dimitrov K, Siegel AF, and Galitski T Genome Res. 2004 14: 380-390

33 www.cytoscape.org Agilent Literature Search Plugin for Cytoscape Extract Nouns/Verbs (User Context/ BNS) Sentence Tokenization No Is Interesting Sentence? Yes Normalize Nouns (User Context/BNS) Classify Sentence Into Interaction Type Bind Cleave Inhibit Promote Catalyze Convert to ALFA Retrieved Documents Meta - Search TermsContext Query Get Document Output ALFA Network Query Interface Information Extraction Routine Output Cytoscape Network

34 www.cytoscape.org

35 Cytoscape Network produced by Literature Search. Abstract from the scientific literature Sentences for an edge

36 www.cytoscape.org Active Modules (UCSD) Ideker T, Ozier O, Schwikowski B, Siegel AF Bioinformatics. 2002;18 Suppl 1:S233-40

37 www.cytoscape.org Active Modules

38 www.cytoscape.org Biomodules (ISB) Prinz S, Avila-Campillo I, Aldridge C, Srinivasan A, Dimitrov K, Siegel AF, and Galitski T Genome Res. 2004 14: 380-390


Download ppt "ISMB, June.26.2005. From protein sequences…to protein networks Database DNA and protein sequences Query Sequence GACTGCATTAC Family of homologous genes."

Similar presentations


Ads by Google