Network biology An introduction to STRING and Cytoscape

Slides:



Advertisements
Similar presentations
STRING Prediction of protein networks through integration of diverse large-scale data sets Lars Juhl Jensen EMBL Heidelberg.
Advertisements

BiNoM, a Cytoscape plugin for accessing and analyzing pathways using standard systems biology formats Eric Bonnet Computational Systems Biology of Cancer.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Pathways analysis Iowa State Workshop 11 June 2009.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The STRING database Michael Kuhn EMBL Heidelberg.
STRING Modeling of biological systems through cross-species data integration.
Mining text and data on chemicals Lars Juhl Jensen.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Mining the Medical Literature Chirag Bhatt October 14 th, 2004.
The STRING Database What it does and how it interfaces to other resources The STRING Database What it does and how it interfaces to other resources Christian.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
Thomas Lemberger Chief Editor, Molecular Systems Biology Deputy Head, Scientific Publications, EMBO Publishing actionable data.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Department of Biomedical Informatics Biomedical Data Visualization Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Introduction to Information Retrieval CS 5604: Information Storage and Retrieval ProjCINETViz by Maksudul Alam, S M Arifuzzaman, and Md Hasanuzzaman Bhuiyan.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Copyright OpenHelix. No use or reproduction without express written consent1.
Networks and Interactions Boo Virk v1.0.
Creating Metabolic Network Models using Text Mining and Expert Knowledge J.A. Dickerson, D. Berleant, Z. Cox, W. Qi, and E. Wurtele Iowa State University.
Tutorial session 2 Network annotation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Lars Juhl Jensen Biomedical text mining. exponential growth.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
Tutorial session 1 Network generation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
A curated database of biological pathways.
A collaborative tool for sequence annotation. Contact:
Introduction to biological molecular networks
GO based data analysis Iowa State Workshop 11 June 2009.
Network construction and exploration using CORNET and Cytoscape - Excercises SPICY WORKSHOP Wageningen, March 8 th 2012 Stefanie De Bodt.
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
Open access – making the most of biomedical literature mining Lars Juhl Jensen EMBL Heidelberg.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
GPML Plugin for Cytoscape Thomas Kelder Maastricht University
Lab Interactions and Ontologies LAB CBW Bioinformatics Workshop February 23 th 2006, Toronto Christopher Hogue Blueprint Initiative.
Computational Biology Signaling networks and drug repositioning Lars Juhl Jensen.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Networks and Interactions
Biological Databases By: Komal Arora.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Volume 19, Issue 5, Pages (May 2017)
Data-intensive Computing: Case Study Area 1: Bioinformatics
Protein association networks with STRING
STRING Large-scale data and text mining
Tutorial 12 Biological networks.
Pathway Visualization
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
STRING Protein networks from data and text mining
PIR: Protein Information Resource
Batyr Charyyev.
A Call for Systematic Research on Solute Carriers
Anastasia Baryshnikova  Cell Systems 
Visualization of Content Information in Networks using GlyphNet
The Omics Dashboard.
Pathway Visualization
Volume 19, Issue 5, Pages (May 2017)
Figure 1. PaintOmics 3 workflow diagram
Integrative omic approaches for the study of host–pathogen interactions Integrative omic approaches for the study of host–pathogen interactions (A) Proteomic.
Analyzing Massive Graphs - ParT I
Presentation transcript:

Network biology An introduction to STRING and Cytoscape Lars Juhl Jensen jensenlab.org

core concepts

STRING

Cytoscape

stringApp

core concepts

nodes / vertices

things to be connected

proteins

diseases

edges

connections between things

undirected

directed

weighted

topology

degree

centrality

clustering coefficient

robustness

protein networks

physical interactions

functional associations

guilt by association

STRING

5090 genomes

24.6 million proteins

functional associations

string-db.org Szklarczyk et al., Nucleic Acids Res., 2017

heavily used

4000 users daily

easy to use

works well

Huang et al., Cell Syst., 2018

data integration

genomic context

gene fusion

Korbel et al., Nat. Biotechnol., 2004

gene neighborhood

Korbel et al., Nat. Biotechnol., 2004

phylogenetic profiles

Korbel et al., Nat. Biotechnol., 2004

experimental data

gene coexpression

physical interactions

von Mering et al., Nucleic Acids Res., 2005

curated knowledge

pathways

Letunic & Bork, Trends Biochem. Sci., 2008

many databases

different formats

different identifiers

varying quality

not comparable

not same species

hard work

parsers

mapping files

quality scores

von Mering et al., Nucleic Acids Res., 2005

score calibration

gold standard

von Mering et al., Nucleic Acids Res., 2005

common identifiers

common scale

orthology transfer

Franceschini et al., Nucleic Acids Res., 2013

cross-species integration

missing most of the data

>10 km

too much to read

computer

as smart as a dog

teach it specific tricks

named entity recognition

dictionary

synonyms

cyclin dependent kinase 1

CDC2

orthographic variation

cyclin dependent kinase 1

cyclin-dependent kinase 1

CDC2

hCDC2

black list

SDS

a

co-mentioning

counting

documents

paragraphs

sentences

weighted count

quality score

score calibration

orthology transfer

combine all evidence

Cytoscape

network tool

analysis

visualization

not a database

user interface

networks

tables

visual styles

load/save sessions

import network

local file

public database

import table

property mappings

passthrough mapping

discrete mapping

continuous mapping

default & bypass

selection filters

select by attributes

layout algorithms

clustering

app store

stringApp

STRING  Cytoscape

protein query

disease query

PubMed query

visualize your own data

proteomics experiment

Emdal et al., Science Signaling, 2015

protein query

STRING network

import table

user data

new node attributes

color by log-ratio

Szklarczyk et al., Nucleic Acids Research, 2017

term enrichment analysis

select terms of interest

visualize on network

easy example

typical omics dataset

100–1000 proteins

1000–10000 interactions

Doncheva et al., J. Proteome Res., 2018

ridiculogram

clustering

functional modules

intra-cluster interactions

re-layout network

Doncheva et al., J. Proteome Res., 2018

compare diseases

disease query

merged STRING network

ridiculogram

clustering

color by diseases

highlight drug targets

summarize literature

topic of interest

PubMed query

retrieve abstracts

enriched proteins

STRING network

color by literature support

highlight physical interactions

visualize enriched terms

summary

networks

nodes & edges

useful abstraction

lends itself to visualization

STRING

protein network database

functional associations

curated knowledge

experimental data

genomic context

text mining

unified identifiers

scored

transferred

web interface

Cytoscape

network analysis

network visualization