Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Transcription Networks And The Cell’s Functional Organization Presenter: Roni Sharf.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
2 March, 2005 Chapter 12 Mutational dissection Normal gene Altered gene with altered phenotype mutagenesis.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Global topological properties of biological networks.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Chip arrays and gene expression data. Motivation.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Bacterial Physiology (Micr430)
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Fuzzy K means.
Biological networks: Types and origin
ANALYSIS OF GENETIC NETWORKS USING ATTRIBUTED GRAPH MATCHING.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Epistasis Analysis Using Microarrays Chris Workman.
25 and 27 February, 2004 Chapter 6C Proteomics Structural and Functional Characterization in the Post- genomic era.
Protein Classification A comparison of function inference techniques.
MCB 7200: Molecular Biology
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 5 A Story of Transcription.
Proteomics Understanding Proteins in the Postgenomic Era.
Analysis of microarray data
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
with an emphasis on DNA microarrays
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Network Analysis and Application Yao Fu
Research Methodology of Biotechnology: Protein-Protein Interactions
Biological Pathways & Networks
(D) Crosslinking Interacting proteins can be identified by crosslinking. A labeled crosslinker is added to protein X in vitro and the cell lysate is added.
Presentation by: Kyle Borge, David Byon, & Jim Hall
Networks and Interactions Boo Virk v1.0.
Network Biology Presentation by: Ansuman sahoo 10th semester
Finish up array applications Move on to proteomics Protein microarrays.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Proteome and interactome Bioinformatics.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Network visualization –Cytoscape Cool.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Protein Microarrays and detection via MALDI-TOF: Project Overview Kyle Nordquist Advisor: Dr. Andrew Link.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Control of Gene Expression. Ways to study protein function by manipulating gene expression Mutations –Naturally occurring, including human and animal.
Two powerful transgenic techniques Addition of genes by nuclear injection Addition of genes by nuclear injection Foreign DNA injected into pronucleus of.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
Proteome and Gene Expression Analysis Chapter 15 & 16.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Biological networks CS 5263 Bioinformatics.
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Presentation transcript:

Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput data amenable to network analysis 2.Network theory and its relationship to biology 1

Physical Interactions: protein-protein interactions Data from: 1. Large-scale yeast-two hybrid assay: recovers binary (1:1) interactions 2. Protein immunoprecipitation & mass-spec identification: recovers complexes Giorgini & Muchowski Gen. Biol PEPTIDE TAG mass spectrometry to identify recovered proteins 3. Literature curation 2

Protein Arrays Proteins or antibodies immobilized onto a solid surface Antibody arrays: for identification & quantification of fluorescently labeled proteins in complex mixtures … proteins bind to immobilized Ab. Functional arrays: for measuring protein function * ppi: detect binding of fluorescent protein to immobilized peptides/proteins * kinase targets: detect phosphorylation of immobilized peptides/proteins by query kinase * ligand binding: detect DNA/carbohydrate/small molecule bound to immobilized proteins Reverse-phase arrays (lysate arrays): cells lysed in situ and immobilized cell lysate is screened 3

From Hall, Ptacek, & Snyder review 2007 Challenges: 1. Large-scale protein purification 2. Protein structure/stability requirements vary widely (unlike DNA) 3. Conditions for protein function vary widely 4. Protein epitope/binding domain must be displayed properly 4

High-throughput identification of gene/protein function: Functional Genomics Gene knock-out libraries: library of single-gene deletions for every gene done in yeast, E. coli, other fungi/bacteria S. cerevisiae libraries: heterozygous deletion (nonessential genes) OR homozygous deletion of all genes. * Each gene replaced with a short, unique ‘barcode’ sequence Strains can be phenotyped individually (screening) OR Selected for particular phenotypes – Strains surviving the selection can be readout on DNA arrays designed against the barcode sequences 5

Yeast deletion library used to: a) Identify ‘essential’ yeast genes and genes required for normal growth a) Genes required for survival of particular conditions/drugs b) Features of functional genomics, gene networks, etc * Screened deletion libraries for >700 conditions * Found ‘phenotype’ for nearly all yeast genes * Characterized which genes could be functionally profiled by which assays (e.g. phenotype, gene expression, etc) 6

Challenges: 1. Difficult to probe ‘essential’ and slow-growing strains 2. Cells likely to pick up secondary mutations to complement missing gene (chromosomal anueploidy in yeast) 7

High-throughput identification of gene/protein function: Functional Genomics RNAi knock-down libraries (C. elegans, flies, humans) Image from David Shapiro Small double-stranded DNAs complementary to mRNA can be injected (or fed) … … these are targeted by the RNAi pathway to inhibit mRNA stability/translation of target gene … knocks down protein abundance/function Challenges: 1. Doesn’t work for all genes/ds DNAs 2. Doesn’t work in all tissues 3. Delay in protein decrease, timing different for different proteins 8

9

Insights from whole-genome knockout / knockdown studies * Screens for genes important for specific phenotypes/processes * Identifying off-target drug effects * Clustering of genes based on common phenotypes from knockdowns * Clustering/analysis of phenotypes with similar underlying genetics/processes * Integrative analysis with genomic expression, etc * Network analysis 10

Network structures Random network: Each node has roughly equal number of connections k, distributed according to Poisson distribution Scale-free network: Some nodes with few connections, other nodes (‘hubs’) with many connections (distributed according to Power Law) Directed vs. Undirected Graphs 11

Network Terminology Connectivity (degree) k: number of connections of a given node (average degree of all nodes ) Degree distribution: probability that a selected node has k connections Shortest path l: fewest number of links connecting two given nodes (average shortest path between all node pairs) Clustering coefficient: # of links connecting the k neighbors of Node X together 12

Scale-free Networks Connectivity: most nodes have few connections but joined by ‘hub’ nodes with many connections 13 ‘Small world’ effect: each node can be connected to any other node through relatively few connections ‘Disassortative’: hubs tend NOT to directly connect to one another ‘Robust’: network structure remains despite node removal (up to 80% removal!) ‘Hub vulnerability’: network structure is particularly reliant on few nodes (hubs)

14 Networks Challenges 1.Identifying relevant subnetworks 2.Integrating multiple data types (see #1 above) 3.Capturing temporary interactions and dynamic relationships 4.Using network structure/subnetworks to infer new insights about biology Networks Challenges 1.Infer hypothetical functions based on network connectivity 2.Reveal new connections between functional groups and complexes 3.Identify motifs and understand motif behaviors (more next time)

A gazillion plugins for Cytoscape …