Introduction to Systems Biology. Overview of the day Background & Introduction Network analysis methods Case studies Exercises.

Slides:



Advertisements
Similar presentations
Biological pathway and systems analysis An introduction.
Advertisements

The Yeast Two-Hybrid System Anne C. Luebke. What is the yeast two-hybrid system used for? n Identifies novel protein-protein interactions n Can identify.
Darwinian Genomics Csaba Pal Biological Research Center Szeged, Hungary.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Systems Biology Biological Sequence Analysis
Gene expression analysis summary Where are we now?
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to Systems Biology. Overview of the day Background & Introduction Network analysis methods Case studies Exercises.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Functional genomics and inferring regulatory pathways with gene expression data.
Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Protein domains vs. structure domains - an example.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Systems Biology Biological Sequence Analysis
Indiana University Bloomington, IN Junguk Hur Computational Omics Lab School of Informatics Differential location analysis A novel approach to detecting.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Chip arrays and gene expression data. Motivation.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Systems Biology Biological Sequence Analysis
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Biological networks: Types and origin
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Genetics: From Genes to Genomes
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Affinity chromatography/mass spec Bait protein GST Page 252.
Case study reveals transcription factor (TF) modules, dynamic TF binding and an expanded role for cell cycle regulators Mapping the DNA Damage Response.
Epistasis Analysis Using Microarrays Chris Workman.
Case study reveals transcription factor (TF) modules, dynamic TF binding and an expanded role for cell cycle regulators Mapping the DNA Damage Response.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Systems Biology The search for the syntax of biological information, that is, the study of the dynamic networks of interacting biological elements. The.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
BUDDING TECHNOLOGIES AND BUDDING YEAST 2012 HHMI Summer Workshop for High School Science Teachers.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Does gene order matter? Cis-regulatory elements, proteins, and messengers are integrated into biological circuits. Does gene location in the genome affect.
Interactions and more interactions
Network Biology Presentation by: Ansuman sahoo 10th semester
Finish up array applications Move on to proteomics Protein microarrays.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Discovering Macromolecular Interactions. An experimental strategy for identifying new molecular actors in a process candidate approach general screen.
Proteome and interactome Bioinformatics.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Introduction to Systems Biology. Overview of the day Background & Introduction Network analysis methods Case studies Exercises.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Central dogma: the story of life RNA DNA Protein.
Introduction to biological molecular networks
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Protein interactions: main methods for detection (all organisms) Two-hybrid8,446 (Co-)Immunoprecipitation567 Interaction adhesion assay225 In vitro binding138.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Schedule for the Afternoon
Principle of Epistasis Analysis
Presentation transcript:

Introduction to Systems Biology

Overview of the day Background & Introduction Network analysis methods Case studies Exercises

Why Systems Biology? …and why now?

Frederick Sanger In 1975, he developed the chain termination method of DNA sequencing, also known as the Dideoxy termination method or the Sanger method. Two years later he used his technique to successfully sequence the genome of the Phage Φ-X174; the first fully sequenced genome. This earned him a Nobel Prize in Chemistry (1980) (his second) –Sanger earned his first Nobel prize in Chemistry (1958) for determining the complete amino acid sequence of insulin in Concluded that insulin had a precise amino acid sequence.

The genomic era Human genome sequence “completed”, Feb 2001

PubMed abstracts indicate a recent interest in Systems Biology Human genome completed

Functional genomics Study of Genomes is called “Genomics” Genomics led to Functional Genomics which aims to characterize and determine the function of biomolecules (mainly proteins), often by the use of high-throughput technologies. Today, people talk about: –Genomics –Transcriptomics –Proteomics –Metabolomics –[Anything]omics

High-throughput applications of microarrays Gene expression De novo DNA sequencing (short) DNA re-sequencing (relative to reference) SNP analysis Competitive growth assays ChIP-chip (interaction data) Array CGH Whole genome tiling arrays

Tiling microarrays Huber W, et al., Bioinformatics 2006

Systematic phenotyping yfg1  yfg2  yfg3  CTAACTCTCGCGCATCATAAT Barcode (UPTAG): Deletion Strain: Growth 6hrs in minimal media (how many doublings?) Rich media … Harvest and label genomic DNA

Systematic phenotyping with a barcode array ( Ron Davis and others) These oligo barcodes are also spotted on a DNA microarray Growth time in minimal media: –Red: 0 hours –Green: 6 hours

Protein-DNA interactions: ChIP-chip Simon et al., Cell 2001 Lee et al., Science 2002

Mapping transcription factor binding sites Harbison C., Gordon B., et al. Nature 2004

Dynamic role of transcription factors Harbison C., Gordon B., et al. Nature 2004

Mass spectrometry Peptide identification Relative peptide levels Protein-protein interactions (complexes) Post-translational modifications Many many technologies

Comparing mRNA levels to protein levels

Protein-protein interaction data Physical Interactions –Yeast two hybrid screens –Affinity purification (mass spec) –Peptide arrays –Protein-DNA by chIP-chip Other measures of ‘association’ –Genetic interactions (double deletion mutants) –Genomic context (STRING)

Yeast two-hybrid method Y2H assays interactions in vivo. Uses property that transcription factors generally have separable transcriptional activation (AD) and DNA binding (DBD) domains. A functional transcription factor can be created if a separately expressed AD can be made to interact with a DBD. A protein ‘bait’ B is fused to a DBD and screened against a library of protein “preys”, each fused to a AD.

Issues with Y2H Strengths –High sensitivity (transient & permanent PPIs) –Takes place in vivo –Independent of endogenous expression Weaknesses: False positive interactions –Auto-activation –‘sticky’ prey –Detects “possible interactions” that may not take place under real physiological conditions –May identify indirect interactions (A-C-B) Weaknesses: False negatives interactions –Similar studies often reveal very different sets of interacting proteins (i.e. False negatives) –May miss PPIs that require other factors to be present (e.g. ligands, proteins, PTMs)

Exercise: Y2H Construct a protein-protein interaction network for proteins A,B,C,D

Systems biology and emerging properties

Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002

Building models from parts lists

Protein-DNA interactions Gene levels (up/down) Protein-protein interactions Protein levels (present/absent) Biochemical reactions Biochemical levels ▲ Chromatin IP ▼ DNA microarray ▲ Protein coIP ▼ Mass spectrometry ▲none Metabolic flux ▼ measurements

Mathematical abstraction of biochemistry

Metabolic models

“Genome scale” metabolic models Genes708 Metabolites584 –Cytosolic559 –Mitochondrial164 –Extracellular121 Reactions1175 –Cytosolic702 –Mitochondrial124 –Exchange fluxes349 Forster et al. Genome Research 2003.

One framework for Systems Biology 1.The components. Discover all of the genes in the genome and the subset of genes, proteins, and other small molecules constituting the pathway of interest. If possible, define an initial model of the molecular interactions governing pathway function (how?). 2.Pathway perturbation. Perturb each pathway component through a series of genetic or environmental manipulations. Detect and quantify the corresponding global cellular response to each perturbation.

One framework for Systems Biology 3.Model Reconciliation. Integrate the observed mRNA and protein responses with the current, pathway- specific model and with the global network of protein- protein, protein-DNA, and other known physical interactions. 4.Model verification/expansion. Formulate new hypotheses to explain observations not predicted by the model. Design additional perturbation experiments to test these and iteratively repeat steps (2), (3), and (4).

From model to experiment and back again

Systems biology paradigm Aebersold R, Mann M., Nature, 2003.

Continuum of modeling approaches Top-downBottom-up

Need computational tools able to distill pathways of interest from large molecular interaction databases (top-down) Data integration and statistical mining

List of genes implicated in an experiment What do we make of such a result? Jelinsky S & Samson LD, Proc. Natl. Acad. Sci. USA Vol. 96, pp. 1486–1491,1999

Types of information to integrate Data that determine the network (nodes and edges) –protein-protein –protein-DNA, etc… Data that determine the state of the system –mRNA expression data –Protein modifications –Protein levels –Growth phenotype –Dynamics over time

Mapping the phenotypic data to the network Begley TJ, Rosenbach AS, Ideker T, Samson LD. Damage recovery pathways in Saccharomyces cerevisiae revealed by genomic phenotyping and interactome mapping. Mol Cancer Res Dec;1(2): Systematic phenotyping of 1615 gene knockout strains in yeast Evaluation of growth of each strain in the presence of MMS (and other DNA damaging agents) Screening against a network of 12,232 protein interactions

Mapping the phenotypic data to the network Begley TJ, Rosenbach AS, Ideker T, Samson LD. Damage recovery pathways in Saccharomyces cerevisiae revealed by genomic phenotyping and interactome mapping. Mol Cancer Res Dec;1(2):

Mapping the phenotypic data to the network Begley TJ, Rosenbach AS, Ideker T, Samson LD. Damage recovery pathways in Saccharomyces cerevisiae revealed by genomic phenotyping and interactome mapping. Mol Cancer Res Dec;1(2):

Network models can be predictive Green nodes represent proteins identified as being required for MMS resistance; gray nodes were not tested as part of the 1615 strains used in this study; blue lines represent protein-protein interactions. The untested gene deletion strains (ylr423c, hda1, and hpr5) were subsequently tested for MMS sensitivity; all were found to be sensitive (bottom). Begley TJ, Rosenbach AS, Ideker T, Samson LD. Damage recovery pathways in Saccharomyces cerevisiae revealed by genomic phenotyping and interactome mapping. Mol Cancer Res Dec;1(2):

Summary Systems biology can be either top-down or bottom-up We are now in the post genomic era (don’t ignore that) Systematic measurements of all transcripts, proteins, and protein interactions enable top- down modeling Metabolic models, built bottom-up, are being refined with genomic information Data – Model – Predictions – Data: cycle as a Systems Biology theme