The ultimate complex system: networks in molecular biology A. W. Schreiber Australian Centre for Plant Functional Genomics Waite Campus, University of.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Topology and Dynamics of Complex Networks FRES1010 Complex Adaptive Systems Eileen Kraemer Fall 2005.
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Biotechnology - Using an organism to make a product, …or using advanced methods to study an organism GMO - Genetically Modified Organism Transgenic - describing.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
6 Mark Tester Australian Centre for Plant Functional Genomics University of Adelaide Research developments in genetically modified grains.
Genetics and Health Jennifer Eyvindson Epi 6181 November 2006.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Signal Processing in Single Cells Tony 03/30/2005.
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Systems Biology Biological Sequence Analysis
Gene expression analysis summary Where are we now?
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Experimental and computational assessment of conditionally essential genes in E. coli Chao WANG, Oct
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Network Motifs: simple Building Blocks of Complex Networks R. Milo et. al. Science 298, 824 (2002) Y. Lahini.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Genetics: From Genes to Genomes
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
PREVIEW 1 ST SIX WEEKS – 5 WEEKS LONG 2 ND SIX WEEKS – 5 WEEKS LONG 3 RD SIX WEEKS – 6 WEEKS LONG 2 WEEKS OF TESTING SEMESTER ENDS BEFORE CHRISTMAS.
Compare and contrast prokaryotic and eukaryotic cells.[BIO.4A] October 2014Secondary Science - Biology.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Paper prepared for presentation at the 16 th ICABR Conference – 128 th EAAE Seminar “The Political Economy of the Bioeconomy: Biotechnology and Biofuel”
PattArAn – From Annotation Triplets to Sentence Fingerprints Motivation Motivation  Scientific concepts are annotated with controlled vocabulary (CV)
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
GTL User Facilities Facility IV: Analysis and Modeling of Cellular Systems Jim K. Fredrickson.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Central dogma: the story of life RNA DNA Protein.
Mechanisms for Diversity and Genetics Big Idea #3 In conjunction with Big Idea #2.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
© 2011 Pearson Education, Inc. Lectures by Stephanie Scher Pandolfi BIOLOGICAL SCIENCE FOURTH EDITION SCOTT FREEMAN 17 Control of Gene Expression in Bacteria.
Increase in complexity in evolution (questions, answers, research programme) Eörs Szathmáry Collegium BudapestEötvös University Budapest.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
FROM GENE TO PROTEIN: TRANSLATION & MUTATIONS Chapter
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Biology Test Content ETS Major Field Test – 150 Questions.
Structures of Networks
Regulation of Gene Expression
Dept of Biomedical Informatics University of Pittsburgh
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Schedule for the Afternoon
Summary of the Standards of Learning
Biology in the 21st Century
The Study of Biological Information
Interactome Networks and Human Disease
Presentation transcript:

The ultimate complex system: networks in molecular biology A. W. Schreiber Australian Centre for Plant Functional Genomics Waite Campus, University of Adelaide Achievements and new directions in Subatomic Physics: Workshop in Honour of Tony Thomass 60 th birthday February 2010

First operational: 2003 Mission: to improve abiotic stress tolerance in cereal crops (salinity, drought, nutrient deficiency etc.) > 100 scientists O(M$10)/annum

Agricultural scenes, tomb of Nakht, 18 th dynasty, Thebes Source: Wikimedia commons Like physics, improving stress tolerance of crops is one of humanitys most ancient pursuits! Plant breeding, 5500 BC The Plant Accelerator Plant breeding, 20 th century High throughput technologies Genetics Molecular Biology Plant breeding, 21 st century

Internet encyclopedia of science At the heart of it all: the molecular cell

Gene expression Proteins Metabolites Protein degradation Metabolic reactions Complex formation, protein-protein interactions Transcriptional regulation Signalling hormones, ligands,extracellular metabolites Post- transcriptional regulation Post- translational regulation cell nucleus extra-cellular space RNA ncRNA Transcription factors Genes DNA

Genes Proteins Metabolites Protein degradation Metabolic reactions Complex formation, protein-protein interactions Transcriptional regulation Signalling hormones, ligands,extracellular metabolites Post- transcriptional regulation Post- translational regulation cell nucleus extra-cellular space RNA ncRNA Transcription factors Gene expression Post- transcriptional regulation Proteins Transcriptional regulation Gene expression

Regulatory network of genes involved in the transition to flowering J.J.B.Keurentjes et al, Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loc, PNAS 2007, 104, 1708 Gene regulatory networks Gene Regulator Positive regulation inhibition (directed graph)

Genes Gene expression Proteins Metabolites Protein degradation Metabolic reactions Complex formation, protein-protein interactions Transcriptional regulation Signalling hormones, ligands,extracellular metabolites Post- transcriptional regulation Post- translational regulation cell nucleus extra-cellular space RNA ncRNA Transcription factors Complex formation, protein-protein interactions

Albert, R. J Cell Sci 2005;118: C. elegans protein interaction network Protein-protein interaction network Protein interaction, e.g. binding (undirected graph)

Genes Gene expression Proteins Metabolites Protein degradation Metabolic reactions Complex formation, protein-protein interactions Transcriptional regulation Signalling hormones, ligands,extracellular metabolites Post- transcriptional regulation Post- translational regulation cell nucleus extra-cellular space RNA ncRNA Transcription factors Metabolic reactions

Metabolic networks: represent metabolism as directed graphs taken from KEGG Pathway database Nodes: Compounds Edges: Enzymes Links to other pathway maps e.g.

Genes Gene expression Proteins Metabolites Protein degradation Metabolic reactions Complex formation, protein-protein interactions Transcriptional regulation Signalling hormones, ligands,extracellular metabolites Post- transcriptional regulation Post- translational regulation cell nucleus extra-cellular space RNA ncRNA Transcription factors Gene expression

Gene co-expression network Transcriptional response to drought stress Gene High correlation of expression patterns (undirected graph) Modularity discovery of function

Genes Gene expression Proteins Metabolites Protein degradation Metabolic reactions Complex formation, protein-protein interactions Transcriptional regulation Signalling hormones, ligands,extracellular metabolites Post- transcriptional regulation Post- translational regulation cell nucleus extra-cellular space RNA ncRNA Transcription factors

Why are networks so important in biology? 1)Molecular biology, like high energy physics, is all about about parts (genes, proteins, metabolites,...) and how they interact: 2)Classification of network structures, definition of functional modules, etc. are part of the effort to move away from the one gene-one function paradigm 3)High-throughput data is becoming prevalent. How does one interpret this data? How does one generate hypotheses? There is a need to formalize analysis techniques 4) Scale-free networks The search for more suitable d.o.f.s Tools Genomic eraGenes, Proteins: sequences of letters (e.g. A,T,C,G) String comparison, computational linguistics, informatics Post-genomic eraInteractions: links, networks Graph & network theory

Metabolomic networks are scale-free (as well as the WWW, transportation system, food-webs, social and sexual networks, citation networks, protein-protein interaction networks, transcriptional regulatory networks, co-expression networks) Barabasi et al, Nature 2000 Number of metabolites 6 archaea, 32 bacteria, 5 eukaryotes Degree distribution Universality:

Natures normal abhorrence of power laws is suspended when the system is forced to undergo a phase transition. Then power laws emergenatures unmistakable sign that chaos is departing in favor of order. The theory of phase transitions told us loud and clear that the road from disorder to order is maintained by the powerful forces of self-organization and is paved by power laws. It told us that power laws are the patent signatures of self-organization in complex systems. Barabasi 2002 The new science of networks The proposed significance of scale-free-ness: This interpretation is a little controversial, but universality of power-law (or at least power-law-like) behaviour is less so: The first law of genomics Slonimski 1998

How do these networks arise in molecular biology? Gene duplication point mutations: under selective pressure, slow (e.g. cystic fibrosis, sickle-cell anaemia) gene duplications and deletions: under more limited selective pressure The most important factor in evolution (Ohno, 1967) (e.g. α- and β- globin arose from globin) The fundamental process is evolution: inheritable changes coupled with a selection process (survival of the fittest) Inheritable changes are: To understand biological network structure, one should study gene duplications

Gene duplications (cont): give rise to (gene) copy number variations among individuals – a hot topic at present! CNV and human disease (compilation taken from Cohen, Science 07)

Gene duplications (cont): give rise to gene families: Somerville, Plant Phys The CesA superfamily

Cluster ( gene family) size distribution

In the absence of selective pressure (i.e. neutral model of evolution), the evolution of gene family sizes is amenable to modelling: gene duplications gene loss gene innovation branching of existent families Departures from model predictions can indicate presence of selective pressure These models predict functional form of family size distributions e.g. f(i) with = duplication rate/(loss rate + branching rate) i /i Wojtowicz and Tiuryn, J. Comp. Biology (2007)

Summary Networks are the natural language to use for understanding molecular biology on a system-wide scale. They are complex ubiquitous interdependent evolving Concepts from network theory provide both conceptual insights (e.g. spontaneous emergence of order in living systems, higher-level degrees of freedom) practical tools (e.g. discovery of gene function through modules in co-expression networks) We are only at the very beginning of understanding biological networks we only have a very incomplete parts list network integration is needed both spatial and temporal aspects are largely neglected Where is the rich phenomenology so familiar from statistical physics? (e.g. collective degrees of freedom, phase transitions)