Paper prepared for presentation at the 16 th ICABR Conference – 128 th EAAE Seminar “The Political Economy of the Bioeconomy: Biotechnology and Biofuel”

Slides:



Advertisements
Similar presentations
Chapter 2 The Process of Experimentation
Advertisements

Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Complex Networks Advanced Computer Networks: Part1.
Research Methodology For reader assistance, have an introductory paragraph in which attention is given to the organization of the section in relation to.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Threshold selection in gene co- expression networks using spectral graph theory techniques Andy D Perkins*,Michael A Langston BMC Bioinformatics 1.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Genomic analysis of regulatory network dynamics reveals large topological changes Paper Study Speaker: Cai Chunhui Sep 21, 2004.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Global topological properties of biological networks.
Predicting protein functions from redundancies in large-scale protein interaction networks Speaker: Chun-hui CAI
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Triangulation of network metaphors The Royal Netherlands Academy of Arts and Sciences Iina Hellsten & Andrea Scharnhorst Networked Research and Digital.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Fig Theory construction. A good theory will generate a host of testable hypotheses. In a typical study, only one or a few of these hypotheses can.
Abstract: In recent years, advances in genetic engineering and techniques of molecular biology have enabled the creation and commercial release of “Genetically.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
The importance of enzymes and their occurrences: from the perspective of a network W.C. Liu 1, W.H. Lin 1, S.T. Yang 1, F. Jordan 2 and A.J. Davis 3, M.J.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Networks and Interactions Boo Virk v1.0.
CSD 5100 Introduction to Research Methods in CSD Observation and Data Collection in CSD Research Strategies Measurement Issues.
WELNS 670: Wellness Research Design Chapter 5: Planning Your Research Design.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Gene Expression Networks Esra Erdin CS 790g Fall 2010.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Enriching Assessment of the Core Albert Oosterhof, Faranak Rohani, & Penny J. Gilmer Florida State University Center for Advancement of Learning and Assessment.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Scale-free and Hierarchical Structures in Complex Networks L. Barabasi, Z. Dezso, E. Ravasz, S.H. Yook and Z. Oltvai Presented by Arzucan Özgür.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Graph clustering to detect network modules
EQTLs.
Biological networks CS 5263 Bioinformatics.
AN INTRODUCTION TO EDUCATIONAL RESEARCH.
Gene-set analysis Danielle Posthuma & Christiaan de Leeuw
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Building and Analyzing Genome-Wide Gene Disruption Networks
Modelling Structure and Function in Complex Networks
Biological Science Applications in Agriculture
Interactome Networks and Human Disease
Presentation transcript:

Paper prepared for presentation at the 16 th ICABR Conference – 128 th EAAE Seminar “The Political Economy of the Bioeconomy: Biotechnology and Biofuel” Ravello, Italy, June 24-27, 2012 Pasquale Lucio Scandizzo, Alessandra Imperiali

Introduction Literature and review Data and variables Empirical analysis Concluding remarks Outline 25/06/12

Investigate the characteristics of genetic networks. Account for the role of desirable traits for modern agriculture. Explore the implication of a model, where the desirable traits depend not only on the properties of the individual genes, but also on their connections and the architecture of the network Aim of this paper 25/06/12 Introduction

Is biotechnology research in agriculture making substantial progress along new lines? Are there new paradigms for research in the sector? What is the role of network theory ? Does it promise to yield revolutionary results? Key Questions 25/06/12 Introduction

Part 1 Consistent with empirical research on large networks, our findings point to a scale-free relationship between the number of links among genes and the number of genes, with a significant and proportional or more than proportional increase in the percentage of links in response to a percentage increase in nodes inside each co-expression network. This relationship can be interpreted as the result of information exchanges, i.e. as a relationship between the information contained and the information exchanged by the genes. Part 2 Our findings point to a positive, but less than proportional, scale free relationship, between the number of co-expressed genes and the number of genes inside each coexpression network. Results’ overview 25/06/12 Introduction

An overview of a Rice coexpression network with 4,495 genes and 32,544 edges (Pearson’s correlation r ≥ 0.93) 25/06/12 Literature and review Source: A.Fukushima et al., “Characterizing gene coexpression modules in Oryza sativa based on a graph-clustering approach”, 2009.

From multiplicity of characters to co-expression 25/06/12 Literature and review In the past 15 years, an intense research activity has been directed towards biological Networks, where the Network Theory finds its natural application. These studies have produced remarkable progress in understanding the topological and chemical structures of the genes, and promise to make spectacular improvements to agricultural crops. Today genes can be modified and recombined into the cells of living organisms to improve crop productivity or to make crops more resistant to stress, diseases and chemical treatments (Steven D. Tanksley and Susan R. McCouch, 1997). This new technique is known as recombinant DNA technology, and has allowed scientists to carry out procedures using genes and DNA that are extremely advanced and innovative.

From multiplicity of characters to co-expression 25/06/12 Literature and review Recombinant DNA (rDNA) consists of DNA sequences resulting from laboratory methods that bring together genetic material from multiple sources. Scientists succeded in isolating genes responsible for main adaptive and improvement traits and were able to determine their chemical structure, together with their function. This acquired knowledge was then used to develop the potential of our wild and cultivated germ-plasm resources for improving agricultural crops. After spending decades to disassemble nature, and having provided a wealth of knowledge about the individual components, scientists developed a theory of complexity where nothing happens in isolation and most of the characteristics of living being derive from the interactions among their constituents.

From multiplicity of characters to co-expression 25/06/12 Literature and review Understanding and unraveling the interactions and the orchestrated activity of many interacting components constitutes a major goal for biologists of the genome era. The network approaches are used to integrate various types of genomics data in order to increase the reliability of predicted interactions. One increasingly important method used to identify interacting gene sets is represented by the construction of gene co-expression networks where traits are the result of cooperative expression (co-expression) of genes, organized according to the topology of networks. Co-expression networks includes genes involved in related biological pathways, which are expressed cooperatively for their functions. It is constructed by determining the tendency of m transcripts to exhibit similar expression patterns across a set of n microarrays.(P. Ficklin and F. Alex Feltus, 2011)

Gene co-expression network 25/06/12 Literature and review In gene co-expression networks, each gene corresponds to a node. Two genes are connected by an edge if their expression values are highly correlated. Definition of “high” correlation is somewhat tricky – One can use statistical significance… – Alternatively, one can use a threshold parameter: scale free topology criterion.

From multiplicity of characters to co-expression 25/06/12 Literature and review By exploring several large databases describing the topology of large networks, Albert and Barabasi (1999) found that the degree distribution follows a power- law for large k: Where K stands for the average degree of a node i, which represents the number of edges incident with the node. P stands for the probability that a node chosen uniformly at random has degree k. The value of the exponent varies between 2 and 3. Following this approach, the literature indicates that the intricate interwoven relationships that govern cellular functions follow a universal law. They are “scale-free, modular, hierarchical, small worlds of short paths and their connections are highly clustered” (Albert-Laszlo Barabasi and Zoltan N. Oltvai, 2004).

Distribution of connections per node in the coexpression network 25/06/12 Literature and review Source: V. van Noort et al, “The yeast coexpression network has a small- world, scale-free architecture and can be explained by a simple model, 2004.”

Meta Analysis 25/06/12 Data and variables In the paper we discuss the recombinant DNA techniques and their application to agricultural crops, focusing our attention on the regulatory mechanism in gene interactions. We aim to study the interactions underlying expressed traits, using three crops species: Arabidopsis thaliana, maize (Zea mays) and rice (Oryza sativa) We selected 57 studies aimed to identify the gene co-expression networks among these crops by examining the co-expression patterns of genes over a large number of experimental conditions. We collected both the data and the results presented by the studies on 101 networks.

Estimates 25/06/12 Empirical analysis Aim: Identify the correspondence between the underlying genes and the observed traits. We analyzed the role of the number of genes on two different characteristics: The number of edges (L) and the number of coexpressed genes (C). Using the data assembled from the studies, we estimated two different relations by means of ordinary least squares (OLS):

Variables used for our estimates 25/06/12 Empirical analysis

The Estimates 25/06/12 Empirical analysis Dep. Variable Number of Links (1)(2)(3) Nodes1.13*** (7.65) 0.99*** (7.35) 1.37*** (6.26) Dummy Arabidopsis0.99* (1.75) -1.73* (-1.83) Dummy Biosynthesis Dummy Metabolic Pathways Dummy Photosynthesis2.30** (3.43) 1.57* (1.82) Dummy Seed Development-0.41 (-0.40) Dummy Signaling Pathways1.48** (2.23) Dummy Stress Response0.16 (-1.27) N41 19 R-squared

The Estimates 25/06/12 Empirical analysis Dep. Variable: N. of coexpressed genes (1)(2)(3) Nodes0.60*** (5.44) 0.79*** (10.72) 0.65*** (5.82) Dummy Arabidopsis0.33 (0.85) Dummy Biosynthesis-0.83* (-1.73) Dummy Metabolic Pathways-0.75** (-2.17) -0.53** (-2.81) Dummy Photosynthesis-0.30 (-0.60) Dummy Seed Development Dummy Signaling Pathways-1.24*** ( Dummy Stress Response-1.25*** (-3.35) -0.90*** (-3.42) -0.83* (-1.73) N R-squared

Conclusion 25/06/12 Concluding remarks Our Analysis confirms the existence of a scale-free relationship which has been found ubiquitous in complex networks. We found also that as the number of genes increase inside the biological networks considered, the number of co-expressed genes increase, less than proportionally. This finding suggests that a strategy of research aimed to identify relevant clusters of co- expression may be more successful than one aimed at identifying single traits or groups of traits and corresponding gene determinants. Althout this conclusion seems to hold for stress response, metabolic pathways and biosynthesis, which have a lower influence on the number of edges, the intensity of seed development appears to be associated instead with an increase in the connectivity of the network.