Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.

Slides:



Advertisements
Similar presentations
Bayesian network for gene regulatory network construction
Advertisements

Slides from: Doug Gray, David Poole
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Decision Tree Approach in Data Mining
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Part I Introduction to Data Mining by Tan,
DYNAMICS OF RANDOM BOOLEAN NETWORKS James F. Lynch Clarkson University.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Decision Tree Algorithm
Inference in Probabilistic Ontologies with Attributive Concept Descriptions and Nominals Rodrigo Bellizia Polastro and Fabio Gagliardi Cozman.
Final Project: Project 9 Part 1: Neural Networks Part 2: Overview of Classifiers Aparna S. Varde April 28, 2005 CS539: Machine Learning Course Instructor:
Yeast Dataset Analysis Hongli Li Final Project Computer Science Department UMASS Lowell.
Radosław Wesołowski Tomasz Pękalski, Michal Borkowicz, Maciej Kopaczyński
Sai Moturu. Introduction Current approaches to microarray data analysis –Analysis of experimental data followed by a posterior process where biological.
Inference of Transcriptional Regulation Network with Gene Expression Data Andrew Kwon.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
6. Gene Regulatory Networks
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bayes Net Perspectives on Causation and Causal Inference
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Next Generation Techniques: Trees, Network and Rules
DATA MINING : CLASSIFICATION. Classification : Definition  Classification is a supervised learning.  Uses training sets which has correct answers (class.
Cis-regulation Trans-regulation 5 Objective: pathway reconstruction.
INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Inductive learning Simplest form: learn a function from examples
Graph and Topological Structure Mining on Scientific Articles Fan Wang, Ruoming Jin, Gagan Agrawal and Helen Piontkivska The Ohio State University The.
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
GATree: Genetically Evolved Decision Trees 전자전기컴퓨터공학과 데이터베이스 연구실 G 김태종.
Treatment Learning: Implementation and Application Ying Hu Electrical & Computer Engineering University of British Columbia.
A COMPREHENSIVE GENE REGULATORY NETWORK FOR THE DIAUXIC SHIFT IN SACCHAROMYCES CEREVISIAE GEISTLINGER, L., CSABA, G., DIRMEIER, S., KÜFFNER, R., AND ZIMMER,
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Lecture 5: Writing the Project Documentation Part III.
Decision Trees. Decision trees Decision trees are powerful and popular tools for classification and prediction. The attractiveness of decision trees is.
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
A Biology Primer Part IV: Gene networks and systems biology Vasileios Hatzivassiloglou University of Texas at Dallas.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Slides for “Data Mining” by I. H. Witten and E. Frank.
Decision Trees Binary output – easily extendible to multiple output classes. Takes a set of attributes for a given situation or object and outputs a yes/no.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Introduction to biological molecular networks
Support Vector Machines and Gene Function Prediction Brown et al PNAS. CS 466 Saurabh Sinha.
Biclustering of Expression Data by Yizong Cheng and Geoge M. Church Presented by Bojun Yan March 25, 2004.
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
DECISION TREE Ge Song. Introduction ■ Decision Tree: is a supervised learning algorithm used for classification or regression. ■ Decision Tree Graph:
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007.
Classification using Decision Trees 1.Data Mining and Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
1 Applications of Symbolic Logic to Gene Regulation Systems Department of Computer Science and Information Engineering of National Chung-Cheng University.
Inference of Gene Relations from Microarray Data by Abduction Irene Papatheodorou & Marek Sergot Imperial College, London UK.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Yiming Kang, Hien-haw Liow, Ezekiel Maier, & Michael Brent
Building and Analyzing Genome-Wide Gene Disruption Networks
Presented by Laur Tooming Bioinformatics journal club 28 Oct 2009
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Presentation transcript:

Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised Learning” - Soinov, Krestyaninova, Brazma

Outline Why study another microarray algorithm? Background info Methods Results Discussion Conclusion

Why study another microarray algorithm? Study of microarray data continues  Still unclear on what the data means  Still unclear on how the genome works Confirm existing knowledge about gene networks using existing datasets Proof of concept in a new algorithm using existing knowledge and datasets This algorithm actually explains its reasoning

Background information What is a gene network? What is supervised learning? What are decision trees / classifiers? Why use classifiers?

What is a gene network? A model of a genes affecting other genes  What other genes affect a given gene  How other genes affect a given gene Positive, negative, complicated Several model types – graphs, nodes, edges  Boolean ( on – off )  Bayesian network ( conditional probability )  Differential equations ( derivatives, integrals )

Gene network - example

What is supervised learning? The paper was unclear on the subject  Perhaps a reference to the type of algorithm used  It may have involved human interaction with the software  Possibly, the software produced the classifiers in the form of a decision tree, then users interpreted the output into classification rules

What are decision trees / classifiers? Acyclic directed graph - tree  Each graph explains what other genes affect a specific gene  Inner nodes are gene products of other genes  Edges are thresholds of concentration of the gene products of the other genes – rules of the tree  Leaf nodes are effects on transcription of the specific gene Each graph is a classifier for a specific gene

Classifiers – model of gene networks Expression of gene is function of transcription Transcription of gene is in discrete states  Expressed more than average  Expressed less than average Transcription state affected by amount of other gene products (expression of other genes) Use yeast cell cycle data to test algorithm and previous knowledge to judge accuracy

Why use classifiers? The products affecting a specific gene are listed in the tree Allows for continuous values for concentrations Each additional dataset refines the decision information Decision trees are easy to read and interpret

Classifier - example

Methods Use induction algorithm to generate decision trees  Program called C4.5 Apply program three ways  Regulation of target gene as a function of other genes at same time (simultaneous)  Regulation of target gene as a function of other genes at previous times (time delay)  Regulation as a function of change of other genes (changes)

Results - given These genes and yeast datasets  Spellman, Cho, … Cdc28 Alpha-factor

Results – produced this

Results – with this accuracy

Discussion Some concern about the accuracy between 70% and 94% on systems with known interactions Does that imply that the microarray data is wrong or the algorithm is flawed?

Conclusions Decision trees and classifiers seem a better way to explain gene expression This paper did not do a good job of explaining how to make / use them Reference to the algorithm itself was almost specious