Putting genetic interactions in context through a global modular decomposition Jamal.

Slides:



Advertisements
Similar presentations
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
Advertisements

Statistics in Bioinformatics May 2, 2002 Quiz-15 min Learning objectives-Understand equally likely outcomes, Counting techniques (Example, genetic code,
Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
Discovery Challenge Gene expression datasets On behalf of Olivier Gandrillon.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
A New Biclustering Algorithm for Analyzing Biological Data Prashant Paymal Advisor: Dr. Hesham Ali.
Decomposition of overlapping protein complexes: A graph theoretical method for analyzing static and dynamic protein associations Algorithms for Molecular.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Mutual Information Mathematical Biology Seminar
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Clustering (Part II) 10/07/09. Outline Affinity propagation Quality evaluation.
SSCP: Mining Statistically Significant Co-location Patterns Sajib Barua and Jörg Sander Dept. of Computing Science University of Alberta, Canada.
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Sai Moturu. Introduction Current approaches to microarray data analysis –Analysis of experimental data followed by a posterior process where biological.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Statistics in Bioinformatics May 12, 2005 Quiz 3-on May 12 Learning objectives-Understand equally likely outcomes, counting techniques (Example, genetic.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Selecting the Correct Statistical Test
Hypothesis Testing II The Two-Sample Case.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Networks and Interactions Boo Virk v1.0.
Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles Jin Chen Sep 2012.
An Improved Algorithm to Accelerate Regular Expression Evaluation Author: Michela Becchi, Patrick Crowley Publisher: 3rd ACM/IEEE Symposium on Architecture.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Comparing Snapshots of Networks Shah Jamal Alam and Ruth Meyer Centre for Policy Modelling 28 th March, 2007 – CAVES Bi-annual Meeting, IIASA,
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
Drug–Target Network Hussein Hijazi Fall 2012 Michigan State University 1.
Introduction to biological molecular networks
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Biclustering of Expression Data by Yizong Cheng and Geoge M. Church Presented by Bojun Yan March 25, 2004.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs Zhe Jin.
The Bi-Module problem: new algorithms and applications Group meeting January 2013 David Amar.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Finding Dense and Connected Subgraphs in Dual Networks
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Genetics and Probability
Biological networks CS 5263 Bioinformatics.
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
SEG5010 Presentation Zhou Lanjun.
Global analysis of the chemical–genetic interaction map.
Presentation transcript:

Putting genetic interactions in context through a global modular decomposition Jamal

Genetic interaction  provide powerful perspective  how gene functions specific mechanisms that give rise to these interactions  not well understood Requires a thorough study of genetic interaction networks  understand the structure of the network. Motivation

This study This study uses a datamining approach to explore all block structure with in this network.

Characteristics Genetic interaction: “ Multiple genetic perturbations whose combination result in a phenotype that is unexpected given the phenotypes of the individual perturbations” The redundancies and dependencies within genetic network can provide powerful means for functional characterization.

Unlike the PPI network, there is no obvious functional interpretation of a single genetic interaction, either negative or positive. The genetic interaction of two genes does not imply that they interact physically, it simply suggest that they share some kind of functional interaction.

Modular hypothesis Gene membership falls into different type of functional modules For example: Protein complexes, pathways, etc.

Negative between pathway Model Defines Negative interactions: which are thought to arise between functionally redundant pathways such that deleting any pair of genes spanning across the pathways results in a significant reduction of fitness

Positive within pathway Model defines Positive interactions: If the second deletion in that same compromised pathway does not result in any additional fitness defect.

Bi-Clusters as block pattern in network Can be over-lapping or disjoint sets of genes Every gene in one set is connected to every other gene in other set.

Pu et al.(2008) specifically designed an algorithm that randomly start with an initial bi-cluster and then rediscover the prominent bi-cluster many times. In this study authors employed an approach based on an algorithm from field association rule mining to find all biclusters of sufficient size.

Approach Summary--bi-cluster Discovery Recent data from Costanzo et al. (2010) was used in this study and the developed approach utilizes the apriori algorithm from the field of association rule mining to discover all biclusters. and the biclusters that can be expressed by degree distribution alone were filter out using non-parametric statistical assessment.

XMOD This approach  XMOD (eXhaustive Modular Discovery)  guaranteed to find all bi-partite graphs : Where 1 part of bi-partite acts as a functional unit

Presence of degree distribution based Bi-clusters Edges were randomized and still bi-partite graphs were obtained suggesting that biologically meaningless bipartite graphs can exist. score for each bi-cluster  lower for biologically meaningful Score: “ the product of probabilities of each edge occurring independently conditioned on the degree of two interacting genes”

Filtered Biclusters: using the independence score a cutt off is applied to separate the ones with less independence score Condensed Biclusters: after removing Biclusters with >40% overlap

Comparison with other techniques

Dataset The dataset in Costanzoo et al. in 2010 was used. 85,714 negative interactions and 35,858 interactions were used.

Association rule Mining Apriori Algorithm in Agrawal (1993) was used. Its standard available implementation from a website was used. Apriori was run on a binary set of positive interactions and also on a set of negative interactions

Randomizing the Genetic Interaction network The number of edges for each gene was preserved but the targets were randomized. A gene cannot have an edge with itself

Filtering Random bi-clusters We found that 50% of the real negative biclusters and 6% of real positive biclusters have scores below the 0.01 percentile of biclusters of the same size from the random networks. This resulted in 256,502 negative biclusters and 2194 positive biclusters.

Removing overlap from Biclusters we first arranged the biclusters in descending order by area. Then, beginning with the first bicluster A, we removed all biclusters whose area overlap with A was greater than 0.4, where overlap between biclusters A and B was calculated using the following formula:

Evaluation of Functional Coherence MEFIT network is based on coexpression data and does not use genetic interaction datasets

Improvements?