Presentation on theme: "D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April 3 2013 Seminar."— Presentation transcript:
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April 3 2013 Seminar in Algorithmic Challenges in Analyzing Big Data* in Biology and Medicine-TAU
P ROTEIN -P ROTEIN INTERACTION All living organisms consist of living cells All those cells, comprise the same building blocks: RNA,DNA and PROTEIN Protein sequences are encoded in DNA Proteins play major roles in all cellular processes
DNA REPLICATION TRANSCIPTION INTO mRNA TRANSLATION OF mRNA
P ROTEIN -DNA INTERACTIONS protein binds a molecule of DNA Regulate the biological function of DNA,biological function usually the expression of a gene.expressiongene Transcription factors that activate or repress gene expressionTranscription factors
G ENE E XPRESSION Gene is a sequence of the DNA. The gene decodes to a protein. the process by which information from a gene is used in the synthesis of a functional protein is called gene expression It is interesting to test gene expression on multiple conditions (experiments). Differential- express
DNA chips/ Microarrays -Simultaneous measurement of expression levels of all genes.
M OTIVATION Databases of PROTEIN-PROTEIN & PROTEIN-DNA interactions Widely available mRNA expression data Generate concrete hypotheses for the underlying mechanisms governing the observed changes in gene expression
M OTIVATION Exposing the yeast galactose utilization pathway to 20 perturbations Constructing a molecular interaction network by screening a database of protein- protein and protein-DNA interactions Select 362 interactions linking genes that were differentially-expressed under one or more perturbations. Analyze changes in expression.
Conclusion: Pairs of genes linked in this network were more likely to have correlated expression profiles than genes chosen at random however, the general task of Associating gene expression changes with higher order groups of interaction was not discussed
D ISCOVERING REGULATORY AND SIGNALING CIRCUITS IN MOLECULAR INTERACTION NETWORKS Introducing method for searching the networks to find ‘active sub-networks’ On multiple conditions, determine which conditions significantly affect gene expression in each subnetwork.
Z- SCORE CALCULATION Given each gene i a value p i p i= The significance of differential expression of gene I z i = Ф -1 (1- p i ) ( z-score for gene i) aggregate z-score for subnetwork A Calibrating z against the background distribution
S CORING OVER MULTIPLE CONDITIONS Extending the scoring system over multiple conditions. Create a matrix of z-score. Rows- m conditions Columns-genes Produce m different aggregate scores (one for each condition Sort them from highest to lowest. compute r A max = max j (r A[j] )
Compute r A[j] for each j=1….m as follows: P Z = 1 – Ф( Z A[j] ) (the probability that any single condition has a z- score above Z A[j] ) b (the probability that at least j of the m conditions had scores above Z A[j] ) r A[j] = Ф - 1 (1-p A[j) ) r A max = max j (r A[j] ) compute r A max
Z score of gene 1 Conditio n 1 Conditio n 2 Conditio n 3 Conditio n 4
Aggregate scores of z A1 ….. z Amc Aggregate scores of z A1 ….. z Am sorted Computing r A … r A[m] Taking max j (r A[j] ) Calibrating z against the background distribution
S IMULATED ANNEALING strategy to find local maximum we must sometimes select new points that do not improve solution Annealing - Gradual cooling of liquid Incorporate a temperature parameter into the maximization procedure At high temperatures, explore parameter space At lower temperatures, restrict exploration
S IMULATED A NNEALING S TRATEGY Start with some sample Propose a change Decide whether to accept change
S IMULATED A NNEALING S TRATEGY Decide whether to accept change- HOW?? Consider decreasing series of temperatures For each temperature, iterate these steps: Propose an update and evaluate function Accept updates that improve solution Accept some updates that don't improve solution Acceptance probability depends on “temperature” parameter
S EARCHING FOR HIGH SCORING SUBNETWORKS VIA SIMULATED ANNEALING Associate an active/inactive state with each node G W = denote the working sub graph of G induced by the active nodes
S UBNETWORKS ARE CONSISTS WITH KNOWN REGULATORY CIRCUITS
S UBNETWORKS VERSUS GENE EXPRESSION CLUSTERS Our approach groups genes subject to the constraints of molecular interaction network Subnetworks are scored over only a subset of conditions Groups genes only by the significance of change, while clustering methods groups genes by both magnitude and direction of change Our method leaves some genes unaffiliated with any subnetwork, unlike clustering which assign every gene to distinct cluster
F UTURE WORK Investigating the subnetworks we found in the laboratory Accommodating new types of interaction networks (protein and small molecules) Annotating each interaction with its directionally compartments