Conservation and Evolution of Cis-Regulatory Systems Tal El-Hay Computational Biology Seminar חנוכה תשס"ו December 2005.

Slides:



Advertisements
Similar presentations
Periodic clusters. Non periodic clusters That was only the beginning…
Advertisements

Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Regulatory Motifs. Contents Biology of regulatory motifs Experimental discovery Computational discovery PSSM MEME.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Work Process Using Enrich Load biological data Check enrichment of crossed data sets Extract statistically significant results Multiple hypothesis correction.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Functional annotation and network reconstruction through cross-platform integration of microarray data X. J. Zhou et al
[Bejerano Fall10/11] 1 Any Project reflections?
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
Bas E. Dutilh Phylogenomics Using complete genomes to determine the phylogeny of species.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Promoter Analysis using Bioinformatics, Putting the Predictions to the Test Amy Creekmore Ansci 490M November 19, 2002.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Biological (genomic) information Dan Janies
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Protein Interactions and Disease Audry Kang 7/15/2013.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 3 A Story of Transcription.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Cis-regulatory element study in transcriptome Jin Chen CSE Fall
Paper in “Evolution and Development” The Evolution of Signaling Pathways In Animal Development Andr é Pires-daSilva & Ralf J. Sommer Nature Reviews Genetics,
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Analyzing transcription modules in the pathogenic yeast Candida albicans Elik Chapnik Yoav Amiram Supervisor: Dr. Naama Barkai.
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
whole-genome duplications and large segmental duplications… …seem to be a common feature in eukaryotic genome evolution …play a crucial role in the evolution.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Calculating branch lengths from distances. ABC A B C----- a b c.
Computational Genomics and Proteomics Lecture 8 Motif Discovery C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E.
Identification of Compositionally Similar Cis-element Clusters in Coordinately Regulated Genes Anil G Jegga, Ashima Gupta, Andrew T Pinski, James W Carman,
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Module Networks BMI/CS 576 Mark Craven December 2007.
Finding Motifs Vasileios Hatzivassiloglou University of Texas at Dallas.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
1 How do regulatory networks evolve? Module = group of genes co-regulated by the same regulatory system * Evolution of individual gene targets Gain or.
Last time … * Constraint on transcription factor binding sites Sites with the most ‘information content’ generally evolve slowest * Stabilizing selection.
Change in Pufs and their RNA InteractionsAnalogous change in transcription factors and their gene regulation Puf binding specificity tends to be conserved.
Genomes and their evolution
Evolutionary genomics can now be applied beyond ‘model’ organisms
Evolution of eukaryotic genomes
Genetics and Evolutionary Biology
Basics of Comparative Genomics
Genomes and Their Evolution
Genomes and Their Evolution
Relationship between Genotype and Phenotype
Gene Density and Noncoding DNA
Relationship between Genotype and Phenotype
SEG5010 Presentation Zhou Lanjun.
Volume 29, Issue 5, Pages (March 2008)
Basics of Comparative Genomics
Predicting Gene Expression from Sequence
Relationship between Genotype and Phenotype
Presentation transcript:

Conservation and Evolution of Cis-Regulatory Systems Tal El-Hay Computational Biology Seminar חנוכה תשס"ו December 2005

"Nothing in biology makes sense except in the light of evolution." "Nothing in biology makes sense except in the light of evolution." - Theodosius Dobzhansky Science, 23 December 2005 Evolution in Action

Molecular Evolution Wikipedia

How Genomes Evolve Random mutations Transposable DNA Gene duplication Divergence Whole genome duplication (+divergence) Recombination of exons Purifying selection Molecular Biology of the Cell, Alberts et al., 4 th ed.

“Relative dearth of species specific genes” “Seeming abundance of functionally homologous proteins” (A. P. Gasch et al., PLoS Biol., 2004)  Additional mechanisms for diversification Such as timing, location and level of proteins  Our focus – Gene Expression Evolution of Regulation

Outline of Questions Conserved and evolved properties Mechanisms of conservation and evolution Bridging Genotype and Phenotype

Cis-Regulatory Factors Composition Location Modules… chiken  A mouse  A mouse  1 Gene control regions for eye lens chrystallins Molecular Biology of the Cell, Alberts et al., 4 th ed.

Goal Explore evolution of gene expression regulation –Mainly through examination of cis-elements

The Fungul Family A. P. Gasch et al., PLoS Biol., 2004

Large Scale Analysis Identify 264 co-regulated gene groups in S. serevisiae Putative cis-regulatory elements –80 known consensus binding sites –597 elements by motif search with MEME Score enrichment of genes containing each putative element - 42 cis-elements in 35 unique groups Orthologous modules in other species Enrichment of orthologous modules

Conservation of S. cerevisiae motifs G1 phase cell cycle ACGCGMCB Amino acid biosynthesis TGACTMGcn4p Nitrogen source GATAA GATA factors Proteasome GGTGGCAAARpn4p

Novel Sequences Ribosomal proteins AGCCCTAA Ribosomal proteins GTGACTGT tRNA synthetases TGACTCAN

Positions of binding sites Non random distribution Similar across species No correlations in locations across species

Spacing between binding sites in Methionine Biosynthesis genes Small distance between Cbf1p and Met31/32p Conserved across species Independent of exact positions

Zooming In – The Proteasome The proteasome is regulated by Rpn4p Consensus sequence enriched across all hemiascomycete Slight differences between C. Albicans and S. Serevisiae  Exploring evolution of the sequence

Proteazome Cis-Element Evolution Procedure: Generate ‘meta matrix’ Species specific matrix Hierarchical clustering GGTGGCAAAW AGTGGCAAAN GGTGGCAAYA GRAGGCAAAA

Binding specificities of Rpn4p Sc element Common element CA element

Validation of specificities? Sc_Rpn4p Ca_Rpn4p hybrid A. GGTGGCAAAA B. GAAGGCAAAA C. AGTGGCAACA D. GGTGGCAAAA A E. AGTGGCAAAA C F. GGTGGCAACA G. CTGCATTTGG

Intermediate Summary - Conservation Conserved cis-elements in ‘un-align-able’ non coding regions Correspondence of conservation and evolutionary distance Did not observe position conservation Similar position distributions Found an example of conserved spacing –Interaction constraint between TF?

Intermediate Summary - Evolution Novel sequences in coregulated orthologous modules Conserved expression patterns with evolved regulation (e.g. Proteasome, Ribosome) Individual example for: –Addition of gene targets to a regulatory network (S1 phase cell cycle) –Coevolution in a regulatory network –Cooption of a regulatory system

Goals Learn mechanisms of evolution of regulation systems Integrating comparative expression and sequence analysis

Computational framework Identify conserved modules Derive orthologous modules Identify cis-elements profiles Reconstruction of evolution

Orthologous Modules in Yeast a.S-phase module b.Respiration c.Amino acid metabolism d.Ribosomal proteins synthesis e.Stress f.Ribosome biogenesis

Conserved and Diverged Regulatory Mechanisms Conserved cis-elements – Respiration module –Mlu cell cycle element ( ACGCGT ) –Bound by MBF complexes in both species Diverged - Ribosomal protein synthesis –RAP1 and IFHL in S. cerevisae ( TACATCCGTACAT & TCCGCCTAG resp.) –Homol-D box and Homol-E site in S. pombe ( TGTGACTG & ACCCTACCCTA ) Conserved and diverged – Ribosome biogenesis module

The evolution of the Ribosomal Regulatory Program Apparent redundancy of binding sites Switching from Homol-D to RAP1 Gradual Evolution in the IFHL Box

Evidence for Regulator Switching Evolution of Transcription factor –RAP1 is a submotif of telomeric repeat –Rap1p regulates telomer length –Association of events: Rap1p Gained of Trans activation domain, RAP1 joined RP module Interacation of RAP1 & Homol-D? –usually 2-6 base pairs apart –Conserved order

Gradual Evolution of IFHL Box TCTGCCTA AGGGCTATAGCCCT GCCCTA CCCTACCCTA Convergent domain duplication or Acquired dimerization domain?

Spatial Configuration HomolD-RAP1 Sites are tightly coupled Rap1-IFHL Variable probably to modified role of IFHL RAP1-RAP1 Sites are tightly coupled IFHL-IFHL Coupling mainly in A. gossypii and K. Waltii

Intermediate Summary - Modes of Evolution Conserved modules with diverged regulatory mechanisms Some changes via redundant intermediate programs

Goals Genotype and Phenotype

Background – Yeast Growth Some Yeast species are fermentative Others can employ only respiration Connection to whole genome duplication

Analysis of Genetic Basis of Phenotypic Diversity Examine gene expression program –S. Cerevisiae with 1000 expression profiles –C. Albicans with 198 profiles Motif search in related modules Validation of motif role Comparison of motif and phenotype evolution

Transcriptional Wiring Differences RPCytoplasmic ribosomal proteins rRNA rRNA processing genes MRP Mitochondrial RP STREnvironmental Stress response

Cis or Trans? Cis element search: PAC in rRNA of both species None for MRP in S. Serevisiae AATTTT in MRP, putative rRNA regulator Validation of cis-regulatory Role of AATTTT in MRP

Spatial Configuration of AATTTT Both species: Position is confined in RP and rRNA Not represented in STR C. Albicans : AATTTT Regulates also MRP  Rapid growth element

Loss or Gain of Binding Site? Loss of binding site in MRP associated with whole genome duplication

Evolution of Genome and Phenotype “Gene duplication can facilitate the evolution of new function” –by specialization of new coding sequences –Also by facilitating the evolution of gene expression

Summary of methodologies Integration of sequence and gene- expression –Finding orthologous modules –Finding orthologous binding motifs Exploiting phylogenetic trees –Find genotype change rules –Associate phenotype and genotype changes –Exploit gene expression data of extremes

Summary of principles Conservation of regulatory programs –Binding site conservation –Position and spacing Conservation of module with evolution of control –Loss and gain –Drift –Switching, could be explained via redundancy Evolution of regulatory programs -> Evolution of phenotype –Addition of gene targets –Cooption or loss of TF –Also facilitated by whole genome duplication

Points for thought How can simultaneous change of cis- elements happen? Evolution of elaborate wiring Evolution of other modes of regulation Is there gene expression data for more species? Look for conserved patterns?