Presentation is loading. Please wait.

Presentation is loading. Please wait.

Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems.

Similar presentations


Presentation on theme: "Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems."— Presentation transcript:

1 Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems Moscow, Russia CASB-20, UCDS, La Jolla, 13-14.III.2009

2 Plan Co-evolution of transcription factors and their binding motifs Evolution of regulatory systems and regulons

3 Regulators and their motifs Cases of motif conservation at surprisingly large distances Subtle changes at close evolutionary distances Correlation between contacting nucleotides and amino acid residues

4 NrdR (regulator of ribonucleotide reducases and some other replication-related genes): conservation at large distances

5 DNA motifs and protein-DNA interactions CRP PurR IHFTrpR Entropy at aligned sites and the number of contacts (heavy atoms in a base pair at a distance <cutoff from a protein atom)

6 The CRP/FNR family of regulators

7 Correlation between contacting nucleotides and amino acid residues CooA in Desulfovibrio spp. CRP in Gamma-proteobacteria HcpR in Desulfovibrio spp. FNR in Gamma-proteobacteria DD COOA ALTTEQLSLHMGATRQTVSTLLNNLVR DV COOA ELTMEQLAGLVGTTRQTASTLLNDMIR EC CRP KITRQEIGQIVGCSRETVGRILKMLED YP CRP KXTRQEIGQIVGCSRETVGRILKMLED VC CRP KITRQEIGQIVGCSRETVGRILKMLEE DD HCPR DVSKSLLAGVLGTARETLSRALAKLVE DV HCPR DVTKGLLAGLLGTARETLSRCLSRMVE EC FNR TMTRGDIGNYLGLTVETISRLLGRFQK YP FNR TMTRGDIGNYLGLTVETISRLLGRFQK VC FNR TMTRGDIGNYLGLTVETISRLLGRFQK TGTCGGCnnGCCGACA TTGTgAnnnnnnTcACAA TTGTGAnnnnnnTCACAA TTGATnnnnATCAA Contacting residues: REnnnR TG: 1 st arginine GA: glutamate and 2 nd arginine

8 The correlation holds for other factors in the family

9 The LacI family: subtle changes in motifs at close distances G A n CGCG GnGn GCGC

10 The LacI family: systematic analysis 1369 DNA-binding domains in 200 orthologous rows =35%, =71 а.о. 4484 binding sites, L=20н., =45% Calculate mutual information between columns of TF and site alignments Set threshold on mutual information of correlated pairs

11 Definitions Sites LAFDHDQILQMAQERLQGKVRYQP-IGFELLPEKFSLRQLQRMYETVLGRS---LDKRNF LAFDHNQILDYGYQRLRNKLEYSP-IAFEVLPELFTLNDLFQLYTTVLGED--FADYSNF tTAaTGgCTTTAtGcCACTAT LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHSKILAYGHRRLCNKLEYSP-VAFDVLPEYFTLNDLYQFYSTVLGAN--FSDYSNF LAFDHNQILDYGYQRLRNKLEYSP-IAFEVLPELFTLNDLFQLYTTVLGED--FADYSNF LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN—-FSDYSNF LSFDHNEILAYGHRRLRNKLEYSP-VAFEVLPEMFTLNDLYQLYTTVLGEN--FSDYSNF TTAaaGTAAtAaTTACCATAA AaAtTGTCTTTAtGcCACTAT TTATGGTAAATTcTACCATAA TTATgGTCAgTTTcACcAaAA tTAaTGgCTTTAtGcCACTAT TTaGTCgAAATAaccaACtAA TTATCGTCAtCtcGACGACAA TttAGGTAAgTTATACTTTTA Z-score Mutual information Protein alignment

12 Correlated pairs

13 Higher order correlations -ATIKDVAKRANVSTTTV- AATTGTGAGCGCTCACT SLSQ TL TQ

14 Not a phylogenetic trace

15 NrtR (regulator of NAD metabolism)

16 Comparison with the recently solved structure: correlated positions indeed bind the DNA (more exactly, form a hydrophobic cluster)

17 Catalog of events Expansion and contraction of regulons New regulators (where from?) Duplications of regulators with or without regulated loci Loss of regulators with or without regulated loci Re-assortment of regulators and structural genes … especially in complex systems Horizontal transfer

18 Regulon expansion, or how FruR has become CRA CRA (a.k.a. FruR) in Escherichia coli: –global regulator –well-studied in experiment (many regulated genes known) Going back in time: looking for candidate CRA/FruR sites upstream of (orthologs of) genes known to be regulated in E.coli

19 Common ancestor of gamma-proteobacteria icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria

20 Common ancestor of the Enterobacteriales icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria Enterobacteriales

21 Common ancestor of Escherichia and Salmonella icdA aceA aceB aceEF pckA ppsApykF adhE gpmApgk tpiA gapA pfkA fbp Fructose fruKfruBA eda edd epd Glucose ptsHI-crr Mannose manXYZ mtlD mtlA Mannitol Gamma-proteobacteria Enterobacteriales E. coli and Salmonella spp.

22 Regulation of amino acid biosynthesis in the Firmicutes Interplay between regulatory RNA elements and transcription factors Expansion of T-box systems (normally – RNA structures regulating aminoacyl-tRNA-synthetases)

23 Recent duplications and bursts: ARG-T-box in Clostridium difficile

24 … caused by loss of transcription factor AhrC

25 Duplications and changes in specificity: ASN/ASP/HIS T-boxes

26 Blow-up 1

27 Blow-up 2. Prediction Regulators lost in lineages with expanded HIS-T-box regulon??

28 … and validation conserved motifs upstream of HIS biosynthesis genes candidate transcription factor yerC co-localized with the his genes present only in genomes with the motifs upstream of the his genes genomes with neither YerC motif nor HIS-T-boxes: attenuators Bacillales (his operon) Clostridiales Thermoanaerobacteriales Halanaerobiales Bacillales

29 The evolutionary history of the his genes regulation in the Firmicutes

30 T-boxes: Summary / History

31 Life without Fur

32 Regulation of iron homeostasis (the Escherichia coli paradigm) Iron: essential cofactor (limiting in many environments) dangerous at large concentrations FUR (responds to iron): synthesis of siderophores transport (siderophores, heme, Fe 2+, Fe 3+ ) storage iron-dependent enzymes synthesis of heme synthesis of Fe-S clusters Similar in Bacillus subtilis

33 Regulation of iron homeostasis in α-proteobacteria Experimental studies: FUR/MUR: Bradyrhizobium, Rhizobium and Sinorhizobium RirA (Rrf2 family): Rhizobium and Sinorhizobium Irr (FUR family): Bradyrhizobium, Rhizobium and Brucella RirA Irr FeSheme RirA degraded Fur Fe Fur Iron uptak e systems Sideroph ore uptake Fe / Fe uptake Transcription factors 2+3+ Iron storage ferritins FeS synthesis Heme synthesis Iron-requiring enzymes [iron cofactor] IscR Irr [- Fe] [+Fe] [- Fe] [+Fe] [ Fe]- FeS FeS status of cell

34 Distribution of transcription factors in genomes Search for candidate motifs and binding sites using standard comparative genomic techniques

35 Regulation of genes in functional subsystems Rhizobiales Bradyrhizobiaceae Rhodobacteriales The Zoo (likely ancestral state)

36 Reconstruction of history Appearance of the iron-Rhodo motif Frequent co-regulation with Irr Strict division of function with Irr

37 All logos and Some Very Tempting Hypotheses: 1.Cross-recognition of FUR and IscR motifs in the ancestor. 2.When FUR had become MUR, and IscR had been lost in Rhizobiales, emerging RirA (from the Rrf2 family, with a rather different general consensus) took over their sites. 3.Iron-Rhodo boxes are recognized by IscR: directly testable 1 2 3

38 Summary and open problems Regulatory systems are very flexible –easily lost –easily expanded (in particular, by duplication) –may change specificity –rapid turnover of regulatory sites With more stories like these, we can start thinking about a general theory –catalog of elementary events; how frequent? –mechanisms (duplication, birth e.g. from enzymes, horizontal transfer) –conserved (regulon cores) and non-conserved (marginal regulon members) genes in relation to metabolic and functional subsystems/roles –(TF family-specific) protein-DNA recognition code –distribution of TF families in genomes; distribution of regulon sizes; etc.

39 People Andrei A. Mironov – software, algorithms Alexandra Rakhmaninova – SDP, protein-DNA correlations Anna Gerasimova (now at LBNL) – NadR Olga Kalinina (on loan to EMBL) – SDP Yuri Korostelev – protein-DNA correlations Olga Laikova – LacI Dmitry Ravcheev– CRA/FruR Dmitry Rodionov (on loan to Burnham Institute) – iron etc. Alexei Vitreschak – T-boxes and riboswitches Andy Jonson (U. of East Anglia) – experimental validation (iron) Leonid Mirny (MIT) – protein-DNA, SDP Andrei Osterman (Burnham Institute) – experimental validation Howard Hughes Medical Institute Russian Foundation of Basic Research Russian Academy of Sciences, program “Molecular and Cellular Biology”


Download ppt "Evolution of bacterial regulatory systems Mikhail Gelfand Research and Training Center “Bioinformatics” Institute for Information Transmission Problems."

Similar presentations


Ads by Google