[BejeranoWinter12/13] 1 MW 11:00-12:15 in Beckman B302 Prof: Gill Bejerano TAs: Jim Notwell & Harendra Guturu CS173 Lecture 8:

Slides:



Advertisements
Similar presentations
Prokaryotic Gene Regulation:
Advertisements

Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Methods to read out regulatory functions
3B1 Gene regulation results in differential GENE EXPRESSION, LEADING TO CELL SPECIALIZATION.
20,000 GENES IN HUMAN GENOME; WHAT WOULD HAPPEN IF ALL THESE GENES WERE EXPRESSED IN EVERY CELL IN YOUR BODY? WHAT WOULD HAPPEN IF THEY WERE EXPRESSED.
Understanding the Human Genome: Lessons from the ENCODE project
Four of the many different types of human cells: They all share the same genome. What makes them different?
1 Control of eukaryotic gene expression As usual, much more complicated than in prokaryotes. –Increased amount of DNA –Tight packing into nucleosomes –Physical.
[Bejerano Fall10/11] 1 Thank you for the midterm feedback! Projects will be assigned shortly.
[Bejerano Spr06/07] 1 TTh 11:00-12:15 in Clark S361 Profs: Serafim Batzoglou, Gill Bejerano TAs: George Asimenos, Cory McLean.
[Bejerano Aut07/08] 1 MW 11:00-12:15 in Redwood G19 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
Control of Gene Expression Big Idea 3: Living systems store, retrieve, transmit, and respond to info essential to life processes.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
[Bejerano Fall09/10] 1 Thank you for the midterm feedback!
[Bejerano Aut08/09] 1 MW 11:00-12:15 in Beckman B302 Profs: Serafim Batzoglou, Gill Bejerano TAs: Cory McLean, Aaron Wenger.
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
[BejeranoWinter12/13] 1 MW 11:00-12:15 in Beckman B302 Prof: Gill Bejerano TAs: Jim Notwell & Harendra Guturu CS173 Lecture 7:
CS273A Lecture 5: Genes Enrichment, Gene Regulation I
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
Introduction to gene expression Seema Zargar. Lecture outline Introduction to all terms used in Gene expression.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
Introns and Exons DNA is interrupted by short sequences that are not in the final mRNA Called introns Exons = RNA kept in the final sequence.
Control of gene expression Unit but different cells have different functions and look and act differently! WHY? Different sets of genes are expressed.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Draw 8 boxes on your paper
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Genetica per Scienze Naturali a.a prof S. Presciuttini 1. The logic of prokaryotic transcriptional regulation In addition to the sigma factors that.
ChIP-on-Chip and Differential Location Analysis Junguk Hur School of Informatics October 4, 2005.
Reconstruction of Transcriptional Regulatory Networks
Gene Regulation How does your body know when to make certain proteins? Unit 4 – Chapter 12-5.
The Lac Operon An operon is a length of DNA, made up of structural genes and control sites. The structural genes code for proteins, such as enzymes.
Introduction to Gene Expression
Thank you for the midterm feedback!
Eukaryotic Gene Expression. Introduction Every cell in a multi-cellular eukaryote does not express all its genes, all the time (usually only 3-5%) –Long-term.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
CS173 Lecture 9: Transcriptional regulation III
[BejeranoFall15/16] 1 MW 1:30-2:50pm in Clark S361* (behind Peet’s) Profs: Serafim Batzoglou & Gill Bejerano CAs: Karthik Jagadeesh.
Finding genes in the genome
Welcome  In your journal write a paragraph explain what is a gene and what is gene expression?  Notes on Gene Expression Regulation  Quiz over.
Chapter 13: Gene Regulation. The Big Picture… A cell contains more genes than it expresses at any given time – why? Why are cells in multicellular organisms.
Gene regulation biology 1 lecture 13. Differential expression of genetic code in prokaryotes and eukaryotes Regulation at the transcription level How.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Regulating Gene Expression WITH OVER GENES IN EVERY CELL, HOW DOES THE CELL KNOW WHAT GENES TO EXPRESS AND WHEN TO EXPRESS THEM?
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Regulation of Gene Expression
Regulation of Prokaryotic and Eukaryotic Gene Expression
CS273A Lecture 6: Gene Regulation II MW 12:50-2:05pm in Beckman B100
Control of Gene Expression
Introduction to Genetic Analysis
Control of Gene Expression
Chapter 12.5 Gene Regulation.
Control of eukaryotic gene expression
Relationship between Genotype and Phenotype
CS273A Lecture 9: Gene Regulation II
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Regulation of Gene Expression
12-5 Gene Regulation.
Review Warm-Up What is the Central Dogma?
Relationship between Genotype and Phenotype
Molecular Biology timeline RNA polymerase complex comprised of many proteins—scaffolds that bind TATA box (orange) plus enzymatic subunits. Basic.
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
The Human Genome Source Code
The Human Genome Source Code
The Human Genome Source Code
Gene Regulation A gene (DNA) is expressed when it is made into a functional product (protein/enzyme)
Relationship between Genotype and Phenotype
Presentation transcript:

[BejeranoWinter12/13] 1 MW 11:00-12:15 in Beckman B302 Prof: Gill Bejerano TAs: Jim Notwell & Harendra Guturu CS173 Lecture 8: Transcriptional regulation II

[BejeranoWinter12/13] 2 Announcements HW1 due today. Thoughts and comments? HW2 will be out by midnight Halfway feedback today

[BejeranoWinter12/13] 3 Announcements

[BejeranoWinter12/13] 4 TTATATTGAATTTTCAAAAATTCTTACTTTTTTTTTGGATGGACGCAAAGAAGTTTAATAATCATATTACATGGCATTACCACCATATA CATATCCATATCTAATCTTACTTATATGTTGTGGAAATGTAAAGAGCCCCATTATCTTAGCCTAAAAAAACCTTCTCTTTGGAACTTTC AGTAATACGCTTAACTGCTCATTGCTATATTGAAGTACGGATTAGAAGCCGCCGAGCGGGCGACAGCCCTCCGACGGAAGACTCTCCTC CGTGCGTCCTCGTCTTCACCGGTCGCGTTCCTGAAACGCAGATGTGCCTCGCGCCGCACTGCTCCGAACAATAAAGATTCTACAATACT AGCTTTTATGGTTATGAAGAGGAAAAATTGGCAGTAACCTGGCCCCACAAACCTTCAAATTAACGAATCAAATTAACAACCATAGGATG ATAATGCGATTAGTTTTTTAGCCTTATTTCTGGGGTAATTAATCAGCGAAGCGATGATTTTTGATCTATTAACAGATATATAAATGGAA AAGCTGCATAACCACTTTAACTAATACTTTCAACATTTTCAGTTTGTATTACTTCTTATTCAAATGTCATAAAAGTATCAACAAAAAAT TGTTAATATACCTCTATACTTTAACGTCAAGGAGAAAAAACTATAATGACTAAATCTCATTCAGAAGAAGTGATTGTACCTGAGTTCAA TTCTAGCGCAAAGGAATTACCAAGACCATTGGCCGAAAAGTGCCCGAGCATAATTAAGAAATTTATAAGCGCTTATGATGCTAAACCGG ATTTTGTTGCTAGATCGCCTGGTAGAGTCAATCTAATTGGTGAACATATTGATTATTGTGACTTCTCGGTTTTACCTTTAGCTATTGAT TTTGATATGCTTTGCGCCGTCAAAGTTTTGAACGATGAGATTTCAAGTCTTAAAGCTATATCAGAGGGCTAAGCATGTGTATTCTGAAT CTTTAAGAGTCTTGAAGGCTGTGAAATTAATGACTACAGCGAGCTTTACTGCCGACGAAGACTTTTTCAAGCAATTTGGTGCCTTGATG AACGAGTCTCAAGCTTCTTGCGATAAACTTTACGAATGTTCTTGTCCAGAGATTGACAAAATTTGTTCCATTGCTTTGTCAAATGGATC ATATGGTTCCCGTTTGACCGGAGCTGGCTGGGGTGGTTGTACTGTTCACTTGGTTCCAGGGGGCCCAAATGGCAACATAGAAAAGGTAA AAGAAGCCCTTGCCAATGAGTTCTACAAGGTCAAGTACCCTAAGATCACTGATGCTGAGCTAGAAAATGCTATCATCGTCTCTAAACCA GCATTGGGCAGCTGTCTATATGAATTAGTCAAGTATACTTCTTTTTTTTACTTTGTTCAGAACAACTTCTCATTTTTTTCTACTCATAA CTTTAGCATCACAAAATACGCAATAATAACGAGTAGTAACACTTTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGA TAATGTTTTCAATGTAAGAGATTTCGATTATCCACAAACTTTAAAACACAGGGACAAAATTCTTGATATGCTTTCAACCGCTGCGTTTT GGATACCTATTCTTGACATGATATGACTACCATTTTGTTATTGTACGTGGGGCAGTTGACGTCTTATCATATGTCAAAGTTGCGAAGTT CTTGGCAAGTTGCCAACTGACGAGATGCAGTAACACTTTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGT TTTCAATGTAAGAGATTTCGATTATCCACAAACTTTAAAACACAGGGACAAAATTCTTGATATGCTTTCAACCGCTGCGTTTTGGATAC CTATTCTTGACATGATATGACTACCATTTTGTTATTGTACGTGGGGCAGTTGACGTCTTATCATATGTCAAAGTCATTTGCGAAGTTCT TGGCAAGTTGCCAACTGACGAGATGCAGTTTCCTACGCATAATAAGAATAGGAGGGAATATCAAGCCAGACAATCTATCATTACATTTA AGCGGCTCTTCAAAAAGATTGAACTCTCGCCAACTTATGGAATCTTCCAATGAGACCTTTGCGCCAAATAATGTGGATTTGGAAAAAGA GTATAAGTCATCTCAGAGTAATATAACTACCGAAGTTTATGAGGCATCGAGCTTTGAAGAAAAAGTAAGCTCAGAAAAACCTCAATACA GCTCATTCTGGAAGAAAATCTATTATGAATATGTGGTCGTTGACAAATCAATCTTGGGTGTTTCTATTCTGGATTCATTTATGTACAAC CAGGACTTGAAGCCCGTCGAAAAAGAAAGGCGGGTTTGGTCCTGGTACAATTATTGTTACTTCTGGCTTGCTGAATGTTTCAATATCAA CACTTGGCAAATTGCAGCTACAGGTCTACAACTGGGTCTAAATTGGTGGCAGTGTTGGATAACAATTTGGATTGGGTACGGTTTCGTTG GTGCTTTTGTTGTTTTGGCCTCTAGAGTTGGATCTGCTTATCATTTGTCATTCCCTATATCATCTAGAGCATCATTCGGTATTTTCTTC TCTTTATGGCCCGTTATTAACAGAGTCGTCATGGCCATCGTTTGGTATAGTGTCCAAGCTTATATTGCGGCAACTCCCGTATCATTAAT GCTGAAATCTATCTTTGGAAAAGATTTACAATGATTGTACGTGGGGCAGTTGACGTCTTATCATATGTCAAAGTCATTTGCGAAGTTCT TGGCAAGTTGCCAACTGACGAGATGCAGTAACACTTTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGTTT TCAATGTAAGAGATTTCGATTATCCACAAACTTTAAAACACAGGGACAAAATTCTTGATATGCTTTCAACCGCTGCGTTTTGGATACCT ATTCTTGACATGATATGACTACCATTTTGTTATTGTTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGTTT TCAATGTAAGAGATTTCGATTATCCTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGTTTTCAATGTAAGA GATTTCGATTATCCTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGTTTTCAATGTAAGAGATTTCGATTA TCCTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGTTTTCAATGTAAGAGATTTCGATTATCCTTATAGTT CATACATGCTTCAACTACTTAATAAATGATTGTATGATAATGTTTTCAATGTAAGAGATTTCGATTATCCTTATAGTTCATACATGCTT CAACTACTTAATAAATGATTGTATGATAATGTTTTCAATGTAAGAGATTTCGATTATCCTTATAGTTCATACATGCTTCAACTACTTAA TAAATGATTGTATGATAATGTTTTCAATGTAAGAGATTTCGATTATCCTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGT ATGATAATGTTTTCAATGTAAGAGATTTCGATTATCTTATAGTTCATACATGCTTCAACTACTTAATAAATGATTGTATGATAATAAAG

Gene Regulation [BejeranoWinter12/13] 5 Some proteins and non coding RNAs go “back” to bind DNA near genes, turning these genes on and off. Gene DNA Proteins Gene activation:

[BejeranoWinter12/13] 6 Transcription Activation contd.

Transcription Activation [BejeranoWinter12/13] Terminology: RNA polymerase Transcription Factor Transcription Factor Binding Site Promoter Enhancer Gene Regulatory Domain 7 TF DNA

[BejeranoWinter12/13] 8 Transcription activation “loop” Gene Transcription factors bind DNA, turn on or off different promoters and enhancers, which in-turn turn on or off different genes, some of which may themselves be transcription factors, which again changes the presence of TFs in the cell, the state of active promoters/enhancers etc. Proteins DNA transcription factor binding site

IFN beta enhancer [BejeranoWinter12/13]9

Transcription Measurements [BejeranoWinter12/13] Some measurement techniques: Chromatin Immunoprecipitation Transcription output: –Transfection –Transgenics –Genome Engineering Chromosome Conformation Capture 10

Transcription Activation Properties [BejeranoWinter12/13] Observed Properties: Most TF binding site basepair preferences are independent of each other. TFs can synergize to turn gene activity on. Behavior can change in different conditions. TFs bind to hundreds and thousands of different targets in a single condition. Enhancers complement in different tissues. 11

Gene Regulation is HOT Gene regulation is currently one of the hottest topics in the study of the human genome. Large projects are pouring lots of money to generate large descriptive datasets. The challenge now is to glean logic from these piles. [BejeranoWinter12/13] 12 Measured >100 TFs in >70 cellular conditions. How does TF binding determine its output: gene expression?

System output measurements Measure non/coding gene expression! 1. First generation mRNA (cDNA) and EST sequencing: [BejeranoWinter12/13] 13 In UCSC Browser:

2. Gene Expression Microarrays (“chips”) [BejeranoWinter12/13] 14

3. RNA-seq “Next” (2nd) generation sequencing. [BejeranoWinter12/13] 15

[BejeranoWinter12/13] 16 Gene Finding II: technology dependence Challenge: “Find the genes, the whole genes, and nothing but the genes” We started out trying to predict genes directly from the genome. When you measure gene expression, the challenge changes: Now you want to build gene models from your observations. These are both technology dependent challenges. The hybrid: what we measure is a tiny fraction of the space-time state space for cells in our body. We want to generalize from measured states and improve our predictions for the full compendium of states.

4. Spatial-temporal maps generation [BejeranoWinter12/13] 17

Gene Expression Causality Measuring gene expression over time provides sets of genes that change their expression in synchrony. But who regulates whom? Some of the necessary regulators may not change their expression level when measured, and yet be essential. “Reading” enhancers can provide gene regulatory logic: If present(TF A, TF B, TF C) then turn on nearby gene X [BejeranoWinter12/13] 18

Some Computational Challenges in Gene Regulation Transcription factor binding site discovery Technology-dependent challenge in constructing the correct binding site model (e.g. motif) from the measurements. Eg, ChIP produces sequences of bp. Your motif of length 4-20 is there somewhere. Find the most enriched model in the set of sequences you obtained. Methods range between full enumeration, heuristic/probabilistic searches, and hybrids. [BejeranoWinter12/13] 19

Transcription factor motif discovery: different technologies SELEX = Systematic Evolution of Ligands by Exponential Enrichment PBM = Protein Binding Microarrays [BejeranoWinter12/13] 20

Transcription factor binding site prediction Given the genome, and possibly some cell measurements predict (all and nothing but) the binding sites of a given transcription factor (in a/all context/s). [BejeranoWinter12/13] 21

Enhancer Prediction How do TFs “sum” together to provide the activity of an enhancer? A network of genes? [BejeranoWinter12/13] 22

Enhancer Prediction Given a sequence of DNA predict: Is it an enhancer? Ie, can it drive gene expression? If so, in which cells? At which times? Driven by which transcription factor binding sites? Given a set of different enhancers driving expression in the same population of cells: Do they share any logic? If so what is it? Can you generalize this logic to find new enhancers? [BejeranoWinter12/13] 23

Biology is empirical: you predict, and you measure! Measuring is great. It allows you to check your assumptions and improve your models until you get it. Some difficulties associated with gene regulation: Single cell measurements are rare. You most often measure some “average” over a population of cells. The population of cells is seldom in sync (same state). The closer a population of cells is to its in vivo state the less homogeneous it is. The closer a population of cells is to its in vivo state the harder (time, effort, money) it is to measure it. [BejeranoWinter12/13] 24

Biology is empirical, you predict, and you measure! Some more difficulties associated with gene regulation: A family of TFs often has very similar binding motifs Expression pattern may be different (but unknown to you). Family members may have different protein-protein interaction (PPI) domains which are also important. The genome is pleiotropic ( = good for all contexts). If an enhancer you are studying is in fact good for multiple contexts they will be overlaid on each other in sequence and make prediction (and disentanglement) harder. [BejeranoWinter12/13] 25

Transcription Factors Large “fan outs” revisited TFs reproducibly bind to thousands of genomic locations almost anywhere we’ve looked. Gene regulation forms a dense network. However, when such a TF is perturbed (over expressed or silenced) only a fraction of the genes it binds next to change their expression levels. [BejeranoWinter12/13] 26

Genomics vs. Genetics Last but not least – genomics is descriptive. It can show you “everything”. Eg: all the location a given transcription factor is bound to the genome (reproducibly) in a given cell state. Which of these bindings actually matters? [BejeranoWinter12/13] 27 frequency effect on cell adverse effect observable in experiments adverse effect on cell no or near no effect FunctionAssay Binds reproduciblyRelatively easy Changes expression of nearby genes Hard Affects cell/organism function/fitness Very hard Affects cell/organism but not where/when I looked for it (pleiotropy) Harder still

Transcription factors “rule” [BejeranoWinter12/13] 28 Cellular reprogramming is done by adding to the cell large quantities of a small number of the “right” TFs. These somehow “reset” cell state. We have learned (in a dish) to: 1 control differentiation 2 reverse differentiation 3 hop between different states

[BejeranoWinter12/13] 29 Transcription Regulation is not just about activation

Transcriptional Repression [BejeranoWinter12/13] An equally important but less visible part of transcription (tx) regulation is transcriptional repression (that lowers/ablates tx output). Transcription factors can bind key genomic sites, preventing/repelling the binding of –The RNA polymerase machinery –Activating transcription factors (including via competitive binding) Some transcription factors have stereotypical roles as activators or repressors. Likely many can do both (in different contexts). DNA can be bent into 3D shape preventing enhancer – promoter interactions. Activator and co-activator proteins can be modified into inactive states. Note: repressor thus can relate to specific DNA sequences or proteins. 30

Transcriptional Output Prediction [BejeranoWinter12/13] All these can increase or decrease tx output: Adding/repressing different proteins Modifying DNA bases Adding genomic context Changing cellular context Repression logic is harder to tease out. (need positive controls) 31

[BejeranoWinter12/13] 32 Transcription can only happen in open Chromatin Chromatin / Proteins DNA / Proteins Genome packaging in fact provides a critical layer of gene regulation.

Gene Activation / Repression via Chromatin Remodeling A dedicated machinery opens and closes chromatin. Interactions with this machinery turns genes and/or gene regulatory regions like enhancers and repressors on or off (by making the genomic DNA in/accessible) [BejeranoWinter12/13] 33

Insulators [BejeranoWinter12/13] Insulators are DNA sequences that when placed between target gene and enhancer prevent enhancer from acting on the gene. Known insulators contain binding sites for a specific DNA binding protein (CTCF) that is involved in DNA 3D conformation. However, CTCF fulfills additional roles besides insulation. I.e, the presence of a CTCF site does not ensure that a genomic region acts as an insulator. 34 TSS1 TSS2 Insulator

[BejeranoWinter12/13] 35 Cis-Regulatory Components Low level (“atoms”): Promoter motifs (TATA box, etc) Transcription factor binding sites (TFBS) Mid Level: Promoter Enhancers Repressors/silencers Insulators/boundary elements Cis-regulatory modules (CRM) Locus control regions (LCR) High Level: Epigenetic domains / signatures Gene expression domains Gene regulatory networks (GRN)

Signal Transduction Everything we discussed so far happens within the cell. But cells talk to each other, copiously. [BejeranoWinter12/13] 36

[BejeranoWinter12/13] 37 Gene Regulation II Chromatin / Proteins DNA / Proteins Extracellular signals To be continued…

[BejeranoWinter12/13] 38 (On Mondays) ask students to stack the chairs without wheels at the back of the room at the end of class.

[BejeranoWinter12/13] 39