Functional Non-Coding DNA Part I Non-coding genes and non-coding elements of coding genes BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG.

Slides:



Advertisements
Similar presentations
QUESTIONS: 1. Name the DNA for which transcription requires SP1. 2. What is the evidence as seen in the autoradiogram? 3. What type of transcription factor.
Advertisements

Introduction to genomes & genome browsers
Control of Gene Expression
Ch 17 Gene Expression I: Transcription
Regulation of eukaryotic gene sequence expression Lecture 6.
Lecture 4: DNA transcription
Chromosome structure and chemical modifications can affect gene expression
Eukaryotic Gene Regulation. Introduction Difference between eukaryotic and prokaryotic DNA Regulation at chromosome level Regulation at the transcription.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
RNA Molecules and RNA Processing Functions and Modifications of RNA Molecules.
Chapter 15 Noncoding RNAs. You Must Know The role of noncoding RNAs in control of cellular functions.
Chapter 11 Gene Expression and Epigenetics
UTR motifs and microRNA analysis 曾 大 千 助 理 教 授 10/28/2008.
(CHAPTER 12- Brooker Text)
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG
Relationship between Genotype and Phenotype
Regulation of eukaryotic gene sequence expression
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
NcRNAs What Genomes are Telling Us ncrna.ppt. ncRNA genes are difficult to discover! small an annotational and statistical concern no ORFs and no polyadenylation.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
Introns and Exons DNA is interrupted by short sequences that are not in the final mRNA Called introns Exons = RNA kept in the final sequence.
Regulation of Gene Expression
Eukaryotic Gene Expression The “More Complex” Genome.
Regulation of Gene Expression Eukaryotes
GENE REGULATION ch 18 CH18 Bicoid is a protein that is involved in determining the formation of the head and thorax of Drosophila.
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
Intro to RNA-seq July 13, Goal of the course To be able to effectively design, and interpret genomic studies of gene expression. We will focus on.
RNA interference Definition: RNA interference (RNAi) is a mechanism where the presence of certain fragments.
MBP1007/ Nucleic Acids A functional mRNA: The cytoplasmic story Objectives (1) To discuss the iNUTS and iBOLTS of how mRNAs function in the cytoplasm.
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
Molecular Biology in a Nutshell (via UCSC Genome Browser) Personalized Medicine: Understanding Your Own Genome Fall 2014.
How Genes Work Ch. 12.
Fig.1.8 DNA STRUCTURE 5’ 3’ Antiparallel DNA strands Hydrogen bonds between bases DOUBLE HELIX 5’ 3’
LECTURE CONNECTIONS 14 | RNA Molecules and RNA Processing © 2009 W. H. Freeman and Company.
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Chap. 8 Problem 1 Mechanisms of post-transcriptional gene control of protein coding genes are shown in Fig The most commonly used mechanism is the.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Control of Gene Expression Chapter Proteins interacting w/ DNA turn Prokaryotic genes on or off in response to environmental changes  Gene Regulation:
Transcription and mRNA Modification
Control of Gene Expression Chapter DNA RNA Protein replication (mutation!) transcription translation (nucleotides) (amino acids) (nucleotides) Nucleic.
AP Biology Control of Prokaryotic (Bacterial) Genes.
Control of Eukaryotic Genome
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
RNA-ligand interactions and control of gene expression
Javad Jamshidi Fasa University of Medical Sciences, November 2015 Gene Structure and Transcriptio n.
GENE REGULATION RESULTS IN DIFFERENTIAL GENE EXPRESSION, LEADING TO CELL SPECIALIZATION Eukaryotic DNA.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
CFE Higher Biology DNA and the Genome Transcription.
Non-Coding RNA Helen Nordquist November 13, 2015.
Genes in ActionSection 2 Section 2: Regulating Gene Expression Preview Bellringer Key Ideas Complexities of Gene Regulation Gene Regulation in Prokaryotes.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Office hours: M 10/ :30am 2-5pm in Bio 6.
MiRNAs and siRNAs 5 th March 2013 Saeideh Jafarinejad 4/22/2013 Rhumatology Research Center Lab(RRC lab)
Gene structure and function
CAMPBELL BIOLOGY IN FOCUS © 2014 Pearson Education, Inc. Urry Cain Wasserman Minorsky Jackson Reece Lecture Presentations by Kathleen Fitzpatrick and Nicole.
Chapter 15. I. Prokaryotic Gene Control  A. Conserves Energy and Resources by  1. only activating proteins when necessary  a. don’t make tryptophan.
Mestrado Integrado em Medicina Biologia Celular e Molecular II
Chapter 13 Regulatory RNA Introduction  RNA functions as a regulator by forming a region of secondary structure (either inter- or intramolecular)
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Regulatory RNAs. RNA DNA mRNA rRNA tRNA snRNA snoRNAmicroRNA siRNAribozymes Protein synthesis Splicing of mRNA Processing of rRNA Regulation of gene expression.
Eukaryotic Gene Regulation
miRNA genomic organization, biogenesis and function
Regulation of Gene Expression
Long Noncoding RNA in Prostate, Bladder, and Kidney Cancer
Regulation of Gene Expression by Eukaryotes
Regulation of Gene Expression
“Gene Expression Regulation”
Nat. Rev. Gastroenterol. Hepatol. doi: /nrgastro
mRNA Degradation and Translation Control
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

Functional Non-Coding DNA Part I Non-coding genes and non-coding elements of coding genes BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG

What Does ‘Functional Non-Coding DNA’ Mean? DNA whose sequence affects transcripts made from DNA in some way Could affect transcription levels, splicing or sequestering of RNA Three main ways to identify functional non- coding elements – Sequence characteristics – favored bases – Genomic conservation – Epigenetic marks and open chromatin especially outside of genes

Types of Non-Coding Elements Non-coding RNAs – miRNAs, lncRNAs, etc Non-coding gene elements – UTRs, splice sites, poly-adenylation sites, splice sites and regulating element, RNA-binding sites DNA elements outside genes – our main focus – Promoters – Enhancers/Silencers – Insulators

Types of Non-Coding RNA microRNAs Silencing RNAs Small nuclear/nucleolar RNAs Piwi-Interacting RNAs Long Non-Coding RNAs Circular RNAs Still other RNAs??? Comprehensive data base at

Micro-RNAs Micro-RNAs are small non-coding RNA molecules, about 21– 25 nucleotides in length They are processed from much longer genes, or from introns within mRNA, by several molecular pathways Micro-RNAs base-pair with complementary sequences within mRNA molecules, often in 3’ or 5’ UTR. miRNA binding usually results in gene repression either via translational stalling or by triggering mRNA degradation Image by Charles Mallery, U of Miami

Micro-RNAs The human genome encodes over 1500 miRNAs, which are believed to affect more than half of human genes miRNAs are abundant in many cell types – Thousands of copies per cell of some miRNAs – Those within gene introns share regulation miRNAs are well-conserved across vertebrates – No orthologs between plant and animal miRNAs – miRBase is the comprehensive repository of micro- RNAs

Other Short RNAs: siRNA Small interfering RNAs are double-stranded with an overhang They are processed by some of the same machinery as miRNAs and have some of the same effects

Other Short RNAs: piRNA Piwi-Interacting RNAs are longer base single-stranded RNAs – PIWI (P-element Induces Wimpy Testis) protein Over 50,000 sequences known in mouse – They are the largest class of nc-RNA They seem to play an ancient role in defense against retro-viruses and transposons

Other Short RNAs: snRNAs & snoRNAs Small nuclear RNAs (snRNAs) are typically ~ 150 bases long, and associate with protein – Many conserved copies of each snRNA gene – U1-U6 snRNAs key parts of splicing machinery Small nucleolar RNAs (snoRNAs) – Guide chemical modifications of other RNAs – Prader-Willi syndrome results from deletion of region containing 29 copies of SNORD116 on chr 15q11 U6 snRNA

Long Non-Coding RNAs Many long (>200bp) stretches of genome are transcribed and have epigenetic marks like those of protein-coding genes Most of these are spliced RNAs with two (or more) exons GENCODE v15 has 13.5K lncRNA See also – Derrien et al, Genome Research 2012 – Lee, Science 2012 From Derrien et al Genome Res 2012

Many lncRNAs Induce Silencing Coat nearby gene(s) and silence them Xist binds to gene clusters first Xist binds disparate parts of chromosome Many lncRNA are antisense to genes Some lncRNAs maintain pluripotency of stem cells From Jeannie Lee lab (Harvard) website

Long Non-Coding RNAs - 2 Most lncRNAs are expressed in only a few tissues Most human lncRNAs are specific to the primate lineage From Derrien et al Genome Res 2012

Circular RNAs Several thousand non-coding RNAs apparently form circular structures Many form complexes with AGO and seem to absorb attached miRNAs, blocking processing CDR1 has 70 conserved binding sites for mir7

Functional Pseudo-Genes Pseudo-genes are copies of genes that are decaying and rarely (never) make proteins Some pseudo-genes act to absorb negative regulators of the original gene – eg. SRGAP2B

How to Identify Non-Coding RNAs? Short (and long) RNA transcriptomes Promoter chromatin marks for independent (non-embedded) miRNAs and lncRNAs

DEMO: Display HOTAIR & XIST Tracks in UCSC Browser

Non-Coding Elements of Genes TSS 5' UTRs Introns Splicing regulation sites 3' UTRs Termination/Poly-adenylation sites

Transcription Start Sites Transcription of most genes may initiate at several distinct clusters of locations with distinct promoters for each TSS Two major types of metazoan TSS: CG-rich broad TSS, and narrow (often tissue-specific) TSS

Transcription Start Sites Transcription often starts at CG within promoter

5’ Untranslated Regions First exon often contains dozens to thousands of bases before Start codon (median 150) Sometimes contains regulatory sequences, e.g. binding sites for RNA binding proteins, and translation initiators

Splice Regulatory Sites Splicing is achieved through binding of spliceosome to recognition sequences on nascent RNA molecule

Splice Regulatory Sites Tissue-specific splice regulatory sites are highly conserved From Merkin et al Science 2012

Splicing Patterns Evolve in All Tissues Except Brain From Merkin et al Science 2012

Non-Coding Elements in Coding Exons Many regulatory sites occur within coding exons, esp. toward 5’ end These constrain some codons as much as protein sequence Many human SNPs break TFBS but have little effect on protein (AFAWK) From Stergachis et al Science 2013

3’ Untranslated Regions Longest exon is usually 3’UTR (>1000 nt) Typically 1/3 – 1/2 of a gene is in 5’ & 3’ UTRs 3’UTR has binding sites for miRNAs and RNA binding proteins AU-rich elements (AREs) stabilize mRNA Proteins recognize complex secondary structure GRIK4 3’UTR secondary structure is conserved

RNA Binding-Protein Sites mRNAs are usually further processed (e.g. transported or sequestered) RNA binding proteins recognize specific motifs within secondary structure of 3’ or 5’ UTR These sites are often highly conserved From Ray et al Nature 2013

Poly-adenylation/Termination Sites Transcripts can be terminated and poly-adenylated at sites with specific sequences Most genes have alternate poly-adenylation sites Median lengths of 3’UTR are 250 & 1773 bp (mouse)

Poly-adenylation/Termination Sites Rapidly proliferating cells express gene isoforms with short 3’ UTRs Neurons typically have longer 3’ UTRs Elkon et al, NRG 2013Types of alternate poly-adenylation

DEMO: GAPDH and GABRA1 in UCSC Browser