Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.

Slides:



Advertisements
Similar presentations
Genomics – The Language of DNA Honors Genetics 2006.
Advertisements

Chromatin Structure & Genome Organization. Overview of Chromosome Structure Nucleosomes –~200 bp DNA in 120 Å diameter coil –3.4 Å /bp x 200 = 680 Å –680/120.
The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Gene Expression Chapter Eleven. What is Gene Expression? When a gene is expressed – that gene’s protein product is made: 1.DNA is transcribed into RNA.
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Lecture #8Date _________ n Chapter 19~ The Organization and Control of Eukaryotic Genomes.
Genome Organization & Protein Synthesis and Processing in Plants
3.1 An overview of genetic possesses 3.2 The basis of hereditary 3.3 DNA replication 3.4 RNA and protein synthesis 3.5 Gene expression.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
chromosome organization, what about genome organization?
Chapter 11 Gene Expression and Epigenetics
(CHAPTER 12- Brooker Text)
CHAPTER 3 GENE EXPRESSION IN EUKARYOTES (cont.) MISS NUR SHALENA SOFIAN.
RNA Ribonucleic Acid.
Transcription Nicky Mulder Acknowledgements: Anna Kramvis for lecture material (adapted here)
How Proteins are Made. I. Decoding the Information in DNA A. Gene – sequence of DNA nucleotides within section of a chromosome that contain instructions.
NcRNAs What Genomes are Telling Us ncrna.ppt. ncRNA genes are difficult to discover! small an annotational and statistical concern no ORFs and no polyadenylation.
Chapter 19: Eukaryotic Genomes Most gene expression regulated through transcription/chromatin structure Most gene expression regulated through transcription/chromatin.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
Transcription Transcription is the synthesis of mRNA from a section of DNA. Transcription of a gene starts from a region of DNA known as the promoter.
Introns and Exons DNA is interrupted by short sequences that are not in the final mRNA Called introns Exons = RNA kept in the final sequence.
Eukaryotic Gene Expression The “More Complex” Genome.
Human Genetics The Human Genome 1.
Eukaryotic Gene Control. Developmental pathways of multicellular organisms: All cells of a multicellular organism start with the same complement of DNA.
Regulation of Gene Expression Eukaryotes
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Fig Genome = Genic + Intergenic (or non-genic) Eukaryotic genomes: composition of human genome.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
From Gene to Protein A.P. Biology. Regulatory sites Promoter (RNA polymerase binding site) Start transcription DNA strand Stop transcription Typical Gene.
AP Biology Control of Eukaryotic Genes.
AP Biology Control of Eukaryotic Genes.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Control of Gene Expression Chapter Proteins interacting w/ DNA turn Prokaryotic genes on or off in response to environmental changes  Gene Regulation:
Lecture 10 Genes, genomes and chromosomes
Control of Eukaryotic Genome
Eukaryotic Genomes: The Organization and Control.
Eukaryotic Gene Structure. 2 Terminology Genome – entire genetic material of an individual Transcriptome – set of transcribed sequences Proteome – set.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Chapter 3 The Interrupted Gene.
11 Gene function: genes in action. Sea in the blood Various kinds of haemoglobin are found in red blood cells. Each kind of haemoglobin consists of four.
Gene Regulation In 1961, Francois Jacob and Jacques Monod proposed the operon model for the control of gene expression in bacteria. An operon consists.
RNA and Gene Expression BIO 224 Intro to Molecular and Cell Biology.
How many genes are there?
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
Chapter 19 The Organization & Control of Eukaryotic Genomes.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
CFE Higher Biology DNA and the Genome Transcription.
Unit-II Synthetic Biology: Protein Synthesis Synthetic Biology is - A) the design and construction of new biological parts, devices, and systems, and B)
Chapter 15, Part I. Topic Outline Translation Prokaryotic Gene Regulation Eukaryotic Gene Regulation Mutations Cancer.
Gene structure and function
Human Molecular Genetics Institute of Medical Genetics.
Aim: How is DNA organized in a eukaryotic cell?. Why is the control of gene expression more complex in eukaryotes than prokaryotes ? Eukaryotes have:
CAMPBELL BIOLOGY IN FOCUS © 2014 Pearson Education, Inc. Urry Cain Wasserman Minorsky Jackson Reece Lecture Presentations by Kathleen Fitzpatrick and Nicole.
Chapter 12 Protein Synthesis. Central Dogma: DNA  RNA  Protein (the flow of genetic information)
Control of Eukaryotic Genes (Ch. 19) The BIG Questions… How are genes turned on & off in eukaryotes? How do cells with the same genes differentiate to.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Gene Expression: Prokaryotes and Eukaryotes AP Biology Ch 18.
Eukaryotic Gene Regulation
Eukaryotic Gene Structure
SGN23 The Organization of the Human Genome
Gene Regulation Ability of an organisms to control which genes are present in response to the environment.
Eukaryote Regulation and Gene Expression
Epigenetics Study of the modifications to genes which do not involve changing the underlying DNA
Eukaryotic Genomes: The Organization and Control.
Gene Density and Noncoding DNA
T--A--C--A--A--G--T--A--C-- T--T--G--T--T--T--C--T--T--A--A—A
Chapter 6: Transcription and RNA Processing in Eukaryotes
Presentation transcript:

Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.

Genome Organization G

C value paradox: The amount of DNA in the haploid cell of an organism is not related to its evolutionary complexity or number of genes.

Highly Repeated Sequences

There are different classes of eukaryotic DNA based on sequence complexity.

Amount of DNA in a Genome Does Not Correlate with Complexity 105 106 107 108 109 1010 1011 1012 basepairs

How many genes do humans have? Original estimate was between 50,000 to 100,000 genes We now think humans have ~ 20,000 genes How does this compare to other organisms? Mice have ~30,000 genes Pufferfish have ~35,000 Nematodes (C. elegans), have ~19,000 Yeast (S. cerevisiae) has ~6,000 The microbe responsible for tuberculosis has ~4,000

Single Copy Sequences Exome

Even the Amount of DNA a Gene Spans Differs Among Species

Problems? Some gene products are RNA (tRNA, rRNA, others) instead of protein Some nucleic acid sequences that do not encode gene products (noncoding regions) are necessary for production of the gene product (protein or RNA). Eukaryotic genes are complex!

Gene Identification Open reading frames Sequence conservation Database searches Synteny Sequence features CpG islands Evidence for transcription ESTs, microarrays Gene inactivation Transformation, RNAi

Unique genes

Noncoding regions Regulatory regions Introns RNA polymerase binding site Transcription factor binding sites Introns Polyadenylation [poly(A)] sites

Splice Sites Eukaryotes only Removal of internal parts of the newly transcribed RNA. Takes place in the cell nucleus Splice sites difficult to predict

One gene, many proteins via alternative splicing , 3’ cleavage and polyadenlyation

Exon Shuffling

Trans-Splicing in Higher Eukaryotes 21 Gingeras, Nature (2009) 461, 206-211

Non-contiguous Transcription Generates An Enormous Number of Possible Transcripts • Trans-splicing exists in higher eukaryotes as well as in lower ones like Trypanosomes Six 2-exon co-linear combinations from four exons Blue: only co-linear Red: all combinations 325 combinations of 3-exons, non-colinear • Reassortment of exons coding for ncRNA or protein domains could dramatically increase number of functional products beyond the number of ‘genes’ 22 Gingeras, Nature (2009) 461, 206-211

Why genome size isn’t the only concern (size doesn’t matter?) More sophisticated regulation of expression? Proteome vastly larger than genome? Alternate splicing RNA editing Postranslational modifications? Cellular location? Moonlighting

Gene families E.g. globins, actin, myosin Clustered or dispersed Pseudogenes

Pseudogenes Nonfunctional copies of genes Formed by duplication of ancestral gene, or reverse transcription (and integration) Not expressed due to mutations that produce a stop codon (nonsense or frameshift) or prevent mRNA processing, or due to lack of regulatory sequences

Duplicated genes Encode closely related (homologous) proteins Formed by duplication of an ancestral gene followed by mutation Five functional genes and two pseudogenes

Paralogs vs Orthologs Different members of the globin gene family are paralogs, having evolved one from another through gene duplication. Paralogs are separated by a gene duplication event. Each specific gene family member (e.g. a specific gene in human) is an ortholog of the same family member in another species (e.g. mouse). Both evolved from an ancestral globin gene. Orthologs are separated by a speciation event. It is not always easy to distinguish true orthologs from paralogs , especially in polyploid organisms!

Protein - coding sequences less than 1.5% of the genome in humans!

Noncoding RNAs (ncRNA) Do not have translated ORFs Small Not polyadenylated

Functions of Known lncRNAs • Transcriptional interference -lncRNA transcription turns off transcription of nearby gene • Initiation of chromatin remodeling - lncRNA transcription turns on transcription of nearby gene • Promoter inactivation - lncRNA binds to TFIIB and to promoter DNA • Activation of an accessory protein - lncRNA binds to allosteric effector protein TLS and inhibits histone acetyltransferase, decreasing transcription 34 Ponting et al, Cell (2009) 136, 629-641

Functions of Known lncRNAs • Activation of transcription factors - binding of lncRNA to Dlx2 activates Dlx5/6 activity • Oligomerization of an accessory protein - lncRNA induces heat shock factor trimerization • Transport of transcription factors -lnRNA NRON keeps NFAT out of nucleus • Epigenetic silencing of gene clusters -Xist RNA inactivates X chromosome • Epigenetic repression of genes in trans -HOTAIR binds PRC2, leading to methylation and silencing of several genes in HOXD locus Ponting et al, Cell (2009) 136, 629-641 35

ncRNA ~97-98% of the transcriptional output of the human genome is ncRNA Introns Transfer RNAs (tRNA) ~ 500 tRNA genes in human genome Ribosomal RNAs Tandem arrays on several chromosomes 150-200 copies of 28S – 5.8S – 18S cluster 200-300 copies of 5S cluster

Genome Organization - ncRNA The level of transcription from human chromosomes 21 and 22 is an order of magnitude higher than can be accounted for by known or predicted exons Almost half of all transcripts from well-constructed mouse cDNA libraries are ncRNAs (identified because they do not code for an open reading frame of larger than 100 codons)

Repeat sequences – 50% or more of the genome

Repetitive DNA Moderately repeated DNA Simple-sequence DNA Tandemly repeated rRNA, tRNA and histone genes (gene products needed in high amounts) Large duplicated gene families Mobile DNA Simple-sequence DNA Tandemly repeated short sequences Found in centromeres and telomeres (and others) Used in DNA fingerprinting to identify individuals

Segmental duplications Found especially around centromeres and telomeres Often come from nonhomologous chromosomes Many can come from the same source Tend to be large (10 to 50 kb) Unique to humans?

Repetitive DNA - Segmental duplications

Mobile DNA Moves within genomes Most of the moderately repeated DNA sequences found throughout higher eukaryotic genomes L1 LINE is ~5% of human DNA (~50,000 copies) Alu is ~5% of human DNA (>500,000 copies) Some encode enzymes that catalyze movement

Repetitive DNA – Highly repetitive satellite DNA