Download presentation
Presentation is loading. Please wait.
Published byEaster Berry Modified over 6 years ago
1
Human genome: gene structure, function & regulation Jeroen Breckpot
Interuniversity course in human genetics October 2017
2
Chapter 2. Thompson & Thompson
DNA DNA structure was discovered by Watson, Crick, Wilkins and Franklin in 1953 DNA is composed of dexoyribonucleotides which form a double helix with a sugar-phosphate backbone. Complementary bases A and T, and G and C form the rungs of the DNA ladder by means of hydrogen bonds. Chapter 2. Thompson & Thompson Genetics in Medicine
3
From base pair to chromosome
top-down view side view In eukaryotic cells DNA is not free flowing, but it is wrapped around proteins, the histones, like a wire of a classic telephone. A chromosome is a single piece of coiled double-stranded DNA with all associated proteins. JV
4
Chromosome territories
While chromosomes do appear as condensed, elongated structures during the process of cell division, for most of a cell's lifetime, chromosomes do not look anything like this (Figure 1). In fact, some cell types, such as neurons, never undergo cell division, and their chromosomes therefore never take on a condensed state. So, what do chromosomes look like in the nucleus of a cell between cell divisions? Figure 1 : Chromosome territories in the chicken. a) Stained, diploid spread of chicken metaphase chromosomes. b) Same spread after multicolor FISH with pseudocolored chromosomes. c) Probes were detected using antibodies. d) Mid-plane light optical section through a chicken fibroblast nucleus shows mutually exclusive chromosome territories with homologous chromosomes seen in separate locations. Chromosome territories in the chicken
5
The central dogma Genotype Phenotype DNA mRNA protein phenotype
nucleus - cytoplasm cytoplasm, organelles, extracellular tissues, organs, organism Genotype Phenotype
6
The central dogma Genotype Phenotype DNA mRNA protein phenotype
nucleus - cytoplasm cytoplasm, organelles, extracellular tissues, organs, organism Genotype Phenotype DNA sequence = 3,324,592,091 bp (3 billion) (hg19) coding genes: 20,769 short non coding genes: 9,079 long non coding genes: 13,564 pseudogenes: 14,165
7
features of a typical human gene
DEFINITION: a gene is a region of DNA that controls a discrete hereditary characteristic, usually corresponding to a single mRNA which will be translated into a protein (coding genes). Some genes encode a functional RNA molecule which is not translated into a polypeptide (non-coding genes) exon exon exon intron
8
features of a typical human gene
a gene includes: actual coding sequences adjacent nucleotide sequences for proper expression of the gene exon exon exon intron These adjacent nucleotide sequences provide the start and stop signal for the synthesis of mRNA. At the 5’ end of each gene lies a promotor region that is responsible for the proper initiation of transcription. Other regulatory elements (enhancers, silencers and locus control regions) can lie at the 5’ or 3’ of a gene, or can be intronic, and sometimes lie a significant distance away from the coding portion of a gene. That means that the genomic environment of a gene is an important feature of its regulation (mutations in remote regions may affect gene expression). At the 3’ end of the gene lies an UTR that contains a signal for the addition of adenosine residues (poly A tail) to the end of the mature mRNA. initiator codon termination codon in eukaryotes, the genes have their coding sequences (exons) interrupted by non-coding sequences (introns)
9
features of a pseudogene
DEFINITION DNA sequences that closely resemble known genes but are afunctional nonprocessed pseudogenes ‘dead’ genes: ‘duplicates’ which were inactivated by mutations in coding or regulatory sequences processed pseudogenes formed by retrotransposition: reverse transcription of RNA followed by integration in genome → lack of introns
10
features of non-coding genes
DEFINITION DNA sequences that encode an untranslated functional RNA product miRNA
11
Junk DNA In humans, genes constitute only about 2-3% of DNA, the rest is ‘junk’ DNA: gene-dense "urban centers“ predominantly composed of G & C versus gene-poor "deserts" rich in A and T: GC- and AT-rich regions usually can be seen through a microscope as light and dark bands on chromosomes. CpG islands : stretches C and G repeats adjacent to gene-rich areas, forming a barrier between the genes and the junk DNA. CpG islands help regulate gene activity.
12
Junk DNA junk DNA: LINE and SINE: “copy and paste” LINE: begins with an UTR, followed by 2 ORF, one encoding an RNA binding protein, and one encoding an endonuclease and a reverse transcriptase, machinery for retrotransposition SINE: do not encode a functional reverse transcriptase protein: they integrate at chromosomal breaks e.g. Alu repeats junk DNA: repeated sequences not coding for proteins: >50% of human genome reshape humane genome: creating entirely new genes, and modifying and reshuffling existing genes. during the past 50 million years: decrease seems to have occurred in the rate of accumulation of repeats
13
Junk DNA: Alu repeats FISH with Alu sonde marks gene-rich regions in human metaphase chromosomes Alu sequences are SINEs (short interspersed nuclear elements) that are enriched in intragenic regions most abundant transposable elements in the human genome. Modern Alu elements are about 300 base pairs long and are therefore classified as short interspersed elements (SINEs) among the class of repetitive DNA elements. The typical structure is 5'Part A- A5TACA6 -Part B - PolyA Tail - 3', where Part A and Part B are similar nucleotide sequences. DNA counterstain = red Alu sequences = green
14
The central dogma Transcription 1. Initiation of transcription
DNA mRNA protein phenotype Transcription 1. Initiation of transcription basal initiation complex transcription factors 2. mRNA synthesis 3. mRNA processing
15
Initiation of transcription
hetero-chromatin euchromatin In interphase DNA is condensed -> nucleosome
16
assembling basal initiation complex
Initiation of transcription assembling basal initiation complex RNA polymerase + TF RNA polymerase I: rRNA RNA polymerase II: mRNA, miRNA, snRNA, siRNA RNA polymerase III: tRNA, 5S rRNA,... Eukaryotic transcription takes place in the nucleus of the cell and proceeds in three sequential stages: initiation, elongation, and termination.[1] At the core of the transcriptional machinery that catalyzes the transcription is an RNA polymerase. Three types… The complex from the closed state to the open state results in the separation of the two DNA strands and the positioning of the template strand to the active site of the RNA polymerase. Without the need of a primer, RNA polymerase can initiate the synthesis of a new RNA chain using the template DNA strand Each RNA polymerase has its own promotor characteristics and transcription factors (some are shared)
17
Initiation of transcription
TATA box Conserved promotor sequence rich in adenines and thymines: 25 to 30 bp upstream from the start site of transcription Gene = UGT1A1
18
Initiation of transcription
assembling the enhanceome Eukaryotic genes also contain regulatory sequences beyond the core promoter. These cis-acting control elements bind transcriptional activators or repressors to increase or decrease transcription from the core promoter = enhancers, silencers, insulators….
19
Initiation of transcription
DNA binding domains
20
mRNA synthesis Transcription by RNAP II is initiated at the transcriptional start point at the 5’end: RNA synthesis proceeds in a 5’ to 3’ direction Template DNA strand is read from 3’ tot 5’: 5’ to 3’ strand of nontranscribed DNA = coding or sense DNA strand 3’ to 5’ strand of transcribed DNA = non-coding or antisense strand Similar to DNA syntesis, except: RNA polymerase no primer no proofreading Uracil (U) instead of T
21
mRNA processing The primary RNA transcript is processed by addition of a chemical cap structure to the 5’ end of the RNA and cleavage of the 3’ end at a specfic point downstream of the the of the coding information: the cleavage is followed by addition of a poly A tail to the 3’ end of the RNA. AAUAAA
22
mRNA splicing consensus sequence for an intron 3’ acceptor site
5’ donor site transesterification Within the intron, a donor site (5' end of the intron), a branch site (near the 3' end of the intron) and an acceptor site (3' end of the intron) are required for splicing. The splice donor site includes an almost invariant sequence GU at the 5' end of the intron, the splice acceptor site at the 3' end terminates the intron with an almost invariant AG sequence. The branch point contains an adenosine in a region rich of C and G. The consensus sequence for an intron (in IUPAC nucleic acid notation) is: A-G-[cut]-G-U-R-A-G-U (donor site) ... intron sequence ... Y-U-R-A-C (branch sequence nucleotides upstream of acceptor site) ... Y-rich-N-C-A-G-[cut]-G (acceptor site) Splicing is catalyzed by the spliceosome, a large RNA-protein complex composed of five small nuclear ribonucleoproteins (snRNPs, pronounced 'snurps'). This type of splicing is termed canonical splicing or termed the lariat pathway, which accounts for more than 99% of splicing. By contrast, when the intronic flanking sequences do not follow the GU-AG rule, noncanonical splicing is said to occur (see "minor spliceosome" below). major spliceosome complex: 5 snRNPs (small nuclear ribonucleoproteins: U1, U2, U4, U5, U6) canonical splicing or lariat pathway (GU ... AG) 99% of splicing consensus sequence for an intron
23
This results in a mature mRNA with a missing section of an exon.
splice site mutations splice site mutations can activate a cryptic splice site in part of the transcript that usually is not spliced point mutations in the underlying DNA or errors during transcription can activate a cryptic splice site in part of the transcript that usually is not spliced. This results in a mature messenger RNA with a missing section of an exon. This results in a mature mRNA with a missing section of an exon.
24
alternative splicing DEFINITION
the splicing process can create a range of unique proteins by varying the exon composition of the same mRNA. A B 1 2 3 4 the splicing process can create a range of unique proteins by varying the exon composition of the same mRNA. This phenomenon is then called alternative splicing.
25
alternative splicing: enhancers
26
alternative splicing: repressors
28
post-transcriptional mRNA modification
29
The central dogma Translation 1. Initiation of translation
DNA mRNA protein phenotype Translation 1. Initiation of translation 2. protein synthesis 3. protein modifications
30
Initiation of translation
Eukaryotic mRNAs possess a 5’end cap and are polyadenylated. The 5’ cap interacts with the initiation complex. mRNA is translated starting from codon AUG (Methionin) mRNA strand is read in direction from 5’ to 3’ Initiation: The ribosome assembles around the target mRNA. The first tRNA is attached at the start codon. Elongation: The tRNA transfers an amino acid to the tRNA corresponding to the next codon. Translocation: The ribosome then moves (translocates) to the next mRNA codon to continue the process, creating an amino acid chain. Termination: When a stop codon is reached, the ribosome releases the polypeptide.
31
5’cap and UTR interact with pre-initiation complex:
mRNA 5’ cap 3’ polyA 40S ribosomal subunit eIF4 eIF3 Met tRNA eIF2 5’cap and UTR interact with pre-initiation complex: 40S ribosomal subunit eIF3 eukaryotic initiation factor 3 eIF2 + initiator tRNA with Methionine (start codon AUG) eIF4 (eIF4A, eIF4E, eIF4F, eIF4G) Eukaryotic initiation factor eIF4 scans along mRNA from 5’ cap to find the start codon AUG: bases around the initiating AUG influence the efficiency of initiation (Kozak consensus sequence) The complete ribosome (80S) then commences translation elongation, during which the sequence between the 'start' and 'stop' codons is translated from mRNA into an amino acid sequence—thus, a protein is synthesized Elongation: Complete ribosomal unit (80S) = 60S + 40S Aminoacyl-tRNA
32
eIF4 mutations & Parkinson disease 18
33
Initiation of translation
34
Elongation: protein synthesis
35
Elongation: protein synthesis
Codon: 4³ = 64 codons Start codon: AUG (Met) Stop codon: UGA UAA UAG 20 naturally occurring amino acids: Met = AUG (start) 60 codons for 19 other amino acids
36
Protein synthesis A protein is a linear polymer of amino acids linked together by peptide bonds. average protein length 200 amino acids protein functions: structure, catalysis of reactions, … Complex structure: four structural levels Proteins have a complex structure which can be thought of as having four structural levels.
37
Protein modifications
Glycosylation, methylation, phosphorylation,…
38
DNA variants
39
Levels of Control Regulation of transcription: 1. DNA methylation
mRNA protein phenotype Regulation of transcription: 1. DNA methylation 2. Histone modification
40
Levels of Control: DNA methylation
abnormal hypermethylation of CpG islands can cause cancer, e.g. transcriptional silencing of tumor suppressor genes: target for gene therapy? DNA methyl transferase DNA methylation occurs mainly at the C5 position of CpG dinucleotides: maintenance methylation activity is necessary to preserve DNA methylation after every cellular DNA replication cycle: DNMT1. de novo methylation: installing methylation patterns early in development DNMT3a and DNMT3b: DNA methylation can stably alter the expression of genes in cells during cell division and differentiate from embryonic stem cells into specific tissues. DNA methylation is typically removed during zygote formation and re-established through successive cell divisions during development.
41
DNA methylation related disease
42
DNA methylation related disease
FRAGILE X SYNDROME CGG trinucleotide expansion > 200 (‘full mutation’) in FMR1 gene (on X chromosome) causes hypermethylation of the FMR1 promotor: inactivation of FMR1 expression Phenotype depends on # CGG repeats & methylation status moderate intellectual disability in affected males and mild intellectual disability in affected females Males with full mutation: large head long face prominent forehead and chin protruding ears large testes after puberty behavioral abnormalities & autism.
43
Levels of Control: DNA methylation
Effect of DNA methylation on gene transcription: the methylation of DNA itself physically impede the binding of transcriptional factors to the gene methylated DNA may be bound by methyl-CpG-binding domain proteins: MBDs. MBD proteins recruit histone deacetylases and other chromatin remodeling proteins > histone modification: forming compact, inactive heterochromatin.
44
MBD proteins related disease
RETT SYNDROME Developmental regression: onset 6 to 18 months Severe ID & autism Epilepsy Ataxia Behavioral problems Stereotyped hand movements Acquired microcephaly Loss of methyl-CpG-binding protein 2 (MeCP2) (on X chromosome) has been implicated in girls with Rett syndrome. MECP2 is an of MBD protein, which can act as a transcriptional repressor. MECP2 duplications cause severe ID in boys.
45
Levels of Control: chromatin remodeling
Lysine acetylation by HATs: reduces electrostatic attraction between the histone and the negatively charged DNA backbone, loosening the chromatin structure = EUCHROMATIN (<> HDAC)
46
Histone acetylation related disease
47
Levels of Control: chromatin remodeling
Lysine methylation by histone methyl transferase: induces euchromatin: H3K4, H3K36, H3K79 induces heterochromatin: H3K9, H3K27, H4K20
48
Histone methylation related disease
facial gestalt short stature microcephaly feeding problems oligodontia high/cleft palate fetal pads lax joints cardiac defects renal defects ID hypotonia frequent infections Kabuki syndrome MLL2 (KMT2D) loss-of-function mutations in 50-70% of KS patients KMT2D is a histone methyltransferase that targets lysine 4 of histone H3 (H3K4) to promote an open chromatin state.
49
Closing remark gene expression potential histone modifications
DNA mRNA protein phenotype gene expression potential histone modifications DNA methylation copy number variation regulatory elements SNVs & mutations RNA interference
50
The End
52
mRNPs: co-transcriptional assembly and remodelling from transcription to translation and degradation
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.