How many genes are there?

Slides:



Advertisements
Similar presentations
Genomics – The Language of DNA Honors Genetics 2006.
Advertisements

The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
Genomics, Genetics and Biochemistry
Retroviruses And retroposons
Retroviruses and Retroposons Chapter Introduction Figure 22.1.
Eukaryotes and Prokaryotes Key Differences in Protein Synthesis.
Prof. Drs. Sutarno, MSc., PhD.. Biology is Study of Life Molecular Biology  Studying life at a molecular level Molecular Biology  modern Biology The.
GENETIC-CONCEPTS.
ECE 501 Introduction to BME
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
(CHAPTER 12- Brooker Text)
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
An Overview of Protein Synthesis. Genes A sequence of nucleotides in DNA that performs a specific function such as coding for a particular protein.
Plant of the Day! Rafflesia arnoldii (Euphorbiaceae)
Organisation of DNA in prokaryotes and eukaryotes
Chapter 2 Genes Encode RNAs and Polypeptides
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Eukaryotic Gene Expression The “More Complex” Genome.
Human Genetics The Human Genome 1.
Chapter 5 Genome Sequences and Gene Numbers. 5.1Introduction  Genome size vary from approximately 470 genes for Mycoplasma genitalium to 25,000 for human.
Chapter 10 genome, gene expression; genes as units of inheritance transmission of heritable characteristics; gene regulation, eukaryote chromosomes, alleles.
Chapter 2: From genes to Genomes. 2.1 Introduction.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
What does the word Promoter mean? It is the place at which RNA Pol II binds. But the word is incorrectly used to describe Enhancers plus Promoter.
Genetics: Chromosome Organization. Chromosomes: Structures that contain the genetic material (DNA) Genome – complete set of genetic material in a particular.
Chapter 11 Phage strategies.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Genomes & their evolution Ch 21.4,5. About 1.2% of the human genome is protein coding exons. In 9/2012, in papers in Nature, the ENCODE group has produced.
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Chapter 21 Eukaryotic Genome Sequences
Protein Synthesis Part 1: Transcription. DNA is like a book of instructions written with the alphabet A, T, G, and C. Genes are specific sequences of.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Lecture 10 Genes, genomes and chromosomes
BioSci D145 lecture 1 page 1 © copyright Bruce Blumberg All rights reserved Organization and Structure of Genomes (contd) Genome size –i.e. total.
Chapter 1 Introduction.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
Chapter 3 The Interrupted Gene.
11 Gene function: genes in action. Sea in the blood Various kinds of haemoglobin are found in red blood cells. Each kind of haemoglobin consists of four.
Genomics Chapter 18.
RNA and Gene Expression BIO 224 Intro to Molecular and Cell Biology.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
Unit 4: Genetic Information, Variation and Relationships between Organisms Lesson 1 Genetic Organisation IN PROKARYOTIC CELLS, DNA MOLECULES ARE SHORT,
Chapter 13 Test Review.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Molecular structure of gene and chromosome Gene: In molecular terms, a gene is the entire DNA sequence required for synthesis of functional protein or.
Aim: How is DNA organized in a eukaryotic cell?. Why is the control of gene expression more complex in eukaryotes than prokaryotes ? Eukaryotes have:
 DNA- genetic material of eukaryotes.  Are highly variable in size and complexity.  About 3.3 billion bp in humans.  Complexity- due to non coding.
The genome of prokaryotes and eukaryotes- nuclear and extranuclear genetic organization.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
MCB 7200: Molecular Biology
Chromosome Structure and
Chapter 5 The Content of the Genome
Chromosome Structure and
Genomes Genes and Alleles
Today… Review a few items from last class
Evolution of eukaryote genomes
Chapter 4 The Interrupted Gene.
Evolutionary genetics
Chapter 6 Genome Sequences and Gene Numbers
BSC1010: Intro to Biology I K. Maltz Chapter 21.
Unit 1: 1.1 Structure of DNA Organisation of DNA
The Structure of the Genome
The Content of the Genome
Genome Sequences and Gene Number
The Content of the Genome
Presentation transcript:

How many genes are there? Chapter 3: How many genes are there?

3.1 Introduction Total number of genes at four levels: genome is the complete set of genes of an organism transcriptome is the complete set of genes expressed under particular conditions proteome is the complete set of proteins Proteins may function independently or as part of multiprotein assemblies Identify the coding potential of a genome directly: open reading frames transcriptome: mRNAs proteome: all the proteins

3.2 Why are genomes so large? C Value

Figure 3.1 DNA content of the haploid genome is related to the morphological complexity of lower eukaryotes, but varies extensively among the higher eukaryotes. The range of DNA values within a phylum is indicated by the shaded area.

Figure 3.2 The minimum genome size found in each phylum increases from prokaryotes to mammals.

Figure 3.3 The genome sizes of some common experimental animals.

Figure 3.4 The proportions of different sequence components vary in eukaryotic genomes. The absolute content of nonrepetitive DNA increases with genome size, but reaches a plateau at ~2 X109 bp.

3.3 Total gene number is known for several organisms

Figure 3.5 Genome sizes and gene numbers are known from complete sequences for several organisms (Arabidopsis, Drosophila, and man are estimated from partial data). Lethal loci are estimated from genetic data.

Figure 3.6 ~20% of Drosophila genes code for proteins concerned with maintaining or expressing genes, ~20% for enzymes, <10% for proteins concerned with the cell cycle or signal transduction. Half of the genes of Drosophila code for products of unknown function.

Figure 3. 7 Because many genes are duplicated Figure 3.7 Because many genes are duplicated. the number of different gene families is much less than the total number of genes.

Figure 3.8 The fly genome can be divided into genes that are (probably) present in all eukaryotes, additional genes that are (probably) present in all multicellular eukaryotes, and genes that are more specific to subgroups of species that include flies.

3.4 How many genes are essential?

Figure 3.9 Genome sizes and gene numbers are known from complete sequences for several organisms (Arabidopsis, Drosophila, and man are estimated from partial data). Lethal loci are estimated from genetic data.

3.5 How many genes are expressed?

Figure 3.10 Hybridization between excess mRNA and cDNA identifies several components in chick oviduct cells, each characterized by the Rot½ of reaction.

Figure 3.11 HDA analysis allows change in expression of each gene to be measured. Each square represents one gene (top left is first gene on chromosome I, bottom right is last gene on chromosome XVI). Change in expression relative to wild type is indicated by red (reduction), whte (no change) or blue (increase). High-Density oligonucleotide Arrays

3.6 Organelles have DNA

Figure 3.12 Mitochondrial genomes have genes coding for (mostly complex 1-4) proteins, rRNAs, and tRNAs.

3.8 Mitochondrial DNA codes for few proteins

Figure 3.13 Human mitochondrial DNA has 22 tRNA genes, 2 rRNA genes, and 13 protein-coding regions. 14 of the 15 protein-coding or rRNA-coding regions are transcribed in the same direction. 14 of the tRNA genes are expressed in the clockwise direction and 8 are read counter clockwise.

Figure 3. 14 The mitochondrial genome of S Figure 3.14 The mitochondrial genome of S. cerevisiae contains both interrupted and uninterrupted protein-coding genes, rRNA genes, and tRNA genes (positions not indicated). Arrows indicate direction of transcription.

3.9 The chloroplast genome codes for ~100 proteins and RNAs

Figure 3.15 The chloroplast genome codes for 4 rRNAs, 30 tRNAs, and ~50 proteins.

3.10 Summary

The sequences comprising a eukaryotic genome can be classified in three groups: nonrepetitive sequences: unique; moderately repetitive sequences: dispersed repeated a small number of times in the form of related but not identical copies; highly repetitive sequences: short and usually repeated as a tandem array.

The proportions of the types of sequence: characteristic for each genome, although larger genomes tend to have a smaller proportion of nonrepetitive DNA. The complexity of any class describes the length of unique sequences in it; the repetition frequency describes the number of times each sequence is repeated. The C-value paradox describes the discrepancy between coding potential and DNA content in eukaryotic genomes

Most structural genes are located in nonrepetitive DNA Most structural genes are located in nonrepetitive DNA. The complexity of nonrepetitive DNA is a better reflection of the complexity of the organism than the total genome complexity; nonrepetitive DNA reaches a maximum complexity of ~2 x109 bp.

The total number of genes: <1000 for Mycoplasma and intracellular parasites, 20004000 for bacteria >6000 for yeast >12,000 for insects >100,000 for mammals.

Genes are expressed at widely varying levels Genes are expressed at widely varying levels. There may be 105 copies of mRNA for an abundant gene whose protein is the principal product of the cell, 103 copies of each mRNA for <10 moderately abundant messages, and <10 copies of each mRNA for >10,000 scarcely expressed genes. Overlaps between the mRNA populations of cells of different phenotypes are extensive; the majority of mRNAs are present in most cells.

not all genes are essential (lethal genes: the existence of devastating effects when they are mutated). The numbers of nonessential genes and essential genes could be comparable. yeast: only 60% of genes appear to be essential; D. melanogaster: <5000 essential genes. We do not understand how nonessential genes are maintained; they may provide selective advantages that are not evident.

NonMendelian inheritance is explained by the presence of DNA in organelles in the cytoplasm. Mitochondria and chloroplasts both represent membrane-bounded systems in which some proteins are synthesized within the organelle, while others are imported. The organelle genome is usually a circular DNA that codes for all of the RNAs and for some of the proteins that are required.

Mitochondrial genomes vary greatly in size from the 16 kb minimalist mammalian genome to the 570 kb genome of higher plants. It is assumed that the larger genomes code for additional functions. Chloroplast genomes range from 120~200 kb. Those that have been sequenced have a similar organization and coding functions. In both mitochondria and chloroplasts, many of the major proteins contain some subunits synthesized in the organelle and some subunits imported from the cytosol.

Mammalian mtDNAs are transcribed into a single transcript from the major coding strand, and individual products are generated by RNA processing. Rearrangements occur in mitochondrial DNA rather frequently in yeast; and recombination between mitochondrial or between chloroplast genomes has been found. There are some tantalizing homologies between mitochondrial and chloroplast genomes.