Outline of Talk Protein Genes Protein Genes SNPs SNPs Haplotypes Haplotypes Finding a Disease Locus Finding a Disease Locus.

Slides:



Advertisements
Similar presentations
Overview Transcription Detail Another Transcription Animation.
Advertisements

Click Here to Begin Your Lab
Chapter 13- RNA and Protein Synthesis
Transcription and Translation
Transcription and Translation
Proteins are made by decoding the Information in DNA Proteins are not built directly from DNA.
Concepts and Applications Eighth Edition
DNA.
Protein Synthesis 12-3.
Lab Make-Ups If you missed the strawberry lab or the candy dye lab you need to Borden ASAP to sign-up. The Candy dye lab kit will be returning to Fred.
From DNA to Proteins Chapter 13. Byssus: Marvelous Mussel Adhesive Mussels live in the surf zone and are subjected to continual pounding by waves Mussel.
GENE EXPRESSION. Gene Expression Our phenotype is the result of the expression of proteins Different alleles encode for slightly different proteins Protein.
Molecular Biology in a Nutshell (via UCSC Genome Browser) Personalized Medicine: Understanding Your Own Genome Fall 2014.
RNA Structure Like DNA, RNA is a nucleic acid. RNA is a nucleic acid made up of repeating nucleotides.
RNA AND PROTEIN SYNTHESIS
7. Protein Synthesis and the Genetic Code a). Overview of translation i). Requirements for protein synthesis ii). messenger RNA iii). Ribosomes and polysomes.
Chapter 11 DNA and Genes.
Cell Division and Gene Expression
Chapter 14.  Ricin (found in castor-oil plant used in plastics, paints, cosmetics) is toxic because it inactivates ribosomes, the organelles which assemble.
Chapter 14 Genetic Code and Transcription. You Must Know The differences between replication (from chapter 13), transcription and translation and the.
Chapter 17 From Gene to Protein. Protein Synthesis  The information content of DNA  Is in the form of specific sequences of nucleotides along the DNA.
Protein Synthesis. RNA (RIBONUCLEIC ACID)  Nucleic acid involved in the synthesis of proteins  Subunits are nucleotides  Nucleotides are composed of.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
From DNA to Proteins Chapter 13. Same two steps produce all proteins: 1) DNA is transcribed to form RNA –Occurs in the nucleus –RNA moves into cytoplasm.
©1998 Timothy G. Standish From DNA To RNA To Protein Timothy G. Standish, Ph. D.
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
From DNA to Proteins Chapter 13. Central Dogma DNA RNA Protein.
Example 1 DNA Triplet mRNA Codon tRNA anticodon A U A T A U G C G
G U A C G U A C C A U G G U A C A C U G UUU UUC UUA UCU UUG UCC UCA
Protein Synthesis Translation e.com/watch?v=_ Q2Ba2cFAew (central dogma song) e.com/watch?v=_ Q2Ba2cFAew.
Figure 17.4 DNA molecule Gene 1 Gene 2 Gene 3 DNA strand (template) TRANSCRIPTION mRNA Protein TRANSLATION Amino acid ACC AAACCGAG T UGG U UU G GC UC.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Chapter 13- RNA and Protein Synthesis
F. PROTEIN SYNTHESIS [or translating the message]
From DNA to Protein.
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
Please turn in your homework
RNA and Protein Synthesis
The blueprint of life; from DNA to Protein
Where is Cytochrome C? What is the role? Where does it come from?
Ch 10: Protein Synthesis DNA to RNA to Proteins
What is Transcription and who is involved?
Ch. 17 From Gene to Protein Thought Questions
Gene Expression: From Gene to Protein
From Gene to Protein The information content of DNA is in the form of specific sequences of nucleotides The DNA inherited by an organism leads to specific.
Overview: The Flow of Genetic Information
Section Objectives Relate the concept of the gene to the sequence of nucleotides in DNA. Sequence the steps involved in protein synthesis.
Protein Synthesis Translation.
Overview: The Flow of Genetic Information
Usually, dominant alleles are recipes for functional proteins.
Transcription You’re made of meat, which is made of protein.
Gene Expression: From Gene to Protein
Chapter 17 From Gene to Protein.
SC-100 Class 25 Molecular Genetics
From DNA to Proteins Chapter 13.
RNA & Protein Synthesis
Warm Up 3 2/5 Can DNA leave the nucleus?
Today’s notes from the student table Something to write with
Transcription and Translation
Central Dogma and the Genetic Code
Protein Synthesis.
Bellringer Please answer on your bellringer sheet:
Protein Synthesis: Transcription & Translation
Protein Synthesis.
DNA, RNA, Amino Acids, Proteins, and Genes!.
How does DNA control our characteristics?
Transcription and Translation
12-3: RNA and Protein Synthesis (part 1)
12.2 Replication of DNA DNA replication is the process of copying a DNA molecule. Semiconservative replication - each strand of the original double helix.
Presentation transcript:

Outline of Talk Protein Genes Protein Genes SNPs SNPs Haplotypes Haplotypes Finding a Disease Locus Finding a Disease Locus

Size of the Genomes bacteria yeast round worm fruit fly flowering plant

The Human Genome

What the letters stand for DNA has four chemical subunits, called nucleotide bases abbreviated A, C, G, T. GATTACA en.wikipedia.org/wiki/Nucleotide

Whats in the Genome? Chromosomes – 23 pairs Chromosomes – 23 pairs – Genes Protein genes RNA genes MicroRNA genes – Repeats Tandem repeats Inverted repeats Transposons Segmental duplications – Regulatory regions Promoters Transcription factor binding sites

Protein Genes A protein gene contains the genetic code for a protein. The production of protein involves transcription (copying DNA to RNA) and translation (using RNA code to produce a protein).

tion/polysome_em.html r_Beatty3.jpg Transcription Translation

Finding Protein Genes Before the sequencing of genomes, protein genes were found experimentally. Now, new genes are predicted computationally using a gene model.

Finding Protein Genes Before the sequencing of genomes, protein genes were found experimentally. Now, new genes are predicted computationally using a gene model.

Finding Protein Genes Before the sequencing of genomes, protein genes were found experimentally. Now, new genes are predicted computationally using a gene model.

Finding Protein Genes Before the sequencing of genomes, protein genes were found experimentally. Now, new genes are predicted computationally using a gene model.

Finding Protein Genes Before the sequencing of genomes, protein genes were found experimentally. Now, new genes are predicted computationally using a gene model.

Building a Gene Model Gene models for prediction are based on the structure of genes in DNA and their messenger RNAs (mRNAs). This includes exons, introns, promoters, and the polyadenylation signal.

Exons In this example, EXONS are uppercase and introns are lowercase. Exons contain the code for a protein, introns interrupt the exons. Before translation, introns are removed from the messenger RNA. DNA: …ACTGCTACAGtctattgaGAACAACATAGtcacgaacttaacgtgca GTTTAACAGCACGtctcgaagggca… RNA (before removal of introns): …ACUGCUACAGucuauugaGAACAACAUAGucacgaacuuaacg ugcaGUUUAACAGCACGucucgaagggca… RNA (after removal of introns): …ACUGCUACAGGAACAACAUAGGUUUAACAGCACG…

The sequence of an exon contains codons. Each codon is a triplet of nucleotides which codes for a single amino acid. Amino acids are the building blocks of a protein. Finding Exons

Genetic Code. Each codon specifies one of twenty amino acids. Three codons are stop codons, which specify the end of translation.

An open reading frame (ORF), is a sequence of codons that does not contain a stop codon. Open Reading Frame (ORF) alaninethreonine glutamic acid leucinearginineserineSTOP!

Sequence: acggacucuagccuaaugugacgacugacauagguaaauucgcuc Even though this sequence contains stop codons, they are not present in all reading frames. frame +1 acg gac ucu agc cua aug uga cga cug aca uag gua aau ucg cuc frame +2 a cgg acu cua gcc uaa ugu gac gac uga cau agg uaa auu cgc uc frame +3 ac gga cuc uag ccu aau gug acg acu gac aua ggu aaa uuc gcu c Very short ORFs are unlikely. Finding Exons

Finding Introns Introns usually start at a G – T boundary and end at an A – G boundary.

Sequence:acggacucuagccuaaugugacgacugacauagguaaauucgcuc A gene can contain open reading frames connected across stop codons by an intron frame +1 acg gac ucu agc cua aug uga cga cug aca uag gua aau ucg cuc frame +3 ac gga cuc uag ccu aau gug acg acu gac aua ggu aaa uuc gcu c Finding Exons

How many genes are there? Estimates pre 2000: 100,000 based on estimates of required number of genes to account for human complexity pre 2000: 100,000 based on estimates of required number of genes to account for human complexity 2001: 30,000 – 40,000 based on first draft of human genome 2001: 30,000 – 40,000 based on first draft of human genome 2003: 23,000 – 24,500 based on gene prediction computer programs 2003: 23,000 – 24,500 based on gene prediction computer programs Why so low? alternate splicing of exons alternate splicing of exons complex regulatory mechanisms complex regulatory mechanisms inability to predict genes which are unlike those seen before inability to predict genes which are unlike those seen before

RNA Genes RNA genes do not code for proteins. Instead, the RNA molecule itself is functional in the cell. Examples include: 1.Ribosomal RNA – these molecules form the major component of the protein building machinery 2.Transfer RNA – work with ribosomal RNA to insert correct amino acids into growing proteins 3.MicroRNA – a newly discovered class of RNA which helps regulate gene expression.

Ribosome m.nih.gov/Class/N AWBIS/Modules/R NA/images/fig_rna 12.jpg

tion/polysome_em.html r_Beatty3.jpg Transcription Translation

RNA Genes MicroRNAs are short and show little or no conservation of sequence. Unlike protein genes, RNA genes do not contain codons or open reading frames. But, they do contain inverted repeats.

Inverted Repeats (IRs) RNA G A C U U G A U C A A G U C G A C U U G A U C A A G U C complemented reversed Two patterns, one the reverse complement of the other

IR Nomenclature Left arm Right arm Spacer RNA G A C U U G A U C A A G U C G A C U U G A U C A A G U C

Stem-Loop Structure CAGUUCAGGUCAAGUC Spacer Left arm Right arm Structure forms by pairing of complementary bases

MicroRNA MicroRNAs come from a precursor that contains a stem-loop.

Detection of Approximate Inverted Repeats Human Chr. 3 ~173,291,101 AAGACTTGAA CAACTTTTAA ACATAAGATC AATTATTTCA AGTAGATTCC CTTTTTCATT CACAATCACA TTCTCACAGA CACAGTCCCA GTTTCTACCT GACTGAGATG CAGTAAGGAA TCTGATTATA ACACTCATTG ATTATAACAC TCATTGAATT TATGGATTCC TTACTGCATC TCATTCAGGT AGAAAAAGGG ACTGTGTCTG TGAGAATGTG ATTGTGAATG AAAAAGATGG AATATGTGTA TTTTTGAGTG TCTATGGAAG AGCTTCTGAC AAGAGAGAGG AAGATTAGGT AAAATGAAAT ATCGCCGTCG GCATTTCCCC CTACGT AAGACTTGAA CAACTTTTAA ACATAAGATC AATTATTTCA AGTAGATTCC CTTTTTCATT CACAATCACA TTCTCACAGA CACAGTCCCA GTTTCTACCT GACTGAGATG CAGTAAGGAA TCTGATTATA ACACTCATTG ATTATAACAC TCATTGAATT TATGGATTCC TTACTGCATC TCATTCAGGT AGAAAAAGGG ACTGTGTCTG TGAGAATGTG ATTGTGAATG AAAAAGATGG AATATGTGTA TTTTTGAGTG TCTATGGAAG AGCTTCTGAC AAGAGAGAGG AAGATTAGGT AAAATGAAAT ATCGCCGTCG GCATTTCCCC CTACGT

Detection of Approximate Inverted Repeats Human Chr. 3 ~173,291,101 AAGACTTGAA CAACTTTTAA ACATAAGATC AATTATTTCA AGTAGATTCC CTTTTTCATT CACAATCACA TTCTCACAGA CACAGTCCCA GTTTCTACCT GACTGAGATG CAGTAAGGAA TCTGATTATA ACACTCATTG ATTATAACAC TCATTGAATT TATGGATTCC TTACTGCATC TCATTCAGGT AGAAAAAGGG ACTGTGTCTG TGAGAATGTG ATTGTGAATG AAAAAGATGG AATATGTGTA TTTTTGAGTG TCTATGGAAG AGCTTCTGAC AAGAGAGAGG AAGATTAGGT AAAATGAAAT ATCGCCGTCG GCATTTCCCC CTACGT AAGACTTGAA CAACTTTTAA ACATAAGATC AATTATTTCA AGTAGATTCC CTTTTTCATT CACAATCACA TTCTCACAGA CACAGTCCCA GTTTCTACCT GACTGAGATG CAGTAAGGAA TCTGATTATA ACACTCATTG ATTATAACAC TCATTGAATT TATGGATTCC TTACTGCATC TCATTCAGGT AGAAAAAGGG ACTGTGTCTG TGAGAATGTG ATTGTGAATG AAAAAGATGG AATATGTGTA TTTTTGAGTG TCTATGGAAG AGCTTCTGAC AAGAGAGAGG AAGATTAGGT AAAATGAAAT ATCGCCGTCG GCATTTCCCC CTACGT Arms are 72 nt long, spacer is 42bp long

The Problem: Find the Inverted Repeat Human Chr. 3 ~173,291,101 AAGACTTGAA CAACTTTTAA ACATAAGATC AATTATTTCA AGTAGATTCC ATGG AATATGTGTA TTTTTGAGTG TCTATGGAAG AGCTTCTGAC AAGAGAGAGG AAGATTAGGT AAAATGAAAT ATCGCCGTCG GCATTTCCCC CTACGT AAGACTTGAA CAACTTTTAA ACATAAGATC AATTATTTCA AGTAGATTCC CTTTTTCATT CACAATCACA TTCTCACAGA CACAGTCCCA GTTTCTACCT GACTGAGATG CAGTAAGGAA TCTGATTATA ACACTCATTG ATTATAACAC TCATTGAATT TATGGATTCC TTACTGCATC TCATTCAGGT AGAAAAAGGG ACTGTGTCTG TGAGAATGTG ATTGTGAATG AAAAAGATGG AATATGTGTA TTTTTGAGTG TCTATGGAAG AGCTTCTGAC AAGAGAGAGG AAGATTAGGT AAAATGAAAT ATCGCCGTCG GCATTTCCCC CTACGT

Single Nucleotide Polymorphisms (SNPs) A SNP is a single position in the genome (a locus) that is not the same in all people. Some people have one type of nucleotide and other people have a different nucleotide. Differences in the population at a single locus are called polymorphisms and the individual types are called alleles. SNPs are found experimentally acattcct acgttatt SNPs

Haplotypes A haplotype is a collection of SNP alleles on a single chromosome in an individual. Shown are SNPS on two chromosomes in each individual. acgttcatacattcattcgttcatacagatatacattcct acattcct atagtccaacagtcca tcattcatacattcaa

Haplotypes A haplotype is a collection of SNP alleles on a single chromosome in an individual. Homozygous (same alleles) acgttcatacattcattcgttcatacagatatacattcct acattcct atagtccaacagtcca tcattcatacattcaa

Haplotypes A haplotype is a collection of SNP alleles on a single chromosome in an individual. Heterozygous (different alleles) acgttcatacattcattcgttcatacagatatacattcct acattcct atagtccaacagtcca tcattcatacattcaa

Haplotypes A haplotype is a collection of SNP alleles on a single chromosome in an individual. Rare alleles acgttcatacattcattcgttcatacagatatacattcct acattcct atagtccaacagacca tcattcatacattcaa

Haplotypes A haplotype is a collection of SNP alleles on a single chromosome in an individual. Strong linkage (usually occur together) acgttcatacattcattcgttcatacagatatacattcct acattcct atagtccaacagtcca tcattcatacattcaa

Linkage Analysis SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). acattcat`atagtcca acagatattcattcat momdad acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child recombination and inheritance

Linkage Analysis SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). acattcat`atagtcca acagatattcattcat momdad acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child recombination in the mothers chromosomes

Linkage Analysis SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). acattcat`atagtcca acagatattcattcat momdad acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child recombination in the fathers chromosomes

Linkage Analysis SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). acattcat`atagtcca acagatattcattcat momdad acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child two to three crossovers per chromosome per generation

Linkage Analysis Key point: Alleles that are physically close together tend to be inherited together because the chance of a crossover between them is small. They exhibit strong linkage. acattcat`atagtcca acagatattcattcat momdad acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child

Finding an Unknown Disease Locus The location on the genome of many diseases is unknown. SNPs and haplotypes are being used to search for disease loci using linkage analysis. acattcat`atagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease

Linkage Analysis – Dominant Model Assume the disease is caused by a dominant allele, meaning one copy is enough to cause the disease. acattcat`atagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease SNP alleles in father that are not in mother

Linkage Analysis – Dominant Model Assume the disease is caused by a dominant allele, meaning one copy is enough to cause the disease. acattcat`atagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease SNP allele in child, inherited from father with disease

Linkage Analysis – Dominant Model Assume the disease is caused by a dominant allele, meaning one copy is enough to cause the disease. acattcat`atagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease SNP allele and disease are linked indicating possible disease locus.

Linkage Analysis – Recessive Model Assume the disease is caused by a recessive allele, meaning two copies are required to cause the disease. acattcatatagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease homozygous SNP alleles in father that are heterozygous in mother

Linkage Analysis – Recessive Model Assume the disease is caused by a recessive allele, meaning two copies are required to cause the disease. acattcat`atagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease homozygous SNP allele in child, identical to fathers

Linkage Analysis – Recessive Model Assume the disease is caused by a recessive allele, meaning two copies are required to cause the disease. acattcat`atagtcca acagatattcattcat mom dad has disease acagtcc aaccaaggttcccca`a`aaccaaggttcccca`a`acagacat child has disease SNP allele and disease are linked indicating possible disease locus.

BMI = weight/height 2 in kg/m 2, BMI > 25 overweight, BMI > 30 obese

Other Differences – Microdeletions A microdeletion is the loss of a small piece of DNA, perhaps as small as 1000 bases. These pieces can contain genes, parts of genes or regulatory regions. acattcct atgttt gcgcat acactcct microdeletions

Other Differences – Microdeletions A microdeletion is the loss of a small piece of DNA, perhaps as small as 1000 bases. These pieces can contain genes, parts of genes or regulatory regions. acattcct atgttt gcgcat acactcct heterozygous

Other Differences – Microdeletions A microdeletion is the loss of a small piece of DNA, perhaps as small as 1000 bases. These pieces can contain genes, parts of genes or regulatory regions. acattcct atgttt gcgcat acactcct homozygous

Other Differences – Microdeletions A microdeletion is the loss of a small piece of DNA, perhaps as small as 1000 bases. These pieces can contain genes, parts of genes or regulatory regions. acattcct atgttt gcgcat acactcct miscalled homozygous

Apparent Inheritance Inconsistency SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). ataagaaccccac acatccacccctccac momdad ccca ccccccaac`c`ccccccaac`c`acatccac child

Apparent Inheritance Inconsistency SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). ataagaaccccac acatccacccctccac momdad ccca ccccccaac`c`ccccccaac`c`acatccac child a a + t t a t by Mendelian inheritance

Apparent Inheritance Inconsistency SNPs and haplotypes are used to identify regions of the genome that cause disease. The technique is called linkage analysis and evidence of a connection is called linkage disequilibrium (LD). ataagaaccccac acatccacccctccac momdad ccca ccccccaac`c`ccccccaac`c`acatccac child cluster of inconsistencies suggests a microdeletion.

Microdeletions Hundreds of microdeletion haplotypes have been discovered recently. They may be a major contributor to human differences and disease.

Resources UCSC Human Genome Browser National Center for Biotechnology Information (NCBI) PubMed