Limitations of genome projects Windowjhgjhddoorhubbahubbastairduh 10 7 3.10 9 What do proteins do for a living?

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Recombinant DNA Technology
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
DNA Sequencing and Gene Analysis
Bacterial Physiology (Micr430)
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Cloning, genomes, and proteomes
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 5 A Story of Transcription.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 1 A Story of Transcription.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Proteome.
Fine Structure and Analysis of Eukaryotic Genes
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Lecture 5: Challenges in the post- genomic era The tiger leg leaf frog Photo: Zig Leszccynski Image: courtesy Rainforest Alliance.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
CO 10.
ASSIGNING GENE FUNCTION BY EXPERIMENTAL ANALYSIS
Lecture 5 Post-genomics. Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
es/by-sa/2.0/. Large Scale Approaches to the Study of Protein Levels and Activity Prof:Rui Alves
Lecture 5 Post-genomics. Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems.
1. Bacterial genomes - genes tightly packed, no introns... HOW TO FIND GENES WITHIN A DNA SEQUENCE? Scan for ORFs (open reading frames) - check all 6 reading.
Finish up array applications Move on to proteomics Protein microarrays.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Literature reviews revised is due4/11 (Friday) turn in together: revised paper (with bibliography) and peer review and 1st draft.
Biotechnology.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Protein-protein interactions “The Interactome” Yeast two-hybrid analysis Yeast two-hybrid analysis Protein chips Protein chips Biochemical purification/Mass.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
MCB 317 Genetics and Genomics Topic 11 Genomics. Readings Genomics: Hartwell Chapter 10 of full textbook; chapter 6 of the abbreviated textbook.
Chapter 7 Analyzing DNA and gene structure, variation and expression 1.Sequencing and genotyping DNA Standard/manual DNA sequencing using dideoxynucleotide.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
From Genomes to Genes Rui Alves.
Post-genomics. Post-genomics Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome)
Chapter 11: Functional genomics
Central dogma: the story of life RNA DNA Protein.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Lecture 18 – Functional Genomics Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.
Two powerful transgenic techniques Addition of genes by nuclear injection Addition of genes by nuclear injection Foreign DNA injected into pronucleus of.
目录 The Principle and Application of Common Used Techniques in Molecular Biology chapter 18.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
Finding genes in the genome
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Techniques of Molecular Biology
The Transcriptional Landscape of the Mammalian Genome
Lecture 8 A toolbox for mechanistic biologists (continued)
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Today… Review a few items from last class
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics II
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
From Mendel to Genomics
Genome Annotation and the Human Genome
Presentation transcript:

Limitations of genome projects Windowjhgjhddoorhubbahubbastairduh What do proteins do for a living?

(A) Identifying genes from the sequence (B) Gene expression profiling (C) Genome activity studies Genomes2 by TA Brown; chapter 7 Post-genomics

(A) Hunting genes from the sequence 2 broad approaches 1) Ab initio method (computational) 2) Experimental method

Ab initio method (computational) Scanning ORFs (open reading frames) – initiation or termination codons  Codon bias found in specific species  Exon-intron boundaries  Upstream control sequences – e.g conserved motifs in transcription factor binding regions  CpG islands Homology searches

Ab initio method (computational)….. Software for automated annotation of genes like GENSCAN, Genie, GENEBUILDER etc are being used. These scan for special features like 1)Scanning ORFs (open reading frames) – initiation or termination codons 5’- ATGACGCATGATCGAGGAT –3’ 3’ – TACTGCGTACTAGCTCCTA –5’ AAC TAA ATG CCT CTA TCC

Ab initio method (computational)…  Codon bias found in specific species Not all codons used at same frequency e.g.human leucine mainly coded by CTG and rarely by TTA or CTA  Exon-intron boundaries (splice sites) 5’-AG GTAAGT-3’ hit and miss affair  Upstream control sequences – e.g conserved motifs in transcription factor binding regions  CpG islands

experimental method Experimental evaluation based on the use of transcribed RNA to locate exons and entire genes from DNA fragment.

experimental method 2 main strategies  Hybridisation approaches – Northern Blots, cDNA capture / cDNA select, Zoo blots  Transcript mapping: RT-PCR, exon trapping etc In this method, known DNA databases are searched to find out whether the test sequence is similar to any other known genes, suggesting an evolutionary relationship.

Northern BlotZoo Blot Fig 7.4: Genomes 2 Fig 7.5: Genomes 2

RT-PCRExon trapping Fig 7.: Genomes 2 Fig 7.8: Genomes 2

(B) Gene expression profiling COMPUTATIONAL APPROACH Homology searches for either - Orthologous genes (homologues in different organisms with common ancestor) - Paralogous genes (genes in the same organism, e.g. multigene families)

(B) Gene expression profiling….. EXPERIMENTAL APPROACH gene inactivation methods (knockouts, RNAi, site-directed mutagenesis, transposon tagging, genetic footprinting etc) Gene overexpression methods (knock-ins, transgenics, reporter genes etc)

(C) Genome activity studies Gene expression needs to be complemented by Transcriptome analysis Proteome analysis

The transcriptome mRNA Pre-r RNAPre-t RNAsn RNA sno RNA sc RNA t RNA tm RNA etc hn RNA Non-coding RNA (96%) coding RNA (4%) Total RNA r RNA All organismseukaryotes bacteria

The transcriptome complete collection of transcribed elements of the genome transcriptome maps will provide clues on Regions of transcription Transcription factor binding sites Sites of chromatin modification Sites of DNA methylation Chromosomal origins of replication

The transcriptome Analysis can be done by either SAGE (serial analysis of gene expression) technology Microarray technology

SAGE Shortcut to doing cDNA library screening SAGE tags identify mRNAs derived from known genes anonymous mRNAs, also known as expressed sequence tags (ESTs) mRNAs derived from currently unidentified genes Advantages Analyzes all transcripts (Transcriptome) without prior selection of known genes Provides quantitative data on both known and unknown genes Ideally suited for determining changes on gene expression as consequence of an experimental treatment (e.g. carcinogen, hormone)

SAGE

Microarrays – allows comparisons

Microarrays….

Proteomics

Nature (2003) March 13: Insight articles from pg 194

Proteomics Proteome projects - co-ordinated by the HUPO (Human Protein Organisation) Involve protein biochemistry on a high- throughput scale Problems  limited and variable sample material,  sample degradation,  abundance,  post-translational modifications,  huge tissue, developmental and temporal specificity as well as disease and drug influences. Nature (2003) March 13: Insight articles from pgs

Approaches in proteomics Nature (2003) March 13: Insight articles from pgs High throughput approach 1)Mass- spectrometry based 2)Array based 3)Structural proteomics 4)Informatics 5)Clinical proteomics

High throughput approaches in proteomics 1) Mass spectrometry-based proteomics: relies on the discovery of protein ionisation techniques. used for  protein identification and quantification,  profiling,  protein interactions and  modifications. Nature (2003) March 13: Insight articles from pgs

Mass spectrometry (MS) Nature (2003) March 13: Insight articles from pgs

Principle of MS Nature (2003) March 13: Insight articles from pgs oion source, omass analyser that measures mass-to-charge ratio (m/z) odetector that registers the number of ions at each m/z value Electrospray ionisation (ESI) matrix-assisted laser desortion/ionisation (MALDI) MALDI-MS - simple peptide mixtures whereas ESI-MS - for complex samples.

Principle of MALDI-TOF Fig 7.24 Genomes 2 by TA Brown pg 210 Matrix assisted laser desorption/ ionisation – time of flight

2) Array-based proteomics Nature (2003) March 13: Insight articles from pgs Based on the cloning and amplification of identified ORFs into homologous (ideally used for bacterial and yeast proteins) or sometimes heterologous systems (insect cells which result in post-translational modifications similar to mammalian cells). A fusion tag (short peptide or protein domain that is linked to each protein member e.g. GST) is incorporated into the plasmid construct.

Array based proteomics…. Nature (2003) March 13: Insight articles from pgs a. Protein expression and purification b. Protein activity: Analysis can be done using biochemical genomics or functional protein microarrays. c. Protein interaction analysis two-hybrid analysis (yeast 2-hybrid), FRET (Fluorescence resonance energy transfer), phage display etc d. Protein localisation: immunolocalisation of epitope-tagged products. E.g the use of GFP or luciferase tags

3) Structural proteomics ! Nature (2003) March 13: Insight articles from pgs a. Protein expression and purification b. Protein activity: Analysis can be done using biochemical genomics or functional protein microarrays. c. Protein interaction analysis two-hybrid analysis (yeast 2-hybrid), FRET (Fluorescence resonance energy transfer), phage display etc d. Protein localisation: immunolocalisation of epitope-tagged products. E.g the use of GFP or luciferase tags

PROTEIN INTERACTION MAPS FOR MODEL ORGANISMS Nature Reviews Molecular Cell Biology 2; (2001); doi: /

Challenges for the future – ‘physiome’ Nature Reviews Molecular Cell Biology 4; (2003)