Comparative Genome Analysis. Comparative yeast genomics Kellis et al (2003) Nature 423, 241 - 254.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Biological Motivation Gene Finding
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
ATG GAG GAA GAA GAT GAA GAG ATC TTA TCG TCT TCC GAT TGC GAC GAT TCC AGC GAT AGT TAC AAG GAT GAT TCT CAA GAT TCT GAA GGA GAA AAC GAT AAC CCT GAG TGC GAA.
Supplementary Fig.1: oligonucleotide primer sequences.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
Genome evolution There are both proximate and ultimate explanations in molecular biology Mutation continually generates variation in genome content and.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction methods Gene indices Mapping cDNA on genomic DNA Genome-genome.
1 Bioinformatics Research Center. 2 Talk overview 1.DNA and genes 2.Project idea 3.PriFi – finding primers based on a multiple alignment 4.GeMprospector.
The human genome Lander et al Venter et al Patil et al Gabriel et al Reich et al Sabeti et al Yu et al
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
chromosome organization, what about genome organization?
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
1. 2 Talk overview Overall project scenario PriFi motivation PriFi algorithm description Web version Demo.
1 Speaker: Jakob Fredslund, computer scientist, post doc. at Bioinformatics Research Centre, Aarhus University.
Introduction to Molecular Biology. G-C and A-T pairing.
Reading the blueprint of life DNA sequencing. Introduction The blueprint of life is contained in the DNA in the nuclei of eukaryotic cells and simply.
Gene Structure and Identification
Fine Structure and Analysis of Eukaryotic Genes
International Livestock Research Institute, Nairobi, Kenya. Introduction to Bioinformatics: NOV David Lynn (M.Sc., Ph.D.) Trinity College Dublin.
1 EE381V: Genomic Signal Processing Lecture #13. 2 The Course So Far Gene finding DNA Genome assembly Regulatory motif discovery Comparative genomics.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
HC70AL Final Presentation Chris McQuilkin June 4 th, 2009.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Linkage Mapping of the Angiotensin I Converting Enzyme Gene in Pig V.Q. Nguyen 1, K.L. Glenn 2, B.E. Mote 2, and M.F. Rothschild 2 1 Department of Biological.
A.B. C. orf60(pOrf60) 042orf orf60(pOrf60-M5 ) orf60(pOrf60-M1) orf60(pOrf60-M4) 042orf60 042orf60(pOrf60-M5) orf60(pOrf60) 042orf60(pOrf60-M1)
Supplemental Table S1 For Site Directed Mutagenesis and cloning of constructs P9GF:5’ GAC GCT ACT TCA CTA TAG ATA GGA AGT TCA TTT C 3’ P9GR:5’ GAA ATG.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Current Topics in Evolutionary Genomics Wen-Hsiung Li and Justin Borevitz.
Bioinformatics Overview School of B&I TCD May 2010.
From Genomes to Genes Rui Alves.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Chapter 11: Functional genomics
Passing Genetic Notes in Class CC106 / Discussion D by John R. Finnerty.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Comparative Genomics Methods for Alternative Splicing of Eukaryotic Genes Liliana Florea Department of Computer Science Department of Biochemistry GWU.
Supplementary materials
While replication, one strand will form a continuous copy while the other form a series of short “Okazaki” fragments Genetic traits can be transferred.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Chapter 3 The Interrupted Gene.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Finding genes in the genome
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
ORF Calling.
RNA and Protein Synthesis
Relationship between Genotype and Phenotype
Supplementary Figure 1 – cDNA analysis reveals that three splice site alterations generate multiple RNA isoforms. (A) c.430-1G>C (IVS 6) results in 3.
Gene architecture and sequence annotation
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
Recitation 7 2/4/09 PSSMs+Gene finding
More on translation.
Cis-regulatory evolution of duplicate genes in yeasts
Genome organization and Bioinformatics
Phylogenetic footprinting and shadowing
Structure of the 5′ Portion of the Human Plakoglobin Gene
From Mendel to Genomics
INTRODUCTION TO MOLECULAR GENETICS
Relationship between Genotype and Phenotype
Shailaja Gantla, Conny T. M. Bakker, Bishram Deocharan, Narsing R
Presentation transcript:

Comparative Genome Analysis

Comparative yeast genomics Kellis et al (2003) Nature 423,

Genic tree Inter-Genic tree

Reading Frame Conservation Species 1 ATG GAC GAT AGT CTT CGA GCA AAA TAG Species 2 TTG GAC GGA TAG TCT TGA GCA AAA AAG Species 3 CTG GA- GAT AGT CTT CGA GCA AAA TAG Species GAT AGT -TT CGA GCA AAA TTA 6000 ORFs -> 5500 ORFs ORF no ORF }

Phylogenetic footprints

72 motifs found: 40 known, 32 new

Ancient genome duplication Kellis, Birren & Lander (2004) Nature 428,

Duplicated protein evolution Average evolution (n = 457) Accelerated evolution (1 out of 72) Gene conversion Dating the WGD

Transcriptional regulatory code Harbison et al. (2004) Nature 431,

On-off Graded response Combinatorial regulation Physical interaction between regulators

Requires other molecule (leucine precursor) Requires regulation by movement Concentration dependent affinity changes Interaction with other regulator changes affinity

Mapping: principles

Variation Phenotypic traits Molecular markers –Microsattelites –Indels –SNPs Markers are polymorphic and determine only one genomic region

Linkage

Marker development: A Comparative Approach

Legume - Genomics Aim: Development of Molecular Markers for legume genetics Approach: CATS - Comparative anchor tagged sequences Alignment of ESTs from multiple legume species Alignment of EST to genomic region Intron

CATS – comparative anchor tagged sequences Alignment of ESTs from multiple legume species Alignment of EST to genomic region Intron Identification of evolutionary conserved regions Design of primers for PCR amplification of intron in mapping parents for polymorphism acertainment

Copyright ©2004 by the National Academy of Sciences Choi, Hong-Kyu et al. (2004) Proc. Natl. Acad. Sci. USA 101, , Doyle & Luckow species Legume Taxonomy Genome

Looking for conserved regions exon intron exon intron Lotus genome Glycine EST Medicago EST Phaseolus? AGC..ATCGATCAGGACAGT..TGTAC..CCCAC..AT GGAGGAGGACAATAAGAGAC CTAAACTCTCTCTAG TAC..CCCAC..AT AGC..AT GGG..AATAC..CCCAC..AT TAC..CC CAC..AT

Alignment: Lotus, Glycine, Medicago introns Good marker region

Reasons for conservation 1.Purifying selection on proteins conserves positions 1 and 2 of a codon 2.Selection on RNA G/C content introduces Codon bias that conserves the third position High G/C: constitutively expressed genes Low G/C: regulated genes (defense etc) 3.Purifying selection on RNA secondary structure Regulatory RNA molecules

CATS marker development

ETs: Expressed transcripts (full length cDNAs) TC: Tentative clusters

Three-way conservation Lotus Medicago Glycine

Copyright ©2004 by the National Academy of Sciences Paterson, A. H. et al. (2004) Proc. Natl. Acad. Sci. USA 101, Phylogenetic tree of whole genome duplications

Number of homologes in Arabidopsis Number of three way conserved sequences Three way conserved sequences vs. Arabidopsis homologe count

Macrosynteny Copyright ©2004 by the National Academy of Sciences Choi, Hong-Kyu et al. (2004) Proc. Natl. Acad. Sci. USA 101,

Macrosynteny between M. truncatula and L. japonicus Copyright ©2004 by the National Academy of Sciences Choi, Hong-Kyu et al. (2004) Proc. Natl. Acad. Sci. USA 101, Phaseolus ? Leg148 Leg101 Leg178 Leg148

Microsynteny between Arabidopsis, M. truncatula and L. japonicus Copyright ©2004 by the National Academy of Sciences Choi, Hong-Kyu et al. (2004) Proc. Natl. Acad. Sci. USA 101,

Consensus comparative map data for six legume species Copyright ©2004 by the National Academy of Sciences Choi, Hong-Kyu et al. (2004) Proc. Natl. Acad. Sci. USA 101,