Alternative splicing: A playground of evolution Mikhail Gelfand Research and Training Center for Bioinformatics Institute for Information Transmission.

Slides:



Advertisements
Similar presentations
Quick Lesson on dN/dS Neutral Selection Codon Degeneracy Synonymous vs. Non-synonymous dN/dS ratios Why Selection? The Problem.
Advertisements

A very short introduction (in plants)
The Concept of Functional Constraint. The intensity of purifying selection is determined by the degree of intolerance characteristic of a site or a genomic.
Alternative splicing: A playground of evolution Mikhail Gelfand Research and Training Center for Bioinformatics Institute for Information Transmission.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
1 Alternative Splicing. 2 Eukaryotic genes Splicing Mature mRNA.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
28-Way vertebrate alignment and conservation track in the UCSC Genome Browser Journal club Dec. 7, 2007.
Alternative Splicing As an introduction to microarrays.
Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13.
The Influence of Alternative Splicing in Protein Structure The fact that gene number is not significantly different between mammals and some invertebrates.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
RNA processing. RNA species in cells RNA processing.
Anum kamal(BB ) Umm-e-Habiba(BB ). Gene splicing “Gene splicing is the removal of introns from the primary trascript of a discontinuous gene.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Coding Domain Sequence Prediction and Alternative Splicing Detection in Human Malaria Gambiae Jun Li 1, Bing-Bing Wang 2, Jose M. Ribeiro 3, Kenneth D.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
The Biology and Genetic Base of Cancer. 2 (Mutation)
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
MPL Identification of alternative spliced mRNA variants related to cancers by genome-wide ESTs alignment KIM DAE SOO Oncogene Apr.
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Click to edit Master title style Click to edit Master subtitle style CLICKER QUESTIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry,
Alternative splicing: A playground of evolution Mikhail Gelfand Research and Training Center for Bioinformatics Institute for Information Transmission.
Endogenous Retroviral promoter of the Human gene Kim Tae Hyung Oct 02,2004 MPL.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Alternative splicing: A playground of evolution Mikhail Gelfand Institute for Information Transmission Problems, RAS May 2004.
Alternative splicing: A playground of evolution Mikhail Gelfand Research and Training Center for Bioinformatics Institute for Information Transmission.
Background & Motivation Problem & Feature Construction Experiments Design & Results Conclusions and Future Work Exploring Alternative Splicing Features.
A Non-EST-Based Method for Exon-Skipping Prediction Rotem Sorek, Ronen Shemesh, Yuval Cohen, Ortal Basechess, Gil Ast and Ron Shamir Genome Research August.
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Annotation of Drosophila virilis Chris Shaffer GEP workshop, 2006.
Comparative Genomics Methods for Alternative Splicing of Eukaryotic Genes Liliana Florea Department of Computer Science Department of Biochemistry GWU.
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
SDPpred: a method for identification of amino acid residues that determine differences in functional specificity of homologous proteins and application.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Evolution of alternative splicing Mikhail Gelfand Institute for Information Transmission Problems, Russian Academy of Sciences Workshop “Gene Annotation.
Chapter 3 The Interrupted Gene.
Alternative splicing: A playground of evolution Mikhail Gelfand Research and Training Center for Bioinformatics Institute for Information Transmission.
A genetic polymorphism in the Drosophila insulin receptor suggests adaptation to climate variation across continents Annalise Paaby a, Mark Blacket b,
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
Using DNA Subway in the Classroom
The Transcriptional Landscape of the Mammalian Genome
Genomes and Their Evolution
Pipelines for Computational Analysis (Bioinformatics)
Institute for Information Transmission Problems
Evolution of Genes with Novel Functions
GEP Annotation Workflow
The Functional Impact of Alternative Splicing in Cancer
Ab initio gene prediction
What are the Patterns Of Nucleotide Substitution Within Coding and
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Genome organization and Bioinformatics
Evolution of eukaryote genomes
Ensembl Genome Repository.
Functional Impact of Transposable Element using Bioinformatic Analysis
Identify D. melanogaster ortholog
The Functional Impact of Alternative Splicing in Cancer
Volume 13, Issue 24, Pages (December 2003)
Presentation transcript:

Alternative splicing: A playground of evolution Mikhail Gelfand Research and Training Center for Bioinformatics Institute for Information Transmission Problems RAS, Moscow, Russia

% of alternatively spliced human and mouse genes by year of publication Human (genome / random sample) Human (individual chromosomes) Mouse (genome / random sample) All genes Only multiexon genes Genes with high EST coverage

Evolution of alternative exon-intron structure –mammals: human, mouse, dog –dipteran insects: Drosophila melanogaster, D. pseudoobscura, Anopheles gambiae Evolutionary rate in constitutive and alternative regions –human / mouse –D. melanogaster / D. pseudoobscura –human-chimpanzee / human SNPs Functional consequences of alternative splicing: what does it do with proteins Plan

Alternative exon-intron structure in fruit flies and the malarial mosquito Same procedure (AS data from FlyBase) –cassette exons, splicing sites –also mutually exclusive exons, retained introns Follow the fate of D. melanogaster exons in the D. pseudoobscura and Anopheles genomes Technically more difficult: –incomplete genomes –the quality of alignment with the Anopheles genome is lower –frequent intron insertion/loss (~4.7 introns per gene in Drosophila vs. ~3.5 introns per gene in Anopheles)

Conservation of coding segments constitutive segments alternative segments D. melanogaster – D. pseudoobscura 97%75-80% D. melanogaster – Anopheles gambiae 77%~45%

Observations Alternative splicing is less conserved than constitutive one D.melanogaster - D.pseudoobscura –retained introns are the least conserved (are all of them really functional?) –mutually exclusive exons are as conserved as constitutive exons D.melanogaster – Anopheles gambiae –mutually exclusive exons are conserved exactly (no intron insertions – would disrupt regulation?) –cassette exons are the least conserved

The MacDonald-Kreitman test: evidence for positive selection in (minor isoform) alternative regions Human and chimpanzee genome mismatches vs human SNPs Exons conserved in mouse and/or dog Genes with at least 60 ESTs (median number) Fisher’s exact test for significance Pn/Ps (SNPs)Dn/Ds (genomes)diff.Signif. Const – Major – % Minor % Minor isoform alternative regions: More non-synonymous SNPs: Pn(alt_minor)=.12% >> Pn(const)=.06% More non-synonym. mismatches: Dn(alt_minor)=.91% >> Dn(const)=.37% Positive selection (as opposed to lower stabilizing selection): α = 1 – (Pa/Ps) / (Da/Ds) ~ 25% positions Similar results for all highly covered genes or all conserved exons

Alternative splicing avoids disrupting domains (and non-domain units) Data: SwissProt PROFAM PROSITE Control: fix the domain structure; randomly place alternative regions

Positive selection towards domain shuffling (not simply avoidance of disrupting domains by occurring between domains )

Short (<50 aa) alternative splicing events within domains target protein functional sites c) Prosite patterns unaffected Prosite patterns affected FT positions unaffected FT positions affected ExpectedObserved

An attempt of integration AS is often genome-specific –alternative exons and sites are less conserved (more often lost or gained) than constitutive ones … but still functional –Even NMD-inducing isoforms are conserved in at least one lineage –… especially those supported by multiple ESTs AS regions show evidence for decreased negative (stabilizing) selection –excess non-synonymous codon substitutions AS regions show evidence for positive (diversifying) selection –excess non-synonymous SNPs AS tends to shuffle domains and target functional sites in proteins Thus AS may serve as a testing ground for new functions without sacrificing old ones

Acknowledgements Authors Discussions –Vsevolod Makeev (GosNIIGenetika) –Eugene Koonin (NCBI) –Igor Rogozin (NCBI) –Dmitry Petrov (Stanford) –Dmitry Frishman (GSF, TUM) –Shamil Sunyaev (Harvard University Medical School) Data –King Jordan (NCBI) Support –Howard Hughes Medical Institute –INTAS –Russian Academy of Sciences (program “Molecular and Cellular Biology”) –Russian Fund of Basic Research Andrei Mironov (Moscow State University) Ramil Nurtdinov (Moscow State University) – human/mouse/dog Dmitry Malko (GosNIIGenetika) – drosophila/mosquito Ekaterina Ermakova (Moscow State University, IITP) – Kn/Ks Vasily Ramensky (Institute of Molecular Biology) – SNPs Irena Artamonova (GSF/MIPS) – human/mouse, plots Alexei Neverov (GosNIIGenetika) – functionality of isoforms