Analysis of Alternate Splicing in Human Genes  Name : Bingding Huang ( 黄炳顶 )   Address : Bioinformation center of Shanghai.

Slides:



Advertisements
Similar presentations
A very short introduction (in plants)
Advertisements

© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
An Introduction to Bioinformatics Finding genes in prokaryotes.
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Central dogma of genetics Lecture 4. The conversion of DNA to Proteins.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction methods Gene indices Mapping cDNA on genomic DNA Genome-genome.
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
Alignment of mRNAs to genomic DNA Sequence Martin Berglund Khanh Huy Bui Md. Asaduzzaman Jean-Luc Leblond.
1 Alternative Splicing. 2 Eukaryotic genes Splicing Mature mRNA.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Gene Finding Charles Yan.
Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13.
The Influence of Alternative Splicing in Protein Structure The fact that gene number is not significantly different between mammals and some invertebrates.
Bioinformatics Original definition (1979 by Paulien Hogeweg): “application of information technology and computer science to the field of molecular biology”
Lecture 12 Splicing and gene prediction in eukaryotes
Eukaryotic Gene Finding
Biological Motivation Gene Finding in Eukaryotic Genomes
Doug Brutlag 2011 Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine Genomics, Bioinformatics.
Anum kamal(BB ) Umm-e-Habiba(BB ). Gene splicing “Gene splicing is the removal of introns from the primary trascript of a discontinuous gene.
Doug Brutlag 2011 Next Generation Sequencing and Human Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University.
Chapter 6 Gene Prediction: Finding Genes in the Human Genome.
International Livestock Research Institute, Nairobi, Kenya. Introduction to Bioinformatics: NOV David Lynn (M.Sc., Ph.D.) Trinity College Dublin.
©2000 Timothy G. Standish Revelation 18:4 4And I heard another voice from heaven, saying, Come out of her, my people, that ye be not partakers of her sins,
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
Grupo 5. 5’site 3’site branchpoint site exon 1 intron 1 exon 2 intron 2 AG/GT CAG/NT.
Sequence & course material repository Annotation (sequences & evidence) Manuals (DNA, Subway, Apollo, JalView) Presentations.
MPL Identification of alternative spliced mRNA variants related to cancers by genome-wide ESTs alignment KIM DAE SOO Oncogene Apr.
DNA TO RNA Transcription is the process of creating a molecule that can carry the genetic blueprint for a particular protein coding gene from the DNA.
Genome Annotation Rosana O. Babu.
LECTURE CONNECTIONS 14 | RNA Molecules and RNA Processing © 2009 W. H. Freeman and Company.
Sackler Medical School
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
Gene Prediction: Similarity-Based Methods (Lecture for CS498-CXZ Algorithms in Bioinformatics) Sept. 15, 2005 ChengXiang Zhai Department of Computer Science.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
Gene, Proteins, and Genetic Code. Protein Synthesis in a Cell.
Bioinformatics and Computational Biology
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
How can we find genes? Search for them Look them up.
Research about Alternative Splicing recently 楊佳熒.
Splice Site Recognition in DNA Sequences Using K-mer Frequency Based Mapping for Support Vector Machine with Power Series Kernel Dr. Robertas Damaševičius.
JIGSAW: a better way to combine predictions J.E. Allen, W.H. Majoros, M. Pertea, and S.L. Salzberg. JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the.
Fgenes++ pipelines for automatic annotation of eukaryotic genomes Victor Solovyev, Peter Kosarev, Royal Holloway College, University of London Softberry.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Finding splice sites With particular interest given to the inhibitory dopamine receptor gene (D2)
CFE Higher Biology DNA and the Genome Transcription.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
AceView Danielle and Jean Thierry-Mieg NCBI = global annotation of the whole human genome ● Restricted to the Gencode Regions ●
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Using DNA Subway in the Classroom Genome Annotation: Red Line.
Basics of Genome Annotation Daniel Standage Biology Department Indiana University.
Bioinformatics Computing 1 CMP 807 – Day 4 Kevin Galens.
bacteria and eukaryotes
Using DNA Subway in the Classroom
Recurrent inversion breaking intron 1 of the factor VIII gene is a frequent cause of severe hemophilia A by Richard D. Bagnall, Naushin Waseem, Peter M.
Genomes and Their Evolution
Genome organization and Bioinformatics
Alternative RNA Splicing
Introduction to Alternative Splicing and my research report
Gene Structure.
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Gene Structure.
Presentation transcript:

Analysis of Alternate Splicing in Human Genes  Name : Bingding Huang ( 黄炳顶 )   Address : Bioinformation center of Shanghai Instutes for Biological Sciences  Date: 08/31/01

The human genome sequence can be thought of as a picture of the human organism. However,like an impressionist painting,the genome is a very large canvas whose details becomes fuzzy when you look closely. ---by Douglas L.Black

Index  Definition:What is the alternative splice gene?Especially in human genes.  Mechanism : How the alternative splice genes work?  Type : How many types the alternative splice genes are?  Database :current databases and how to establish the corresponding database?Materials and method?  Future work :

Definition

Alternative Splicing is a major cellular process by which functionally diverse proteins can be generated from the transcript of a single gene,often in tissue-specific,patterns. As many as 30% of human genes utilize this mechanism to generate mature mRNA with difference in exon composition at the 5’- end,within internal coding regions or at the 3’- end.

Mechanism

How to predict the donor and acceptor sites?  GT---AG Rule: The donor site of introns in most eukaryotes usually contains GT(GU in pre-mRNA)while the acceptor site contains AG;a pyrimidine-rich region at 3’;a shorter consensus sequence at 5’;a very weak sequence at the branch point;  algorithm and Software: GeneSplicer, NetGene2,HSPL,NNSplice,HMMGene, Spliceview,and so on. GeneSplicer NetGene2HSPLNNSpliceHMMGeneSpliceview  False or True: how to determine the false sites from true sites?

Current Database  Intron Sequence Information System (ISIS) : ISIS contains phylogenetic and protein homology categories, information about individual sequences and various bioinformatics analyses of taxonomical groupings of sequences Intron Sequence Information System (ISIS)  Exon-Intron Database : An exhaustive database of protein-coding intron-containing genes. Exon-Intron Database  Intron Sequence and Evolution Databases This includes a non-redundant database of nuclear, protein- encoding, genomic DNA sequences highlighting nuclear pre- mRNA introns Intron Sequence and Evolution Databases

Types of alternate splicing gene

Method 1,Data collection 2,Construction intron-EST homology dataset 3,Analysis of type and frequency of alternative splicing using intron-EST homolies 4,Construction of an exon-EST homology dataset 5,Analysis of exon skipping,exon extensions or shortening using exon-EST homologies

Data Collection  GenBank and UniGene: entries contaiing alternative splice are picked out and recorded in a list.  EST: Download ESTs database from NCBI  Swiss-port :

Intron-EST homologies  Contigs overlaping two flanking exons: a,cryptic exon; b, 5’exon extension; c, 3’exon extension; d,whole intron retention  Contigs overlaping one flanking exon: a,cryptic exon or exon extention; b,whole intron retention

Contigs overlaping two flanking exons

 cryptic exon: a1<b1 and a2<b2 The intron fragment was less than the size of the whole intron,from middle of the introncryptic exon  5’exon extension: a1<b1 and a2=b2 3’end of the intron was contained within in the EST contig 5’exon extension  3’exon extention: a1=b1 and a2<b2 5’end of the intron was contained within the EST conting 3’exon extention  Whole intron retention: |CG—GG|<10 where the whole intron was contained in the contig

Contigs overlaping one flanking exon

 Cryptic exon or exon extension: CG<GG—10  Whole intron retention: CG>=GG--10,

Exon—EST homologies  EST contigs with matches to non-consecutive exons(eg.exon1-exon3) 1,perfect exon skipping 2,exon skipping with exon truncation  EST contigs with matches to consecutive exons (eg.exon1-exon2) 1,exon trucation 2,cryptic exon or 3’or 5’ exon extension 3,whole intron retention 4.normal splicing

EST contigs with matches to non-consecutive exons(eg.exon1-exon3)  perfect exon skipping CG<1 and GG=<0  exon skipping with exon truncation CG 0

EST contigs with matches to consecutive exons(eg.exon1-exon2)

 exon trucation CG<(GG—IN)  cryptic exon or 3’or 5’ exon extension CG>(GG—IN)but (CG—(GG—IN))<(IN—10)  whole intron retention CG>(GG—IN)but (CG—(GG—IN))>=(IN—10)  Normal splicing CG=(GG-IN)

Thank you