Purposes: To demonstrate the tendency of proteins to become longer with increase of organism complexity To study domain architecture of proteins and to.

Slides:



Advertisements
Similar presentations
GBrowse at TAIR Philippe Lamesch TAIR curator. Seqviewer.
Advertisements

DNAStructureandReplication. Transformation: Robert Griffith (1928)
Eukaryotic Intron Loss Tobias Mourier & Daniel C. Jeffares.
Human Genome Project What did they do? Why did they do it? What will it mean for humankind? Animation OverviewAnimation Overview - Click.
First release of HOGENOM, a database of homologous genes from complete genome Equipe Bioinformatique et Génomique Evolutive Laboratoire de Biométrie et.
Design principle of biological networks—network motif.
Selection on codons OEB Degenerate Code.
Alternative splicing and evolution Daniel Jeffares.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Genomes and Genetic Architecture. Life on Earth.
The Human Genome The International Human Genome Consortium Initial sequencing and analysis of the human genome Nature, 409, February 15, (2001)
Model Organisms and Databases. Model Organisms Characteristics of model organisms in genetics studies –Genetic history well known –Short life cycle; large.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 3 A Story of Transcription.
Comparative Genomics of the Eukaryotes
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Meiosis Organisms that reproduce sexually have specialized cells called gametes (sex cells) Gametes are the result of a type of cell division called meiosis.
Genomes School B&I TCD Bioinformatics May Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs)
Chapters 19 - Genetic Analysis of Development: Development Development refers to interaction of then genome with the cytoplasm and external environment.
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
1 Gene Geography Dan Graur Department of Biology and Biochemistry 3c.
The Human Genome (part 1 of 2) Wednesday, November 5, 2003 Introduction to Bioinformatics ME: J. Pevsner
IGEM 101: Session 7 4/2/15Jarrod Shilts 4/5/15Ophir Ospovat.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
© 2015 W. H. Freeman and Company CHAPTER 1 The Genetics Revolution Introduction to Genetic Analysis ELEVENTH EDITION Introduction to Genetic Analysis ELEVENTH.
Genomfart Any general theory for genome evolution will have to account for: the unique natural history of various genetic elements, the population-genetic.
Comparative genomics Haixu Tang School of Informatics.
Using blast to study gene evolution – an example.
Phylogenetic prediction of gene function Daniel Barker Centre for Evolution, Genes and Genomics, School of Biology, University of St Andrews
Phylogenetic analysis taken from and es/MSAPhylogeny.htm.
Chapter 1 Introduction.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Biol729 – The kinomes of model organisms. Phylogenetic comparison of the human kinome with those of yeast ( S. cerevisiae), worm (C. elegans) and fly.
Chapters 19 - Genetic Analysis of Development:
Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster.
Chapter 11 Meiosis & Genetics What do you think meiosis makes?
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
E VOLUTION OF E UKARYOTIC G ENOMES G ENE 342 Lecture 13 – Comparative genomics.
Regulation of transcription in eukaryotes
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used.
Annotating with GO: an overview
Some of the organisms that are used as highly informative models to study gene action and development. (a) Escherichia coli is a common bacterium; (b)
Sequencing and personal genomics
Polo样激酶Plk3对p73转录活性的影响 及对其磷酸化位点的分析 桑梅香 河北医科大学第四医院/河北省肿瘤医院.
PBIO 4500/5500: Biotechnology and Genetic Engineering
EL: To find out what a genome is and how gene expression is regulated
Chapters 19 - Genetic Analysis of Development:
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
CHMI 2227E Biochemistry I Gene expression
BIOL 2416 Chapter 1: Genetics: An Introduction
Every living organism inherits a blueprint for life from its parents.
Evolution of eukaryote genomes
Chapters 19 - Genetic Analysis of Development:
Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention  Iain G. Johnston, Ben P. Williams  Cell.
NRGA1, a Putative Mitochondrial Pyruvate Carrier, Mediates ABA Regulation of Guard Cell Ion Channels and Drought Stress Responses in Arabidopsis  Chun-Long.
Linking transcriptional mediators via the GACKIX domain super family
CARPEL FACTORY, a Dicer Homolog, and HEN1, a Novel Protein, Act in microRNA Metabolism in Arabidopsis thaliana  Wonkeun Park, Junjie Li, Rentao Song,
Computational genomics
Centromeres Current Biology
Maria J.E. Koster, Berend Snel, H.Th. Marc Timmers  Cell 
Centromeres Current Biology
Controlling the Elongation Phase of Transcription with P-TEFb
Exploring a Putative Gene
The Ran GTPase: Theme and Variations
Nup153 is an M9‐containing mobile nucleoporin with a novel Ran‐binding domain The Nup153 zinc finger Ran‐binding domain (RBZ) is very similar to the zinc.
Correspondence Current Biology
Presentation transcript:

Purposes: To demonstrate the tendency of proteins to become longer with increase of organism complexity To study domain architecture of proteins and to date different domains

Homologous proteins Orthologs Paralogs Have evolved by vertical descent from a common ancestor and are presumed to have complete structural and functional correspondence Arise by duplication and domain shuffling within a genome and hence may have divergent functions New function We are interested in proteins that have changed their domain content, but preserved the same function

Human HRX protein PHD zf-CXXC SET FYRN FYRC BROMO

Plant PHD SET BROMO zf-CXXC FYRC FYRN zf-C4 RRM PostSET HMG PWWP TUDOR gi|15217143|gb|AAK92531.1|AF401284_1 trithorax 3 [Arabidopsis thaliana] (330 letters) gi|15233199|ref|NP_191733.1| putative protein [Arabidopsis thaliana] (902 letters) gi|6850313|gb|AAF29390.1|AC009999_10 Contains similarity to MLL proteinfrom Fugu rubripes gb|AF036382, and contains a PWWP PF|00855 and a SET PF|00856 domain. [Arabidopsis thaliana] (1193 letters) gi|15225109|ref|NP_180721.1| putative SET-domain transcriptional regulator [Arabidopsis thaliana] (186 letters) gi|15238735|ref|NP_200155.1| putative protein [Arabidopsis thaliana] (1040 letters) gi|16118405|gb|AAL12215.1| trithorax 4 [Arabidopsis thaliana] (285 letters) gi|15238953|ref|NP_199055.1| putative protein [Arabidopsis thaliana] (1421 letters) gi|15231914|ref|NP_187459.1| unknown protein [Arabidopsis thaliana] (764 letters)

Yeast Worm PHD SET BROMO zf-CXXC FYRC FYRN zf-C4 RRM PostSET HMG PWWP TUDOR gi|7493085|pir||T41282 probable transcription silencing protein - fission yeast (Schizosaccharomyces pombe) (920 letters) gi|6321911|ref|NP_011987.1| Gene has a 'SET' or 'TROMO' domain at its carboxyterminus like the trithorax gene family from human and Drosophila with postulated function in chromatin-mediated gene regulation.; Set1p [Saccharomyces cerevisiae] (1080 letters) Yeast Worm gi|17552318|ref|NP_498040.1| C26E6.9a.p [Caenorhabditis elegans] (1507 letters) + gi|17552316|ref|NP_498041.1| C26E6.9b.p [Caenorhabditis elegans] (739 letters) gi|17555046|ref|NP_499819.1| PHD-finger. (2 domains), SET domain [Caenorhabditis elegans] (2561 letters)

Fly PHD SET BROMO zf-CXXC FYRC FYRN zf-C4 RRM PostSET HMG PWWP TUDOR gi|17861882|gb|AAL39418.1| GM10003p [Drosophila melanogaster] (421 letters) gi|7511805|pir||T12687 ALR protein homolog - fruit fly (Drosophila melanogaster) (2422 letters) gi|7289568|gb|AAF45425.1| CG17396 gene product [Drosophila melanogaster] (177 letters) gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster] (3358 letters) gi|7291672|gb|AAF47094.1| CG5591 gene product [Drosophila melanogaster] (630 letters) gi|15292119|gb|AAK93328.1| LD39445p [Drosophila melanogaster] (700 letters) gi|10720313|sp|Q24742|TRX_DROVI Trithorax protein (3828 letters)

Human PHD SET BROMO zf-CXXC FYRC FYRN zf-C4 RRM PostSET HMG PWWP TUDOR gi|1170364|sp|Q03164|HRX_HUMAN Zinc finger protein HRX (ALL-1) (Trithorax-like protein) (3969 letters) gi|4336749|gb|AAD17932.1| myeloid/lymphoid leukemia 2 [Homo sapiens] (1010 letters) gi|6634011|dbj|BAA20763.2| KIAA0304 protein [Homo sapiens] (1900 letters) gi|14424624|gb|AAH09337.1|AAH09337 Similar to KIAA0304 gene product [Homo sapiens] (798 letters) gi|13938427|gb|AAH07353.1|AAH07353 Similar to KIAA0304 gene product [Homo sapiens] (140 letters) gi|4588363|gb|AAD26112.1|AF105280_1 myeloid/lymphoid leukemia 2 [Homo sapiens] (140 letters)

Human - continued PHD SET BROMO zf-CXXC FYRC FYRN zf-C4 RRM PostSET HMG PWWP TUDOR Human - continued gi|4505197|ref|NP_003473.1| mixed-lineage leukemia 2; ALL1-related gene [Homo sapiens] (5262 letters) gi|7512280|pir||T03455 ALR protein - human (4957 letters) gi|6683126|dbj|BAA20797.2| KIAA0339 protein [Homo sapiens] (1709 letters) gi|16163206|ref|XP_037523.2| KIAA1076 protein [Homo sapiens] (772 letters) gi|10864041|ref|NP_067053.1| mixed-lineage leukemia 3; ALR-like protein [Homo sapiens] (4025 letters) gi|10434227|dbj|BAB14179.1| unnamed protein product [Homo sapiens] (452 letters)

940Ma 1087Ma 1508Ma Protista PHD SET BROMO zf-CXXC FYRC FYRN zf-C4 RRM PostSET HMG PWWP TUDOR Vertebrates Homo sapiens 940Ma Arthropods Drosophyla melanogaster Drosophyla virilis 1087Ma Nematodes Caenorhabditis elegans Fungi Schizosaccharomyces pombe Saccharomyces cerevisiae 1508Ma Protista Plants Arabidopsis thaliana

Conclusion: Elongation of proteins within interspecific trx family is correlated with organism complexity Proteins elongate following domain duplication, shuffling and accretion