Stéphane HACQUARD (INRA NANCY) The secretome of Melampsora larici-populina First results Nancy, workshop Melampsora, august 2008 David JOLY (CFL QUEBEC)

Slides:



Advertisements
Similar presentations
Rust Pathogen Genomics and the Potential Impact on Crop Management Les J. Szabo USDA ARS Cereal Disease Laboratory University of Minnesota St. Paul, MN.
Advertisements

2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
A Few More Things About B Cell Development
Introduction to Immunology
The Genome of Melampsora larici-populina, The Poplar Leaf Rust Tree/Microbe Interactions, INRA/Nancy University.
COMPARATIVE FUNGAL GENOMICS : KNOWN KNOWNS & KNOWN UNKNOWNS. Josh Herr, Plant Biology Interdepartmental Program.
IDENTIFICATION OF HOST FACTORS RECRUITED BY PLANT PATHOGENS Ester Buiate Physiology of Plant Health and Disease Instructor - Dr. Aardra Kachroo.
The genome sequence of Melampsora larici-populina the causal agent of the poplar rust disease Inventory and annotation of Mlp Signaling genes Mlp Summer.
Corrections. SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY.
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Identification of Novel Virulence-Associated Genes via Genome Analysis of Hypothetical Genes Sara Garbom, Åke Forsberg, Hans Wolf- Watz, and Britt-Marie.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Genome analysis and annotation. Genome Annotation Which sequences code for proteins and structural RNAs ? What is the function of the predicted gene products.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
MCSG Site Visit, Argonne, January 30, 2003 Genome Analysis to Select Targets which Probe Fold and Function Space  How many protein superfamilies and families.
Resistance Inherent capacity of a host plant to prevent or retard the development of an infectious disease Complete resistance vertical resistance Highly.
Stripe rust Haustorial cDNAs Chuntao Yin, Xianming Chen, Scot Hulbert Chuntao Yin Puccinia striiformis.
Genome Annotation BCB 660 October 20, From Carson Holt.
Predicting Function (& location & post-tln modifications) from Protein Sequences June 15, 2015.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
The genome sequence of Melampsora larici-populina, the causal agent of the poplar rust disease M. larici-populina Transcriptome Mlp Summer workshop – INRA.
Genomics of Microbial Eukaryotes Igor Grigoriev, Fungal Genomics Program Head US DOE Joint Genome Institute, Walnut Creek, CA.
Analyzing transcription modules in the pathogenic yeast Candida albicans Elik Chapnik Yoav Amiram Supervisor: Dr. Naama Barkai.
Plant Immunology.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
EXPLORING DEAD GENES Adrienne Manuel I400. What are they? Dead Genes are also called Pseudogenes Pseudogenes are non functioning copies of genes in DNA.
Figure S1_Yao Qin et al. Figure S1 Occurrence and distribution of trihelix family in different plant species. Red branches in the cladogram indicate that.
Module 3 Sequence and Protein Analysis (Using web-based tools) Working with Pathogen Genomes - Uruguay 2008.
The WRKY superfamily of plant transcription factors.
Melampsora Genome Annotation and Genome Structure Analysis First Annotation Workshop of the Melampsora Genome Consortium Yao-Cheng Lin Bioinformatics &
Introduction Sample Projects Resources Summary Future Plans Bioinformatics Support Information Session Karsten Hokamp TCD 3rd October, 2007.
Genomics of Microbial Eukaryotes Igor Grigoriev Fungal Genomics Program Head US DOE Joint Genome Institute, Walnut Creek, CA.
THE PUZZLING PROPERTIES OF THE PERMEASE (PPP) Kim Finer, Jennifer Galovich, Ruth Gyure, Dave Westenberg March 4, 2006.
Transposable elements in Melampsora larici-populina genome Marie-Pierre Oudot-Le Secq Melampsora Genome Consortium 2008 Summer Workshop Melampsora Genome.
Characterization of RDR Gene Expression Johnny R. Nunez and Lisa K. Johansen Community College of Denver and University of Colorado at Denver and Health.
UMR 1136 INRA/UHP Tree-microbes Interactions “Physiology and functional genomic of transporters” team Fungal pathogen : Melampsora larici-populina Populus.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Do not reproduce without permission 1 Gerstein.info/talks (c) (c) Mark Gerstein, 2002, Yale, bioinfo.mbb.yale.edu Gerstein Lab Aims in ModENCODE.
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
Lecture 1: Immunogenetics Dr ; Kwanama
August 20, 2007 BDGP modENCODE Data Production. BDGP Data Production Project Goals 21,000 RACE experiments 6,000 cDNA’s from directed screening and full.
Genome sequencing and annotation Comprehensive identification of virulence gene candidates by various means Bioinformatic prioritization of virulence gene.
Group discussion Name this protein. Protein sequence, from Aedes aegypti automated annotation >25558.m01330 MIHVQQMQVSSPVSSADGFIGQLFRVILKRQGSPDKGLICKIPPLSAARREQFDASLMFE.
1 Computational functional genomics Lital Haham Sivan Pearl.
Peptide-assisted annotation of the Mlp genome Philippe Tanguay Nicolas Feau David Joly Richard Hamelin.
Specific Defenses of the Host Part 2 (acquired or adaptive immunity)
InterPro Sandra Orchard.
1 Computational Approaches(1/7)  Computational methods can be divided into four categories: prediction methods based on  (i) The overall protein amino.
An EST library from Puccinia graminis f. sp. tritici reveals genes potentially involved in fungal differentiation Katja Broeker, Frank Bernard & Bruno.
Considerations for multi-omics data integration Michael Tress CNIO,
Protein families, domains and motifs in functional prediction May 31, 2016.
The mating type genes of Melampsora larici-populina
WRKY transcription factors in potato genome factors in potato genome
Protein families, domains and motifs in functional prediction
Uncovering the Protein Tyrosine Phosphatome in Cattle
Protein Families, Motifs & Domains.
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite.
Sequence based searches:
John Rathjen and group ANU
Hepatitis C Virus NS5A Protein–A Master Regulator?
Sequence analyses and evolutionary conservation of the CLDN10 gene and the identified CLDN10 variants. Sequence analyses and evolutionary conservation.
Secreted Fringe-like Signaling Molecules May Be Glycosyltransferases
Hepatitis C Virus NS5A Protein–A Master Regulator?
WRKY transcription factors in potato genome factors in potato genome
Figure 9. Categories of pha-siRNA-yielding genes
Identification of the GCS1 ortholog in Gonium pectorale.
General structure of RIFINs and STEVORs
Presentation transcript:

Stéphane HACQUARD (INRA NANCY) The secretome of Melampsora larici-populina First results Nancy, workshop Melampsora, august 2008 David JOLY (CFL QUEBEC)

I- Introduction Hahn and Mendgen 1997: haustorial ESTs from U. fabae  Early stages of host infection marked by the activation of many genes: PIGs (in Planta Induced Genes) Transport: genes encoding transporters: PIG2- HXT1 (Hahn et al, 1997; Voegele et al, 2001) Biosynthesis: genes involved in thiamine biosynthesis (THI1; THI4) (Sohn et al, 2000) Trafficking an signalling process : RTP1 (Kemen et al, 2005) Small secreted proteins ( SSP ) and the establishment of fungal biotrophy M. larici-populina: obligate biotrophic fungus completely dependant on living plant tissues for growth and development  Specialized infection structure: haustorium

I- Introduction Catanzariti et al, 2006: haustorial ESTs from M. lini Focus on the secreted proteins  Identification of 21 transcripts encoding Haustorially Expressed Secreted Protein (HESPs)  Among HESPs: AvrL567, expressed in haustoria and trigger specific hypersensitive response-like necrosis that is dependant on the co- expression of the L5, L6, L7 resistance genes (Dodds et al, 2004; Dodds et al, 2006) AvrL567 L9 L5 L6 L7

I- Introduction Kämper et al., 2006: Ustilago maydis secreted proteins  18.6% SP organized in 12 clusters Unaffected (7) Non pathogenic (1) Reduced (3) Clustering of genes encoding secreted proteinsVirulence of deletion mutants increased (1)  SP could trigger defense responses in the host plant (Avr genes) or manipulate host cell structure and fonction to facilitate infection  Deletion of individual clusters altered the virulence of U. maydis in five cases, ranging from a complete lack of symptoms to hypervirulence

What we need to know ?  Protein length  Amino acid composition of proteins  Subcellular localization  Presence / Absence of a signal peptide  Protein topology: transmembrane domains Softwares used PEPSTATS TargetP SignalP TMHMM A combination of different prediction softwares II- The secretome pipeline:

Why a pipeline ?  Limitation of online prediction program: - number of proteins - length of sequences - number of jobs  Local program: - faster than online services - can apply routine filter to the result  1 command for all computational methods II- The secretome pipeline:

Sequence filter Gene models Prediction of a signal peptide by TargetP At least two positive scores by SignalP (Nancy) No transmembrane domains but one domain allowed: N-terminal domain overlaping signal peptide Gene models  300 amino acids SUBCELLULAR LOCALIZATION TOPOLOGY LENGTH II- The secretome pipeline: 2 positive NN scores (Smax and D) plus HMM score by SignalP (Quebec)

III- The secretome of Melampsora larici-populina Gene models Prediction of a signal peptide by targetP At least two positive prediction methods of a signal peptide by signalP No transmembrane domains but one domain allowed: N-terminal domain overlaping signal peptide Gene models  300 amino acids Nancy (15,8%) 2125 (12,7%) 1848 (11%) 961 (5,7%) GhentQuebec 797 (4,8%)818 (4,9%) 1-SSP prediction 16694

 SSP predicted by the 3 pipelines - Nancy, Ghent, Quebec: 720  SSP predicted by 2 pipelines - Nancy, Quebec: 91 - Nancy, Ghent: 52 - Quebec, Ghent: 6  SSP predicted by only one pipeline - Nancy: 98 - Ghent: 15 - Quebec: 3 III- The secretome of Melampsora larici-populina 1- SSP prediction : 985 putative SSP in Melampsora genome MelampsoraPucciniaLaccaria Gene models SSP  300aa 985 (5,8%)1219 (5,9%)864 (4,2%)

III- The secretome of Melampsora larici-populina Among the 985 predicted SSP:  171 hits: hypothetical protein  76 show homology with:  234 contain more than 8 cysteines  320 show homology with Puccinia 2- Putative SSP homology Kemen et al. (2005) MPMI 18: Avirulence genes (AvrL567; AvrM) and Haustorially expressed secreted protein (HESP178, 327, 376, 379, 417, 570, 767, 897, c55, C63, c66, c49) from Melampsora lini. (Dodds et al., 2004; Catanzariti et al., 2006) - Rust transferred protein (RTP1) from Uromyces fabae (Kemen et al., 2005) - Thaumatin, GPI anchored CFEM domain protein, expansin, laccase …  %C: all genes: 1,38 / SSP: 2,64  only 247 show homology with nr

III- The secretome of Melampsora larici-populina 3- SSP families TRIBE-MCL: provides gene family identification based on all-vs-all sequence similarity results.  233 SSP belong to 97 families only composed of secreted proteins (largest:14 members, smallest: 2 members).  151 SSP belong to families composed of both secreted and non secreted proteins  601 SSP correspond to single gene model - large majority of uncharacterized families - Curation of mispredicted SSPs (NS => S or S => NS)

III- The secretome of Melampsora larici-populina 4- SSP annotation August 2008: 401 SSP manually curated (S + D)  114 new or modified gene models (29%)  287 gene models were correctly predicted (71%) EuGene: 103 gene models predicted (35%) fgenesh1: 110 gene models predicted (38%) fgenesh2: 201 gene models predicted (70%)  18 NS => S  Among 155 SSP manually curated corresponding to single model: 45 show homology with others gene models (blastp) or genomic regions (tblastn) => New families

III- The secretome of Melampsora larici-populina 4- SSP annotation Missed models…  One superfamily with more than 30 members  Conserved structure and cysteine residues  Around half were missed by gene predictors

III- The secretome of Melampsora larici-populina 5- SSP clusters Secreted Non secreted Scaffold 31: Scaffold 11: Cutinase Homology with RTP1 1kb Homology with HESP-417 Hypothetical protein 27 clusters of at least 3 SSP separeted by no more than 3 non secreted proteins.

6- SSP expression data III- The secretome of Melampsora larici-populina Melampsora cDNA libraries  Urediniospores library (Sanger ESTs) : - M. larici-populina spores and germlings (JGI, Nancy) 52,269 ESTs  Infected poplar leaves library (454 pyrosequencing ESTs) : - M. larici-populina 98AG31 infected ‘Beaupré’ leaves (compatible interaction) at 4 dpi and 7 dpi (50/50mixed) 4679 ESTs Melampsora 5533 ESTs poplar

6- SSP expression data III- The secretome of Melampsora larici-populina 985 SSP: 323 supported by EST (33%)  58 only supported by spores ESTs (sanger)  97 supported by both spores and infected leaves ESTs (Sanger and 454)  168 only supported by infected leaves ESTs (454)  58 SSP only supported by spores ESTs 19 homologies with nr, 5 hits against known proteins

III- The secretome of Melampsora larici-populina  97 SSP supported by both spores and infected leaves ESTs 6- SSP expression data - Most ESTs belong to spores: 51 homologies with nr, 17 hits against known proteins - Most ESTs belong to infected leaves:

III- The secretome of Melampsora larici-populina 6- SSP expression data  168 SSP only supported by infected leaves ESTs 47 homologies with nr, 15 hits against known proteins

III- The secretome of Melampsora larici-populina 6- SSP expression data Sp Sp Sp Sp Sp Sp Relative expression of M. larici- populina genes (2- ∆Ct ) to ElF and Tub reference genes by RT-qPCR. Time (hours post-inoculation)  Effectors previously described in Pucciniales are also expressed in M. larici-populina.  Specific transcripts profiles are observed at the different stages of the infectious process.  Interestingly, two genes encoding RTP1 homologues have distinct expression profiles.

III- The secretome of Melampsora larici-populina 7- SSP evolutionary constraints HGGLocBLASTX against Fungi nrE-valueCysω 205NSNo hitND NSNo hitND NS ref|XP_ | conserved hypothetical protein [Aspergillus terreus] 3e-07ND SNo hit S gb|AAS | proline-rich antigen [Chrysosporium lucknowense] 4e S gb|EAT | hypothetical protein [Phaeosphaeria nodorum] 7e S ref|XP_ | hypothetical protein [Ustilago maydis] 1e SNo hit SNo hit SNo hit SNo hit SNo hit S ref|XP_ | hypothetical protein [Ustilago maydis] 6e Positive selection screen on an EST data set…  Application to whole genome?

V- Conclusions 985 putative SSP in Melampsora genome (5.8%)  Homology with effectors previously described in Pucciniale  33% supported by ESTs  660 SSP specific of Melampsora  Most of SSP expressed during infection process Validation data: Biological approch - NimbleGen micro-array - RT-qPCR - immunolocalization

Acknowledgment NANCY: Emilie TISSERANT Benoit HILSELBERGER Marie Pierre OUDOT-LE SECQ Sébastien DUPLESSIS Francis MARTIN GHENT: Yao-Cheng LIN Yves van De Peer JGI sequencing and annotation teamQUEBEC: Nicolas FEAU Richard HAMELIN