5 th Spanish Human Proteome Project (SpHPP) Workshop La Cristalera, Miraflores de la Sierra, Madrid, November 5-6th, 2013 SpHPP Council, La Cristalera,

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

BiGCaT Bioinformatics Hunting strategy of the bigcat.
Protein Quantitation II: Multiple Reaction Monitoring
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Recombinant DNA Technology
Recombinant DNA technology
CHR 16 - Human Proteome Project Data dependent and inclusion list results on LTQ – (Orbitrap) Velos 27-28th August 2012 La Cristalera - Madrid PCB – Eliandre.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
SpHPP Bioinformatics. 3 rd CHr-16 Workshop. Miraflores de la Sierra December, 10 th -11 th 2012 Alberto Medina.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
10 Genomics, Proteomics and Genetic Engineering. 2 Genomics and Proteomics The field of genomics deals with the DNA sequence, organization, function,
Bacterial Physiology (Micr430)
Medical Technology Department, Faculty of Science, Islamic University-Gaza MB M ICRO B IOLOGY Dr. Abdelraouf A. Elmanama Ph. D Microbiology 2008 Chapter.
Information Aspects of Nucleic Acids Measurement Technologies Description of nucleic acid measurement technologies Algorithmic, optimization, data analysis.
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
General Microbiology (Micr300) Lecture 11 Biotechnology (Text Chapters: ; )
Definition The terms recombinant DNA technology, DNA cloning, molecular cloning, or gene cloning all refer to the same process: the transfer of a DNA.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Working Group 1 Planing 1st Semester Planing Overview General Plan Tasks Description Deliverables: 1st Semester 2013 WG1: Recruiting Groups.
Proteomics in Malaria Parasites: Packed with Potential! Janette Reader University of Pretoria Proteomics in Malaria Parasites: Packed with Potential! Janette.
Working Group 1 Mid-term Report
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Recombinant protein expression. Other alternatives
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
Human Proteome Project? Màster en bioquímica, biologia molecular i biomedicina Mòdul 4: Genòmica i Proteòmica Núria Colomé Calls.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Biotechnology pp WHAT IS IT?  Biotechnology : the application of technology to better use DNA and biology.
CS 790 – Bioinformatics Introduction and overview.
Finish up array applications Move on to proteomics Protein microarrays.
WG workplan 1.Expression of 60 missing proteins Adquisition of clones from the ASU collection 1.2. Subcloning in the pANT-cGST vector if necessary.
Microbial Biotechnology Philadelphia University
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
Copyright © 2009 Pearson Education, Inc. Genomics, Bioinformatics, and Proteomics Chapter 21 Lecture Concepts of Genetics Tenth Edition.
+ => Bioinformatics: from Sequence to Knowledge Outline: Introduction to bioinformatics The TAU Bioinformatics unit Useful bioinformatics issues and databases:
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Genomics and Forensics
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
FIRST APPROACH TO THE SHOTGUN UNKNOWN PROTEINS sHPP CHROMOSOME 16 MEETING August, 28, 2012 La Cristalera, Miraflores de la Sierra.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
A Molecular Toolkit AP Biology Fall The Scissors: Restriction Enzymes  Bacteria possess restriction enzymes whose usual function is to cut apart.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Molecular Cloning. Definitions   Cloning :   Obtaining a piece of DNA from its original source (Genome) and introducing it in a DNA vector   Sub-cloning:
Information Representation Working Group WG Meeting September 5, 2008.
What is proteomics? Richard Mbasu and Ben Richards.
Lydie Lane, HUPO meeting 2013, Yokohama Integration of proteomics data in.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
SRM working plan. La Cristalera, 5-6 November 2013.
Considerations for multi-omics data integration Michael Tress CNIO,
 Facilities Open House Functional Genomics Facility Molishree Joshi, Ph.D. 6/1/2015 Contact Information:
Virginia G. Piper Center for Personalized Diagnostics Chromosome 10 Joshua LaBaer, M.D., Ph.D. Konstantinos Petritis, Ph.D. Virginia G. Piper Center for.
Methods in Cell Biology Cont. Sept. 24, Science Bomb 2 Unc-22: encodes a myofilament in C. elegans.
Biotechnology.
Metagenomic Species Diversity.
Gene expression.
Thomas BOTZANOWSKI & Blandine CHAZARIN
Scientists use several techniques to manipulate DNA.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Transcriptional Signature of Histone Deacetylases in Breast cancer
Natalie Galles S. N. Bose Research Exchange Summer 2016
From Mendel to Genomics
Presentation transcript:

5 th Spanish Human Proteome Project (SpHPP) Workshop La Cristalera, Miraflores de la Sierra, Madrid, November 5-6th, 2013 SpHPP Council, La Cristalera, Nov 2013

32 research units organized in 5 WG: WG1. Protein expression and purification. Peptides. WG2. Development of quantitative S/MRM assays. WG3. Shotgun proteomics. Molecular profiles and PTMs. WG4. Bioinformatics WG5. Biology and disease. Neurodegenerative, cardiovascular, infectious, cancer, obesity, Rheumatic disorders. AIMS Chromosome 16 annotation. Selection of 3 cell lines based on published transcriptomic profiles for maximum chromosome 16 coverage Transcriptomic analysis of the selected cell lines. Shotgun proteomic analysis of the selected cell lines. Development of SRM methods for 200 proteins/year. Expression of missing proteins and development of SRM methods.

WG 1.-Protein Expression, Peptide Standard & Micro Array Team RU1a (Protein Micro Array and Protein Expression Team) Dr. Manuel Fuentes CIC/IBMCC (USAL/CSIC), Salamanca RU1a (Protein Micro Array and Protein Expression Team) Dr. Manuel Fuentes CIC/IBMCC (USAL/CSIC), Salamanca RU1c (Peptide Standard Team) Dr. Juan P Albar CNB-CSIC, ProteoRed, Madrid RU1c (Peptide Standard Team) Dr. Juan P Albar CNB-CSIC, ProteoRed, Madrid Scientific Researchers: M. González N. Dasilva P. Díez M. Pérez de Andrés Scientific Researchers: M. González N. Dasilva P. Díez M. Pérez de Andrés Scientific Researchers: M. Lombardía Scientific Researchers: M. Lombardía RU1b (Protein Expression Team 2) Dr. Concha Gil UCM-PCM, Madrid RU1b (Protein Expression Team 2) Dr. Concha Gil UCM-PCM, Madrid Scientific Researchers: M. L. Hernáez F. Clemente Scientific Researchers: M. L. Hernáez F. Clemente RU1e (Bioinformatics,PTM and cell lines) Dr. Félix Elortza CIC-BioGUNE Bilbao RU1e (Bioinformatics,PTM and cell lines) Dr. Félix Elortza CIC-BioGUNE Bilbao WG1. Composition RU1d (Peptide Standard Team) Dr. Eliandre Oliveira PCB Barcelona RU1d (Peptide Standard Team) Dr. Eliandre Oliveira PCB Barcelona

Missing proteins neXtProt: Organizing Protein Knowledge in the Context of Human Proteome Projects J. Proteome Res. 2013, 12, 293−298 Pascale Gaudet,† Ghislaine Argoud-Puy,† Isabelle Cusin,† Paula Duek,† Olivier Evalet,† Alain Gateau,† Anne Gleizes,† Mario Pereira,† Monique Zahn-Zabal,† Catherine Zwahlen,† Amos Bairoch,†,‡ and Lydie Lane*,†,‡ † CALIPHO group, SIB-Swiss Institute of Bioinformatics, ‡Department of Human Protein Sciences, Faculty of Medicine, University of Geneva, CMU-1, rue Michel Servet 1211 Geneva 4, Switzerland neXtProt Extends the Coverage of Identified Proteins in the Human Proteome The primary targets of the C-HPP are the so-called “missing proteins” that have not yet been identified by mass spectrometry nor detected by antibodies. An important aspect of the C-HPP work is to define which proteins have already been characterized, to focus on those that have yet to be detected. To help C-HPP in this task, neXtProt captures evidence for the existence of each protein based on criteria established by UniProtKB/Swiss-Prot in Five levels of evidence have been defined: (1) evidence at protein level (e.g., identification by mass spectrometry, detection by antibodies, sequence by Edman degradation, or tridimensional structure resolved), (2) evidence at transcript level (e.g., ESTs or full length mRNA), (3) inferred by homology (strong sequence similarity to known proteins in related species), (4) predicted and (5) uncertain (e.g., dubious sequences that are likely the products of erroneous translations of pseudogenes) WG1. Unknown protein definition

Our Quest for the “Missing Proteins”

neXtProt v ENSEMBL v73 UniProtKB v 2013_09 HPA v11 neXtProt v ENSEMBL v73 UniProtKB v 2013_09 HPA v genes 840 protein coding genes 143 missing proteins 2360 genes 840 protein coding genes 143 missing proteins 120 OMIM hits Obesity Neurodegenerative diseases Cancer 120 OMIM hits Obesity Neurodegenerative diseases Cancer Coverage of 73% gene products in Lymphoid cells Epitelial cells Fibroblasts Coverage of 73% gene products in Lymphoid cells Epitelial cells Fibroblasts Shotgun Proteomic analysis Shotgun Proteomic analysis Transcriptomic analysis Transcriptomic analysis 626 HPA antibodies for Chr16 proteins, 95 for missing Protein profile Gene expression profile Data integration Global and cell type specific outcomes Correlation proteome/transcriptome Proteome coverage and chromosome distribution Chr16 proteome coverage Global and cell type specific outcomes Correlation proteome/transcriptome Proteome coverage and chromosome distribution Chr16 proteome coverage Protein expression vectors for 64 Chr16 missing proteins

WG1. Activities 1.Bioinformatic analysis. Update the missing proteins data according to neXtProt. Definition of tissues and cell lines for biological validation and subcellular localization. 2. Protein expression. * Clone isolation. Gene sequencing. * Expression of proteins in a free cell system. * Purification of expressed proteins. 3. Development of SRM methods for expressed proteins. 4. SRM assays in biological matrices (proteomes or subproteomes of different tissues or cell lines). 5. Synthesize the appropriate peptides for developing quantitative MRM methods.

Description: Chromosome 16 report Name: nextprot_chromosome_16 Release: NeXtProt DATA

WG1. Unkown proteins clonning workflow Bacterial selection Plasmid isolation DNA sequencing

WG1. Unkown proteins cloning Chromosome Unkown proteins (260 before) Cloned in pANT7_cGST WG1 28 new proteins Digest with restriction enzymes Sequencing Expression (IVTT) and purification (GST) Mass spectrometry (MRM)

WG1. Unkown proteins expression List Price 1200 IVTT kit 230 Agarose beads Sample price: 40 euros

WG1. Unkown proteins analysis workflow I Expressed protein digestionSkyline peptide/transition selection MRM assay

WG1. Unkown proteins analysis workflow III MRM data checking and MSMS spectra validation

WG1. Unkown proteins results (11 missing protein from 28 expressed protein) 28 Proteins with selected peptides ( Peptides that matched the specifications to be candidate to use it for quantitation) 22 Proteins with at least three detected peptides in the MRM experiment 24 Proteins with identified peptides by MASCOT in MIDAS experiment.

WG1. Location of unkown proteins

WG1. Unknown proteins PA: protein array, WB: western blot, IF: immunofluorescency

Working plan for Missing Proteins WP Definition of the Chr16 missing proteins group. As indicated in neXtProt. Expression of 60 proteins/year. Development of SRM assays and test in biological matrices. Peptide synthesis for quantitative SRM methods. NAPPA and antibody arrays for B/D. Definition of the missing group according to neXtProt. Available NAPPA and antibody resources. Definition of tissue and cell lines for biological validation. October 2013 May 2014 Nov 2013 Dec 2013 April2014 March 2014 Feb2014 Jan 2014 Validation of SRM assays for previously synthetised proteins in biological matrices. Vector production and optimization (??proteins) Synthesis of peptides for missing proteins with no expression vector. Topics for discussion Expression only for missing proteins? For validation, select the tissue/cell line with higher gene expression. Recombinant protein purification? Criteria to select peptides for synthesis. Contact Josua Labaer if clones are not available in Salamanca. Development of SRM methods for new missing proteins using the recombinant form or peptides Validation in biological matrices. List of potential biomarkers from B/D groups Validation of SRM methods in three independent labs. Design of arrays for B/D WP. Biomarkers. Groups CNB UCM-PCM PCB CIC-BIOGUNE CIC-USAL Other groups might also participate in defining the best biological matrix and also in the interlaboratory validation process. Dec 2014

Working plan for Missing Proteins WP Definition of the Chr16 missing proteins group. As indicated in neXtProt. Expression of 60 proteins/year Development of SRM assays Test in biological matrices. Peptide synthesis for quantitative SRM methods. Antibody and NAPPA arrays for B/D groups (2 year, 2015) (Proyectos Manuel Fuentes, Francisco Blanco, Ignacio Casal, Concha Gil, ………….) Definition of the missing group according to neXtProt. 28 expressed proteins (11 proteins still missing proteins) Definition of tissue and cell lines for biological validation (Colaboration with Banco Nacional de ADN, Banco líneas celulares) Validation of SRM assays for previously synthetised proteins in biological matrices (3 labs- MRM group Synthesis of peptides for missing proteins with no expression vector Expression of 53 missing proteins (if clones are available) Development of SRM methods for new missing proteins using the recombinant form or peptides Validation in biological matrices. List of potential biomarkers from B/D groups October 2013 May 2014 Nov 2013 Dec 2013 April2014 March 2014 Feb2014 Jan Dec 2014

Working plan for Missing Proteins WP Definition of the Chr16 missing proteins group. As indicated in neXtProt. Expression of 60 proteins/year Development of SRM assays Test in biological matrices. Peptide synthesis for quantitative SRM methods. Antibody and NAPPA arrays for B/D groups (2 year, 2015) (Proyectos Manuel Fuentes, Francisco Blanco, Ignacio Casal, Concha Gil, ………….) Topics for discussion Expression only for missing proteins ?? For validation, select the tissue/cell line with higher gene expression. Recombinant protein purification? Criteria to select peptides for synthesis. Contact Josua Labaer if clones are not available in Salamanca (done) Groups CNB UCM-PCM PCB CIC-BIOGUNE CIC-USAL Other groups might also participate in defining the best biological matrix and also in the interlaboratory validation process.

WG1. Unknown proteins peptides

WG1. Unkown proteins results. In detail

Strategies to Find Missing Proteins Unusual tissues and cell types: olfactory epithelium, specific brain regions, testis, placenta, oviduct, lung Development stage: embryo, fetus Protein families with high homology: keratins, GPCRs, immunoglobulins, histocompatibility antigens—greater sequence coverage and accuracy, variety of methods Low/very low abundance, high turnover proteins: transcription factors—greater sensitivity; unfavorable physico-chemical properties: hydrophobic, basic, few tryptic sites Examine RNA-Seq data for evidence of gene expression and mRNA translation (RNC-mRNA)

66 cell types 48 tissues Clinical specimens