Knowledge Enabled Information and Services Science Glycomics project overview.

Slides:



Advertisements
Similar presentations
Knowledge Modeling and its Application in Life Sciences: A Tale of two ontologies Bioinformatics for Glycan Expression Integrated Technology Resource for.
Advertisements

Web Services for N-Glycosylation Process Integrated Technology Resource for Biomedical Glycomics NCRR/NIH Satya S. Sahoo, Amit P. Sheth, William S. York,
Semantic Web & Semantic Web Services: Applications in Healthcare and Scientific Research International IFIP Conference on Applications of Semantic Web.
Analysis of human haptoglobin, digest with trypsin and Glu-C – six putative N-motif peptides. Glycopeptide separation by hydrophilic interaction liquid.
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
Knowledge Enabled Information and Services Science What can SW do for HCLS today? Panel at HCSL Workshop, WWW2007 Amit Sheth Kno.e.sis Center Wright State.
Semantic Web: promising technologies and current applications in Health care & Life Sciences Amit Sheth Thanks: Kno.e.sis team, collaborators at CCRC,
Semantic Web Technology in Support of Bioinformatics for Glycan Expression Amit Sheth Large Scale Distributed Information Systems (LSDIS) lab, Univ. of.
Semantics powered Bioinformatics Amit Sheth, William S. York, et al Large Scale Distributed Information Systems Lab & Complex Carbohydrate Research Center.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
Protein Identification with Mascot Software (Laxmana Rao Y. and Gopalacharyulu P.V.)
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Scaffold Download free viewer:
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Evaluated Reference MS/MS Spectra Libraries Current and Future NIST Programs.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Semantic Web applications in Financial Industry, Government, Health care and Life Sciences SWEG 2006, March 2006 Amit Sheth LSDIS Lab, Department of Computer.
Knowledge Enabled Information and Services Science GlycO.
Kno.e.sis Center, Wright State University,
Semantics Enabled Industrial and Scientific Applications: Research, Technology and Deployed Applications Part III: Biological Applications Keynote - the.
The dynamic nature of the proteome
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
IPAW'08 – Salt Lake City, Utah, June 2008 Exploiting provenance to make sense of automated decisions in scientific workflows Paolo Missier, Suzanne Embury,
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
Phase II Additions to LSG Search capability to Gene Browser –Though GUI in Gene Browser BLAST plugin that invokes remote EBI BLAST service Working set.
Laxman Yetukuri T : Modeling of Proteomics Data
Semantic empowerment of Life Science Applications October 2006 Amit Sheth LSDIS Lab, Department of Computer Science, University of Georgia Acknowledgement:
Knowledge Enabled Information and Services Science SAWSDL: Tools and Applications Amit P. Sheth Kno.e.sis Center Wright State University, Dayton, OH Knoesis.wright.edu.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
Applying Semantic Technologies to the Glycoproteomics Domain W. S York May 15, 2006.
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Bioinformatics Research Overview Outline Biomedical Ontologies oGlycO oEnzyO oProPreO Scientific Workflow for analysis of Proteomics Data Framework for.
Overview of Mass Spectrometry
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Proposed Research Problem Solving Environment for T. cruzi Intuitive querying of multiple sets of heterogeneous databases Formulate scientific workflows.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
1 DMS-DQS-SUPSC03-PRE-12-E © DEIMOS Space S.L., 2007 A Semantic Data Grid for Satellite Mission Quality Analysis Reuben Wright Deimos Space.
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
High throughput biology data management and data intensive computing drivers George Michaels.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
2014 생화학 실험 (1) 6주차 실험조교 : 류 지 연 Yonsei Proteome Research Center 산학협동관 421호
Constructing high resolution consensus spectra for a peptide library
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
Accelerating Research in Life Sciences
LSDIS Lab, Department of Computer Science,
Amit Sheth LSDIS Lab & Semagix University of Georgia
Accelerating Research in Life Sciences
Collaborative RO1 with NCBO
High level view of the MAE algorithm.
Dorota F. Zielinska, Florian Gnad, Jacek R. Wiśniewski, Matthias Mann 
Service-enabling Biomedical Research Enterprise
Accelerating Research in Life Sciences
Tryptic glycopeptides of IGFBP-5 from T47D cells separated by HPLC detected by ESI-MS and sequenced by tandem MS.a, ESI-MS spectrum of combined fractions.
Kuen-Pin Wu Institute of Information Science Academia Sinica
Presentation transcript:

Knowledge Enabled Information and Services Science Glycomics project overview

Knowledge Enabled Information and Services Science Life Science Ontologies ProPreO An ontology for capturing process and lifecycle information related to proteomic experiments 398 classes, 32 relationships 3.1 million instances Published through the National Center for Biomedical Ontology (NCBO) and Open Biomedical Ontologies (OBO) Glyco An ontology for structure and function of Glycopeptides 573 classes, 113 relationships Published through the National Center for Biomedical Ontology (NCBO)

Knowledge Enabled Information and Services Science Two aspects of glycoproteomics: oWhat is it? → identification oHow much of it is there? → quantification Heterogeneity in data generation process, instrumental parameters, formats Need data and process provenance → ontology-mediated provenance Hence, ProPreO models both the glycoproteomics experimental process and attendant data ProPreO ontology

Knowledge Enabled Information and Services Science ProPreO population: transformation to rdf Scientific Data Computational Methods Ontology instances

Knowledge Enabled Information and Services Science “Protein RDF” chemical mass monoisotopic mass amino-acid sequence n-glycosylation concensus Protein Data amino-acid sequence Chemical Mass RDF Monoisotopic Mass RDF Amino-acid Sequence RDF “Peptide RDF” chemical mass monoisotopic mass amino-acid sequence n-glycosylation concensus parent protein Calculate Chemical Mass Calculate Monoisotopic Mass Determine N-glycosylation Concensus Key Protein Path Peptide Path amino-acid sequence Extract Peptide Amino-acid Sequence from Protein Amino-acid Sequence ProPreO population: transformation to rdf Scientific Data Computational Methods RDF

Knowledge Enabled Information and Services Science Semantic annotation of scientific/experimental data

Knowledge Enabled Information and Services Science parent ion m/z fragment ion m/z ms/ms peaklist data fragment ion abundance parent ion abundance parent ion charge ProPreO: Ontology-mediated provenance Mass Spectrometry (MS) Data

Knowledge Enabled Information and Services Science <parameter instrument=“micromass_QTOF_2_quadropole_time_of_flight_mass_spectrometer” mode=“ms-ms”/> Ontological Concepts ProPreO: Ontology-mediated provenance Semantically Annotated MS Data

Knowledge Enabled Information and Services Science Semantic annotation of Scientific Data Annotated ms/ms peaklist data <parameter instrument=“micromass_QTOF_2_quadropole_time_of_flight_mass_spectrometer” mode = “ms/ms”/>

Knowledge Enabled Information and Services Science N-GlycosylationProcessNGP N-Glycosylation Process (NGP) Cell Culture Glycoprotein Fraction Glycopeptides Fraction extract Separation technique I Glycopeptides Fraction n*m n Signal integration Data correlation Peptide Fraction ms datams/ms data ms peaklist ms/ms peaklist Peptide listN-dimensional array Glycopeptide identification and quantification proteolysis Separation technique II PNGase Mass spectrometry Data reduction Peptide identification binning n 1

Knowledge Enabled Information and Services Science Storage Standard Format Data Raw Data Filtered Data Search Results Final Output Agent Biological Sample Analysis by MS/MS Raw Data to Standard Format Data Pre- process DB Search (Mascot/ Sequest) Results Post- process (ProValt) OIOIOIOIO Biological Information Semantic Annotation Applications Semantic Web Process to incorporate provenance

Knowledge Enabled Information and Services Science Raw2mzXMLmzXML2PklPkl2pSplitMASCOT SearchProVault Raw mzXMLPklpSplit MACOT result ProVault result Experimental Data Semantic Annotation Metadata File SPARQL query-based User Interface Semantic Metadata Registry PROTEOMECOMMONS PROTEOMICS WORKFLOW Integrated Semantic Information and knowledge System (Isis) ProPreO ontology EXPERIMENTAL DATA Have I performed an error? Give me all result files from a similar organism, cell, preparation, mass spectrometric conditions and compare results. Is the result erroneous? Give me all result files from a similar organism, cell, preparation, mass spectrometric conditions and compare results.

Knowledge Enabled Information and Services Science Semantic Biological Web Service Registry Semantic Web Service

Knowledge Enabled Information and Services Science Gly | Asn | Gly | Ser moiety_2 moiety_ GLYDE-CT : GLYcan Data Exchange Based on a Connection Table Format

Knowledge Enabled Information and Services Science Data, ontologies, more publications at Biomedical Glycomics project web site: Thank You