Molecular Interactions 2013 Liverpool. PSICQUIC & PSICQUIC-view 2.5/2.6/2.7 Review of new implementation based on MITAB2.7 (2.6/2.5) Reference implementation.

Slides:



Advertisements
Similar presentations
Sandra Orchard EMBL-EBI Molecular Interactions
Advertisements

New tools for MIAPE Generation Emilio Salazar Doñate Bioinformatics Group CNB – CSIC.
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title International Molecular Exchange Consortium - IMEx Sandra Orchard EMBL-EBI.
1 XML Data Management Course Outline and Organisation Werner Nutt.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
The IntAct Database Sandra Orchard & Birgit Meldal.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Topic Denormalisation S McKeever Advanced Databases 1.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Information systems and databases Database information systems Read the textbook: Chapter 2: Information systems and databases FOR MORE INFO...
Overview of Search Engines
Federated Searching Pre-Conference Workshop - The federated searching cookbook Qin Zhu HP Labs Research Library February 18, 2007.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
NotesDuranceDescriptionTask Basic and advanced training throughout the project 15.4 – 15.6Reading android material and writing.
LexEVS 6.0 Overview Scott Bauer Mayo Clinic Rochester, Minnesota February 2011.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
UIS Data Transformation and Validations As it pertains to the SDMX TWG EXL Initiative.
2131 Structured System Analysis and Design By Germaine Cheung Hong Kong Computer Institute Lecture 2 (Chapter 2) Information System Building Blocks.
Patient Empowerment for Chronic Diseases System Sifat Islam Graduate Student, Center for Systems Integration, FAU, Copyright © 2011 Center.
Python MySQL Database Access
Copyright OpenHelix. No use or reproduction without express written consent1.
1 XML Data Management Course Outline and Organisation Werner Nutt.
Dali JPA Tools. About Dali Dali JPA Tools is an Eclipse Web Tools Platform sub-Project Dali 1.0 is a part of WTP 2.0 Europa coordinated release Goal -
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Data Visualization Project B.Tech Major Project Project Guide Dr. Naresh Nagwani Project Team Members Pawan Singh Sumit Guha.
Distributed Aircraft Maintenance Environment - DAME DAME Workflow Advisor Max Ong University of Sheffield.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
File Systems and Databases Lecture 1. Files and Databases File: A collection of records or documents dealing with one organization, person, area or subject.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
Introduction to IntAct Pablo Porras Millán, IntAct
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Iccha Sethi Serdar Aslan Team 1 Virginia Tech Information Storage and Retrieval CS 5604 Instructor: Dr. Edward Fox 10/11/2010.
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
1 SHAWEL Sharable and Interactive Web-Lexicon Greg Gulrajani - Max-Planck-Institute in collaboration with David Harrison & Peter Wittenburg Max Planck.
EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.
Talend MDM Web User Interface – Levels of customization
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
Protein interactions and Pathways Tutorial Wellcome trust summer school Jyoti Khadake.
Project Description 2 Indexing. Indexing Tokenize a text document, and attach to each token a list of locations that this token has appeared Sort and.
Presentation on Database management Submitted To: Prof: Rutvi Sarang Submitted By: Dharmishtha A. Baria Roll:No:1(sem-3)
1 CS 8803 AIAD (Spring 2008) Project Group#22 Ajay Choudhari, Avik Sinharoy, Min Zhang, Mohit Jain Smart Seek.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
Application architectures Advisor : Dr. Moneer Al_Mekhlafi By : Ahmed AbdAllah Al_Homaidi.
PROTEIN IDENTIFIER IAN ROBERTS JOSEPH INFANTI NICOLE FERRARO.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
Molecular Interaction Networks Service providers at the BioHackathon: - DIP (Lukasz Salwinski, UCLA) - STRING/STICH (Michael Kuhn, EMBL) - IntAct (Bruno.
GeneConnect Use Cases and Design August 3, GeneConnect Database IDs are linked by Direct Annotation, Inferred Annotation, or Sequence Alignment.
1 Using the Lucene Search Engine. 2 Team Phil Corcoran Project Leader 10 Years Software Telecoms, Finance, Manufacturing Reqs, Design, Test Derek O’ Keeffe.
Information Retrieval in Practice
FHIR and Relational Databases
Tools For Vertebrate Gene Naming
The Operations Portal and the Grid Operations Interoperability
Take a REST from manual searching: PDBe, programmatically
A&AI Component Diagram
Systems Biology Tools for working with BIND data
Searching and Indexing
The EBI Search RESTful API
Building Search Systems for Digital Library Collections
The Complex Portal Birgit Meldal
PRG 421 MART Knowledge is divine-- prg421mart.com.
Lecture 1 File Systems and Databases.
Getting Started With Solr
Springshare’s LibInsight: E-Journals/Databases Dataset
Presentation transcript:

Molecular Interactions 2013 Liverpool

PSICQUIC & PSICQUIC-view 2.5/2.6/2.7 Review of new implementation based on MITAB2.7 (2.6/2.5) Reference implementation updated and released – SOLR and Lucene implementations Usage of columns discussed to try and standardize across resources

PSICQUIC & PSICQUIC-view 2.5/2.6/2.7 Future plans Add ability to Sort SOLR indexing allows faceting - enable users to restrict/filter searches based on facets Update SOAP with new Rest protocols Organise websites by database ‘types’ – IMEx data – Internally-curated – Imported data – Text-mining/predicted

PSICQUIC -XML Lukasz Salwinski has XML based reference implementation (alpha) User directly queries record store, minimizes LOSSY transformation stages as transformation done on indexing Needs to enable additional download formats but all PSI formats already available Needs to test and gather statistics on query time and indexing Can deal with multi-protein complexes

Clustering To cluster or to merge? Sufficient to continue to cluster based on MITAB2.5 columns Continue to rely on identifiers for clustering – keeping isoforms and PRO chains as separate entities

Clustering Jose Villaveces - Demo of tool developed at Max Planck to enable clustering and scoring on the fly using PSICQUIC – library required update to cope with ever-increasing datasize Manuel Bernal Llinares– experience of working with MITAB2.7 – documentation needs improving, and sample code

JAMI Single JAVA API which operates over both MITAB and XML – ends requirement for redundant development Initial test-cases – development of Syntax checker and file enricher using web-services Code available for review and input

Protein complexes Birgit Meldal (EBI) – encyclopaedia of stable reference complexes Kim van Roey (EMBL) – annotation of transient complexes Colin Combes (U Edinburgh) – complex viewer Sylvie Ricard-Blum (MatrixDB) – extracellular matrix requires complexes as an annotation object Several shortfalls of XML2.5 schema identified as a result of this work

XML3.0 Need to properly describe complexes – knowledge rather than data so ‘experiment’ now not required Need to model ‘protein groups’ – enable capture of AP-MS expts. Opportunity to remove/deprecate unused items compromising parsing Opportunity to improve capture of additional biology in systematic manner

XML3.0 Specification needs to be written Prototype to be prepared Circulate both for criticism and feedback!!!