Worldwide Protein Data Bank www.wwpdb.org Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.

Slides:



Advertisements
Similar presentations
Integration of MBSE and Virtual Engineering for Detailed Design
Advertisements

Making the System Operational
© 2009 Oracle Corporation Oracle APEX Forms Conversion Overview.
Coursework.  5 groups of 4-5 students  2 project options  Full project specifications on 3 rd March  Final deadline 10 th May 2011  Code storage.
Business logic for annotation workflow Tom Oldfield July 21, 2010.
Chapter 10: The Traditional Approach to Design
Systems Analysis and Design in a Changing World, Fifth Edition
Use Case & Use Case Diagram
HP Quality Center Overview.
Chapter 10 The Traditional Approach to Design
Software Delivery. Software Delivery Management  Managing Requirements and Changes  Managing Resources  Managing Configuration  Managing Defects 
User experience designer, User Interface Designer (UI), Information architect, Portal / Intranet development SharePoint WORK SAMPLES Highly confidential.
HORIZONT 1 ProcMan ® The Handover Process Manager Product Presentation HORIZONT Software for Datacenters Garmischer Str. 8 D München Tel ++49(0)89.
T-FLEX DOCs PLM, Document and Workflow Management.
Software Configuration Management
A Guide to Oracle9i1 Introduction To Forms Builder Chapter 5.
Translation Workflow The Big Picture  The execution of translation projects involves a lot of file transfers between the project members.
Configuration Management
1 1 Roadmap to an IEPD What do developers need to do?
Product Offering Overview CONFIDENTIAL AND PROPRIETARY Copyright ©2004 Universal Business Matrix, LLC All Rights Reserved The duplication in printed or.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
User Interface Mock Up for Sequence Processing Jasmine Young, Jawahar Swaminathan The following interface mock ups take into account the functionalities.
CSCI ClearQuest 1 Rational ClearQuest Michel Izygon - Jim Helm.
SecureAware Building an Information Security Management System.
Sage CRM Developers Course
Training Course 2 User Module Training Course 3 Data Administration Module Session 1 Orientation Session 2 User Interface Session 3 Database Administration.
Testing. Definition From the dictionary- the means by which the presence, quality, or genuineness of anything is determined; a means of trial. For software.
Managing Projects using Oracle Project Management (PJT) & SPREADSHEETS Neeraj Garg Vice President, Client Services.
Agenda Teams Responsibilities Timeline Homework and next steps.
This presentation is the property of Paradigm Information Systems It is confidential to the intended recipient for the purpose of evaluating FMS Any other.
Rational Unified Process Fundamentals Module 4: Disciplines II.
CoCreate OneSpace 2007 Training Model Manager 2007 User Training.
Volunteer Management System Presented by Team SE18-08S SE18-T08S - Jan 2012.
Worldwide Protein Data Bank wwPDB Common D&A Project January 28, 2010 Steering Committee Project Update.
Software Processes lecture 8. Topics covered Software process models Process iteration Process activities The Rational Unified Process Computer-aided.
Configuration Management (CM)
Chapter 7 IS630. Project Design  Technical Design & Specification Network and System Architecture & Design Software System Architecture & Design  Database.
Chapter 9 Moving to Design
ISM 5316 Week 3 Learning Objectives You should be able to: u Define and list issues and steps in Project Integration u List and describe the components.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Developing software and hardware in parallel Vladimir Rubanov ISP RAS.
Product Update March Copyright © IET Ltd 2008 Agenda  Release 7.7  VerifIEr.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Software Maintenance Speaker: Jerry Gao Ph.D. San Jose State University URL: Sept., 2001.
Worldwide Protein Data Bank wwPDB Common D&A Project November 24, 2009 November 24, 2009 Steering Committee Project Update.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Rational Unified Process Fundamentals Module 4: Core Workflows II - Concepts Rational Unified Process Fundamentals Module 4: Core Workflows II - Concepts.
Stages of design  High level design  High level data structure  Architecture  Low level design-code design  Algorithms  Low level data structures.
Worldwide Protein Data Bank wwPDB Common D&A Project Full Project Team Meeting Rutgers March 16-19, 2010.
Institute for the Protection and Security of the Citizen HAZAS – Hazard Assessment ECCAIRS Technical Course Provided by the Joint Research Centre - Ispra.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NGDC CLASS Developers Meeting July 14th, 2008 Simple NOAA Archive Access Portal.
T Project Review Muuntaja I1 Iteration
6/6/ SOFTWARE LIFE CYCLE OVERVIEW Professor Ron Kenett Tel Aviv University School of Engineering.
/16 Final Project Report By Facializer Team Final Project Report Eagle, Leo, Bessie, Five, Evan Dan, Kyle, Ben, Caleb.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
1 Process activities. 2 Software specification Software design and implementation Software validation Software evolution.
Systems Analysis and Design in a Changing World, Fourth Edition
Architecture Review 10/11/2004
Software Configuration Management
PLM, Document and Workflow Management
Applied Software Implementation & Testing
Overview of Workflows: Why Use Them?
Software Development Process Using UML Recap
Our Process CMSC 345, Version 1/04.
Presentation transcript:

Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable

Worldwide Protein Data Bank Ligand Processing Ligand Processing Release Processing Geometry CK Validation Geometry CK Validation Calculated annotations (Bio Assem) Calculated annotations (Bio Assem) Corrections (water trans, pro- chiral ck) User Interface WFE/API Requirements Design Progress Tracking/ Status Sequence Processing Module 4.1, Delivered May 6, 2010 Annotation Pipeline

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Technical Deliverable Details Master Format. Finalization of Physical Data Exchange Extended API Tracking DB creation/support Extended Work Flow Engine (WFE) Work Flow Manager (WFM) Work Flow Manager User Interface (WFM UI) Annotator graphical interface for sequence module Integration of all components creating the Sequence Processing “module”

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Key Requirements Met  Complete and “correct” entries processed automatically  Sequence mutation – editing and visualization supported  Sequence mismatch – editing and visualization supported  Processing of very large structures, ie. Ribosome  Polymer processing, individual and in complex  Short peptide complex cross reference  Sequence matches sortable by % match  Annotator triggered global ALA/GLY substitutions  Support Self reference for cases with no Uniprot match.

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Future Enhancement List  Automation of “gap” recognition and processing*  Implementation of Uniprot isoform, variant searches for mismatched proteins.*  Validation and checks within the Sequence Editor  Modified residues – support one to many sequence alignments (ie. chromophore)  Chimera processing  Conconavalin A Example (alternate splicing) *PDBe code to be packaged for module integration

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Sequence Module Processing T1 - Initialzation Update workflow status Verify required data inputs Model file Taxonomy assignments T2 - Reference Sequence Search Determine unique polymers Run sequence database search Update reference sequence data files T3 - Assessment Check Author/Coordinate seequence conflicts Check sequence database assignments

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Sequence Module Processing T3 Assessment Succeeds Check author/coordinate sequence conflicts Check sequence database assignments T4 - Update Apply residue mapping Apply database references Create new version of model file T5 - End Update workflow status T3 Assessment Fails Check author/coordinate sequence conflicts Check sequence database assignments T6 -Sequence Editor Interactive residue- level modifications Reference database selections or self- reference Reset taxonomy Run sequence databases search by entity. Add reference sequence by ID Export residue mapping and reference assignments T4 - Update Apply residue mapping Apply database references Create new version of model file

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Under the covers… DP File System Archival Storage Deposition Data Set Id 1 Deposition Data Set Id 2 Depoisiton Data Set Id N Workflow Storage Deposition Data Set ID 1 Workflow Instance WF Inst ID 1WF Inst ID 2 WF Shared Storage WF Namespace A WF Namespace B Deposition Data Set ID 2 Workflow Instance WF Inst ID 3WF Inst ID 4 WF Shared Storage WF Namespace A WF Namespace B

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Data Management ToparchiveD_ D_000001_model_P1.cif.V 1 workflowD_000001instanceshared ToparchiveD_ D_000001_model_P1.cif.V1 D_000001_seqdb-match_P1.cif.V1 D_000001_seqdb-match_P2.cif.V1 D_000001_seqdb-match_P3.cif.V1 workflowD_000001instanceW_ D_000001_model_P1.cif.V1 D_000001_seqdb-match_P1.cif.V1 D_000001_seqdb-match_P2.cif.V1 D_000001_seqdb-match_P3.cif.V1 shared File Prior to Seq. Processing File After Seq. Processing Database Search

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Data Management Toparchive D_ D_000001_model_P1.cif.V1 D_000001_seqdb-match_P1.cif.V1 D_000001_seqdb-match_P2.cif.V1 D_000001_seqdb-match_P3.cif.V1 D_000001_seqdb-assign_P1.cif.V1 workflow D_ instanceW_ D_000001_model_P1.cif.V1 D_000001_seqdb-match_P1.cif.V1 D_000001_seqdb-match_P2.cif.V1 D_000001_seqdb-match_P3.cif.V1 W_ D_000001_model_P1.cif.V1 D_000001_seqdb-match_P1.cif.V1 D_000001_seqdb-match_P2.cif.V1 D_000001_seqdb-match_P3.cif.V1 D_000001_seqdb-assign_P1.cif.V1 shared File System After Seq. Processing Editor Task: New results returned to archival storage …

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Workflow Manager User Interface Workflow engine Session ID + workflowID Domain data archive (local) API Start/Stop Launch module UIs Depositions Remote data – Snap Mirror share Applications Status Data View system activity – Tracking DB Tasks Tracking DB System Architecture

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting THE DEMO  A brief walk about the WFM  The System at Work –Selection of a raw file within the WFM –Trigger Sequence Processing Interface  Processing options –Tracking by the WFM of the task status  Blessing of the output

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting System Extensibility: Set up for adding New Functionality ProcessRunner ActionRegistry actions.xml Plugin Modules FileUtils PdbxUtils FormatUtils UtilsBase

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Next Steps  Sequence Processing Module –Sequence Processing Module to go into targeted Testing –Modifications to be adopted as prioritized by the team and approved by the PI’s –User Manual development  Ligand Processing –Finalize requirements –Develop Design –Development Module with delivery target end of August

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Process Overview - Ligand Processing Step 1.0 Deposition Format Check Step 1.0 Deposition Format Check Step 3.0 Ligand Processing Step 3.0 Ligand Processing Step 2.0 Sequence Processing Step 2.0 Sequence Processing Step 4.0 Calculation of Derived data Step 4.0 Calculation of Derived data Step 5.0 Corrections Water trans pro- chiral ck Step 5.0 Corrections Water trans pro- chiral ck Step 6.0 Calculated Annotation - Biological Assembly Step 6.0 Calculated Annotation - Biological Assembly Step 7.0 Geometry Ck Validation Step 7.0 Geometry Ck Validation Step 8.0 Release processing Generate Files Step 8.0 Release processing Generate Files Step 9 Send to Authors Step 9 Send to Authors WFE,API, WFM Graphical User Interface

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Ligand Processing – Functional Requirements Annotator exchange – experience with, and analysis of, existing work flows Draft of new TO BE process – Level 1 Annotator Team elaborated - Level 2,3 Annotator Team created decision trees and SIPOCS for all process steps. Annotators documented key Use Cases Annotator Team mapped existing functional software components to the proposed workflow components. Annotator Team created interface mock ups for interactive components

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Ligand Processing – Technical Requirements and Design  Create Plan, identify resources  Tech Team to review the requirements  Review Functional software components  Capture technical requirements  Complete the draft design for the Ligand processing module  Develop module

Worldwide Protein Data Bank Common D&A Project March 2010 Project Team Meeting Project Team