MIAMExpress development October 2002 Mohammad shojatalab

Slides:



Advertisements
Similar presentations
Object Oriented Analysis And Design- IT0207 III Semester UNIT-IV.
Advertisements

Connecting to Databases. connecting to DB DB server typically a standalone application Server runs on localhost for smaller sites –i.e. Same machine as.
Connecting to Databases. relational databases tables and relations accessed using SQL database -specific functionality –transaction processing commit.
Introduction to the BinX Library eDIKT project team Ted Wen Robert Carroll
The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
Total Resource Management RulesManager New Features August 21, 2012 Al Johnson, VP RulesManager Architecht.
Configuration management
Software change management
Configuration management
Chapter 10: Designing Databases
2/16/2004Sergei Sadilov PH/SFT 1 Bonsai in Integration Testing of Geant4 This presentation will probably involve audience discussion, which will create.
JTX Overview Overview of Job Tracking for ArcGIS (JTX)
ESSnet on SDMX phase II Laura Vignola ISTAT Rome, 3-4 December 2012.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
Fast Track to ColdFusion 9. Getting Started with ColdFusion Understanding Dynamic Web Pages ColdFusion Benchmark Introducing the ColdFusion Language Introducing.
Test Automation Best Practices
This chapter is extracted from Sommerville’s slides. Text book chapter
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
EBI is an Outstation of the European Molecular Biology Laboratory. MAGE-TAB - The ArrayExpress Production Experience Helen Parkinson, PhD.
SQL Server to MySQL Database Migration SQLWays - Migration Software Presentation March 2009 Copyright (c) Ispirer Systems Ltd.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
ProCol~ A jEdit Plugin for Remote Project Collaboration Justin Dieters Spring 2004 CS470 Final Presentation.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
September 2003 Aix en Provence Jonathon Blake EMBL Biochemical Instrumentation.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
1 Software Development Configuration management. \ 2 Software Configuration  Items that comprise all information produced as part of the software development.
MAPLDDesign Integrity Concepts You Mean We’re Still Working On It? Sustaining a Design.
MIAMExpress development and local installation DESPRAD Meeting,November 2002 Mohammad shojatalab
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
Presentation on SubmissionTrackingTool: by Anjan Sharma.
Configuration Management (CM)
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
Real Time Monitor of Grid Job Executions Janusz Martyniak Imperial College London.
(Chapter 10 continued) Our examples feature MySQL as the database engine. It's open source and free. It's fully featured. And it's platform independent.
Axel Naumann University of Nijmegen / NIKHEF, NL ROOT 2004 Users Workshop The Future of THtml Plans and Status of ROOT’s documentation facility.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
© 2006 IBM Corporation Agile Planning Web UI. © 2006 IBM Corporation Agenda  Overview of APT Web UI  Current Issues  Required Infrastructure  API.
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
s1 201 Advance Database Systems Dr.Kwanchai Eurviriyanukul
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
Design - programming Cmpe 450 Fall Dynamic Analysis Software quality Design carefully from the start Simple and clean Fewer errors Finding errors.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
Mr. Justin “JET” Turner CSCI 3000 – Fall 2015 CRN Section A – TR 9:30-10:45 CRN – Section B – TR 5:30-6:45.
TEMBLOR mid-term review Participation in DESPRAD project Bernd Drescher Robert Wagner.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
Object storage and object interoperability
LCG Distributed Databases Deployment – Kickoff Workshop Dec Database Lookup Service Kuba Zajączkowski Chi-Wei Wang.
The ATLAS DAQ System Online Configurations Database Service Challenge J. Almeida, M. Dobson, A. Kazarov, G. Lehmann-Miotto, J.E. Sloper, I. Soloviev and.
Overview 16 Databases investigated 4 Systems MIAME Compliant –ArrayExpress, SMD, LAD and GeneX 2.x 2 Systems Support MAGE-ML import and export –ArrayExpress.
Final Fantasy ½: IOC Overview Team Q CSE 403 Winter ‘03 I’ve got something special for you.
CS 440 Database Management Systems Stored procedures & OR mapping 1.
Excel Services Displays all or parts of interactive Excel worksheets in the browser –Excel “publish” feature with optional parameters defined in worksheet.
ArrayExpress Ugis Sarkans EMBL - EBI
Web Technology Solutions
Using ArrayExpress.
GLAST Release Manager Automated code compilation via the Release Manager Navid Golpayegani, GSFC/SSAI Overview The Release Manager is a program responsible.
PRG 421 GUIDE Higher Education / prg421guide.com.
James Blankenship March , 2018
Lecture 1: Multi-tier Architecture Overview
Content of Presentation
Use of GEANT4 in CMS The OSCAR Project
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Presentation transcript:

MIAMExpress development October 2002 Mohammad shojatalab

Talk structure History Underlying concepts Design & Development Current status Future

History (March 2001 ~ present) Need for a submission tool for ArrayExpress Obviously It should be a web base tool It was supposed to be quick and dirty prototype Start about May 2001

Underlying concepts Based on MIAME concepts and questionnaire Submission of Experiment, Arrays, Protocols Avoiding free text as much as possible Using controlled vocabulary

Login Pending/New Experiment Sample1Sample2Sample3 Sample n Sample protocol Hybridisations Hyb protocol Array 1 Array 2 Array 3 Array n Scanning protocol Data 1 Data 2 Data 3 Data n Image analysis protocol Combined Experiment Data Transformation protocol Submit Final free text comment Extracts 1…n E1E1 E2E2 EnEn E1E1 E2E2 EnEn E1E1 E2E2 EnEn E1E1 E2E2 EnEn Extraction protocol

Design considerations Complex submission structure Long submission time (may be weeks) So it needs a database It meant to be fast, Open source and free Usable as a Lab Notebook Free database management system Web based submissions MAGE-ML file as output

Technologies & Tools Using MySQL DBMS for its database Using Perl CGI technology Using DBI, DBD::MySQL to interact with db Using javascript CVS as source code repository to keep track of changes and also incorporate changes made by other developers in to all developers working copy.

MIAMExpress GUI, MAGExpress, … Biology Layer Data acces dat a Physical data layer (Database, files,…) Biology;MIAME Layer Data access functions

Team work culture MIAMExpress team Biologists (Helen and curation team, External people) Development team; Myself,Niran (Jan 2002), Sergio (Jun 2002) “Everyone owns all the code so whenever something is busted everyone has a right and duty to fix it” “Successful culture has to accept that mistakes will happen”

Development Started at May 2001 Simple data model; around 30 tables Avoiding hard code in program Debug tools to assist developer Log functionality which write down user’s activities in a file to assist developer to find out what is wrong if an controlled error happened. Readable and maintainable code

Impact on data access layer and APIs New Requirement Requirement Analysis Impact analysis Essential information for development Impact on data model Impact on UIImpact on MAGExpress Development Test Development New Release Beta version

Requiremen t Analysis  Impact Analysis  Developmen t  Test  Steady state MIAMExpress development states

Apache HTTP Test Production Apache HTTP Developers CVS Infrastructure

Submissions Types Array submission: Array Description File (Excel sheet, tab delimited files) Experiment submission experiment design,samples,extractions,labelled extracts,protocols (Web based forms) Hybridization data files (Excel sheet, tab delimited files) combined data file (Excel sheet, tab delimited files)

MAGE-ML Creation; MAGExpress module MAGEstk: A set of APIs which are created from OM and are able to read MAGE-ML file and create object structure and vice versa To create MAGE-ML file from MIAMExpress we have to know which piece of data fits where in the model that means we need map MIAMExpress Data model(schema) to MAGE-OM Still we have problem because MAGE classes are quite abstract in terms of working with physical data

MAGE-ML Creation (2) To have a object oriented design we need to have a new set of classes which are derived from MAGE classes but are MIAMExpress specific Objects which are instantiated from these new set of classes know how get their attributes from MIAMExpress database Also they are inheriting all of the properties of their parents

MAGE-ML Creation; Example biosrc instantiated from original BioSource class mx_biosrc instantiated from MXBioSource class MXBioSource class is inherited from original BioSource class biosrc.go&Load_your_data -> ERROR!!! mx_biosrc.go&Load_your_data -> you have a loaded object from MIAMExpress Encapsulation of dirty works inside the object

How go&Load_your_data works? By calling our one or a sequence of appropriate biology(MIAME) APIs. Note that the method name in all Classes is the same but behaviour is context sensitive which is hard coded inside; For example ; for BioSource we say; mx_biosrc. go&Load_your_data; And also for labelled extract; Mx_label. go&Load_your_data and all others the like.

How we create MAGE-ML file We Create the top level object; experiment experiment.go&create_all_your_associations; which creates whole object structure experiment.go&Load_your_data; which load data to whole structure write MAGE-MLby starting from experiment top level object

Problem with new releases Usually new releases means new schema as well. that means we have to change the data access layer. that means we have to change our mapping model which map MIAMExpress schema to MAGE model. that means we have to migrate the existing data. that introduce data migration module; dmm dmm is have to be provided with new release

Missing bits behind the scene We are using MySQL which at least at the moment doesn’t support constraints and foreign keys, sub queries. They have promised to to solve these in their next releases soon. Persistent connection with database. Having an automated schema  MAGE mapping.

Current status Release 1.0 is ready soon (Dec 2002) With this we get the experiment submissions, and create the MAGE-ML file get the Array submissions and create the MAGE- ML file Related project ILSI specific MIAMExpress

Future KeyLargoExpress? ;) Organism specific Integrated with Curation tool ?? be able to work with a full MAGE-ML file? cover all missing pieces behind the scene Being implemented using Java related technologies?!

Questions