2 st ISA-TAB workshop Outcome/Summary (to date) Workshops on Data Standards (WODS) – EBI, Cambridge, UK 16 th, 17 th and 18 th June 2008 This workshop.

Slides:



Advertisements
Similar presentations
1 jNIK IT tool for electronic audit papers 17th meeting of the INTOSAI Working Group on IT Audit (WGITA) SAI POLAND (the Supreme Chamber of Control)
Advertisements

IPP Notification and Notification Services White Paper Hugo Parra; Novell, Inc. October 6, 1999 The intent of this paper is to supplement the discussions.
Advanced XSLT. Branching in XSLT XSLT is functional programming –The program evaluates a function –The function transforms one structure into another.
Advanced XSLT II. Iteration in XSLT we sometimes wish to apply the same transform to a set of nodes we iterate through a node set the node set is defined.
What is XML? a meta language that allows you to create and format your own document markups a method for putting structured data into a text file; these.
July 2010 D2.1 Upgrading strategy Javier Soto Catalog Release 3. Communities.
PantherSoft Financials Smart Internal Billing. Agenda  Benefits  Security and User Roles  Definitions  Workflow  Defining/Modifying Items  Creating.
Kendle Implementation of Clinical Data Acquisition Standards Harmonization Dr Elke Sennewald Kendle 9th German CDISC User Group Meeting Berlin, 28 September.
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2006 Microsoft Corporation.
#PhUSE Standard Scripts Project Proposal for Qualification of Standard Scripts.
NITAAC Customer Support Phone: Website: NITAAC.nih.gov Customer Guide for using the electronic Government.
Chapter 9 Collecting Data with Forms. A form on a web page consists of form objects such as text boxes or radio buttons into which users type information.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Overview of Mini-Edit and other Tools Access DB Oracle DB You Need to Send Entries From Your Std To the Registry You Need to Get Back Updated Entries From.
CS 8532: Adv. Software Eng. – Spring 2007 Dr. Hisham Haddad Tuesday Class will start momentarily. Please Stand By … CS 8532: Advanced Software.
EBI is an Outstation of the European Molecular Biology Laboratory. MAGE-TAB - The ArrayExpress Production Experience Helen Parkinson, PhD.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
1 st (RSBI) ISA-Tab Workshop – Scope and Outcome  Tackle today's need for exchange of multi-omics experiments Evaluate the ISA-TAB straw-man (incomplete)
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
Easy Chair Online Conference Submission, Tracking and Distribution Process: Getting Started + Information for Reviewers AMS World Marketing Congress /
Processing of structured documents Spring 2002, Part 2 Helena Ahonen-Myka.
 2004 Prentice Hall, Inc. All rights reserved. 1 Chapter 34 - Case Study: Active Server Pages and XML Outline 34.1 Introduction 34.2 Setup and Message.
JavaScript, Fourth Edition Chapter 5 Validating Form Data with JavaScript.
Software Project Planning Defining the Project Writing the Software Specification Planning the Development Stages Testing the Software.
(Business) Process Centric Exchanges
Chapter 8 Collecting Data with Forms. Chapter 8 Lessons Introduction 1.Plan and create a form 2.Edit and format a form 3.Work with form objects 4.Test.
IBM Software Group ® Overview of SA and RSA Integration John Jessup June 1, 2012 Slides from Kevin Cornell December 2008 Have been reused in this presentation.
IFORM ACCOUNT MAINTENANCE ICT4D SESSION 4. IFORMBUILDER WEBSITE REQUIREMENTS To access the iFormBuilder website, you need the following items: -Reliable.
(Spring 2015) Instructor: Craig Duckett Lecture 10: Tuesday, May 12, 2015 Mere Mortals Chap. 7 Summary, Team Work Time 1.
DemocracyApps, Inc. Community Budget Explorer A Technical Overview.
EMBL- EBI Wellcome Trust Genome Campus Hinxton, Cambridge, CB10 1SD, UK Standards and infrastructure for managing experimental metadata Philippe Rocca-Serra,
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
CS 3610: Software Engineering – Fall 2009 Dr. Hisham Haddad – CSIS Dept. Class Project OO Design Document Here is what you need to do for your class project.
Rev.04/2015© 2015 PLEASE NOTE: The Application Review Module (ARM) is a system that is designed as a shared service and is maintained by the Grants Centers.
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
Representing Flow Cytometry Experiments within FuGE Josef Spidlen 1, Peter Wilkinson 2, and Ryan Brinkman 1 1 BC Cancer Research Centre, Vancouver, BC,
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
July 22, 2009 Sunflower Project Monthly Conversion Meeting – July.
® A Proposed UML Profile For EXPRESS David Price Seattle ISO STEP Meeting October 2004.
FIPA Abstract Architecture London FIPA meeting January 24-29, 2000 from: TC-A members.
User Guide, 21 May 2009 © Copyright ISAteam 1 ISAconfigurator for ISAcreator User Guide Alpha version: May 2009 Contact:
The SharePoint Shepherd’s Course for End Users Based on the book by Robert L. Bogue Copyright 2011 AvailTek LLC All Rights Reserved.
CaArray User Community Meeting Feature Overview and Review of MAGE-TAB Update and Export Specification Call in: Participant Passcode:
PRO/ARC and TST/PRO joint sessions at TP20 Group Name: oneM2M TP20 Source: Peter Niblett, IBM Meeting Date:
Project Planning Defining the project Software specification Development stages Software testing.
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
WMO GRIB Edition 3 Enrico Fucile Inter-Program Expert Team on Data Representation Maintenance and Monitoring IPET-DRMM Geneva, 30 May – 3 June 2016.
28 May 2010 HMA-FO Task 2: Feasibility Analysis Service HMA Follow On Activities Task 2: Feasibility Analysis Service (Sensor Planning Service) Monthly.
Chapter 2 Build Your First Project A Step-by-Step Approach 2 Exploring Microsoft Visual Basic 6.0 Copyright © 1999 Prentice-Hall, Inc. By Carlotta Eaton.
Compatible with the latest browsers; Chrome, Safari, Firefox, Opera and Internet Explorer 9 and above.
PantherSoft Financials Smart Internal Billing
ESøknad - A web-based system for the electronic submission of research funding applications A short presentation of the system intended for principal investigators/researchers.
Instructor Name Instructor Title Library Name
Templates July 14, 2017.
Template library tool and Kestrel training
ETS Submission Process for New Project Applications
Standard Scripts Project 2
ESøknad - A web-based system for the electronic submission of research funding applications A short presentation of the system intended for principal investigators/researchers.
Standard Scripts Project 2
Rational Publishing Engine RQM Multi Level Report Tutorial
5.00 Apply procedures to organize content by using Dreamweaver. (22%)
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Standard Scripts Project 2
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

2 st ISA-TAB workshop Outcome/Summary (to date) Workshops on Data Standards (WODS) – EBI, Cambridge, UK 16 th, 17 th and 18 th June 2008 This workshop is funded by the BBSRC Tools and Resources (WODS, BB/E025080/1), with contributions from EBI and NERC Bioinformatics Center

Monday, 16 th Reviewing XSLT

XSLT issues discussed and solutions  FuGE extensions’ status –ACTION (Andy) to create a ‘portal’ on FuGE page to list all the extensions, their status, contacts, links and examples etc This can help maximize interactions and advertise the status of each extensions (like MIBBI does for the checklists) –E.g. the RNAi group (that is building a FuGE extension in this domain) needs to develop something to describe a microtiter plate and the work on the Array Design can be reused If there will be a FuGE-derived MAGE-ML* then this group could ‘reuse’ the ADF part or other parts ACTION (Javier,Helen) explore encoding of microplate representation in the datafiles and referencing from Assay –*Problem is that we do not have a final decision on where there will be a MGED’s extension of FuGE (ACTION, Helen) to check with the MAGE list + MGED board

XSLT issues discussed and solutions  Namespace inconsistency –Should we have FuGE ‘controlled’ namespaces? The OBO Foundry is considering doing it for ontologies –ACTION (Andy) to ask the list if desiderable namespace is a critical issue for xsl processing, not so much for other parsing methods this is probably a recommendation for those that wish to use XSLT for presentation purpose; a dedicated page will be set up on the ISA-TAB website to list such XSLT recommendations ACTION (Philippe)  Annotation overloading –Descriptions are used as term gathering fields –We could recommend on the fly ‘term creation’ (collection of term as supplied by users)? –ACTION (Andy, Ally) add recommendations on FuGE wiki (explain the use of FuGE) Paper soon out on recommendations to extend FuGE

 Name attribute optionality -When this is missing XSLT uses the ID, giving a less ‘human readable’ transformations -We could recommend that name is used when readability is required/preferred; ACTION (Philippe) to modify XSL templates  Way to categorize assays is not in FuGE -How to code ‘technology’ and ‘endpoint’ to categorize the assay (InvestigationComponent)? -It can be done implicitly, but would useful to have these as explicit objects -However, as there will not be a FuGE v1.1, ‘work around’ to any issues or needs will be done via recommendations ACTION (Andy) to add in FuGE wiki  Reagents info are on ISA-TAB –Flow cytometry examples have more depth/granularity, e.g. all the reagents are listed; they have coded it via Material - Even if FuGE recommends to do it via structured Protocol (see Gel-ML) –ACTION (Andy, Ally) point them to design patters on FuGE wiki –ISA-TAB can (somehow) deal with it via ‘Protocol Component’ field, just added XSLT issues discussed and solutions

 More FuGE ML files are needed to test the current scripts -ACTION (Ally) to give more example to Philippe from Symba -ACTION (Philippe) to send script to Ally -ACTION (Philippe) to set up a XSLT page on ISA-TAB to post all the scripts  Then the scripts will be tested with FuGE extensions e.g. GelML -More example to test to evaluate and finalize the scripts -ACTION (Frank, Philippe) collaboratively finalize the scripts for GelML  Final comment on scope of ISA-TAB in relation to FuGE -FuGE or other XMLx are more granular/expressive -We got to accept that fact that when transform in ISA-TAB we will/may loose/compress some of the info XSLT scripts’ library next steps

 To be done by AE team MAGE-TAB to ISA-TAB converter

Tuesday, 17 th Reviewing the ISA-TAB

Investigation File – Changes and decisions (1)  Add ‘Investigation PubMed ID’ and ‘Investigation Publication DOI’ to Investigation section - Only for paper describing across Studies  Studies section need to be singular (Study)  Comment point: Study header is sufficient to ‘separate’ the sections, no need for have ‘start/end repeatable block’  If developers want to add a ‘comment’ then this would be ‘#this is a comment thingy’. - Comment must have # as the first char - But in Study/Assay by adding a ‘column comment’ (see Table 5 in spec v0.3) for the users  Create a new section ‘Study Publications’ where we group publication’s attributes, moving the id, description and date under ‘Study Section’  Create a new section ‘Study Design Descriptors’

Investigation File – Changes and decisions (2)  All fields’ name are case sensitive -Edit every field must have first letter upper case -Section headers go all upper case -To allow easy visualization when ‘imported’ in spreadsheet  File will be interpreted in a Unicode  Any subsections within a repeatable block (Study) must remain within the block - But the order of the subsection within the block can vary  Use the triplet (type, accession, source ref) consistently, if ontology/CV is used, if not, Name and Type are entered as free text (add example in the spec)  Add ‘Protocol Parameter Name’, followed by ‘Protocol Parameter Type’ and ‘Protocol Parameter Type Term Accession’, ‘Protocol Parameter Type Term Source REF’

Investigation File – Changes and decisions (3)  Correct to allow for multiple values, for ‘Protocol Parameter Type/Term/Source’ and ‘Study Design Type/Term/Source’ triplets  Add ‘Protocol URI’ and ‘Protocol Version’ fields in the ‘Study Protocol’ subsection -The pointer to external file(s) allows to users to provide these in the format they wish -URI should be resolvable -Ultimately these requirement are up to the implementers; similarly to make e.g. mandatory other Protocol fields  Remove ‘Protocol Component Parameter, Instrument Component, Software’ triplets and ‘Processing’ fields  Add ‘Protocol Component Name’, ‘Protocol Component Types’, ‘Protocol Component Types Term Accession’, ‘Protocol Component Types Term Source REF’ -Used for listing, e.g. instruments, software, reagents, operator -Semicolon separator ACTION (Marco) to provide examples of options  Clean up all the field names and make them ‘unique’ by prefix them with the name of the section, e.g. ‘Investigation PubMed ID’ vs ‘Study PubMed ID’

Study/Assay File – Recommendations  The table represents a graph and each edge needs to appear at least one, nodes do not need to be repeated, e.g.  Microarray (technology) / Gene expression (endpoint) tab -Document how to represent the case when when 2 different analysis protocols are applied to the same set of data file -In this case we follow MAGE-TAB by repeating vertically the data file names (only, not need to repeat the previous columns) followed by a new analysis protocol and output data file names  Factors Value can be referenced in both Study and Assay tabs  But the same value cannot be in both tabs, examples to be added

Tuesday, 17th Tools and implementations

Scripts and tools’ plans  From ISA-TAB to FuGE ML -To be done (Phil, Ally wants this ;-) -Map the ISAcreator java model to FuGE general elements -Ally to help checking/validating mapping  From FuGE ML to ISA-TAB -Work in progrosses XSLT under development -Philippe, Frank, Ally, Nigel and Andy  ISA-TAB creation -ISAcreator (will be open source) -Other tools from participating systems….  ISA-TAB validation - Common, minimum validation rules/scripts to be defined/developed (e.g. structure, case sensitive) -Use part of the ISAcreator configuration as library -> Google doc with list of basic rules (to be identified when creating the v1 spec) -> The ISAcreator config code will be stripped down to the basic rules and posted on the ISA-TAV sf site (SVN)  ISA-TAB and MAGE-TAB - Helen and Susanna to talk to MGED, ref to NIH grant

Wednesday, 18th Next steps and publication plans

Release plans  Release candidate 1, ISA-TAB v1 -Philippe, Marco to edit/add all the agree changes in the spec -> done by June 27th -Dave, Ally, Kieran check and review -> done by July 18th -All to read and comment/suggest -> wiki pages will be set up on ISA-TAB site to facilitate discussion -> all comments received by end of August -Philippe fix the current ISA-TAB examples to reflect new spec -> Release candidate 1, ISA-TAB v1 out by mid Sept  This version will include details on fields in Investigation file and the list of fields allowed in the Study and Assay files - The specific Assay files defined by the participating communities, will be listed and new can be added, without having to release new versions

Pending issue  Reference system for SEND and CDISC (STDM) -Take this discussion on the ISA-TAB list with interested parties, Michael and Steve in particular -Subject ID in STDM should be the same of Source Name ISA- TAB (as add another subject ID column?); then add the file as external -Each STDM file has a Study ID, Domain ID, Subject ID (2 types of these, probabaly we can use the UsubjID) and Idvar (column) and Idvarval (column value)

Publication and next workshop  Publication – suggested content -Rationale and use case for ISA-TAB -History from MAGE-TAB to ISA-TAB -Present it as format (and interface to a format) not a ‘standard’ -> Describe scripts making it ‘interoperable’ with other formats -Example of implementations to date -> Tools/systems that have output/input in this format -> Also (simply) more real examples from communities posted on this site -Start writing ~end of this year, to submit ~early next year; journal to be decided later  Next workshop would be a users meeting (in 2009) -To fix minor issues, recommendations, ambiguities, sharing development approaches, components etc….