An Introduction to Designing and Executing Workflows with Taverna Part 2 – Importing and exporting data Norman Morrison University of Manchester Credits:

Slides:



Advertisements
Similar presentations
Citavi – Adding References – Articles from EBSCOhost Databases
Advertisements

An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.
1 CA202 Spreadsheet Application Combining Data from Multiple Sources Lecture # 6.
Newsletter Plugin The newsletter plugin allows you to create and send newsletters to a managed list or multiple lists of users. Your users can subscribe.
A Visual Follow-Along Guide to the Instructions of the NBTA Modular Hotel RFP.
BioMoby and Taverna Tutorial. Downloading Taverna ► Taverna can be obtained from:
Quark QuarkXPress 4 Foundation Level Course. What is QuarkXPress? This courseware teaches the fundamentals of QuarkXPress 4.1. It is a page layout application.
Microsoft Access 2007 Microsoft Access 2007 Introduction to Database Programs.
With Alex Conger – President of Webmajik.com FrontPage 2002 Level I (Intro & Training) FrontPage 2002 Level I (Intro & Training)
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
Unit J: Creating a Database Microsoft Office Illustrated Fundamentals.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
4/22/2017 5:36 PM EViews Training Creating Workfiles.
Creating a Web Site to Gather Data and Conduct Research.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
XP New Perspectives on Integrating Microsoft Office XP Tutorial 2 1 Integrating Microsoft Office XP Tutorial 2 – Integrating Word, Excel, and Access.
Introduction to Taverna, an environment For designing and executing workflows Franck Tanoh University of Manchester.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
For Version 4.0 and later Lattice3D Reporter Tutorial For Version 4.0 and later LATTICE TECHNOLOGY, INC.
 Objectives Objectives  Introduction Introduction  Exploring Excel Exploring Excel  Navigating a Worksheet Navigating a Worksheet  Workbook Workbook.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Performing statistical analyses using the Rshell processor Original material by Peter Li, University of Birmingham, UK Adapted by Norman.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Files Tutor: You will need ….
INTRODUCTION TO ACCESS. OBJECTIVES  Define the terms field, record, table, relational database, primary key, and foreign key  Create a blank database.
Kepler Exercise Deana Pennington University of New Mexico December 10, 2004.
Excel Basics. Differentiating between worksheets and spreadsheets Differentiating between workbooks and worksheets.
Chapter 2: Excel Basics and Formatting Spreadsheet-Based Decision Support Systems Prof. Name Position (123) University Name.
More Oracle SQL Scripts. Highlight (but don’t open) authors table, got o External data Excel, and make an external spreadsheet with the data.
How to Work With SURN Principal Academy Data For data downloaded from onlineobservationtools.com.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
Exploring Taverna engine Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Advanced Taverna Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft, Aleksandra Pawlik, Alan Williams
Getting data out of XML These exercises provide an overview of how to use the native Taverna XPath services to get data out of XML.
An Introduction to Running, Reusing and Sharing Workflows with Taverna – part 2 Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
Aleksandra Pawlik University of Manchester. Something that can be put into a workflow Well described - what the component does Behaves “well” - conforms.
Aleksandra Pawlik Alan Williams University of Manchester.
An Introduction to Designing, Executing and Sharing Workflows with Taverna BioVel Workshop 2011.
These exercises highlight the services that do not perform biological functions, but are vital for running life science workflows.
Designing, Executing and Sharing Workflows with Taverna 2.4 Different Service Types Katy Wolstencroft Helen Hulme myGrid University of Manchester.
Designing and Sharing Taverna Workflows: Exploring Taverna 2.1 Beta
Performing statistical analyses using the Rshell processor
Data Validation and Protecting Workbook
An Introduction to Designing and Executing Workflows with Taverna
Creating & Managing Workbooks
Microsoft Office Illustrated
Mail Merge Instructions (Yanick’s Version)
Exchanging Data with Other Programs
Advanced Data Import & Export Jeff Henrikson
Lesson 18 Getting Started with Excel Essentials
Microsoft Excel 101.
Taverna Tutorial exercise 2: REST services from BioCatalogue
Microsoft Excel 101.
An Introduction to Designing, Executing and Sharing Workflows with Taverna and myExperiment Katy Wolstencroft University of Manchester.
Microsoft Excel 101.
Shim (Helper) Services and Beanshell Services
Aleksandra Pawlik materials by Katy Wolstencroft
Overview of Contract Association Batch Upload
Help..
Amos Introduction In this tutorial, you will be briefly introduced to the student version of the SEM software known as Amos. You should download the current.
Scripts In Matlab.
Grauer and Barber Series Microsoft Excel Chapter One
Xpath service Getting data out of XML Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester 1.
REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.
A drag and drop exercise can be created using Word quite easily using tables, text boxes and ensuring the document is saved properly.
Unit J: Creating a Database
An Introduction to Designing and Executing Workflows with Taverna
Excel Tips & Tricks July 18, 2019.
Presentation transcript:

An Introduction to Designing and Executing Workflows with Taverna Part 2 – Importing and exporting data Norman Morrison University of Manchester Credits: Aleksandra Pawlik and Katy Wolstencroft

 We can add input data into the workflow not only manually but also from a file. Go to myExperiment group and download a file called:  03B_species_1.txt  Click run workflow again but instead of selecting Set value select Set file location and navigate to where you saved the 03B_species_1.txt file

Instead of downloading the file we can point the workflow to the file’s URL (if we know it). Let’s run the workflow again but this time select “Set URL” and paste in: ad/03B_species_1.txt

 So far we have used simple text files, but it is also possible to use Spreadsheets as sources of input data. In order to do that we will need to add a Spreadsheet tool to our workflow.  From the myExperiment group download the file:  03C_species_list_1.xls  Open it on your machine and see what it contains (the list of the species name is in cells B3 to B6)  From the Service Templates select the Spreadsheet Import tool right-click on it and add it to the workflow

 In the pop up window set the correct range for columns and rows (untick the box “all rows”)

 We need to delete the input port for the workflow (right click on it and select Delete)  The Spreadsheet tool expects as an input the URL (or path) to the file. The best way to feed in that URL/path is to add a service called “Text constant”

 Where it says “Add your own value here” enter: nt.org/files/1108/version s/1/download/03C_spec ies_list_1.xls nt.org/files/1108/version s/1/download/03C_spec ies_list_1.xls  If you prefer you can insert the full path to your local file  Then Apply and Close

 Connect the Text constant with the Spreadsheet Import tool  Connect the Spreadsheet Import tool with the input to the GBIF service

 When we run the service, we can see that there are four values for the results (as there were 4 species names that we read from the spreadsheet). Taverna implicitly iterated over these 4 input values and processed them.

 Taverna allows you to save results in different formats and also allows you to save intermediate workflow results (which is very useful when you run a large workflow)  You can save all result values:  Taverna allows you to save values in a variety of formats

 You can also save each single value separately:  In order to save intermediate values, in the results tab select the part of the workflow which you want to save the values for, then in the results window you should see these values and you will be able to save them

 A shim is a service that doesn’t perform an experimental function, but acts as a connector, or glue, when 2 experimental services have incompatible outputs and inputs  A shim can be any type of service – WSDL, soaplab etc. Many are simple Beanshell scripts  Shims can also be used to preprocess data that are input into the workflow and we will use one of these shims for this exercise

 Create a directory called “data”  Copy over the files which we used for the previous exercise in to this directory:  03B_species_1.txt  03C_species_list_1.xls  From the myExperiment group download the following files to the same directory:  03D_species_2.txt  03E_species_list_2.xls

 Let’s assume you’re regularly having to deal with data in different formats - one of them is spreadsheet (csv or xls).  You know that the spreadsheet files always have the species names in column B starting from row 3 up to row 100 (some rows may be empty).  You can automate your workflow to pull the species names from all of these spreadsheets in a specified directory at once using a shim service.

 Delete the Text constant service in your workflow  From the Available Services select Local Services io and List Files by Extension

 Connect the shim service with the Spreadsheet tool  Right click on “file extension” and enter xls  Right click on directory, click constant vaue and enter the path to the Directory you just created caled “Data”.

 We need to reconfigure the Spreadsheet service  We’ll set the rows from 3 to 100  And make the service ignore the blank rows

 Run the workflow  When we look at the results we can see that Taverna  read the species names from both spreadsheets  ignored the text files  found the values for them using the GBIF service