Martin Senger [using also few slides from the presentations from the Galaxy Developers Conference 2011]

Slides:



Advertisements
Similar presentations
My EBSCOhost Tutorial Tutorial support.ebsco.com.
Advertisements

EPrints Web Configuratio n Management. SQL database Web server Scripts to configure repository activities Configuration files EPrints - the Administrator's.
Rclis in vision and reality Thomas Krichel
Grid Wizard Enterprise Basic Tutorial Using Web Control Panel.
Drupal Basics Part 1 An Overview Login Information Edit the Homepage Using the theme Agricultural Communications Services Integrated Media Training Sessions.
With Folder HelpDesk for Outlook, support centres and other helpdesks can work efficiently with support cases inside Microsoft Outlook. The support tickets.
A Toolbox for Blackboard Tim Roberts
Managing Data with iPlant Introduction to Uploading, Downloading, Sharing, and Metadata in the Data Store.
University of Nevada, Reno College of Business Administration What are we going to learn 9/27 – 9/29? 1. Answer questions about MS Access queries. 2. Understand.
Many kinds of clients and servers This work is licensed under a Creative Commons Attribution-Noncommercial- Share Alike 3.0 License. Skills: none IT concepts:
COMS E Cloud Computing and Data Center Networking
Different Streaming Technologies. Three major streaming technologies include:
WMS: Democratizing Data
UML exam advice. Minimal, yet sufficient UML course 80% of modeling can be done with 20% of the UML. Which 20% was that again? We’re supposed to be “Use.
© InLoox ® InLoox PM Web App product presentation The Online Project Software.
Mastering the Basics of Adobe Acrobat Pro ENGINEERING RESEARCH ADMINISTRATION.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
© 2009 IBM Corporation 1 RTC ClearQuest Importer and Synchronizer Lorelei Ngooi – RTC ClearQuest Synchronizer Lead.
Innovative Broadband Services C hanging the way we present information: Digital Signs : Effective business promotion Nikolaos Desypris, Chief Executive.
Oracle iLearning/Tutor Integration Jan  Oracle iLearning Overview  Oracle Tutor Overview  Benefits of integration  Manual integration process.
GMOD in the Cloud Genome Informatics November 3, 2011 Scott Cain GMOD Project Coordinator Ontario Institute for Cancer Research
Customized cloud platform for computing on your terms !
Implementation of HUBzero as a Knowledge Management System in a Large Organization HUBBUB Conference 2012 September 24 th, 2012 Gaurav Nanda, Jonathan.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Part 1. Persistent Data Web applications remember your setting by means of a database linked to the site.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
University of Illinois at Urbana-Champaign A Unified Platform for Archival Description and Access Christopher J. Prom, Christopher A. Rishel, Scott W.
…using Git/Tortoise Git
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Introduction To BlueMix By: Ryan
CS562 Advanced Java and Internet Application Introduction to the Computer Warehouse Web Application. Java Server Pages (JSP) Technology. By Team Alpha.
Trilinos Framework: Next Steps Thursday November 9, :45-9:30 a.m. Jim Willenbring Mike Phenow.
Web-based Front End for Kraken Jing Ai Jingfei Kong Yinghua Hu.
Globus.org/genomics Globus Galaxies Science Gateways as a Service Ravi K Madduri, University of Chicago and Argonne National Laboratory
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
: Information Retrieval อาจารย์ ธีภากรณ์ นฤมาณนลิณี
PIER USER TRAINING 1 Class Date and Location. Instructor Name Instructor Contact Info 2 PIER User Training 1.
Integrating Laserfiche and SharePoint PO108 Alex Wilson and Jessica Huang.
Using Galaxy to build and run data processing pipelines Jelle Scholtalbers / Charles Girardot GBCS Genome Biology Computational Support.
Galaxy in Production Nate Coraor Galaxy Team Penn State University.
//liveVirtualacademy2011/ What’s New for ASP.NET 4.5 and Web Development in Visual Studio 11 Developer Preview Γιώργος Καπνιάς MVP, MCT, MCDP, MCDBA, MCTS,
Business Applications– Using Java _____ Presented by Priya Saha.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Deploying Elastic Java EE Microservices in the Cloud with Docker
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
Core ELN Training: Office Web Apps (OWA)
University of Chicago and ANL
CyVerse Discovery Environment
Data Bridge Solving diverse data access in scientific applications
Scalable systems.
Tools and Services Workshop
Store, Share, Sync and Collaborate
MCU cluster Cristian Alexe 18 October 2010.
Data uploading and sharing with CyVerse
MIK 2.1 DBNS - introduction to WS-PGRADE, 2013
SRA Submission Pipeline
Unit 27: Network Operating Systems
What’s New in Colectica 5.3 Part 1
Principles of report writing
Alexa Programming.
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Galaxy API.
Presentation transcript:

Martin Senger [using also few slides from the presentations from the Galaxy Developers Conference 2011]

1. What Galaxy can do... (or could do) 2. Show me where I can try for myself 3. What can we do to make our Galaxy better...and what this is not a detailed tutorial how to use Galaxy a way to convince you that I understand everything about Galaxy

 A web-based interface to the command-line tools (of any kind) and their combinations (“workflows”)  Galaxy performs analysis interactively through the web, on arbitrarily large datasets  Galaxy remembers what it did - history  Flexibility to include anybody’s command-line tools  by writing wrappers whose templates are available  An environment for sharing tools (or their wrappers)  “Tools Shed” repository

 Locally stored data  user-specific  shared between users  e.g. genome builds  Origin of data  uploaded data from your computer  using a web interface  using an FTP server  fetched from external databases (“datasources”)  only those that are “aware” of Galaxy  internally: two ways how to fetch data (async vs. sync.)  you need to be familiar with these databases and their UIs

1 2 3

Data have metadata allowing to use data only for those tools that recognize such data types Data have attributes annotate data convert data to a new format change data type

 Automated set of steps – perhaps each time with different input data (of the same type)  reproducibility (usable in publications)  reusability (sharing workflows with others)  created from the scratch (using a workflow editor) or from your history

An example – a workflow editor Thanks to:

user would not have done this from the command line on our cluster

If we have time (6mins) click here: Creating a workflow from your history

 Where are all these galaxies?  public servers  available immediately, free of charge   and few others, such as  usually limited resources  you cannot customize them to your special needs  KAUST/CBRC Galaxy   running on an internal cluster with limited resources  but we can do with it whatever we need to do  Galaxy in the Amazon clouds (CloudMan)  when you do not have infrastructure in house  when you have particular resource (cores, memory...) needs  when you need a customization  if you have a credit card  details in this presentation:  CloudManGalaxyOnTheCloud.pdf CloudManGalaxyOnTheCloud.pdf  Galaxy has also the RESTfull API for programmatic access (beta)

Image courtesy of need to:

 Data issues  add genome-wise data we (CBRC) need  add data usable for others (Core, students...)  Tools  make a subset of tools we really need and test them fully  consider to wrap other tools (not yet available by default)  Logistics  provide user-oriented courses  create a user group to share experience and to promote knowledge  monitor its stability and usage  Hardware/sysadmin issues  Install it on better hardware (in due time)  Change the current queue priority (a chicken-egg problem)  Add an ftp server

 Galaxy home page:   An overview presentation:  get&target=IntroductionSession.pdf