Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,

Slides:



Advertisements
Similar presentations
Cloud Computing Mick Watson Director of ARK-Genomics The Roslin Institute.
Advertisements

Amazon Web Services (aws) B. Ramamurthy. Introduction  Amazon.com, the online market place for goods, has leveraged the services that worked for their.
Managing Data with iPlant Introduction to Uploading, Downloading, Sharing, and Metadata in the Data Store.
 Contents 1.Introduction about operating system. 2. What is 32 bit and 64 bit operating system. 3. File systems. 4. Minimum requirement for Windows 7.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Chromium OS is an open-source project that aims to build an operating system that provides a fast, simple, and more secure computing experience for people.
Customized cloud platform for computing on your terms !
Using the “Setup Assistant” to configure your new Mac Personalizing your new Mac.
Guide to Linux Installation and Administration, 2e1 Chapter 3 Installing Linux.
Hands-On Virtual Computing
Software GCSE COMPUTING.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
Customized cloud platform for computing on your terms ! Nirav Merchant
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop - Part 1 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 28, 2015,
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
Overview of Atmosphere
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store – Managing Your ‘Big’ Data.
Data Science Background and Course Software setup Week 1.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
AWS Usage Tips SCS APAC MAR Agenda About Amazon Web Service Sign up the AWS account AWS Management Oracle Apps AMI – Siebel CRM – EBS R
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
Bringing your favorite analysis applications to iPlant using Docker containers Nirav Merchant
RNA-Seq visualization with CummeRbund
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Transforming Science Through Data-driven Discovery Welcome to the Tools and Services Workshop SIU Host: Jane Geisler-Lee Kapeel Chougule, Joslynn Lee,
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Canadian Bioinformatics Workshops
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store Overview.
UNIX U.Y: 1435/1436 H Operating System Concept. What is an Operating System?  The operating system (OS) is the program which starts up when you turn.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker usage Zoltán Farkas MTA SZTAKI LPDS
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store – Managing your ‘Big’ Data Joslynn Lee, Ph.D. – Data Science.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store – Managing your ‘Big’ Data Joslynn Lee – Data Science Educator.
Special Education Teachers and Speech Language Pathologist Effective Technology Tools By: Beth Fulks, June 23, 2014.
CyVerse Data Store Managing Your ‘Big’ Data. Welcome to the Data Store Manage and share your data across all CyVerse platforms.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
Joslynn S. Lee, PhD, Data Science Educator Cold Spring Harbor Laboratory, DNA Learning Center Transforming Science Through Data-driven Discovery.
Transforming Science Through Data-driven Discovery Bringing your Bioinformatics tools to CyVerse’s Discovery Environment using Docker Upendra Kumar Devisetty.
Advanced Computing Facility Introduction
Big Data is a Big Deal!.
Guide to Linux Installation and Administration, 2e
CyVerse Tools and Services
Tools and Services Workshop
Customized cloud platform for computing on your terms !
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
Tools and Services Workshop Overview of Atmosphere
Tools and Services Workshop
Bioinformatic analysis using Jetstream, a cloud computing environment
Advanced Computing Facility Introduction
Cyberinfrastructure for the Life Sciences
Different types of Linux installation
Software - Operating Systems
MCBIOS 2016 – University of Memphis, TN
Presentation transcript:

Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory, DNA Learning Center

Welcome to Atmosphere Custom Cloud Computing for Life Sciences

What is Cloud Computing? Yet another round of jargon In the simplest terms, cloud computing means storing and accessing data and programs over the Internet instead of your computer's hard drive

Why Cloud Computing? Yet another round of jargon Some biological research problems require intense computation to those requiring little computation. Size of data sets will vary from MB to GB to TB. Advantageous to have a shared high performance computing (HPC) cluster and storage resources! BigDog at SIU! Atmosphere reduces the extensive time, resources, and overhead needed to set up analyses. Utilize virtual machines.

What is Cloud Computing? Important concept: Image: a template of a virtual machine containing an installed OS, software, configuration Image (file) Document(s) (file) Original system Local storage New “clone” system (files/data) Copied Document(s) (file) New system

Working with ‘Big’ Data Important concept: Instance – launched image of a virtual machine CyVerse Cloud (Disk + CPU + Memory) + (Image) Atmosphere Instance (virtual machine)

Working with ‘Big’ Data Important concept: Instance – launched image of a virtual machine CyVerse Cloud (Disk + CPU + Memory) + (Image) Atmosphere Instance (virtual machine) Anything that you would normally be able to do with your local laptop/desktop, you can do on a virtual machine in the Atmosphere.

Atmosphere Overview Largest, easiest to use for Life Sciences Choose an existing image or customize Instances up to 16-Core / 128 GB RAM Access via iCommands (shell) or VNC Share you image with selected users, or make them public

Atmosphere Overview Connecting to your instance VNC Viewer: PuTTy: WindowsMacLinux VNC Viewer Shell/terminal VNC Viewer Shell/terminal VNC Viewer PuTTY

Atmosphere Overview Connecting to your instance Work in an on-demand Linux environment (most bioinformatics) Collaborate with students and colleagues on the same instance Get Science Done Reproducibility Productivity Multicore high memory images to run multithreading applications Move your analyses from your laptop to the cloud Make data, workflows, and analyses available in a public image Access previous software version and images

Hands-on Demo Workshop packet: Atmosphere Cloud Computing Page 20 Handout the usernames / passwords

Atmosphere Overview Hands-on demo: Atmosphere Cloud Computing Launch and connect to Atmosphere – page 23 Connect to your Instance– page 23 Connect via VNC– page 24 By the end of this demo, you should be able to: Select and launch an instance Connect your instance to the Data Store Use an application in Atmosphere Understand how to pause, stop and terminate instances

Atmosphere Overview Login to Atmosphere  scroll down to icons Sign in on the top right corner

Atmosphere Overview Hands-on demo: Atmosphere Cloud Computing Launch and connect to Atmosphere – page 23 Connect to your Instance– page 23 Connect via VNC– page 24 By the end of this demo, you should be able to: Select and launch an instance Connect your instance to the Data Store Use an application in Atmosphere Understand how to pause, stop and terminate instances

Atmosphere Overview Key things to remember when you try this yourself Images do not have automatic access to your Data Store Use Cyberduck to access the Data Store Use iCommands Users have monthly allocation limits Terminate or stop instances not in use If a larger allocation is needed, contact support All data on terminated instances will be destroyed Use Cyberduck or iCommands to transfer data off the instance You may also create an EBS Volume (see documentation)

Atmosphere Overview User perspectives and possible applications Learned how to use the shell and how to work with Linux Mastered using R to develop plots for his manuscript Launches an image and has full SUDO access to customize Developed a software with numerous R and Python library dependencies She updates it regularly by making a new image Linked several atmosphere instances with Apache Hadoop Worked with CyVerse to import existing Amazon image Bioinformatician Core Facilities Bench Scientist

Help: ask.iplantcollaborative.org Detailed instructions with videos, manuals, documentation in CyVerse Wiki Search by tag

Parker Antin Nirav Merchant Eric Lyons Matt Vaughn Doreen Ware Dave Micklos CyVerse is supported by the National Science Foundation under Grant No. DBI and DBI Executive Team Transforming Science Through Data-driven Discovery