Low Cost, Scalable Proteomics Data Analysis Using Amazon's Cloud Computing Services and Open Source Search Algorithms Brian D. Halligan, Ph.D. Medical.

Slides:



Advertisements
Similar presentations
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Advertisements

Ivan Pleština Amazon Simple Storage Service (S3) Amazon Elastic Block Storage (EBS) Amazon Elastic Compute Cloud (EC2)
Cloud Computing at GES DISC Presented by: Long Pham Contributors: Aijun Chen, Bruce Vollmer, Ed Esfandiari and Mike Theobald GES DISC UWG May 11, 2011.
B. Ramamurthy 4/17/ Overview of EC2 Components (fig. 2.1) 10..* /17/20152.
Data-Intensive Cloud Control for GENI GEC 8 demo Orca control framework July 20, 2010 Michael Zink, Prashant Shenoy, Jim Kurose, David Irwin and Emmanuel.
Amazon Web Services Justin DeBrabant CIS Advanced Systems - Fall 2013.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Amazon Web Services and Eucalyptus
WEB & MOBILE CLOUD APP With Bootstrap, Backbone, Pusher, AWS, Slim Gabriele Mittica –
Amazon. Cloud computing also known as on-demand computing or utility computing. Similar to other utility providers like electric, water, and natural gas,
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
Webscale Computing Mike Culver Amazon Web Services.
C-Store: Data Management in the Cloud Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 5, 2009.
Cloud Computing Brandon Hixon Jonathan Moore. Cloud Computing Brandon Hixon What is Cloud Computing? How does it work? Jonathan Moore What are the key.
Infrastructure as a Service (IaaS) Amazon EC2
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Spark: Cluster Computing with Working Sets
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Matt Bertrand Building GIS Apps in the Cloud. Infrastructure - Provides computer infrastructure, typically a platform virtualization environment, as a.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
 Cloud computing is one of the more recent technologies that many businesses, individuals and other industry organizations believe to by one of the keys.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
CLOUD COMPUTING 2.0 Finally, the promise of the cloud has arrived v 1.8.
Developing Scalable Web Applications on Amazon Web Services
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Accessing the Amazon Elastic Compute Cloud (EC2) Angadh Singh Jerome Braun.
The Blue Book pages 19 onwards
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Introduction to Cloud Computing
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Cloud Computing Characteristics A service provided by large internet-based specialised data centres that offers storage, processing and computer resources.
Webscale Computing Mike Culver Amazon Web Services.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Presented by: Mostafa Magdi. Contents Introduction. Cloud Computing Definition. Cloud Computing Characteristics. Cloud Computing Key features. Cost Virtualization.
| nectar.org.au NECTAR TRAINING Module 3 Common use cases.
The New Zealand Institute for Plant & Food Research Limited Use of Cloud computing in impact assessment of climate change Kwang Soo Kim and Doug MacKenzie.
By: Charles Tapp, Christopher Felipe, Danielle White, Jessica Tamayo, John Keller, and Joseph Stevenson IS 485 Spring 2012.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Enterprise Cloud Computing
| nectar.org.au NECTAR TRAINING Module 4 From PC To Cloud or HPC.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
Cloud Computing ENG. YOUSSEF ABDELHAKIM. Agenda :  The definitions of Cloud Computing.  Examples of Cloud Computing.  Which companies are using Cloud.
#SummitNow Alfresco Deployments on AWS Cost-Effective, Scalable & Secure Michael Waldrop Director, Solutions Engineering .
Canadian Bioinformatics Workshops
© 2015 MetricStream, Inc. All Rights Reserved. AWS server provisioning © 2015 MetricStream, Inc. All Rights Reserved. By, Srikanth K & Rohit.
INTRODUCTION TO AMAZON WEB SERVICES (EC2). AMAZON WEB SERVICES  Services  Storage (Glacier, S3)  Compute (Elastic Compute Cloud, EC2)  Databases (Redshift,
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
SEMINAR ON.  OVERVIEW -  What is Cloud Computing???  Amazon Elastic Cloud Computing (Amazon EC2)  Amazon EC2 Core Concept  How to use Amazon EC2.
Cloud Computing % of us use some form of cloud coumputing.
Unit 3 Virtualization.
Course: Cluster, grid and cloud computing systems Course author: Prof
11. Looking Ahead.
AWS Integration in Distributed Computing
Cloud computing-The Future Technologies
Amazon Web Services Submitted By- Section - B Group - 4
AWS COURSE DEMO BY PROFESSIONAL-GURU. Amazon History Ladder & Offering.
Acutelearn Amazon Web Services Training Classroom Training Instructor led trainings at Acutelearn premises Corporate Training Custom tailored trainings.
Replication Middleware for Cloud Based Storage Service
Amazon Storage as a Service
Cloud Computing BY: Udit Jain.
Brandon Hixon Jonathan Moore
AWS Cloud Computing Masaki.
Docker in AWS ECS.
The Blue Book pages 19 onwards
Presentation transcript:

Low Cost, Scalable Proteomics Data Analysis Using Amazon's Cloud Computing Services and Open Source Search Algorithms Brian D. Halligan, Ph.D. Medical College of Wisconsin ViPDAC

What is ViPDAC? ViPDAC => Virtual Proteomics Data Analysis Cluster One of the slowest parts of proteomics is data analysis. Single CPU machines analyze data much slower than instruments can generate it. Computer Clusters offer increased speed, but have high costs to implement and maintain.

Cloud Computing Distributed or Cloud computing allows for the use of virtual computers to perform computer intensive tasks without having to own the computer. Amazon has built large scale computing facilities that they offer for use on an hourly basis. The cost of analysis using this system is very low and the size of the cluster can expand, contract or even disappear based on need.

Amazon Web Services (AWS) EC2 – Amazon Elastic Compute Cloud “a web service that provides resizable compute capacity in the cloud. It is designed to make web- scale computing easier for developers. “ S3 - Amazon Simple Storage Service “provides a simple web services interface that can be used to store and retrieve any amount of data, at any time, from anywhere on the web.”

Overview

Workflow

Time vs. Nodes

ViPDAC Costs per Run

Advantages of ViPDAC Low cost – No startup costs – Low hourly usage costs – No cost when not in use Scalable – Everyone is first in line – Launch as few or as many worker nodes as needed – Fast costs the same as slow – 1 instance for 20 hrs = 20 instances for 1 hr

Advantages of ViPDAC Secure – Data is stored and transferred in a secure system – Your data/database does not leave your control and is not seen or shared with others Stable – AMI can be cloned and saved – Consistent data analysis for long term projects – SOP across laboratories

Advantages of ViPDAC Cost Accounting – Very easy to determine cost of a single run with ViPDAC compared to physical cluster Freedom to experiment – Can perform complex analysis on a dataset without blocking routine analysis – Custom interface or analysis

Acknowledgments Joey F. Geiger Andrew K. Vallejos Simon N. Twigger Andrew S. Greene – MCW NHLBI Proteomics Center

Acknowledgments Joey F. Geiger Andrew K. Vallejos Simon N. Twigger Andrew S. Greene – MCW NHLBI Proteomics Center