IPDPS - April 20, 2010 Jie Li 1, Deb Agarwal 2, Marty Humphrey 1, Keith Jackson 2, Catharine van Ingen 3, Youngryel Ryu 4 University of Virginia eScience.

Slides:



Advertisements
Similar presentations
System Center 2012 R2 Overview
Advertisements

Page 1 Ricardo Villalobos Windows Azure Architect Evangelist Microsoft Corporation Designing, Building, and Deploying Windows Azure applications.
Microsoft Cloud Futures 2010 April 9, 2010 Jie Li 1, Youngryel Ryu 2, Deb Agarwal 3, Keith Jackson 3, Marty Humphrey 1, Catharine van Ingen 4 University.
Overview Of Microsoft New Technology ENTER. Processing....
12 → Wed Umbraco User Group Kristofer Liljeblad Windows Azure Kristofer Liljeblad, Microsoft Developer Evangelist
MS CLOUD DB - AZURE SQL DB Fault Tolerance by Subha Vasudevan Christina Burnett.
Inside Windows Azure Virtual Machines Vijay Rajagopalan Microsoft Corporation.
Joey Snow Technical Evanglist Microsoft Corporation SESSION CODE: WSV310.
Cloud Computing Systems Lin Gu Hong Kong University of Science and Technology Sept. 21, 2011 Windows Azure—Overview.
Name Title Microsoft Windows Azure: Migrating Web Applications.
University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn 1 Quinn Hart 1 Bertram Ludäscher 1 Yogesh Simmhan.
Free, online, technical courses Take a free online course. Microsoft Virtual Academy.
Windows Azure Storage Services Saranya Sriram, Technology Evangelist, Microsoft, India.
VM Role (PaaS)Virtual Machine (IaaS) StorageNon-Persistent StoragePersistent Storage Easily add additional storage DeploymentBuild VHD offsite and upload.
Migrating Applications to Windows Azure Virtual Machines Michael Washam Senior Technical Evangelist Microsoft Corporation.
Platform as a Service PaaS
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Components of Windows Azure - more detail. Windows Azure Components Windows Azure PaaS ApplicationsWindows Azure Service Model Runtimes.NET 3.5/4, ASP.NET,
Microsoft Virtual Academy. Microsoft Virtual Academy.
Migrating Business Apps to Windows Azure Marc Müller Principal Consultant, 4tecture GmbH
Getting Started with Windows Azure Name Title Microsoft Corporation.
A Performance Evaluation of Azure and Nimbus Clouds for Scientific Applications Radu Tudoran KerData Team Inria Rennes ENS Cachan 10 April 2012 Joint work.
Monitoring Latency Sensitive Enterprise Applications on the Cloud Shankar Narayanan Ashiwan Sivakumar.
Larisa kocsis priya ragupathy
Austin code camp 2010 asp.net apps with azure table storage PRESENTED BY CHANDER SHEKHAR DHALL
Mostafa Abdollahi Mazandaran University Of Science And Technology January 2011.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Overview of Cloud Computing Sven Rosvall ACCU
Operating System for the Cloud Runs applications in the cloud Provides Storage Application Management Windows Azure ideal for applications needing:
Microsoft Azure SoftUni Team Technical Trainers Software University
Windows Azure Migrating Applications and Workloads Speaker Title Organization.
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
Virtual techdays INDIA │ august 2010 Cloud Computing – What and How ? Sandeep J Alur │ Microsoft India.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Efficient Live Checkpointing Mechanisms for computation and memory-intensive VMs in a data center Kasidit Chanchio Vasabilab Dept of Computer Science,
Zvezdan Pavković. Storage Non-Persistent Storage Persistent Storage Easily add additional storage. Networking Internal and Input Endpoints configured.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
 Mike Martin  Architect  MEET Member  Crew Member of Azug  Windows Azure Insider  Windows Azure MVP  
Windows Azure Custom Software Development Mobile Middleware Windows Azure Compute Dipl.-Ing. Damir Dobric Lead Architect daenet
Parallel Applications And Tools For Cloud Computing Environments CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Azure in a Day Training: Windows Azure Module 1: Windows Azure Overview Module 2: Development Environment / Portal – DEMO: Signing up for Windows Azure.
Enabling the Cloud OS Today  New high-density Web Sites with elastic cloud scaling and complete dev-ops experiences  New rich IaaS experience for self-service.
Microsoft Cloud Computing. Topics to be covered 1.Environmental Features of windows azure 2.What is Cloud Computing 3.Roles in Cloud Computing 4.Benefits.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
Building Cloud Solutions Presenter Name Position or role Microsoft Azure.
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
Building web applications with the Windows Azure Platform Ido Flatow | Senior Architect | Sela | This session.
1 Cloud Computing, CS An OS for Multicore and Cloud + Microsoft Azure Platform.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
Windows Azure and iOS Chris Risner Windows Azure Technical Evangelist Microsoft
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Deploying Web Application
Platform as a Service.
Logo here Module 3 Microsoft Azure Web App. Logo here Module Overview Introduction to App Service Overview of Web Apps Hosting Web Applications in Azure.
Windows Azure Migrating SQL Server Workloads
Job Management on Azure Batch/Scheduler
02 | Hosting Services in Windows Azure
Windows Azure 講師: 李智樺, Ruddy Lee
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Outline Virtualization Cloud Computing Microsoft Azure Platform
Windows Azure Cloud.
Saranya Sriram Developer Evangelist | Microsoft
Developing for Windows Azure
5 Azure Services Every .NET Developer Needs to Know
Building global and highly-available services using Windows Azure
02 – Cloud Services Bret Stateham | Senior Technical Evangelist​
Presentation transcript:

IPDPS - April 20, 2010 Jie Li 1, Deb Agarwal 2, Marty Humphrey 1, Keith Jackson 2, Catharine van Ingen 3, Youngryel Ryu 4 University of Virginia eScience Group 1 Lawrence Berkeley National Lab 2 Microsoft Research 3 University of California, Berkeley 4 1

 Background  AzureMODIS Framework Overview  Dynamic Scalability & Fault Tolerance  Evaluation  Conclusions & Future Work 2

 Increasing data availability for science discoveries ◦ Growing data size from large scientific instruments ◦ Emerging large-scale inexpensive ground-based sensors  Computational models with increasing complexities and precisions Raw Data Scientific Results ? 3 Resources? Apps &Tools?

 Moderate Resolution Imaging Spectroradiometer Satellites: ◦ Viewing the entire Earth's surface every 1 to 2 days ◦ Acquiring data in 36 spectral bands ◦ Multiple data products (Atmosphere, Land, Ocean etc.) ◦ Important for understanding global environment and earth system models 4

 Data Collection ◦ Multiple FTP sites for MODIS source data ◦ Metadata maintained separately  Data Heterogeneity ◦ Different time granularities and imaging resolutions ◦ Two different project types: “Swath” and “Sinusoidal”  Data Management ◦ Current use case: 10 years of data covering US continent ◦ 5 TB source data (~600,000 files) ◦ 2 TB timeframe- and space-aligned harmonized data ◦ ~50000 CPU hours of parallel computation 5

 A MODIS Data Processing Framework in Microsoft Windows Azure cloud computing platform ◦ Leverage scalability of cloud infrastructure and services ◦ Dynamic, on-demand resource provisioning ◦ Automate data processing tasks to eliminate barriers ◦ A generic Reduction Service to run arbitrary analysis executables MODIS Source Data Scientific Results Windows Azure Cloud Computing Platform AzureMODIS Service Framework 6

 Background  AzureMODIS Framework Overview  Dynamic Scalability & Fault Tolerance  Evaluation  Conclusions & Future Work 7

 Hosted Services ◦ Web Role: Host web applications via an HTTP and/or an HTTPS endpoint ◦ Worker Role: Host user-customized code/applications  Storage Services ◦ Blob service: Storage for entities in the form of binary bits ◦ Queue Service: A reliable, persistent queue model for message-based communication between instances ◦ Table Service: Structured storage in the form of tables, with simple query support 8

9 1. Scientist submits requests for computation on the web portal 2. The request is received and processed by the service monitor 3. Service Workers query the metadata in Azure tables to download source 4. The specified source data are uploaded to the Azure blob storage 5. The heterogeneous sources are reprojected into uniform format 6. Scientist uploads arbitrary executables to work on the uniform data 7. A single download link to the results is sent back to the scientist

10

11 UserWeb Portal (Web Role) Job Request … Job Queue Service Monitor (Worker Role) ReductionJobStatus Table Persist ReductionTaskStatus Table … Dispatch Task Queue Parse & Persist GenericWorker (Worker Role) … … Points to Sinusoidal Land Source Storage Reprojected Data Storage Reduction Result Storage Download Link to Results

 Blob storage level ◦ Each data file (blob) has a global unique identifier ◦ (Pre-)download and cache all source files in blob storage ◦ (Pre-)compute reprojection results for reuse across computations  Local machine level ◦ Each small size instance has ~250GB local storage ◦ Cache large size data files for reuse  Cost-related Trade offs ◦ Data re-generation cost VS. Blob storage cost ◦ For our case, data re-computation is too expensive 12

 Scientists upload their analysis binary tools upon request for the reduction service  Benefits ◦ Scientists can easily debug and refine scientific models in their code ◦ Separate system code debugging from science code debugging  A 2 nd reduction stage to support more comprehensive computation flows 13

 Project Background  AzureMODIS Framework Overview  Dynamic Scalability & Fault Tolerance  Evaluation  Conclusions & Future Work 14

 Use the Azure Management API to dynamically scale up/down instances according to work loads  Dynamic instance shutdown could be a problem ◦ Azure decides which instance to shutdown ◦ Instances may be shutdown during task execution  Currently, computing instance usage are charged by hours ◦ Use CPU hours wisely when applying dynamic scaling strategies 15

 In contrast, the shutdown time for the instances is small (usually within 3 minutes) 16 Instance Start Up Time (Test Date: March 31, 2010)

 Tasks can fail for many reasons ◦ Broken or missing source data files — Unrecoverable ◦ Reduction tool may crash due to code bug — Unrecoverable ◦ Failures caused by system instability — Recoverable  Customized task retry policies ◦ Task with timeout failures will be resent to the task queue ◦ Task with exceptions caught will be immediately resent ◦ Task canceled after 2 retries (Totally 3 executions)  Why not just use queue message visibility settings for failure recovery? 17

18

 Project Background  AzureMODIS Framework Overview  Dynamic Scalability & Fault Tolerance  Evaluation  Conclusions & Future Work 19

DesktopAzure Instance Capacity CPU: Intel Core2Duo 3.0GHZ Memory: 4GB Hard Disk: 1TB SATA Network: 1Gbps Ethernet OS: Windows 7 (32-bit) CPU: 1.6GHZ X64 equivalent processor Memory: 2GB Local Storage: 250GB Network: 100Mbps OS: Windows 2008 Server x64 (64-bit) MOD04_L2MOD06_L2MYD11_L instances instances instances Desktop Table 3. Processing time for 1500 reprojection tasks (Unit: hours) Table 2. Capacity of desktop machine and a single Azure instance 20 Fig. 1 Performance speedups over a single desktop

21  Accumulated time for data transfer from/to Azure blob storage increases as #VM increases

 Project Background  AzureMODIS Framework Overview  Dynamic Scalability & Fault Tolerance  Evaluation  Conclusions & Future Work 22

 Cloud computing provides new capabilities and opportunities for data-intensive eScience research  Dynamic scalability is powerful, but instance start up overhead is not trivial  Built-in fault tolerance & diagnostic features are important in the face of common failures in large- scale cloud applications and systems 23

 Scale up computations from US continent to the global scale  Develop and evaluate a generic dynamic scaling mechanism with AzureMODIS  Evaluate the similarities/differences between our framework and other generic parallel computing frameworks such as MapReduce 24

Thank you! & Questions? 25