JamesRH 2007.06.11  7 major AWS Services (http://aws.amazon.com)http://aws.amazon.com  Amazon E-Commerce Service (ECS)  Amazon.

Slides:



Advertisements
Similar presentations
Running Your Startup on Amazon Web Services Alex Iskold Founder/CEO AdaptiveBlue Feature Writer ReadWriteWeb.
Advertisements

Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Cloud Service Models and Performance Ang Li 09/13/2010.
Web Scale Computing Mike Culver Amazon Web Services.
Amazon S3 – An SOA By: Minam Ulhaq.
STANFORD UNIVERSITY INFORMATION TECHNOLOGY SERVICES IT Services Storage And Backup Low Cost Central Storage (LCCS) January 9,
Making Cloud Storage Provenance- Aware Kiran-Kumar Muniswamy-Reddy, Peter Macko, and Margo Seltzer Harvard School of Engineering and Applied Sciences.
Bill Boebel, CTO of Webmail.us & Mark Washenberger, SW Engineer at Webmail.us Creating an Archiving Service with Amazon S3.
Amazon Web Services (aws) B. Ramamurthy. Introduction  Amazon.com, the online market place for goods, has leveraged the services that worked for their.
University of Notre Dame
© 2013 A. Haeberlen, Z. Ives Cloud Storage & Case Studies NETS 212: Scalable & Cloud Computing Fall 2014 Z. Ives University of Pennsylvania 1.
Amazon Web Services and Eucalyptus
Amazon CloudFront An introductory discussion. What is Amazon CloudFront? 5/31/20122© e-Zest Solutions Ltd. Amazon CloudFront is a web service for content.
Webscale Computing Mike Culver Amazon Web Services.
The Zebra Striped Network Filesystem. Approach Increase throughput, reliability by striping file data across multiple servers Data from each client is.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
File Systems (2). Readings r Silbershatz et al: 11.8.
Platform as a Service (PaaS)
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
Google App Engine Google APIs OAuth Facebook Graph API
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Google App Engine Danail Alexiev Technical Trainer SoftAcad.bg.
Windows Azure Storage Services Saranya Sriram, Technology Evangelist, Microsoft, India.
Introduction to Amazon Web Services (AWS)
“Salesforce” - meet Amazon Cloud Upload unlimited files sizes to your salesforce Get Started On Demand Files Storage for Salesforce Cloud Storage, a Salesforce.
JICSIT/ITAIC 2011 Keynote 1 Yinong Chen (Ph.D.) Arizona State University, Tempe, Arizona Service-Oriented Computing.
CIS 451: Web Services Dr. Ralph D. Westfall March, 2009.
EE616 Technical Project Video Hosting Architecture By Phillip Sutton.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Larisa kocsis priya ragupathy
Microsoft Visual Studio 2010 Muhammad Zubair MS (FAST-NU) Experience: 5+ Years Contact:- Cell#:
Webscale Computing Mike Culver Amazon Web Services.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 3: Operating-System Structures System Components Operating System Services.
1 NETE4631 Working with Cloud-based Storage Lecture Notes #11.
Amit Warke Jerry Philip Lateef Yusuf Supraja Narasimhan Back2Cloud: Remote Backup Service.
Aaron Nathan CS6464 Final Project vapor_disk. Idea Provide a “real time” backup that is widely and easily available Simple to use Works in Windows.
AWS Amazon Web Services Georges Akpoly CS252. Overview of AWS Amazon Elastic Compute Cloud (EC2) Amazon Simple Storage Service (S3) Amazon Simple Queue.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Web Server Design Week 15 Old Dominion University Department of Computer Science CS 495/595 Spring 2010 Martin Klein 4/21/10.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
CloudBerry Explorer for S3. CB Explorer Free to use Browse and manage files PowerShell functions Open and edit files  CloudBerry Explorer is an easy.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Presenter: Yue Zhu, Linghan Zhang A Novel Approach to Improving the Efficiency of Storing and Accessing Small Files on Hadoop: a Case Study by PowerPoint.
Compute and Storage For the Farm at Jlab
Platform as a Service (PaaS)
Course: Cluster, grid and cloud computing systems Course author: Prof
Amazon Web Services (aws)
Platform as a Service (PaaS)
AWS Integration in Distributed Computing
Data Bridge Solving diverse data access in scientific applications
Amazon Storage- S3 and Glacier
Amazon Web Services Submitted By- Section - B Group - 4
Windows Azure Migrating SQL Server Workloads
Web Caching? Web Caching:.
AWS COURSE DEMO BY PROFESSIONAL-GURU. Amazon History Ladder & Offering.
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung Google Presented by Jiamin Huang EECS 582 – W16.
Google App Engine Danail Alexiev
CDISC SHARE API v1.0 CAC Update 22 February 2018
Building a Database on S3
AWS Cloud Computing Masaki.
Web Server Design Week 16 Old Dominion University
Web Server Design Week 16 Old Dominion University
Enterprise Java Beans.
Presentation transcript:

JamesRH

 7 major AWS Services (  Amazon E-Commerce Service (ECS)  Amazon Elastic Compute Cloud Beta (EC2)  Amazon Historical Pricing  Amazon Mechanical Turk Beta  Amazon Simple Storage Service (S3)  Amazon Simple Queue Service (SQS)  Alexa Web Services  Originally planning to write against both EC2 & S3 but EC2 beta was closed until recently  There are EC2 scaling issues (confirmed by Bezos at Web2.0)  S3 Scaling well:  5B objects  Peak transaction rate:  Day: 923M  Second: 16,67  This demo app was written to learn the S3 interfaces, their strengths, and weaknesses 4/17/20152

 Went to aws.amazon.com  Signed into Amazon account  Same one used for Amazon shopping  Fill in additional contact information  agreed to terms and conditions  AWS provides Access Key & Secret Key  Tested using  Simple web app supporting most AWS operations via SOAP  Failed with: “Client.NotSignedUp”  showed up later complaining about credit card out of date  I corrected the payment issue and was up and running  User provisioning is fast and efficient  Overall, quick and easy process 4/17/20153

 Blob store with per blob metadata  Blobs supported up to 5GB  ~36.4 hour upload via Comcast “broadband”  Metadata up to 4k in name/value pairs (2k of user metadata)  Up to 100 ACLs on each object (part of system metadata)  Read, write, read_ACP, write_ACP, and full_control  Owner has full control and is not changeable  Each blob stored in a bucket  Bucket name space shared by ALL S3 users  All buckets have a single unchanging owner  Each blob has a URI identifier in a flat per-bucket namespace  Directory hierarchy simulated  Get(key=“C:/X/Y/Z.txt”) gets the file  Get(key=“C:/X”) gets all objects with leading “C:/X/” in URI  Get(key=“C:/X/”, separator=“/”) gets all objects with leading “C:/X/” but no subsequent “/” 4/17/20154

 Client libraries offered as samples for:  Java SOAP, Java REST, C# SOAP, C# REST, Ruby REST, Python REST, & Perl REST  BitTorent also supported  Docs at protocol layer rather than APIs  API implementations uneven without docs  C# SOAP has max send of 1M (can use DIME attachment for larger)  C# REST doesn’t support direct streaming  Source provided so quality & functionality issues all have solution 4/17/20155

conn = new AWSAuthConnection(key, secretKey); res = conn.delete(Bucket, key, null); Soap request Headers: - 10T01CX2MA2M9WD25002 NbT52AjYRzhHdA490B+T20KBYYg= T20:10:28.000Z 10T01CX2MA2M9WD25002-Archiver C:/JamesRH/dev/AWS/tst/test.tst*1 SOAP response Headers No Content 4/17/20156 REST Access Key:AWS : Base64(hmac-sha1(VERB, Content-MD5, Content type, Date, AMZ headers, resourceKey )) SOAP Model Doesn’t sign

 S3 Archiver is a version store  Registered directories & sub-directories contents are stored in S3  Each new file version is stored separately  Any version of any file can be restored in place  Multiple versions of a file can be restored with version suffixes on filename  Any file version can be deleted from S3  Roughly 3,500 lines of C# written against C# REST interface 4/17/20157  Performance:  12,220 files, 12,500 versions totaling 5.1 GB synchronized in under 30 seconds (local- only operation)  Actual transfers very slow ranging from roughly 1GB/hour to as low as 100 MB/hour (28 KBs)  Issues mostly network bandwidth-related on the perf side and standard internet flakiness on the reliability side

 Files stored in S3 with version numbers  C:\JamesRH\OpsFriendlyApps.docsx*10  Metadata as name/value pairs with each file:  Version  Size  Hash  Last Write  Create Time  Access Time  Archive Time  Fetching metadata for 1,000s of files slow so have a cache stored in S3  S3_Archiver.xml  Large scale testing required real data so implemented encryption  Single file operations slow and very uneven in performance so need to run all non-metadata-only operations asynchronously  On scale-up to 1000s of files S3_Archiver.xml gets large and slow so implemented compression  On scale-up to 10s of 1000s of files S3_Archiver.xml still slow to transfer so check hash and avoid transfers when possible 4/17/20158 C:/user/file1*1 C:/user/file1*2 C:/user/file2*1 C:/file3*1 cache cache*C:/usr/libary1 cache*C:/usr/library2

4/17/20159

 Error rate ~4 in 1,000 and similar across different networks:  Comcast:  Recv: 43 to KBs  Send: 5 to 52 KBs  Microsoft:  Recv : 317 to 10,935 KBs  Send: 437 to 5,556 KBs  Home Large File:  Send Fast/Slow ratio: 9.8x  Recv Fast/Slow ratio: 12.9x  Recv/Send ratio: 10.8x  Work Large File:  Send Fast/Slow ratio: 12.7x  Recv Fast/Slow ratio: 34.4x  Recv/Send ratio: 1.9x  Conclusion:  Send to receive difference ranges from a factor of 2 to an order of magnitude  2 orders of magnitude spread in throughput  Internet error rate high  5+ orders of magnitude higher latency than local disk 4/17/ Source: one week hourly perf & reliability probe of S3 at 1k, 10k, 100k, 1m, & 25m

4/17/ Kbs 4 errors Errors show as negative throughput

4/17/ Kbs Errors show as negative throughput 1 error

4/17/ Kbs Errors show as negative throughput 2 errors 3 errors

4/17/ Kbs Errors show as negative throughput 1 error

Summary of This Month's Activity as of March 14, 2007 Billing Cycle for this Report: March 1 - March 31, 2007 Usage Charges Rate Usage Totals Amazon Simple Storage Service $0.20 per GB of data transferred (details) GB $3.04 details $0.15 per GB-Month of storage used (details) GB-Mo $0.04 details Total:$3.08 All web services are sold by Amazon Digital Services, Inc. Summary of Last Month's Statement: February 1 - February 28, 2007 Total Charges for February 1 - February 28, 2007: $0.07  Amazingly inexpensive:  Developed a 3,000 line interactive system storing 1,000s of files and several GB stored with many test cycles at $3.11 4/17/201515

 Current bandwidth price (through May 31, 2007)  $0.20 / GB – uploaded  $0.20 / GB - downloaded  New bandwidth price (effective June 1, 2007)  $0.10 per GB - all data uploaded  $0.18 per GB - first 10 TB / month data downloaded  $0.16 per GB - next 40 TB / month data downloaded  $0.13 per GB - data downloaded / month over 50 TB  Data transferred between Amazon S3 and Amazon EC2 will remain free  New request-based price (effective June 1, 2007)  $0.01 per 1,000 PUT or LIST requests  $0.01 per 10,000 GET and all other requests  No charge for delete requests  Storage will continue to be charged at $0.15 / GB-month used 4/17/201516