Case Study: Amazon AWS. CSE Cloud Compu0ng Prof. Douglas Thain University of Notre Dame

Similar documents
Case Study: Amazon AWS. CSE Cloud Computing Prof. Douglas Thain University of Notre Dame

Today s Objec4ves. Data Center. Virtualiza4on Cloud Compu4ng Amazon Web Services. What did you think? 10/23/17. Oct 23, 2017 Sprenkle - CSCI325

AWS: Basic Architecture Session SUNEY SHARMA Solutions Architect: AWS

CIT 668: System Architecture. Amazon Web Services

TPP On The Cloud. Joe Slagel

Enroll Now to Take online Course Contact: Demo video By Chandra sir

Introduction to Amazon Web Services

Amazon Web Services (AWS) Training Course Content

Training on Amazon AWS Cloud Computing. Course Content

AWS Storage Gateway. Amazon S3. Amazon EFS. Amazon Glacier. Amazon EBS. Amazon EC2 Instance. storage. File Block Object. Hybrid integrated.

How Cloud is working as a disruptor to shake up middleware design EVOLVE OR DIE! Billy Newport

AWS Storage Gateway. Not your father s hybrid storage. University of Arizona IT Summit October 23, Jay Vagalatos, AWS Solutions Architect

Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content

Amazon Web Services Training. Training Topics:

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

Advanced Architectures for Oracle Database on Amazon EC2

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

AWS Solutions Architect Associate (SAA-C01) Sample Exam Questions

Elastic Compute Service. Quick Start for Windows

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

Introduction to Cloud Computing

Big Data Infrastructure & Technologies Big Data Intro & Cloud Computing.

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

SAA-C01. AWS Solutions Architect Associate. Exam Summary Syllabus Questions

EE 660: Computer Architecture Cloud Architecture: IaaS

Amazon Web Services. Block 402, 4 th Floor, Saptagiri Towers, Above Pantaloons, Begumpet Main Road, Hyderabad Telangana India

We are ready to serve Latest IT Trends, Are you ready to learn? New Batches Info

Introducing Amazon Elastic File System (EFS)

Exam Questions AWS-Certified- Developer-Associate

Pass4test Certification IT garanti, The Easy Way!

TestkingPass. Reliable test dumps & stable pass king & valid test questions

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

How to go serverless with AWS Lambda

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

PrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps

ElasterStack 3.2 User Administration Guide - Advanced Zone

Introduction to Amazon Web Services. Jeff Barr Senior AWS /

Next Generation Storage for The Software-Defned World

CIT 668: System Architecture

MySQL In the Cloud. Migration, Best Practices, High Availability, Scaling. Peter Zaitsev CEO Los Angeles MySQL Meetup June 12 th, 2017.

LINUX, WINDOWS(MCSE),

OnCommand Cloud Manager 3.2 Deploying and Managing ONTAP Cloud Systems

Opera&ng Systems: Principles and Prac&ce. Tom Anderson

Zadara Enterprise Storage in

Xen and the Art of Virtualiza2on

CIT 668: System Architecture

Cloud Computing 1. CSCI 4850/5850 High-Performance Computing Spring 2018

FutureGrid 101. Part 2: Ge*ng Started Craig Stewart

AWS Solution Architect Associate

Vagrant and Ansible. Two so2ware tools to create and manage your custom VMs

POSTGRESQL ON AWS: TIPS & TRICKS (AND HORROR STORIES) ALEXANDER KUKUSHKIN. PostgresConf US

Background. IBM sold expensive mainframes to large organiza<ons. Monitor sits between one or more OSes and HW

Big Data Infrastructures & Technology

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018

THE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES

Big Data Infrastructures & Technology. Big Data Intro & Cloud Computing

Information Systems for Big Data. Big Data Intro & Cloud Computing

CLOUD COMPUTING. Rajesh Kumar. DevOps Architect.

GETTING STARTED WITH NUODB

CS15-319: Cloud Computing. Lecture 3 Course Project and Amazon AWS Majd Sakr and Mohammad Hammoud

Background. IBM sold expensive mainframes to large organiza<ons. Monitor sits between one or more OSes and HW

Backtesting in the Cloud

AWS Course Syllabus. Linux Fundamentals. Installation and Initialization:

Virtualization Overview. Joel Jaeggli AFNOG SS-E 2013

Amazon AWS-Solution-Architect-Associate Exam

CSE 344 Introduc/on to Data Management. Sec/on 9: AWS, Hadoop, Pig La/n Yuyin Sun

Amazon Elastic Compute Cloud

Project Blackbird. U"lizing Condor and HTC to address archiving online courses at Clemson on a weekly basis. Sam Hoover

Building a Big IaaS Cloud. David /

It is recommended to complete the tutorial using a web browser from the same operating system as your Putty or SSH client (e.g. Ubuntu terminal).

Cloud Computing /AWS Course Content

Deep Dive on Amazon Elastic File System

MapReduce. Cloud Computing COMP / ECPE 293A

Installation of Informatica Services on Amazon EC2

Scaling Massive Content Stores in the Cloud. CloudExpo New York June Alfresco Founder & CTO

HPE Digital Learner AWS Certified SysOps Administrator (Intermediate) Content Pack

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

Oracle IaaS, a modern felhő infrastruktúra

Elas%c Load Balancing, Amazon CloudWatch, and Auto Scaling Sco) Linder

About Intellipaat. About the Course. Why Take This Course?

CONTAINERIZING JOBS ON THE ACCRE CLUSTER WITH SINGULARITY

Oracle VM Workshop Applica>on Driven Virtualiza>on

EXAM - AWS-Solution-Architect- Associate. AWS Certified Solutions Architect - Associate. Buy Full Product

AWS Storage Optimization. AWS Whitepaper

USING NGC WITH GOOGLE CLOUD PLATFORM

Amazon Glacier. Developer Guide API Version

Introduction to cloud computing

Deploying High Availability and Business Resilient R12 Applications over the Cloud

High School Technology Services myhsts.org Certification Courses

The Packer Book. James Turnbull. April 20, Version: v1.1.2 (067741e) Website: The Packer Book

Testbed-12 TEAM Engine Virtualization User Guide

Testing in AWS. Let s go back to the lambda function(sample-hello) you made before. - AWS Lambda - Select Simple-Hello

VMware vcloud Air User's Guide

Non-disruptive, two node high-availability (HA) support keeps you operating against unplanned storage failures in the cloud

2013 AWS Worldwide Public Sector Summit Washington, D.C.

Virtualization. Introduction. Why we interested? 11/28/15. Virtualiza5on provide an abstract environment to run applica5ons.

Western Michigan University

AUTOMATING IBM SPECTRUM SCALE CLUSTER BUILDS IN AWS PROOF OF CONCEPT

8/3/17. Encryption and Decryption centralized Single point of contact First line of defense. Bishop

Transcription:

Case Study: Amazon AWS CSE 40822 Cloud Compu0ng Prof. Douglas Thain University of Notre Dame

Cau3on to the Reader: Herein are examples of prices consulted in spring 2016, to give a sense of the magnitude of costs. Do your own research before spending your own money!

Several Historical Trends Shared U0lity Compu0ng 1960s MULTICS Concept of a Shared Compu0ng U0lity 1970s IBM Mainframes rent by the CPU-hour. (Fast/slow switch.) Data Center Co-loca0on 1990s-2000s Rent machines for months/years, keep them close to the network access point and pay a flat rate. Avoid running your own building with u0li0es! Pay as You Go Early 2000s - Submit jobs to a remote service provider where they run on the raw hardware. Sun Cloud ($1/CPU-hour, Solaris +SGE) IBM Deep Capacity Compu0ng on Demand (50 cents/hour) Virtualiza0on 1960s OS-VM, VM-360 Used to split mainframes into logical par00ons. 1998 VMWare First prac0cal implementa0on on X86, but at significant performance hit. 2003 Xen paravirtualiza0on provides much perf, but kernel must assist. Late 2000s Intel and AMD add hardware support for virtualiza0on.

Virtual-* Allows for the Scale of Abstrac3on to Increase Over Time Run one process within certain resource limits. Op Sys has virtual memory, virtual CPU, and virtual storage (file system). Run mul0ple processes within certain resource limits. Resource containers (Solaris), virtual servers (Linux), virtual images (Docker) Run an en0re opera0ng system within certain limits. Virtual machine technology: VMWare, Xen, KVM, etc. Run a set of virtual machines connected via a private network. Virtual networks (SDNs) provision bandwidth between virtual machines. Run a private virtual architecture for every customer. Automated tools replicate virtual infrastructure as needed.

Amazon AWS Grew out of Amazon s need to rapidly provision and configure machines of standard configura0ons for its own business. Early 2000s Both private and shared data centers began using virtualiza0on to perform server consolida0on 2003 Internal memo by Chris Pinkham describing an infrastructure service for the world. 2006 S3 first deployed in the spring, EC2 in the fall 2008 Elas0c Block Store available. 2009 Rela0onal Database Service 2012 DynamoDB Does it turn a profit?

Terminology Instance = One running virtual machine. Instance Type = hardware configura0on: cores, memory, disk. Instance Store Volume = Temporary disk associated with instance. Image (AMI) = Stored bits which can be turned into instances. Key Pair = Creden0als used to access VM from command line. Region = Geographic loca0on, price, laws, network locality. Availability Zone = Subdivision of region the is fault-independent.

EC2 Pricing Model Free Usage Tier On-Demand Instances Start and stop instances whenever you like, costs are rounded up to the nearest hour. (Worst price) Reserved Instances Pay up front for one/three years in advance. (Best price) Unused instances can be sold on a secondary market. Spot Instances Specify the price you are willing to pay, and instances get started and stopped without any warning as the marked changes. (Kind of like Condor!) hnp://aws.amazon.com/ec2/pricing/

Free Usage Tier 750 hours of EC2 running Linux, RHEL, or SLES t2.micro instance usage 750 hours of EC2 running Microsop Windows Server t2.micro instance usage 750 hours of Elas0c Load Balancing plus 15 GB data processing 30 GB of Amazon Elas0c Block Storage in any combina0on of General Purpose (SSD) or Magne0c, plus 2 million I/Os (with Magne0c) and 1 GB of snapshot storage 15 GB of bandwidth out aggregated across all AWS services 1 GB of Regional Data Transfer

Reserved Instance Example

Surprisingly, you can t scale up that large.

Simple Storage Service (S3) A bucket is a container for objects and describes loca0on, logging, accoun0ng, and access control. A bucket can hold any number of objects, which are files of up to 5TB. A bucket has a name that must be globally unique. Fundamental opera0ons corresponding to HTTP ac0ons: hnp://bucket.s3.amazonaws.com/object POST a new object or update an exis0ng object. GET an exis0ng object from a bucket. DELETE an object from the bucket LIST keys present in a bucket, with a filter. A bucket has a flat directory structure (despite the appearance given by the interac0ve web interface.)

Easily Integrated into Web Applica3ons <form action="http://examplebucket.s3.amazonaws.com/" method="post" enctype="multipart/form-data"> <input type="input" name="key" value="user/user1/" /> <input type="hidden" name="acl" value="public-read" /> <input type="hidden" name="success_action_redirect" value="http://examplebucket.s3.amazonaws.com/successful_upload.html" />... <input type="text" name="x-amz-credential value="akiaiosfodnn7example/20130806/us-east-1/s3/aws4_request" />... <input type="submit" name="submit" value="upload to Amazon S3" /> </form> hnp://docs.aws.amazon.com/amazons3/latest/api/sigv4-post-example.html

Bucket Proper3es Versioning If enabled, POST/DELETE result in the crea0on of new versions without destroying the old. Lifecycle Delete or archive objects in a bucket a certain 0me aper crea0on or last access or number of versions. Access Policy Control when and where objects can be accessed. Access Control Control who may access objects in this bucket. Logging Keep track of how objects are accessed. No0fica0on Be no0fied when failures occur.

S3 Weak Consistency Model Direct quote from the Amazon developer API: Updates to a single key are atomic. Amazon S3 achieves high availability by replica0ng data across mul0ple servers within Amazon's data centers. If a PUT request is successful, your data is safely stored. However, informa0on about the changes must replicate across Amazon S3, which can take some 0me, and so you might observe the following behaviors: A process writes a new object to Amazon S3 and immediately anempts to read it. Un0l the change is fully propagated, Amazon S3 might report "key does not exist." A process writes a new object to Amazon S3 and immediately lists keys within its bucket. Un0l the change is fully propagated, the object might not appear in the list. A process replaces an exis0ng object and immediately anempts to read it. Un0l the change is fully propagated, Amazon S3 might return the prior data. A process deletes an exis0ng object and immediately anempts to read it. Un0l the dele0on is fully propagated, Amazon S3 might return the deleted data.

Always read the fine print.

Elas3c Block Store An EBS volume is a virtual disk of a fixed size with a block read/write interface. It can be mounted as a filesystem on a running EC2 instance where it can be updated incrementally. Unlike an instance store, an EBS volume is persistent. (Compare to an S3 object, which is essen0ally a file that must be accessed in its en0rety.) Fundamental opera0ons: CREATE a new volume (1GB-1TB) COPY a volume from an exis0ng EBS volume or S3 object. MOUNT on one instance at a 0me. SNAPSHOT current state to an S3 object.

EBS is approx. 3x more expensive by volume and 10x more expensive by IOPS than S3.

Use Glacier for Cold Data Glacier is structured like S3: a vault is a container for an arbitrary number of archives. Policies, accoun0ng, and access control are associated with vaults, while an archive is a single object. However: All opera0ons are asynchronous and no0fied via SNS. Vault lis0ngs are updated once per day. Archive downloads may take up to four hours. Only 5% of total data can be accessed in a given month. Pricing: Storage: $0.01 per GB-month Opera0ons: $0.05 per 1000 requests Data Transfer: Like S3, free within AWS. S3 Policies can be set up to automa0cally move data into Glacier.

Durability Amazon claims about S3: Amazon S3 is designed to sustain the concurrent loss of data in two facili0es, e.g. 3+ copies across mul0ple available domains. 99.999999999% durability of objects over a given year. Amazon claims about EBS: Amazon EBS volume data is replicated across mul0ple servers in an Availability Zone to prevent the loss of data from the failure of any single component. Volumes <20GB modified data since last snapshot have an annual failure rate of 0.1% - 0.5%, resul0ng in complete loss of the volume. Commodity hard disks have an AFR of about 4%. Amazon claims about Glacier is the same as S3: Amazon S3 is designed to sustain the concurrent loss of data in two facili0es, e.g. 3+ copies across mul0ple available domains PLUS periodic internal integrity checks. 99.999999999% durability of objects over a given year. Beware of oversimplified arguments about low-probability events!

Architecture Center Ideas for construc0ng large scale infrastructures using AWS: hnp://aws.amazon.com/architecture/

Command Line Setup Go to your profile menu (your name) in the upper right hand corner, select Security Creden0als and Con0nue to Security Creden0als Select Access Keys Select New Access Key and save the generated keys somewhere. Edit ~/.aws/config and set it up like this: [default] output = json region = us-west-2 aws_access_key = XXXXXX aws_secret_access_key = YYYYYYYYYYYY Now test it: aws ec2-describe-instances Note the syntax here is different from how it was given in the web console! AWSAccessKey=XXXXXX AWSSecretAccessKey=YYYYYYYYY

S3 Command Line Examples aws s3 mb s3://bucket... cp localfile s3://bucket/key mv s3://bucket/key s3://bucket/newname ls s3://bucket rm s3://bucket/key rb s3://bucket aws s3 help aws s3 ls help

EC2 Command Line Examples aws ec2 describe-instances run-instances --image-id ami-xxxxx -- count 1 --instance-type t1.micro --key-name keyfile stop-instances --instance-id i-xxxxxx aws ec2 help aws ec2 start-instances help

Warmup: Get Started with Amazon Skim through the AWS documenta0on. Sign up for AWS at hnp://aws.amazon.com (Skip the IAM management for now) Apply the service credit you received by email. Create and download a Key-Pair, save it in your home directory. Create a VM via the AWS Console Connect to your newly-created VM like this: ssh -i my-aws-keypair.pem ec2-user@ip-address-of-vm Create a bucket in S3 and upload/download some files.

Demo Time h_p://aws.amazon.com