Designing Fault-Tolerant Applications

Similar documents
Fault-Tolerant Computer System Design ECE 695/CS 590. Putting it All Together

Principal Solutions Architect. Architecting in the Cloud

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

Design Patterns for the Cloud. MCSN - N. Tonellotto - Distributed Enabling Platforms 68

Training on Amazon AWS Cloud Computing. Course Content

Enroll Now to Take online Course Contact: Demo video By Chandra sir

LINUX, WINDOWS(MCSE),

Introduction to Cloud Computing

AWS Solution Architecture Patterns

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content

Scaling on AWS. From 1 to 10 Million Users. Matthias Jung, Solutions Architect

AWS Well Architected Framework

Introduction to Amazon Web Services. Jeff Barr Senior AWS /

Amazon Web Services Training. Training Topics:

AWS_SOA-C00 Exam. Volume: 758 Questions

Amazon Web Services (AWS) Training Course Content

CS15-319: Cloud Computing. Lecture 3 Course Project and Amazon AWS Majd Sakr and Mohammad Hammoud

How to host and manage enterprise customers on AWS: TOYOTA, Nippon Television, UNIQLO use cases

PrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps

Building Web-Scale Applications with AWS

AWS Solution Architect Associate

Cloud Computing /AWS Course Content

Amazon Web Services and Feb 28 outage. Overview presented by Divya

HPE Digital Learner AWS Certified SysOps Administrator (Intermediate) Content Pack

Better, Faster, Stronger web apps with Amazon Web Services. Senior Technology Evangelist, Amazon Web Services

PracticeDump. Free Practice Dumps - Unlimited Free Access of practice exam

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

CIT 668: System Architecture

Introduction to Database Services

CIT 668: System Architecture. Amazon Web Services

Automate best practices and operational health for your AWS resources with Trusted Advisor and AWS Health

Security: Michael South Americas Regional Leader, Public Sector Security & Compliance Business Acceleration

Amazon Web Services. Block 402, 4 th Floor, Saptagiri Towers, Above Pantaloons, Begumpet Main Road, Hyderabad Telangana India

AWS Course Syllabus. Linux Fundamentals. Installation and Initialization:

About Intellipaat. About the Course. Why Take This Course?

Amazon AWS-Solution-Architect-Associate Exam

Aurora, RDS, or On-Prem, Which is right for you

AWS Administration. Suggested Pre-requisites Basic IT Knowledge

Servers fail, who cares? (Answer: I do, sort of) Gregg Ulrich, #netflixcloud #cassandra12

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

#AWSSummit. Démarrer sur AWS. L élasticité et les outils de gestions

How Netflix Leverages Multiple Regions to Increase Availability: Isthmus and Active-Active Case Study

Introduction to cloud computing

Microservices on AWS. Matthias Jung, Solutions Architect AWS

High School Technology Services myhsts.org Certification Courses

Intro to Netflix Chaos Monkey

Amazon Web Services. Amazon Web Services

Microservices Architekturen aufbauen, aber wie?

Advanced Architectures for Oracle Database on Amazon EC2

Migrating Existing Applications to AWS. Matt Tavis Principal Solutions Architect

Security Aspekts on Services for Serverless Architectures. Bertram Dorn EMEA Specialized Solutions Architect Security and Compliance

Srinath Vaddepally.

HOW TO PLAN & EXECUTE A SUCCESSFUL CLOUD MIGRATION

Deploying High Availability and Business Resilient R12 Applications over the Cloud

AWS Solutions Architect Associate (SAA-C01) Sample Exam Questions

AWS: Basic Architecture Session SUNEY SHARMA Solutions Architect: AWS

Security & Compliance in the AWS Cloud. Amazon Web Services

Oracle WebLogic Server 12c on AWS. December 2018

Lassoing the Clouds: Best Practices on AWS. Brian DeShong May 26, 2017

Security & Compliance in the AWS Cloud. Vijay Rangarajan Senior Cloud Architect, ASEAN Amazon Web

Security on AWS(overview) Bertram Dorn EMEA Specialized Solutions Architect Security and Compliance

Cloud and Storage. Transforming IT with AWS and Zadara. Doug Cliche, Storage Solutions Architect June 5, 2018

CIT 668: System Architecture

SAA-C01. AWS Solutions Architect Associate. Exam Summary Syllabus Questions

Amazon Aurora Relational databases reimagined.

Lassoing the Clouds: Best Practices on AWS. Brian DeShong May 26, 2017

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

CLOUD AND AWS TECHNICAL ESSENTIALS PLUS

AWS London Loft: CloudFormation Workshop

WHITEPAPER AMAZON ELB: Your Master Key to a Secure, Cost-Efficient and Scalable Cloud.

Lean & Mean on AWS: Cost-Effective Architectures. Constantin Gonzalez, Solutions Architect, AWS

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

VMware Cloud on AWS Adoption in the Enterprise

Architekturen für die Cloud

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies

POSTGRESQL ON AWS: TIPS & TRICKS (AND HORROR STORIES) ALEXANDER KUKUSHKIN. PostgresConf US

Introduction to Amazon Cloud & EC2 Overview

Cloud Computing. Amazon Web Services (AWS)

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

Expert Reference Series of White Papers. Introduction to Amazon Auto Scaling

ActiveNET. #202, Manjeera Plaza, Opp: Aditya Park Inn, Ameerpetet HYD

Highly Available Database Architectures in AWS. Santa Clara, California April 23th 25th, 2018 Mike Benshoof, Technical Account Manager, Percona

ARCHITECTURAL DESIGN ON AWS: 3 COMMONLY MISSED BEST PRACTICES

Cloud Computing ECPE 276. AWS Hosted Services

We are ready to serve Latest IT Trends, Are you ready to learn? New Batches Info

Cloud Computing Economics

Expected Learning Outcomes Introduction To AWS

AWS Interview Questions and Answers

Amazon Web Services Course Outline

A Comparative Study of Amazon Web Service and Windows Azure

Building a Microservices Platform, Patterns and Best Practices

lab Highly Available and Fault Tolerant Architecture for Web Applications inside a VPC V1.01 AWS Certified Solutions Architect Associate lab title

Cloud Providers more AWS, Aneka

Amazon Aurora Deep Dive

Red Hat Storage Server for AWS

Running MySQL on AWS. Michael Coburn Wednesday, April 15th, 2015

Amazon Web Services. Foundational Services for Research Computing. April Mike Kuentz, WWPS Solutions Architect

Network Security & Access Control in AWS

Transcription:

Designing Fault-Tolerant Applications Miles Ward Enterprise Solutions Architect

Building Fault-Tolerant Applications on AWS White paper published last year Sharing best practices We d like to hear your best practices as well http://media.amazonwebservices.com/aws_building_fault_tolerant_applications.pdf

AWS Fault-Tolerant Building Blocks Two approaches: 1) AWS services that are inherently fault-tolerant and highly available: Amazon Simple Storage Service (S3) Amazon SimpleDB Amazon SQS, SNS, SES, CloudWatch, CloudFront, and more. 2) AWS services that offer tools and features to design faulttolerant and highly available systems: Amazon Elastic Compute Cloud (EC2) Availability Zones, Elastic IPs, EBS, etc. Flexible to trade off budget vs. time to recovery Amazon Relational Database Service (RDS) Multi-AZ Deployments Backup/Restore

Amazon EC2 Architecture Amazon Machine Image (AMI) Region Availability Zone EC2 Instance Ephemeral Storage CloudWatch Auto Scaling Security Group(s) Elastic IP Address Elastic Block Storage EBS Snapshot Amazon S3 EBS Snapshot Load Balancing

EC2 Features AMI Packaged, reusable functionality On-Instance Storage Lifetime tied to instance lifetime AFR like standard hard disk (around 5%) EBS Volumes Lifetime independent of any particular EC2 instance Redundant within an AZ AFR is 0.1% to 0.5% Incorporate volume mappings into your architecture Use EBS snapshot backups

EC2 Features Elastic IP Addresses Map to any EC2 instance within a given Region Detach from failed instance; map to replacement Auto Scaling Two ways to use it: Respond to changing conditions by adding or terminating EC2 instances (attach to CloudWatch metrics) Maintain a fixed number of instances running, replacing them if they fail or become unhealthy Reserved Instances Guarantees capacity for when it s needed

EC2 Features CloudWatch Alarms

EC2 Features Elastic Load Balancing Distributes incoming traffic across multiple instances Sends traffic only to healthy instances

Amazon EC2 Regions and Availability Zones US East (Northern Virginia) EU (Dublin) Availability Zone A Availability Zone B Availability Zone C Availability Zone D Availability Zone A Availability Zone B Amazon EC2 Regions: US East (Northern Virginia) / US West (Northern California) / EU (Ireland) / Asia Pacific (Singapore) / Asia Pacific (Tokyo)

Availability Zone Characteristics and Advice Distinct physical locations Low-latency network connections between AZs Independent power, cooling, network, security Always partition app stacks across 2 or more AZs Elastic Load Balance across instances in multiple AZs

Proper Use of Multiple Availability Zones Centralized Services (S3 Backups, SimpleDB, etc) Availability Zone A Database Server or RDS DB Instance App Server Availability Zone B Database Server or RDS DB Instance App Server Web Server Web Server Requests and Health Checks Elastic Load Balancer Incoming Requests

Region Characteristics and Advice Regions are: Functionally separate Composed of 2 or more AZs Connected via the public internet Use regions to: Have functionality geographically close to customers Comply with national laws and practices Implement a DR strategy

RDS Fault-Tolerant Features Multi-AZ Deployments Synchronous replication across AZs Automatic fail-over to standby replica Automated Backups Enables point-in-time recovery of the DB instance Retention period configurable Snapshots User initiated full backup of DB New DB can be created from snapshots

AWS Architectural Guidance

Design For Failure Basic Principles Avoid single points of failure Assume everything fails, and design backwards Goal: Applications should continue to function even if the underlying physical hardware fails or is removed or replaced. Design your recovery process Trade off business needs vs. cost of high-availability

Design For Failure Use AWS Building Blocks Use Elastic IP addresses for consistent and remappable routes Use multiple Amazon EC2 Availability Zones (AZs) Replicate data across multiple AZs Example: Amazon RDS Multi-AZ mode Use real-time monitoring (Amazon CloudWatch) Use Amazon Elastic Block Store (EBS) for persistent file systems Take EBS Snapshots and use S3 for backups

Build Loosely Coupled Systems Copyright 2011 Amazon Web Services Use independent components Design everything as a Black Box Load-balance and scale clusters Think about graceful degradation Amazon SQS as Buffers Tight Coupling Loose Coupling using Queues Controller A Controller A Controller B Q Q Q Controller B Controller C Controller C

Implement Elasticity Don t assume health or fixed location of components Use designs that are resilient to reboot and re-launch Bootstrap your instances Who am I am and what is my role? Enable dynamic configuration Use configurations in SimpleDB for bootstrapping Use Auto Scaling Use Elastic Load Balancing on each tier

Implementing Elasticity Elastic Load Balancing, CloudWatch, and AutoScaling Elastic Load Balancing Utilization Auto Scaling CloudWatch Metrics

Use a Chaos Monkey Copyright 2011 Amazon Web Services From the Netflix blog: Simple monkey: Kill any instance in the account Complex monkey: Kill instances with specific tags Introduce other faults (e.g. connectivity via Security Group) Human monkey: Kill instances from the AWS Management Console http://techblog.netflix.com/2010/12/5-lessons-weve-learned-using-aws.html

AWS Architecture Center aws.amazon.com/architecture White papers: Cloud architectures Building fault-tolerant applications Web hosting best practices Leveraging different storage options AWS security best practices

Thank You!