CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies

Similar documents
Introduction to Cloud Computing

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

LINUX, WINDOWS(MCSE),

CIT 668: System Architecture. Amazon Web Services

Training on Amazon AWS Cloud Computing. Course Content

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

Enroll Now to Take online Course Contact: Demo video By Chandra sir

Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content

Amazon Web Services. Block 402, 4 th Floor, Saptagiri Towers, Above Pantaloons, Begumpet Main Road, Hyderabad Telangana India

AWS Administration. Suggested Pre-requisites Basic IT Knowledge

Certificate of Registration

Amazon Web Services Training. Training Topics:

Cloud Computing. Amazon Web Services (AWS)

Launch and Scale your Social Game in the Cloud with AltEgo, Amazon Web Services and RightScale

AWS Solution Architect Associate

Cloud Computing 4/17/2016. Outline. Cloud Computing. Centralized versus Distributed Computing Some people argue that Cloud Computing. Cloud Computing.

CLOUD AND AWS TECHNICAL ESSENTIALS PLUS

Amazon Web Services (AWS) Training Course Content

Amazon AWS-Solution-Architect-Associate Exam

Introduction to. Amazon Web Services. Thilina Gunarathne Salsa Group, Indiana University. With contributions from Saliya Ekanayake.

Designing Fault-Tolerant Applications

Fault-Tolerant Computer System Design ECE 695/CS 590. Putting it All Together

About Intellipaat. About the Course. Why Take This Course?

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

AWS: Basic Architecture Session SUNEY SHARMA Solutions Architect: AWS

Project Presentation

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

Principal Solutions Architect. Architecting in the Cloud

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

Amazon Web Services. A Short Guide. SLUUG R. Scott Granneman Jans Carton

AWS Concepts and Lab Intro

CIT 668: System Architecture

ThoughtSpot on AWS Quick Start Guide

Introduction to cloud computing

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

AWS Solution Architecture Patterns

ActiveNET. #202, Manjeera Plaza, Opp: Aditya Park Inn, Ameerpetet HYD

AWS_SOA-C00 Exam. Volume: 758 Questions

Cloud & AWS Essentials Agenda. Introduction What is the cloud? DevOps approach Basic AWS overview. VPC EC2 and EBS S3 RDS.

Amazon Linux: Operating System of the Cloud

AWS Solution Architect (AWS SA)

Cloud Computing /AWS Course Content

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

MATE-EC2: A Middleware for Processing Data with Amazon Web Services

AWS Solutions Architect Associate (SAA-C01) Sample Exam Questions

HOW TO PLAN & EXECUTE A SUCCESSFUL CLOUD MIGRATION

CS15-319: Cloud Computing. Lecture 3 Course Project and Amazon AWS Majd Sakr and Mohammad Hammoud

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

Andorid Storage Options

Programming model and implementation for processing and. Programs can be automatically parallelized and executed on a large cluster of machines

AWS Well Architected Framework

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

Cloud Computing 1. CSCI 4850/5850 High-Performance Computing Spring 2018

Splunk & AWS. Gain real-time insights from your data at scale. Ray Zhu Product Manager, AWS Elias Haddad Product Manager, Splunk

SAA-C01. AWS Solutions Architect Associate. Exam Summary Syllabus Questions

Cloud Providers more AWS, Aneka

HPE Digital Learner AWS Certified SysOps Administrator (Intermediate) Content Pack

2013 AWS Worldwide Public Sector Summit Washington, D.C.

Startups and Mobile Apps on AWS. Dave Schappell, Startup Business Development Manager, AWS September 11, 2013

FAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide

Introduction to Amazon Web Services

Cloud Computing Economics

Office Hours Time: Monday/Wednesday 12:45PM-1:45PM (SB237D) More Information: Office Hours Time: Monday/Thursday 3PM-4PM (SB002)

Amazon. Exam Questions AWS-Certified-Solutions-Architect- Professional. AWS-Certified-Solutions-Architect-Professional.

Quick start guide for Infscape UrBackup Appliance on Amazon Web Services

Cloud Computing Economies of Scale

How to host and manage enterprise customers on AWS: TOYOTA, Nippon Television, UNIQLO use cases

High School Technology Services myhsts.org Certification Courses

Backtesting in the Cloud

PrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies. Lecture 1

KillTest *KIJGT 3WCNKV[ $GVVGT 5GTXKEG Q&A NZZV ]]] QORRZKYZ IUS =K ULLKX LXKK [VJGZK YKX\OIK LUX UTK _KGX

CPET 565/CPET 499 Mobile Computing Systems. Lecture 8. Data Dissemination and Management. 2 of 3

Welcome to the New Era of Cloud Computing

AWS 101. Patrick Pierson, IonChannel

ArcGIS 10.3 Server on Amazon Web Services

AWS Course Syllabus. Linux Fundamentals. Installation and Initialization:

Amazon Elastic Compute Cloud (EC2)

The Windows Azure Platform: A Perspective

The Orion Papers. AWS Solutions Architect (Associate) Exam Course Manual. Enter

Migrating Existing Applications to AWS. Matt Tavis Principal Solutions Architect

Large Scale Computing Infrastructures

The Windows Azure Platform: A Perspective

Cloud Infrastructure

Java in der Cloud Aktuelle Möglichkeiten und Entwicklungen

Microservices on AWS. Matthias Jung, Solutions Architect AWS

CLOUD COMPUTING. Rajesh Kumar. DevOps Architect.

Lean & Mean on AWS: Cost-Effective Architectures. Constantin Gonzalez, Solutions Architect, AWS

Immersion Day. Getting Started with AWS Lambda. August Rev

Cloud Computing Introduction & Offerings from IBM

Introduction To Cloud Computing

SAP VORA 1.4 on AWS - MARKETPLACE EDITION FREQUENTLY ASKED QUESTIONS

Security Aspekts on Services for Serverless Architectures. Bertram Dorn EMEA Specialized Solutions Architect Security and Compliance

AWS Interview Questions and Answers

Amazon Web Services. Amazon Web Services

Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack

TestkingPass. Reliable test dumps & stable pass king & valid test questions

Microservices Architekturen aufbauen, aber wie?

AWS Agility + Splunk Visibility = Cloud Success. Splunk App for AWS Demo. Laura Ripans, AWS Alliance Manager

Transcription:

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies Lecture 8 Cloud Programming & Software Environments: High Performance Computing & AWS Services Part 2 of 2 Spring 2015 A Specialty Course for Purdue University s M.S. in Technology Graduate Program: IT/Advanced Computer App Track Paul I-Hai Lin, Professor Dept. of Computer, Electrical and Information Technology Purdue University Fort Wayne Campus 1 References 1. Chapter 6. Cloud Programming and Software Environments, Book Distributed and Cloud Computing, by Kai Hwang, Geoffrey C. Fox a,d Jack J. Dongarra, published by Mogan Kaufmman/ Elsevier Inc. 2 1

Topics High Performance Computation Parallel Matrix Multiplication Computational Complexity and Analysis Parallel Programming on Amazon Web Service (AWS) Amazon Platforms and Service Offerings AWS Elastic Compute Cloud (EC2) AWS Simple Storage Services (S3) AWS Elastics Bock Store (EBS) AWS Simple DS 3 Parallel Matrix Multiplication Matrix, https://en.wikipedia.org/wiki/matrix_(mathematics) 4 2

Parallel Matrix Multiplication Matrix, https://en.wikipedia.org/wiki/matrix_(mathematics) 5 Parallel Matrix Multiplication 6 3

Computational Complexity Analysis 7 When n is very Large Computational Cost Reading and storing large number of input and output matrix elements demand excessive I/O time and memory space Data reference locality demands many duplications of the row and column vectors to local processors The Map functions in MapReduce model. Dot products can be done on the Reduce Nodes in parallel blocks identified by keys Demand large-scale shuffle and exchange sorting and grouping operations over all intermediate <key, value> pairs, even externally in and out of disks. The task fork out from the master server to all available Map and Reduce servers (workers) may result in scheduling overhead. 8 4

Ideas of Parallel Matrix Multiplication Each time unit counts the time to carry out the dot product of two n-element vectors. (repeated multiply-and-add operations over a row vector of A and a column vector of B). In the sequential execution, it take n 2 time units to generate the n 2 output elements in the product matrix C. Here, the example matrix has an order n = 1,024. If you partition the matrix into 16 equal blocks (64 x 64 each). Then, only 256n output elements are generated in each block. Thus 16 blocks can be handled by 16 VM instances in parallel. In theory, the total execution time should be shortened to 1/16 of the total sequential execution time, if all communication and memory-access overheads are ignored. 9 Ideas of Parallel Matrix Multiplication Input Matrix partitioning by row vectors of matrix A and by column vectors of matrix B or by row vector of the transposed matrix B T Dot Product Parallelization into Blocks affect the Reduce speed and efficiency in the computation section of the entire MapReduce process. Matrix C 10 5

Parallel Matrix Multiplication (cont.) Similarly, if you use 64 VM instances, you should expect a 1/64 execution time. Use up to the maximum number of 128 machine instances, if it is allowed in your assigned Amazon account. In the extreme case of using n 2 instances (1 M or 2 20 instances), you may end up with only one time unit to complete the total execution. That is not allowed in the AWS platform, realistically speaking. 11 Hadoop and Amazon Elastic MapReduce A software platform originally developed by Yahoo to enable user write and run applications over vast distributed data. Attractive Features in Hadoop: Scalable Economical: an open-source MapReduce Efficient Reliable 12 6

AWS Usage Growth Bandwidth consumed by Amazon Web Services Bandwidth consumed by Amazon s global websites 13 14 7

The AWS Platform 15 Major Service Modules for IaaS on the AWS Platform Amazon Application Services Simple DB EC2 SQS S3 Launch Controller Controller Monitor Controller Shutdown Controller Billing Controller Amazon Physical Infrastructure 16 8

Amazon Web Services (AWS) The AWS Infrastructure Platform Monitoring Amazon CloudWatch Management AWS Management Console Tools AWS Toolkit for Eclipse Isolated Networks Amazon Virtual Private Cloud Parallel Processing Amazon Elastic MapReduce Content Delivery Amazon CloudFront Messaging Amazon Simple Queue Service (SQS) Payments Amazon Flexible Payments Service (FPS) On-Demand Workforce Amazon Mechanical Turk Compute Amazon Elastic Compute Cloud (EC2) -Elastic Load Balancing -Auto Scaling Storage Amazon Simple Storage Service (S3) -AWS Import/Export Database Amazon RDS Amazon SimpleDB (Courtesy of AWS, 2012) 1-17 Amazon EC2 Execution Environment Amazon Machine Image Public AMIs Private AMIs Paid AMIs Create an AMI Create Key Pair Launch Configure Firewall Elastic IP address Elastic Block Store Virtualization Layer Compute Storage Server 18 9

Amazon Machine Images (AMI) Image Type Private Public Paid Definition Images created by you, which are private by default. You can grant access to other users to launch your private images. Images created by users and released to the Amazon Web Services community, so anyone can launch instances based on them and use them any way they like. The Amazon Web Services Developer Connection Web site lists all public images. You can create images providing specific functions that can be launched by anyone willing to pay you per each hour of usage on top of Amazon charges. AMI is a packaged server environment in EC2, based on Linux running any user software or application. AMIs are the templates for VM instances. Elastic IP address is specially reserved for EC2. Elastic Block Store offers persistent storage for EC2 instances. 1-19 AWS Virtual Private Cloud (VPC) 20 10

Amazon S3 for Storage Provisioning Object is the basic unit of data U s e r Bucket for storing objects R E S T I n t e r f a c e B u c k e t S O A P I n t e r f a c e O b j e c t K e y V a l u e M e t a - d a t a A c c e s s C o n t r o l Key for data object retrieval Object is attributes to values, metadata, and access control V i r t u a l i z a t i o n L a y e r 21 1-22 11

1-23 Python Code Solution by Risheng Wang, USC, 2011 Input Files for left Matrix A and right Matrix B (Courtesy of R. Wang, USC, 2011) 1-24 12

Input Files for left Matrix A and right Matrix B 1-25 The Output File for Matrix C 1-26 13

The Output File for Matrix C Block number is the key 1-27 NB = No. of blocks in each row (or in each transposed column) 1-28 14

ib = row index of each block jb = column index of the block NB = No. of blocks in each row ib * NB + jb = Block number 1-29 1-30 15

1-31 Multiply-and-Add (dot product) 1-32 16

A submatrix (128x128) for each of 64 = 8x8 blocks, if the block size is 128 elements 1-33 1-34 17

1-35 Results on Computing Time and Communication Time 1-36 18

Some Observations : Block size is very sensitive to the speedup performance and implementation efficiency of the MapReduce process. The optimal choice should match with the cache size of the server nodes used. The speedup is slowed down by many overhead factors, such as data I/O and replication times, intermediate < key, value> matching, storing and retrieval, sorting and grouping, and the parallel task scheduling overheads, etc. The optimal number of server or VM instances is a direct function of the matrix order (n), effective dot product computing using GPU subcluster, and the reduction of all sorts of delays caused by parallelism handling, communication latency, memory and I/O overheads, etc. 1-37 19