CS / Cloud Computing. Recitation 11 November 5 th and Nov 8 th, 2013

Similar documents
CS / Cloud Computing. Recitation 9 October 22 nd and 25 th, 2013

CS / Cloud Computing. Recitation 7 October 7 th and 9 th, 2014

/ Cloud Computing. Recitation 8 October 18, 2016

/ Cloud Computing. Recitation 10 March 22nd, 2016

/ Cloud Computing. Recitation 9 March 17th and 19th, 2015

CS / Cloud Compu1ng. Recita1on 8 March 4 th and 6 th, 2014

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014

/ Cloud Computing. Recitation 6 October 2 nd, 2018

/ Cloud Computing. Recitation 7 October 10, 2017

/ Cloud Computing. Recitation 5 February 14th, 2017

/ Cloud Computing. Recitation 5 September 27 th, 2016

/ Cloud Computing. Recitation 5 September 26 th, 2017

/ Cloud Computing. Recitation 3 Sep 13 & 15, 2016

CS / Cloud Computing. Recitation 8 October 14 th and 16 th, 2014

/ Cloud Computing. Recitation 8 March 1 st, 2016

1

/ Cloud Computing. Recitation 7 February 24th & 26th, 2015

1

/ Cloud Computing. Recitation 9 March 15th, 2016

/ Cloud Computing. Recitation 13 April 12 th 2016

/ Cloud Computing. Recitation 2 January 19 & 21, 2016

/ Cloud Computing. Recitation 13 April 14 th 2015

/ Cloud Computing. Recitation 15 December 6 th 2016

/ Cloud Computing. Recitation 2 September 5 & 7, 2017

Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect

Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2016)

Building High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

AWS Solutions Architect Associate (SAA-C01) Sample Exam Questions

Cloud Storage with AWS: EFS vs EBS vs S3 AHMAD KARAWASH

Bigtable: A Distributed Storage System for Structured Data. Andrew Hon, Phyllis Lau, Justin Ng

Office Hours Time: Monday/Wednesday 12:45PM-1:45PM (SB237D) More Information: Office Hours Time: Monday/Thursday 3PM-4PM (SB002)

Presented by Nanditha Thinderu

QLIK INTEGRATION WITH AMAZON REDSHIFT

PrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps

Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2017)

Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis

ΕΠΛ 602:Foundations of Internet Technologies. Cloud Computing

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Best Practices and Performance Tuning on Amazon Elastic MapReduce

CSCE 315 Fall Team Project 3

CloudExpo November 2017 Tomer Levi

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

Welcome to the New Era of Cloud Computing

NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS. Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe

CSE 444: Database Internals. Lectures 26 NoSQL: Extensible Record Stores

YCSB++ Benchmarking Tool Performance Debugging Advanced Features of Scalable Table Stores

Big Data with Hadoop Ecosystem

Frankfurt 26 & 27 September 2018

References. What is Bigtable? Bigtable Data Model. Outline. Key Features. CSE 444: Database Internals

CIB Session 12th NoSQL Databases Structures

COMP 401 COURSE OVERVIEW

Comparing SQL and NOSQL databases

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Towards a Real- time Processing Pipeline: Running Apache Flink on AWS

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

Assignment Tutorial.

Benchmarking Cloud Serving Systems with YCSB 詹剑锋 2012 年 6 月 27 日

YCSB++ benchmarking tool Performance debugging advanced features of scalable table stores

Enroll Now to Take online Course Contact: Demo video By Chandra sir

Next-Generation Cloud Platform

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

Cassandra - A Decentralized Structured Storage System. Avinash Lakshman and Prashant Malik Facebook

2013 AWS Worldwide Public Sector Summit Washington, D.C.

CS 655 Advanced Topics in Distributed Systems

San Jose State University College of Science Department of Computer Science CS185C, NoSQL Database Systems, Section 1, Spring 2018

Typical size of data you deal with on a daily basis

CSC 261/461 Database Systems. Fall 2017 MW 12:30 pm 1:45 pm CSB 601

Extreme Computing. NoSQL.

Rails on HBase. Zachary Pinter and Tony Hillerson RailsConf 2011

CS 245: Principles of Data-Intensive Systems. Instructor: Matei Zaharia cs245.stanford.edu

BigTable. CSE-291 (Cloud Computing) Fall 2016

Introduction to BigData, Hadoop:-

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

Netflix OSS Spinnaker on the AWS Cloud

CS 6343: CLOUD COMPUTING Term Project

Amazon Web Services. Block 402, 4 th Floor, Saptagiri Towers, Above Pantaloons, Begumpet Main Road, Hyderabad Telangana India

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

CSE-E5430 Scalable Cloud Computing Lecture 9

Experiment-Driven Evaluation of Cloud-based Distributed Systems

CA485 Ray Walshe NoSQL

Distributed Systems [Fall 2012]

A Glimpse of the Hadoop Echosystem

8/3/17. Encryption and Decryption centralized Single point of contact First line of defense. Bishop

Welcome to CS 241 Systems Programming at Illinois

San Jose State University College of Science Department of Computer Science CS185C, Introduction to NoSQL databases, Spring 2017

When does RDBMS representation make sense When do other representations make sense. Prerequisites: CS 450/550 Database Concepts

MapReduce and Friends

CSE 544 Principles of Database Management Systems

Cassandra Design Patterns

Amazon ElastiCache 8/1/17. Why Amazon ElastiCache is important? Introduction:

CS / Cloud Computing

Scaling Up HBase. Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech. CSE6242 / CX4242: Data & Visual Analytics

Time Series Storage with Apache Kudu (incubating)

CS 398 ACC Streaming. Prof. Robert J. Brunner. Ben Congdon Tyler Kim

How To Rock with MyRocks. Vadim Tkachenko CTO, Percona Webinar, Jan

Getting to know. by Michelle Darling August 2013

/ Cloud Computing. Recitation 13 April 17th 2018

Data Informatics. Seon Ho Kim, Ph.D.

Transcription:

CS15-319 / 15-619 Cloud Computing Recitation 11 November 5 th and Nov 8 th, 2013

Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my answer is correct Don t post code on Piazza Search before posting Post feedback on OLI

Piazza Questions IAM Role tagreader You only need to assign this IAM role to instances running some specific AMIs (such as the one in the 619 extra project) For those AMIs, we will specify it in the write up For the others, you only need set the tags Details in @738 on Piazza Website problem Open default port 80 Permission to access DynamoDB service

Project 3, Module 4 Reflections DynamoDB vs. MySQL (Checkpoint question) Queries on any column other than key will require expensive scan operations in DynamoDB. DynamoDB can return only 1MB max for a query, which is a limitation for certain queries that could potentially return many rows. Learn to use DynamoDB's API Learn how DynamoDB can be configured for a web server Be cost conscious about using DynamoDB Set up the alarm for AWS costs

Module to Read UNIT 4: Cloud Storage Module 12: Cloud Storage Module 13: Case Studies: Distributed File Systems Module 14: Case Studies: NoSQL Databases Module 15: Case Studies: Cloud Object Storage Quiz 4: Cloud Storage UNIT 5: Distributed Programming and Analytics Engines for the Cloud Module 16: Introduction to Distributed Programming for the Cloud

Project 3 Files vs. Databases File vs. Database Vertical Scaling in Databases Vertical Scaling Horizontal Scaling in Databases Horizontal Scaling Working with NoSQL: DynamoDB / Hbase Amazon DynamoDB DynamoDB vs. HBase

HBase HBase NoSQL Database, based on Google s BigTable Unit 4 Module 14 Data Model (row:string, column:string, time:int64) -> string Rows are ordered and accessed by row key Columns are grouped into sets called column families, which form the basic unit for access control

HBase example Column family defined upon table creation Data can be very sparse

DynamoDB vs. HBase Data Model Key-value vs. Column oriented Key-value Non Hadoop vs. Hadoop Proprietary vs. Open source Cost DynomoDB: Provisioned Throughput Capacity Hbase: Instance + EMR Limitations: DynamoDB: Item size: 64 KB Query result: 1 MB See here for more

YCSB Yahoo! Cloud Serving Benchmark A standard benchmark for evaluating the performance of different key-value and cloud serving stores In this project, we will use a workload which consists of 50% reads and writes to a database More details How to interpret YCSB summary result

Project 3 Module 5 Your Task Run YCSB Benchmark against DynamoDB and Hbase Compare performance and cost Start early (Many experiments required) Terminate DynamoDB and HBase Cluster when done or face a grade penalty.

Project 3: Upcoming Deadlines Project 3 Working with NoSQL: DynamoDB / HBase DynamoDB vs. HBase Checkpoint Available now Due 11/10/13 11:59 PM Unit 4: UNIT 4: Cloud Storage Module 12: Cloud Storage Module 13: Case Studies: Distributed File Systems Module 14: Case Studies: NoSQL Databases Module 15: Case Studies: Cloud Object Storage Quiz 4: Cloud Storage Checkpoint Available now Due 11/7/13 11:59 PM

Upcoming Deadlines Quiz 4 Due THIS WEEK! Thursday 11/07/2013 11:59PM Pittsburgh Late submissions are NOT accepted Timed 180 minutes once started Remember to click SUBMIT before the deadline

Demo Outline 1. DynamoDB (no demo) Remember to delete the table after finishing the experiments 2. Hbase Provision EMR cluster with Hbase Modify Security Group Connect to EMR cluster nodes Create table Run YCSB from launchpad (no demo)

Recommendations Not a difficult module, however, Time consuming Can be very expensive and jeopardize your grade if you are not being careful You need to do some reading and thinking to answer the checkpoint quiz questions