Racs-EV-Java. Gary Zibrat (gdz4)

Size: px
Start display at page:

Download "Racs-EV-Java. Gary Zibrat (gdz4)"

Transcription

1 Racs-EV-Java Gary Zibrat (gdz4)

2 Motivation Large organizations want to store data in the cloud (e.g. Library of Congress, Netflix, Reddit) Not only does do users pay per byte of data currently in the cloud, but also per byte of data transferred to and from.

3 Current Prices Storage Transfer out Put Request Get Request Microsoft $0.024 per GB/month $0.080 per GB $ per 1,000 transactions $ per 1,000 transactions Amazon $ per GB/month $.080 per GB $0.005 per 1,000 requests $ per 1,000 requests Google $0.026 per GB/month (flat rate) $0.080 per GB $0.01 per 1,000 requests $0.001 per 1,000 requests

4 Current Prices Example (500TB) Storage Transfer out (all data) Put Request Get Request Microsoft Amazon Google $ per year $ per year $ per year $ ~0 ~0 $ ~0 ~0 $ ~0 ~0 Moving from Amazon to Microsoft would cost roughly 2.75 years worth of storage! Large customers can t leave due to slight price increases.

5 riginal Racs EBK 1 R A EBK 2 C S

6 riginal Racs R A C E B K 1 E B K 1 E B K 2 E B K 2 S E B K 1 E B K 2

7 riginal Racs Since files are split up, cloud computation requires reassembling the files. nly part of the file may be in the same provider as where a user wishes to do cloud computation. R A C S E B K 1 E B K 1 E B K 1 E B K 2 E B K 2 E B K 2 Cloud Compute on EBK 1 Cloud Compute on EBK 2

8 Racs-EV EBK 1 R A EBK 2 EBK 3 C S

9 Racs-EV R EBK 1 A C EBK 2 S EBK 3

10 Racs-EV Now the files are still distributed evenly, but don t need to be reassembled for cloud computation which saves on transfer fees and transfer time. R A C S EBK 1 EBK 2 EBK 3 Cloud Compute on EBK 1 Cloud Compute on EBK 2 Cloud Compute on EBK 3

11 Racs-EV R EBK 1 A C EBK 2 S Parity of 1 and 2

12 Current Prices Example (625TB) Storage Transfer out (all data) Storage(625)- Storage(500) Microsoft Amazon Google $ per year $ per year $ per year $ $36864 per year $ $44544 per year $ $39936 per year Costs are for with 5 providers and the parity file turned on. Transfer out doesn t include extra 125 TB since parities aren t transfer.

13 verview - Multiple Proxies Clients C1 Repositories R1 C2 ZooKeeper R2 C3 R3 ZooKeeper is a distributed coordination system. For, it is used to get locks and reliably store meta-data.

14 -EV API PUT (Bucket, Key, Data) GET (Bucket, Key) PUTAT (Bucket, Key, Data, Repo) LCATE(Bucket, Key) DELETE (Bucket, Key) PUTS(Buckets...,Keys,Datas ) GETS(Buckets,Keys )

15 Simplified Put Clients C1 Repositories R1 C2 ZooKeeper R2 C3 R3

16 Simplified Put Clients F4 F3 P2 Repositories C1 R1 C2 ZooKeeper R2 F1 F2 P1 C3 R3 Files are grouped by size (in object groups) to reduce overhead of parity object. Group size is the same as the number of repositories.

17 Simplified Put Clients Repositories C1 R1 F4 F2 C2 ZooKeeper R2 F3 P1 C3 R3 P2 F1 Each repository will contain only one object from each group to allow for fault tolerance.

18 Problems with (-EV) String Xor for 16 MB string

19 Problems with (-EV) String Xor for 16 MB string (x) denotes number of threads

20 String Xor for 16 MB and 128 MB strings on 4 core 2 threads machine. Regular Python crashed with memory problems (Numpy did too on 32 cores)

21 Challenges with -EV Eight different objects to modify on put Data on cloud bjectgroup Parity File bjectgroup freelist (keeps track of groups with space) Key to object group mapping Previous key objectgroup (remove key from group) Previous key data on cloud (remove it) Previous parity file

22 Solution Eight different objects to modify on put Data on cloud bjectgroup Parity File bjectgroup freelist (keeps track of groups with space) Key to object group mapping Previous key objectgroup (remove key from group) Previous key data on cloud (remove it) Previous parity file

23 Solution (or not) Locking alone doesn t solve the problems Could lose connection at any point The lock could be lost at any point. if(lock.isacquired()) then modifydata() isn t atomic Similar problems to updating hard drive Things have to be done in a particular order

24 Solutions with -EV 1. Turns out this order is pretty good a. Data on cloud b. bjectgroup c. Parity File d. bjectgroup Freelist (keeps track of groups with space) e. Key to object group mapping f. Previous key objectgroup (remove key from group) g. Previous key data on cloud (remove it) h. Previous parity file

25 Solutions with -EV 1. Turns out this order is pretty good Register Intent b. Data on cloud c. bjectgroup d. Parity File e. bjectgroup Freelist (keeps track of groups with space) f. Key to object group mapping (Deregister Intent, Register Intent to delete, and part f) atomically g. Previous key objectgroup (remove key from group) h. Previous key data on cloud (remove it) i. Previous parity file

26 if(lock.isacquired()) then... This isn t atomic. Requires versioning Lock Download Note version Modify Upload with version Version same: Success Version different: Failure Unlock

27 The Data 3 Repos 3 EC2 Instances 2 cores 2 threads/core 1.7 GHz 8 GB ram Clients: 9 Clients 20 Files of each size per client Clients send then wait for response

28 The Data Part 2 3 Repos 3 EC2 Instances 2 cores 2 threads/core 1.7 GHz 8 GB ram Clients: 18 Clients 20 Files of each size per client Clients send then wait for response

29 Future Plans Always room for optimization to faster Cloud computation

30 Future Plans - Cloud Computation Send same computation request to all proxies R1 C1 R2 R3 Zookeeper not shown; Connections from each R to each not shown

31 Future Plans - Cloud Computation will see which requests belong to its repo R1 C1 R2 R3 Zookeeper not shown; Connections from each R to each not shown

32 Future Plans - Cloud Computation will see which requests belong to its repo R1 C1 R2 R3 Zookeeper not shown; Connections from each R to each not shown

33 Future Plans - Cloud Computation Client will receive back the results R1 C1 R2 R3 Zookeeper not shown; Connections from each R to each not shown

34 Demo/Questions

RACS: Extended Version in Java Gary Zibrat gdz4

RACS: Extended Version in Java Gary Zibrat gdz4 RACS: Extended Version in Java Gary Zibrat gdz4 Abstract Cloud storage is becoming increasingly popular and cheap. It is convenient for companies to simply store their data online so that they don t have

More information

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018 Cloud Computing and Hadoop Distributed File System UCSB CS70, Spring 08 Cluster Computing Motivations Large-scale data processing on clusters Scan 000 TB on node @ 00 MB/s = days Scan on 000-node cluster

More information

FAWN as a Service. 1 Introduction. Jintian Liang CS244B December 13, 2017

FAWN as a Service. 1 Introduction. Jintian Liang CS244B December 13, 2017 Liang 1 Jintian Liang CS244B December 13, 2017 1 Introduction FAWN as a Service FAWN, an acronym for Fast Array of Wimpy Nodes, is a distributed cluster of inexpensive nodes designed to give users a view

More information

Programming model and implementation for processing and. Programs can be automatically parallelized and executed on a large cluster of machines

Programming model and implementation for processing and. Programs can be automatically parallelized and executed on a large cluster of machines A programming model in Cloud: MapReduce Programming model and implementation for processing and generating large data sets Users specify a map function to generate a set of intermediate key/value pairs

More information

Zombie Apocalypse Workshop

Zombie Apocalypse Workshop Zombie Apocalypse Workshop Building Serverless Microservices Danilo Poccia @danilop Paolo Latella @LatellaPaolo September 22 nd, 2016 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

More information

The New Economics of Cloud Storage

The New Economics of Cloud Storage The New Economics of Cloud Storage How Wasabi Hot Cloud Storage Compares With the Economics of Amazon Web Services, Google Cloud and Microsoft Azure Executive Overview Wasabi is fundamentally transforming

More information

Welcome to the New Era of Cloud Computing

Welcome to the New Era of Cloud Computing Welcome to the New Era of Cloud Computing Aaron Kimball The web is replacing the desktop 1 SDKs & toolkits are there What about the backend? Image: Wikipedia user Calyponte 2 Two key concepts Processing

More information

Getting Started with Amazon EC2 and Amazon SQS

Getting Started with Amazon EC2 and Amazon SQS Getting Started with Amazon EC2 and Amazon SQS Building Scalable, Reliable Amazon EC2 Applications with Amazon SQS Overview Amazon Elastic Compute Cloud (EC2) is a web service that provides resizable compute

More information

MATE-EC2: A Middleware for Processing Data with Amazon Web Services

MATE-EC2: A Middleware for Processing Data with Amazon Web Services MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering Ohio State University * School of Engineering

More information

How to go serverless with AWS Lambda

How to go serverless with AWS Lambda How to go serverless with AWS Lambda Roman Plessl, nine (AWS Partner) Zürich, AWSomeDay 12. September 2018 About myself and nine Roman Plessl Working for nine as a Solution Architect, Consultant and Leader.

More information

Cloud Computing. Technologies and Types

Cloud Computing. Technologies and Types Cloud Computing Cloud Computing Technologies and Types Dell Zhang Birkbeck, University of London 2017/18 The Technological Underpinnings of Cloud Computing Data centres Virtualisation RESTful APIs Cloud

More information

Serverless Architecture Hochskalierbare Anwendungen ohne Server. Sascha Möllering, Solutions Architect

Serverless Architecture Hochskalierbare Anwendungen ohne Server. Sascha Möllering, Solutions Architect Serverless Architecture Hochskalierbare Anwendungen ohne Server Sascha Möllering, Solutions Architect Agenda Serverless Architecture AWS Lambda Amazon API Gateway Amazon DynamoDB Amazon S3 Serverless Framework

More information

Integrated hardware-software solution developed on ARM architecture. CS3 Conference Krakow, January 30th 2018

Integrated hardware-software solution developed on ARM architecture. CS3 Conference Krakow, January 30th 2018 Integrated hardware-software solution developed on ARM architecture CS3 Conference Krakow, January 30th 2018 Why Object Storage Data doubles every 2 year...growing at a faster pace and is mainly unstructured

More information

TrafficDB: HERE s High Performance Shared-Memory Data Store Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, and Anis Moussa

TrafficDB: HERE s High Performance Shared-Memory Data Store Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, and Anis Moussa TrafficDB: HERE s High Performance Shared-Memory Data Store Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, and Anis Moussa EPL646: Advanced Topics in Databases Christos Hadjistyllis

More information

Introduction to Amazon Web Services

Introduction to Amazon Web Services Introduction to Amazon Web Services Introduction Amazon Web Services (AWS) is a collection of remote infrastructure services mainly in the Infrastructure as a Service (IaaS) category, with some services

More information

MongoDB - a No SQL Database What you need to know as an Oracle DBA

MongoDB - a No SQL Database What you need to know as an Oracle DBA MongoDB - a No SQL Database What you need to know as an Oracle DBA David Burnham Aims of this Presentation To introduce NoSQL database technology specifically using MongoDB as an example To enable the

More information

David Mandelberg Thanks to Steve Kent, Andrew Chi (BBN), and Kotikalapudi Sriram (NIST) for valuable input.

David Mandelberg Thanks to Steve Kent, Andrew Chi (BBN), and Kotikalapudi Sriram (NIST) for valuable input. RPKI Rsync Performance Test Update David Mandelberg Thanks to Steve Kent, Andrew Chi (BBN), and Kotikalapudi Sriram (NIST) for valuable input. 1 Background Resource Public Key Infrastructure

More information

Voldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation

Voldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation Voldemort Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Leader Election 1/29 Outline 1 2 3 Smruti R. Sarangi Leader Election 2/29 Data

More information

Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao

Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI 2006 Presented by Xiang Gao 2014-11-05 Outline Motivation Data Model APIs Building Blocks Implementation Refinement

More information

Google Cloud Platform for Systems Operations Professionals (CPO200) Course Agenda

Google Cloud Platform for Systems Operations Professionals (CPO200) Course Agenda Google Cloud Platform for Systems Operations Professionals (CPO200) Course Agenda Module 1: Google Cloud Platform Projects Identify project resources and quotas Explain the purpose of Google Cloud Resource

More information

Tasktop Sync - Installation Primer. Tasktop Sync - Installation Primer

Tasktop Sync - Installation Primer. Tasktop Sync - Installation Primer Tasktop Sync - Installation Primer 1 Contents Overview... 3 Hardware Requirements... 3 Supported Operating Systems... 3 Java Runtime Environment... 3 Networking... 3 Hardware Sizing for Deployment Scenarios...

More information

Flat Datacenter Storage. Edmund B. Nightingale, Jeremy Elson, et al. 6.S897

Flat Datacenter Storage. Edmund B. Nightingale, Jeremy Elson, et al. 6.S897 Flat Datacenter Storage Edmund B. Nightingale, Jeremy Elson, et al. 6.S897 Motivation Imagine a world with flat data storage Simple, Centralized, and easy to program Unfortunately, datacenter networks

More information

Cloudian Sizing and Architecture Guidelines

Cloudian Sizing and Architecture Guidelines Cloudian Sizing and Architecture Guidelines The purpose of this document is to detail the key design parameters that should be considered when designing a Cloudian HyperStore architecture. The primary

More information

PREGEL AND GIRAPH. Why Pregel? Processing large graph problems is challenging Options

PREGEL AND GIRAPH. Why Pregel? Processing large graph problems is challenging Options Data Management in the Cloud PREGEL AND GIRAPH Thanks to Kristin Tufte 1 Why Pregel? Processing large graph problems is challenging Options Custom distributed infrastructure Existing distributed computing

More information

Data Clustering on the Parallel Hadoop MapReduce Model. Dimitrios Verraros

Data Clustering on the Parallel Hadoop MapReduce Model. Dimitrios Verraros Data Clustering on the Parallel Hadoop MapReduce Model Dimitrios Verraros Overview The purpose of this thesis is to implement and benchmark the performance of a parallel K- means clustering algorithm on

More information

Kernel Level Speculative DSM

Kernel Level Speculative DSM Motivation Main interest is performance, fault-tolerance, and correctness of distributed systems Present our ideas in the context of a DSM system We are developing tools that Improve performance Address

More information

6. Results. This section describes the performance that was achieved using the RAMA file system.

6. Results. This section describes the performance that was achieved using the RAMA file system. 6. Results This section describes the performance that was achieved using the RAMA file system. The resulting numbers represent actual file data bytes transferred to/from server disks per second, excluding

More information

Tech Brief Wasabi for Consumers & Small Businesses

Tech Brief Wasabi for Consumers & Small Businesses Wasabi for Consumers & Small Businesses Low-cost cloud file storage and data protection Executive Overview Wasabi is fundamentally transforming cloud storage with hot cloud storage the corporate world

More information

COSC 2P91. Bringing it all together... Week 4b. Brock University. Brock University (Week 4b) Bringing it all together... 1 / 22

COSC 2P91. Bringing it all together... Week 4b. Brock University. Brock University (Week 4b) Bringing it all together... 1 / 22 COSC 2P91 Bringing it all together... Week 4b Brock University Brock University (Week 4b) Bringing it all together... 1 / 22 A note on practicality and program design... Writing a single, monolithic source

More information

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe Cloud Programming Programming Environment Oct 29, 2015 Osamu Tatebe Cloud Computing Only required amount of CPU and storage can be used anytime from anywhere via network Availability, throughput, reliability

More information

Data Centers and Cloud Computing

Data Centers and Cloud Computing Data Centers and Cloud Computing CS677 Guest Lecture Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet

More information

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

Data Centers and Cloud Computing. Slides courtesy of Tim Wood Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet

More information

Lecture 18: Reliable Storage

Lecture 18: Reliable Storage CS 422/522 Design & Implementation of Operating Systems Lecture 18: Reliable Storage Zhong Shao Dept. of Computer Science Yale University Acknowledgement: some slides are taken from previous versions of

More information

COSC 6385 Computer Architecture. Storage Systems

COSC 6385 Computer Architecture. Storage Systems COSC 6385 Computer Architecture Storage Systems Spring 2012 I/O problem Current processor performance: e.g. Pentium 4 3 GHz ~ 6GFLOPS Memory Bandwidth: 133 MHz * 4 * 64Bit ~ 4.26 GB/s Current network performance:

More information

Using Cloud Services behind SGI DMF

Using Cloud Services behind SGI DMF Using Cloud Services behind SGI DMF Greg Banks Principal Engineer, Storage SW 2013 SGI Overview Cloud Storage SGI Objectstore Design Features & Non-Features Future Directions Cloud Storage

More information

Department of Computer Science San Marcos, TX Report Number TXSTATE-CS-TR Clustering in the Cloud. Xuan Wang

Department of Computer Science San Marcos, TX Report Number TXSTATE-CS-TR Clustering in the Cloud. Xuan Wang Department of Computer Science San Marcos, TX 78666 Report Number TXSTATE-CS-TR-2010-24 Clustering in the Cloud Xuan Wang 2010-05-05 !"#$%&'()*+()+%,&+!"-#. + /+!"#$%&'()*+0"*-'(%,1$+0.23%(-)+%-+42.--3+52367&.#8&+9'21&:-';

More information

Data Centers and Cloud Computing. Data Centers

Data Centers and Cloud Computing. Data Centers Data Centers and Cloud Computing Slides courtesy of Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet

More information

Automated Control for Elastic Storage Harold Lim, Shivnath Babu, Jeff Chase Duke University

Automated Control for Elastic Storage Harold Lim, Shivnath Babu, Jeff Chase Duke University D u k e S y s t e m s Automated Control for Elastic Storage Harold Lim, Shivnath Babu, Jeff Chase Duke University Motivation We address challenges for controlling elastic applications, specifically storage.

More information

Deep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services

Deep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services Deep Dive Amazon Kinesis Ian Meyers, Principal Solution Architect - Amazon Web Services Analytics Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure

More information

Lecture 2: Memory Systems

Lecture 2: Memory Systems Lecture 2: Memory Systems Basic components Memory hierarchy Cache memory Virtual Memory Zebo Peng, IDA, LiTH Many Different Technologies Zebo Peng, IDA, LiTH 2 Internal and External Memories CPU Date transfer

More information

AWS Storage Gateway. Not your father s hybrid storage. University of Arizona IT Summit October 23, Jay Vagalatos, AWS Solutions Architect

AWS Storage Gateway. Not your father s hybrid storage. University of Arizona IT Summit October 23, Jay Vagalatos, AWS Solutions Architect AWS Storage Gateway Not your father s hybrid storage University of Arizona IT Summit 2017 Jay Vagalatos, AWS Solutions Architect October 23, 2017 The AWS Storage Portfolio Amazon EBS (persistent) Block

More information

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES May, 2017 Contents Introduction... 2 Overview... 2 Architecture... 2 SDFS File System Service... 3 Data Writes... 3 Data Reads... 3 De-duplication

More information

SCAM Portfolio Scalability

SCAM Portfolio Scalability SCAM Portfolio Scalability Henrik Eriksson Per-Olof Andersson Uppsala Learning Lab 2005-04-18 1 Contents 1 Abstract 3 2 Suggested Improvements Summary 4 3 Abbreviations 5 4 The SCAM Portfolio System 6

More information

Unifer Documentation. Release V1.0. Matthew S

Unifer Documentation. Release V1.0. Matthew S Unifer Documentation Release V1.0 Matthew S July 28, 2014 Contents 1 Unifer Tutorial - Notes Web App 3 1.1 Setting up................................................. 3 1.2 Getting the Template...........................................

More information

AWS Lambda: Event-driven Code in the Cloud

AWS Lambda: Event-driven Code in the Cloud AWS Lambda: Event-driven Code in the Cloud Dean Bryen, Solutions Architect AWS Andrew Wheat, Senior Software Engineer - BBC April 15, 2015 London, UK 2015, Amazon Web Services, Inc. or its affiliates.

More information

Immersion Day. Getting Started with AWS Lambda. August Rev

Immersion Day. Getting Started with AWS Lambda. August Rev Getting Started with AWS Lambda August 2016 Rev 2016-08-19 Table of Contents Overview... 3 AWS Lambda... 3 Amazon S3... 3 Amazon CloudWatch... 3 Handling S3 Events using the AWS Lambda Console... 4 Create

More information

REQUEST. Introduction Thank you with your. of the rest. that time, preparation. What You. Will Need 4x _1258

REQUEST. Introduction Thank you with your. of the rest. that time, preparation. What You. Will Need 4x _1258 REQUEST NAS4T/ NAS4R INSTALLATION Introduction Thank you for purchasing a ReQuest Serious Play NAS device.. You will find that this NAS performs better with your ReQuest server than most devices you can

More information

CSE 373 OCTOBER 23 RD MEMORY AND HARDWARE

CSE 373 OCTOBER 23 RD MEMORY AND HARDWARE CSE 373 OCTOBER 23 RD MEMORY AND HARDWARE MEMORY ANALYSIS Similar to runtime analysis MEMORY ANALYSIS Similar to runtime analysis Consider the worst case MEMORY ANALYSIS Similar to runtime analysis Rather

More information

Map-Reduce. Marco Mura 2010 March, 31th

Map-Reduce. Marco Mura 2010 March, 31th Map-Reduce Marco Mura (mura@di.unipi.it) 2010 March, 31th This paper is a note from the 2009-2010 course Strumenti di programmazione per sistemi paralleli e distribuiti and it s based by the lessons of

More information

Stateless Datacenter Load Balancing with Beamer

Stateless Datacenter Load Balancing with Beamer Stateless Datacenter Load Balancing with Beamer Vladimir Olteanu, Alexandru Agache, Andrei Voinescu, Costin Raiciu University Politehnica of Bucharest Thanks to Datacenter load balancing Datacenter load

More information

Cloud Computing & Visualization

Cloud Computing & Visualization Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International

More information

Problem 1.R1: How Many Bits?

Problem 1.R1: How Many Bits? CSC 495 Problem Set 1 Due Tuesday, January 17 Problem 1.R1: How Many Bits? Required Problem Points: 50 points Background When a number is stored in a primitive type, like an int or long variable, it always

More information

Reshaping Text Data for Efficient Processing on Amazon EC2. Gabriela Turcu, Ian Foster, Svetlozar Nestorov

Reshaping Text Data for Efficient Processing on Amazon EC2. Gabriela Turcu, Ian Foster, Svetlozar Nestorov Reshaping Text Data for Efficient Processing on Amazon EC2 Gabriela Turcu, Ian Foster, Svetlozar Nestorov Outline Motivation Goals: Determine empirically simple application performance model Statically

More information

The Google File System

The Google File System The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system

More information

Running MySQL on AWS. Michael Coburn Wednesday, April 15th, 2015

Running MySQL on AWS. Michael Coburn Wednesday, April 15th, 2015 Running MySQL on AWS Michael Coburn Wednesday, April 15th, 2015 Who am I? 2 Senior Architect with Percona 3 years on Friday! Canadian but I now live in Costa Rica I see 3-10 different customer environments

More information

Why SaaS isn t Backup

Why SaaS isn t Backup EBOOK LOGO HERE Why SaaS isn t Backup Yes, You need to backup your cloud data. 1 One of the most business friendly innovations in recent years has been the proliferation of cloud apps like Google Apps,,

More information

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014 CS15-319 / 15-619 Cloud Computing Recitation 3 September 9 th & 11 th, 2014 Overview Last Week s Reflection --Project 1.1, Quiz 1, Unit 1 This Week s Schedule --Unit2 (module 3 & 4), Project 1.2 Questions

More information

Two hours - online. The exam will be taken on line. This paper version is made available as a backup

Two hours - online. The exam will be taken on line. This paper version is made available as a backup COMP 25212 Two hours - online The exam will be taken on line. This paper version is made available as a backup UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE System Architecture Date: Monday 21st

More information

Distributed Systems. 15. Distributed File Systems. Paul Krzyzanowski. Rutgers University. Fall 2017

Distributed Systems. 15. Distributed File Systems. Paul Krzyzanowski. Rutgers University. Fall 2017 Distributed Systems 15. Distributed File Systems Paul Krzyzanowski Rutgers University Fall 2017 1 Google Chubby ( Apache Zookeeper) 2 Chubby Distributed lock service + simple fault-tolerant file system

More information

Succinct Data Structures: Theory and Practice

Succinct Data Structures: Theory and Practice Succinct Data Structures: Theory and Practice March 16, 2012 Succinct Data Structures: Theory and Practice 1/15 Contents 1 Motivation and Context Memory Hierarchy Succinct Data Structures Basics Succinct

More information

CS /30/17. Paul Krzyzanowski 1. Google Chubby ( Apache Zookeeper) Distributed Systems. Chubby. Chubby Deployment.

CS /30/17. Paul Krzyzanowski 1. Google Chubby ( Apache Zookeeper) Distributed Systems. Chubby. Chubby Deployment. Distributed Systems 15. Distributed File Systems Google ( Apache Zookeeper) Paul Krzyzanowski Rutgers University Fall 2017 1 2 Distributed lock service + simple fault-tolerant file system Deployment Client

More information

Hadoop File System S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y 11/15/2017

Hadoop File System S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y 11/15/2017 Hadoop File System 1 S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y Moving Computation is Cheaper than Moving Data Motivation: Big Data! What is BigData? - Google

More information

COSC3330 Computer Architecture Lecture 20. Virtual Memory

COSC3330 Computer Architecture Lecture 20. Virtual Memory COSC3330 Computer Architecture Lecture 20. Virtual Memory Instructor: Weidong Shi (Larry), PhD Computer Science Department University of Houston Virtual Memory Topics Reducing Cache Miss Penalty (#2) Use

More information

COMP283-Lecture 3 Applied Database Management

COMP283-Lecture 3 Applied Database Management COMP283-Lecture 3 Applied Database Management Introduction DB Design Continued Disk Sizing Disk Types & Controllers DB Capacity 1 COMP283-Lecture 3 DB Storage: Linear Growth Disk space requirements increases

More information

Unlimited Scalability in the Cloud A Case Study of Migration to Amazon DynamoDB

Unlimited Scalability in the Cloud A Case Study of Migration to Amazon DynamoDB Unlimited Scalability in the Cloud A Case Study of Migration to Amazon DynamoDB Steve Saporta CTO, SpinCar Mar 19, 2016 SpinCar When a web-based business grows... More customers = more transactions More

More information

Foxtel Broadband on ADSL Service Description

Foxtel Broadband on ADSL Service Description Foxtel Broadband & Home Phone Agreement Foxtel Broadband on ADSL Service Description Effective from 23 February, 2016 CONTENTS 1. ABOUT THIS SERVICE DESCRIPTION 1 2. FOXTEL BROADBAND ON ADSL 1 3. PLANS

More information

what is cloud computing?

what is cloud computing? what is cloud computing? (Private) Cloud Computing with Mesos at Twi9er Benjamin Hindman @benh scalable virtualized self-service utility managed elastic economic pay-as-you-go what is cloud computing?

More information

Distributed Systems. 15. Distributed File Systems. Paul Krzyzanowski. Rutgers University. Fall 2016

Distributed Systems. 15. Distributed File Systems. Paul Krzyzanowski. Rutgers University. Fall 2016 Distributed Systems 15. Distributed File Systems Paul Krzyzanowski Rutgers University Fall 2016 1 Google Chubby 2 Chubby Distributed lock service + simple fault-tolerant file system Interfaces File access

More information

CS 102. Big Data. Spring Big Data Platforms

CS 102. Big Data. Spring Big Data Platforms CS 102 Big Data Spring 2016 Big Data Platforms How Big is Big? The Data Data Sets 1000 2 (5.3 MB) Complete works of Shakespeare (text) 1000 3 (~5-500 GB) Your data 1000 4 (10 TB) Library of Congress (text)

More information

Cumulus: Filesystem Backup to the Cloud

Cumulus: Filesystem Backup to the Cloud Cumulus: Filesystem Backup to the Cloud 7th USENIX Conference on File and Storage Technologies (FAST 09) Michael Vrable Stefan Savage Geoffrey M. Voelker University of California, San Diego February 26,

More information

The City School Liaquat Campus Worksheet Module 3 Class 7 Name Class/Sec Date

The City School Liaquat Campus Worksheet Module 3 Class 7 Name Class/Sec Date The City School Liaquat Campus Worksheet Module 3 Class 7 Name Class/Sec Date 1. A Network consists of 2. A Point-to-Point Topology 3. The Internet is 1. Select the correct answer one computer computers

More information

Scaling Massive Content Stores in the Cloud. CloudExpo New York June Alfresco Founder & CTO

Scaling Massive Content Stores in the Cloud. CloudExpo New York June Alfresco Founder & CTO Scaling Massive Content Stores in the Cloud CloudExpo New York June 2016 @johnnewton Alfresco Founder & CTO Alfresco Customers Government Financial Services Healthcare Manufacturing Corporate Somewhere

More information

www.powerretrieve.solutions Copyright 2015 1 What is PowerRetrieve? Turning your documents into data Imagine a world where using a few words, dates or letter combinations you could search through the content

More information

Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect

Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect Igor Roiter Big Data Cloud Solution Architect Working as a Data Specialist for the last 11 years 9 of them as a Consultant specializing

More information

Microservices without the Servers: AWS Lambda in Action

Microservices without the Servers: AWS Lambda in Action Microservices without the Servers: AWS Lambda in Action Dr. Tim Wagner, General Manager AWS Lambda August 19, 2015 Seattle, WA 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Two

More information

BANKVIC APP FREQUENTLY ASKED QUESTIONS

BANKVIC APP FREQUENTLY ASKED QUESTIONS BANKVIC APP FREQUENTLY ASKED QUESTIONS TABLE OF CONTENTS TABLE OF CONTENTS... 1 ABOUT THE BANKVIC APP... 2 GETTING STARTED... 3 SECURITY... 4 FEATURES & FUNCTIONALITY... 5 PAYMENTS & TRANSFERS... 6 CARD

More information

CIT 668: System Architecture. Amazon Web Services

CIT 668: System Architecture. Amazon Web Services CIT 668: System Architecture Amazon Web Services Topics 1. AWS Global Infrastructure 2. Foundation Services 1. Compute 2. Storage 3. Database 4. Network 3. AWS Economics Amazon Services Architecture Regions

More information

CSE506: Operating Systems CSE 506: Operating Systems

CSE506: Operating Systems CSE 506: Operating Systems CSE 506: Operating Systems Block Cache Address Space Abstraction Given a file, which physical pages store its data? Each file inode has an address space (0 file size) So do block devices that cache data

More information

4 Myths about in-memory databases busted

4 Myths about in-memory databases busted 4 Myths about in-memory databases busted Yiftach Shoolman Co-Founder & CTO @ Redis Labs @yiftachsh, @redislabsinc Background - Redis Created by Salvatore Sanfilippo (@antirez) OSS, in-memory NoSQL k/v

More information

Streaming Log Analytics with Kafka

Streaming Log Analytics with Kafka Streaming Log Analytics with Kafka Kresten Krab Thorup, Humio CTO Log Everything, Answer Anything, In Real-Time. Why this talk? Humio is a Log Analytics system Designed to run on-prem High volume, real

More information

CLOUD-SCALE FILE SYSTEMS

CLOUD-SCALE FILE SYSTEMS Data Management in the Cloud CLOUD-SCALE FILE SYSTEMS 92 Google File System (GFS) Designing a file system for the Cloud design assumptions design choices Architecture GFS Master GFS Chunkservers GFS Clients

More information

Cloud Computing Security: From Single to Multi-Clouds

Cloud Computing Security: From Single to Multi-Clouds 2012 45th Hawaii International Conference on System Sciences Cloud Computing Security: From Single to Multi-Clouds Abstract: The use of cloud computing has increased rapidly in many organizations. Cloud

More information

Brooke Roegge. Digital Information Specialist Minnesota Dept. of Employment and Economic Development

Brooke Roegge. Digital Information Specialist Minnesota Dept. of Employment and Economic Development Editing Existing Items in CONTENTdm s Project Client Brooke Roegge Digital Information Specialist Minnesota Dept. of Employment and Economic Development November 14, 2011 Open an existing collection in

More information

EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures

EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures Haiyang Shi, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda {shi.876, lu.932, panda.2}@osu.edu The Ohio State University

More information

BACKUP APP V7 CLOUUD FILE BACKUP & RESTORE GUIDE FOR WINDOWS

BACKUP APP V7 CLOUUD FILE BACKUP & RESTORE GUIDE FOR WINDOWS V7 CLOUUD FILE BACKUP & RESTORE GUIDE FOR WINDOWS Table of Contents 1 Overview... 1 1.1 About This Document... 7 2 Preparing for Backup and Restore... 8 2.1 Hardware Requirement... 8 2.2 Software Requirement...

More information

TPP On The Cloud. Joe Slagel

TPP On The Cloud. Joe Slagel TPP On The Cloud Joe Slagel Lecture topics Introduc5on to Cloud Compu5ng and Amazon Web Services Overview of TPP Cloud components Setup trial AWS and use of the new TPP Web Launcher for Amazon (TWA) Future

More information

Backup APP v7. Office 365 Exchange Online Backup & Restore Guide for Mac OS X

Backup APP v7. Office 365 Exchange Online Backup & Restore Guide for Mac OS X Backup APP v7 Office 365 Exchange Online Backup & Restore Guide for Mac OS X Revision History Date Descriptions Type of modification 5 Apr 2017 First Draft New Table of Contents 1 Overview... 1 About This

More information

OPENVZ SERVERS PLANS AND FEATURES

OPENVZ SERVERS PLANS AND FEATURES OPENVZ SERVERS PLANS AND FEATURES TABLE OF CONTENTS WHY CHOOSE OPENVZ VPS? 3 OPENVZ VIRTUAL PRIVATE SERVERS 4 BONUS FEATURES WITH HEPSIA 5 BONUS FEATURES WITH CPANEL & DIRECTADMIN 6 MANAGED SERVICES 7

More information

Distributed Filesystem

Distributed Filesystem Distributed Filesystem 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributing Code! Don t move data to workers move workers to the data! - Store data on the local disks of nodes in the

More information

RAID SEMINAR REPORT /09/2004 Asha.P.M NO: 612 S7 ECE

RAID SEMINAR REPORT /09/2004 Asha.P.M NO: 612 S7 ECE RAID SEMINAR REPORT 2004 Submitted on: Submitted by: 24/09/2004 Asha.P.M NO: 612 S7 ECE CONTENTS 1. Introduction 1 2. The array and RAID controller concept 2 2.1. Mirroring 3 2.2. Parity 5 2.3. Error correcting

More information

About us. How we help?

About us. How we help? Go to Top About us Mobile Device Manager Plus is a mobile device management solution developed by ManageEngine. Mobile Device Manager Plus provides admins the power to perform device management from a

More information

HyperDex. A Distributed, Searchable Key-Value Store. Robert Escriva. Department of Computer Science Cornell University

HyperDex. A Distributed, Searchable Key-Value Store. Robert Escriva. Department of Computer Science Cornell University HyperDex A Distributed, Searchable Key-Value Store Robert Escriva Bernard Wong Emin Gün Sirer Department of Computer Science Cornell University School of Computer Science University of Waterloo ACM SIGCOMM

More information

Tutor Lead Session Image Storage and the Cloud.

Tutor Lead Session Image Storage and the Cloud. Tutor Lead Session Image Storage and the Cloud. Storage There are many different types of storage when it comes to backing up your photos and keeping them safe. This guide will explain a few of the different

More information

Random Access Memory (RAM)

Random Access Memory (RAM) Best known form of computer memory. "random access" because you can access any memory cell directly if you know the row and column that intersect at that cell. CS1111 CS5020 - Prof J.P. Morrison UCC 33

More information

Rate Switch Keying Guide

Rate Switch Keying Guide NFI ONLINE Rate Switch Keying Guide For intermediary use only This guide will assist you with keying a Rate Switch in NFI Online. Note: The following application types can t be completed on NFI Online

More information

Podcasting Creating Audio Podcasts

Podcasting Creating Audio Podcasts Podcasting Creating Audio Podcasts HOW TO MAKE PODCASTS...2 Delivering Your Podcasts...2 Domain Names...2 Hosting Your Files...2 Show Note Hosting...3 FREE & EASY & UPGRADEABLE PODBEAN...3 Registering

More information

Developing on DragonBoard

Developing on DragonBoard Developing on DragonBoard Getting Started with APQ8060 and Pragmatux+Android Bill Gatliff bgat@billgatliff.com Ryan Kuester rkuester@insymbols.com 1 2 CPU Daughterboard APQ8060 ARMv7 Dual core 1.5 GHz

More information

Scientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute

Scientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute Scientific Workflows and Cloud Computing Gideon Juve USC Information Sciences Institute gideon@isi.edu Scientific Workflows Loosely-coupled parallel applications Expressed as directed acyclic graphs (DAGs)

More information

Cloud Computing 1. CSCI 4850/5850 High-Performance Computing Spring 2018

Cloud Computing 1. CSCI 4850/5850 High-Performance Computing Spring 2018 Cloud Computing 1 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning

More information

OVERVIEW...3. Why RePOS...3 INTEGRATION...3. Control...4. Reliability...4. Accessibility...5. Flexibly...5. Support...5. RePOS Components...

OVERVIEW...3. Why RePOS...3 INTEGRATION...3. Control...4. Reliability...4. Accessibility...5. Flexibly...5. Support...5. RePOS Components... 1 P a g e CONTENTS OVERVIEW...3 Why RePOS...3 INTEGRATION...3 Control...4 Reliability...4 Accessibility...5 Flexibly...5 Support...5 RePOS Components...6 Technical Requrirements...7 RePOS Interface...7

More information