A Cloud Storage Adaptable to Read-Intensive and Write-Intensive Workload
|
|
- Thomas Martin
- 5 years ago
- Views:
Transcription
1 DEIM Forum 2011 C MyCassandra, Cassandra MySQL, 41.4%, 49.4%.,, Abstract A Cloud Storage Adaptable to Read-Intensive and Write-Intensive Workload Shunsuke NAKAMURA and Kazuyuki SHUDO Dept. of Mathematical and Computing Sciences, Tokyo Institute of Technology Ookayama, Meguro, Tokyo, Japan {nakamur6,shudo}@is.titech.ac.jp We expect that a storage engine determines whether a cloud storage is read-optimized or write-optimized. It means that a single cloud storage can be both of them just by replacing its storage engine with another one. It is not necessary to use another cloud storage to adjust a balance of read and write performance. The expectation was confirmed by performance comparison of Bigtable-based storage engine of the original Cassandra and MySQL storage engine. Write latency of the former is 41.4% lower than the latter with a write-heavy workload, and read latency of the latter is 49.4% lower than the former with a read-heavy workload. Key words distributed database, cloud storage, performance trade-off 1. RDBMS NoSQL Key-Value Store. RDBMS,,.,., key-value (multi dimentional map), master/worker,, /.,,..,, /. Apache Cassandra, MyCassandra.
2 1 Cassandra, HBase Sherpa, sharded MySQL Diff Sequential Key lookup Update Diff Merge Single Read Bigtable MySQL 2., I/O. I/O, I/O,,.,,.,,. 1. Apache Cassandra [1] Apache HBase [2],,, Sherpa sharded MySQL(MySQL sharding ),. /.,,. Cassandra HBase Bigtable ( ) I/O,, I/O., MySQL Sherpa 1 I/O, I/O. RDBMS 1 MySQL,., 1 MyCassandra Storage Engine Interface. Apache Cassandra, MySQL, MyCassandra. MyCassandra, Apache Cassandra, My- Cassandra MyCassandra MyCassandra, Cassandra,.,,. 1 Cassandra MyCassandra. MyCassandra Cassandra Storage Engine Interface., MyCassandra. (, ) put get Cassandra, RDBMS key-value store.,. MySQL key value key-value., Redis key-value, key ColumnFamily prefix Apache Cassandra Apache Cassandra, Facebook, Apache Project Consistent Hashing Cassandra Amazon Dynamo [3] Consistent Hashing
3 2 YCSB Workload Read Update Record Selection App. Example Update-Only 0% 100% Zipfian Log Update-Heavy 50% 50% Zipfian Session Store Read-Heavy 95% 5% Zipfian Photo tagging Read-Only 100% 0% Zipfian Cache 2 Cassandra , KB 4 OS fc14.x86 64 CPU 2.40 GHz Xeon E Mem 32GB RAM Disk 1TB SATA HDD 2 JVM Java SE 6 Update 21 MYSQL alpha. Consistent Hashing,. Gossip Protocol,., ID N-1, Bigtable Cassandra Bigtable, Commit Log, MemTable, SSTable. Cassandra 2. Commit Log, MemTable. MemTable, SSTable.,,,.,, SSTable MySQL / InnoDB, 1 MySQL 6.0 My- Cassandra. MySQL JDBC API, MySQL InnoDB Redis Redis [4] Key Value keyvalue store.redis,,.,,. 3. MyCassandra, Cassandra Bigtable, MySQL, Redis 3. Yahoo! s Cloud Serving Benchmark(YCSB) [5] YCSB YCSB, Yahoo! Research.. YCSB,. YCSB,,,. 2, 4. Update-Only Update-Heavy, Read-Only, Read-Only.., Zipfian.Zipfian,,,. 3, ,000 /. Bigtable MySQL, Bigtable, MySQL 41.4%, MySQL
4 3..,. Cloudy [7],.,.. Amazon Dynamo [3], Amazon key-value store. Berkeley DB MySQL,.,. Dynamo, Bigtable 49.4% , Bigtable MySQL 5.32, MySQL Bigtable 2.35.,, Bigtable, MySQL.,. Redis,. 4. Anvil [6], dtable, /, SSD HDD, SSD. SSD HDD I/O,, HDD Bigtable MySQL. 5. 2,,,.,., 5,., Bigtable Redis, MySQL Redis,.,.,, Cassandra Gossip Protocol.,.,
5 5 MyCassandra Cluster., Cassandra,, Quorum. MyCassandra,. (proximity),,,.,,,,,,.,.,. [1] Avinash Lakshman and Prashant Malik. Cassandra - a decentralized structured storage system. In Proc. LADIS 09, [2] Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. Bigtable: A distributed storage system for structured data. In Proc. OSDI 06, Vol. 7, pp , [3] Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels. Dynamo: Amazon s highly available key-value store. In Proc. SOSP 07, [4] Redis. Redis. March [5] B. F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. Benchmarking cloud serving systems with ycsb. In Proc. SOCC 10, [6] Mike Mammarella, Shant Hovsepian, and Eddie Kohler. Modular data storage with anvil. In Proc. SOSP 09, [7] Donald Kossmann, Tim Kraska, Simon Loesing, Stephan Merkli, Raman Mittal, and Flavio Pfaffhauser. Cloudy: A modular cloud storage system. In Proc. VLDB 10, 2010.
Cassandra- A Distributed Database
Cassandra- A Distributed Database Tulika Gupta Department of Information Technology Poornima Institute of Engineering and Technology Jaipur, Rajasthan, India Abstract- A relational database is a traditional
More informationL22: NoSQL. CS3200 Database design (sp18 s2) 4/5/2018 Several slides courtesy of Benny Kimelfeld
L22: NoSQL CS3200 Database design (sp18 s2) https://course.ccs.neu.edu/cs3200sp18s2/ 4/5/2018 Several slides courtesy of Benny Kimelfeld 2 Outline 3 Introduction Transaction Consistency 4 main data models
More informationNOSQL DATABASE PERFORMANCE BENCHMARKING - A CASE STUDY
STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LXIII, Number 1, 2018 DOI: 10.24193/subbi.2018.1.06 NOSQL DATABASE PERFORMANCE BENCHMARKING - A CASE STUDY CAMELIA-FLORINA ANDOR AND BAZIL PÂRV Abstract.
More informationDynamo: Amazon s Highly Available Key-value Store
Dynamo: Amazon s Highly Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall and
More informationBenchmarking Cloud Serving Systems with YCSB 詹剑锋 2012 年 6 月 27 日
Benchmarking Cloud Serving Systems with YCSB 詹剑锋 2012 年 6 月 27 日 Motivation There are many cloud DB and nosql systems out there PNUTS BigTable HBase, Hypertable, HTable Megastore Azure Cassandra Amazon
More informationIntroduction Data Model API Building Blocks SSTable Implementation Tablet Location Tablet Assingment Tablet Serving Compactions Refinements
Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber Google, Inc. M. Burak ÖZTÜRK 1 Introduction Data Model API Building
More informationCS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University
CS 555: DISTRIBUTED SYSTEMS [REPLICATION & CONSISTENCY] Frequently asked questions from the previous class survey Shrideep Pallickara Computer Science Colorado State University L25.1 L25.2 Topics covered
More informationA Proposal of the Distributed Data Management System for Large-scale Sensor Data
1, 1,a) 2 2 2 3 4 2012 3 31, 2012 11 2 Home Energy Management System HEMS Building Energy Management System BEMS PUCC DBMS EMS A Proposal of the Distributed Data Management System for Large-scale Sensor
More informationBigtable. Presenter: Yijun Hou, Yixiao Peng
Bigtable Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber Google, Inc. OSDI 06 Presenter: Yijun Hou, Yixiao Peng
More informationCSE-E5430 Scalable Cloud Computing Lecture 10
CSE-E5430 Scalable Cloud Computing Lecture 10 Keijo Heljanko Department of Computer Science School of Science Aalto University keijo.heljanko@aalto.fi 23.11-2015 1/29 Exam Registering for the exam is obligatory,
More informationCSE-E5430 Scalable Cloud Computing Lecture 9
CSE-E5430 Scalable Cloud Computing Lecture 9 Keijo Heljanko Department of Computer Science School of Science Aalto University keijo.heljanko@aalto.fi 15.11-2015 1/24 BigTable Described in the paper: Fay
More informationDynamo: Amazon s Highly Available Key-value Store
Dynamo: Amazon s Highly Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall and
More informationCS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University
CS 555: DISTRIBUTED SYSTEMS [DYNAMO & GOOGLE FILE SYSTEM] Frequently asked questions from the previous class survey What s the typical size of an inconsistency window in most production settings? Dynamo?
More informationCS655: Advanced Topics in Distributed Systems [Fall 2013] Dept. Of Computer Science, Colorado State University
CS 655: ADVANCED TOPICS IN DISTRIBUTED SYSTEMS Shrideep Pallickara Computer Science Colorado State University PROFILING HARD DISKS L4.1 L4.2 Characteristics of peripheral devices & their speed relative
More informationCassandra - A Decentralized Structured Storage System. Avinash Lakshman and Prashant Malik Facebook
Cassandra - A Decentralized Structured Storage System Avinash Lakshman and Prashant Malik Facebook Agenda Outline Data Model System Architecture Implementation Experiments Outline Extension of Bigtable
More informationEvaluating Cassandra as a Manager of Large File Sets
Evaluating Cassandra as a Manager of Large File Sets Leander Beernaert, Pedro Gomes, Miguel Matos, Ricardo Vilaça, Rui Oliveira High-Assurance Software Laboratory INESC TEC & Universidade do Minho Braga,
More informationΕΠΛ 602:Foundations of Internet Technologies. Cloud Computing
ΕΠΛ 602:Foundations of Internet Technologies Cloud Computing 1 Outline Bigtable(data component of cloud) Web search basedonch13of thewebdatabook 2 What is Cloud Computing? ACloudis an infrastructure, transparent
More informationReprise: Stability under churn (Tapestry) A Simple lookup Test. Churn (Optional Bamboo paper last time)
EECS 262a Advanced Topics in Computer Systems Lecture 22 Reprise: Stability under churn (Tapestry) P2P Storage: Dynamo November 20 th, 2013 John Kubiatowicz and Anthony D. Joseph Electrical Engineering
More informationNoSQL and Database as a Service
NoSQL and Database as a Service Prof. Dr. Marcel Graf TSM-ClComp-EN Cloud Computing (C) 2017 HEIG-VD Databases Historical overview Prehistory : Hierarchical or network databases 1980 Ascent of relational
More informationEnhancing the Query Performance of NoSQL Datastores using Caching Framework
Enhancing the Query Performance of NoSQL Datastores using Caching Framework Ruchi Nanda #1, Swati V. Chande *2, K.S. Sharma #3 #1,# 3 Department of CS & IT, The IIS University, Jaipur, India *2 Department
More informationDISTRIBUTED FILE SYSTEMS CARSTEN WEINHOLD
Department of Computer Science Institute of System Architecture, Operating Systems Group DISTRIBUTED FILE SYSTEMS CARSTEN WEINHOLD OUTLINE Classical distributed file systems NFS: Sun Network File System
More informationA Proxy-based Query Aggregation Method for Distributed Key-Value Stores
A Proxy-based Query Aggregation Method for Distributed Key-Value Stores Daichi Kawanami, Masanari Kamoshita, Ryota Kawashima and Hiroshi Matsuo Nagoya Institute of Technology, in Nagoya, Aichi, 466-8555,
More informationA Review to the Approach for Transformation of Data from MySQL to NoSQL
A Review to the Approach for Transformation of Data from MySQL to NoSQL Monika 1 and Ashok 2 1 M. Tech. Scholar, Department of Computer Science and Engineering, BITS College of Engineering, Bhiwani, Haryana
More informationEnabling Distributed Key-Value Stores with Low Latency-Impact Snapshot Support
Enabling Distributed Key-Value Stores with Low Latency-Impact Snapshot Support Jordà Polo, Yolanda Becerra, David Carrera, Jordi Torres and Eduard Ayguadé Barcelona Supercomputing Center (BSC) - Technical
More informationThe material in this lecture is taken from Dynamo: Amazon s Highly Available Key-value Store, by G. DeCandia, D. Hastorun, M. Jampani, G.
The material in this lecture is taken from Dynamo: Amazon s Highly Available Key-value Store, by G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall,
More informationDISTRIBUTED FILE SYSTEMS CARSTEN WEINHOLD
Department of Computer Science Institute of System Architecture, Operating Systems Group DISTRIBUTED FILE SYSTEMS CARSTEN WEINHOLD OUTLINE Classical distributed file systems NFS: Sun Network File System
More informationHOD, Assoc. Prof, Department of computer applications, Sri Venkateswara College of Engineering and Technology, Chittoor, Andhra Pradesh, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 4 ISSN : 2456-3307 Remote Sensing Application with Real-Time Big Data
More informationPresented by Nanditha Thinderu
Presented by Nanditha Thinderu Enterprise systems are highly distributed and heterogeneous which makes administration a complex task Application Performance Management tools developed to retrieve information
More informationDistributed Hash Tables Chord and Dynamo
Distributed Hash Tables Chord and Dynamo (Lecture 19, cs262a) Ion Stoica, UC Berkeley October 31, 2016 Today s Papers Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications, Ion Stoica,
More informationDepartment of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 17
Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases Lecture 17 Cloud Data Management VII (Column Stores and Intro to NewSQL) Demetris Zeinalipour http://www.cs.ucy.ac.cy/~dzeina/courses/epl646
More informationAn Efficient Distributed B-tree Index Method in Cloud Computing
Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 214, 8, 32-38 32 Open Access An Efficient Distributed B-tree Index Method in Cloud Computing Huang Bin 1,*
More informationNoSQL BENCHMARKING AND TUNING. Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India
NoSQL BENCHMARKING AND TUNING Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India Today large variety of available NoSQL options has made it difficult for developers to choose
More informationAccelerating Big Data: Using SanDisk SSDs for Apache HBase Workloads
WHITE PAPER Accelerating Big Data: Using SanDisk SSDs for Apache HBase Workloads December 2014 Western Digital Technologies, Inc. 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com Table of Contents
More informationFAQs Snapshots and locks Vector Clock
//08 CS5 Introduction to Big - FALL 08 W.B.0.0 CS5 Introduction to Big //08 CS5 Introduction to Big - FALL 08 W.B. FAQs Snapshots and locks Vector Clock PART. LARGE SCALE DATA STORAGE SYSTEMS NO SQL DATA
More informationAnalysis of HBase Read/Write
Analysis of HBase Read/Write Arvind Dwarakanath School of Informatics and Computing, Indiana University adwaraka@indiana.edu Vaibhav Nachankar School of Informatics and Computing, Indiana University vmnachan@indiana.edu
More informationCausal Consistency for Data Stores and Applications as They are
[DOI: 10.2197/ipsjjip.25.775] Regular Paper Causal Consistency for Data Stores and Applications as They are Kazuyuki Shudo 1,a) Takashi Yaguchi 1, 1 Received: November 3, 2016, Accepted: May 16, 2017 Abstract:
More informationINF 551: Overview of Data Informatics in Large Data Environments Section: 32405D Spring 2017 (4 units), 3:30 5:20 PM, MW, SOS B44
USC VITERBI SCHOOL OF ENGINEERING INFORMATICS PROGRAM INF 551: Overview of Data Informatics in Large Data Environments Section: 32405D Spring 2017 (4 units), 3:30 5:20 PM, MW, SOS B44 Syllabus Instructor:
More informationarxiv: v3 [cs.dc] 1 Oct 2015
Continuous Partial Quorums for Consistency-Latency Tuning in Distributed NoSQL Storage Systems [Please refer to the proceedings of SCDM 15 for the extended version of this manuscript.] Marlon McKenzie
More informationDesign & Implementation of Cloud Big table
Design & Implementation of Cloud Big table M.Swathi 1,A.Sujitha 2, G.Sai Sudha 3, T.Swathi 4 M.Swathi Assistant Professor in Department of CSE Sri indu College of Engineering &Technolohy,Sheriguda,Ibrahimptnam
More informationOn the Varieties of Clouds for Data Intensive Computing
On the Varieties of Clouds for Data Intensive Computing Robert L. Grossman University of Illinois at Chicago and Open Data Group Yunhong Gu University of Illinois at Chicago Abstract By a cloud we mean
More informationHyperDex. A Distributed, Searchable Key-Value Store. Robert Escriva. Department of Computer Science Cornell University
HyperDex A Distributed, Searchable Key-Value Store Robert Escriva Bernard Wong Emin Gün Sirer Department of Computer Science Cornell University School of Computer Science University of Waterloo ACM SIGCOMM
More informationScalable database management in cloud computing
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 70 (2015 ) 658 667 4 th International Conference on Eco-friendly Computing and Communication Systems Scalable database
More informationMDHIM: A Parallel Key/Value Framework for HPC
: A Parallel Key/Value Framework for HPC Hugh N. Greenberg 1 Los Alamos National Laboratory John Bent EMC Gary Grider Los Alamos National Laboratory Abstract The long-expected convergence of High Performance
More informationTwo Levels Hashing Function in Consistent Hashing Algorithm
1 Yihua Lan, 2Xiaopu Ma, 3Yong Zhang, 4Haozheng Ren, 5Chao Yin, 6Huaifei Hu School of Computer Engineering, Huaihai Institute of Technology, Lianyungang, China, lanhua_2000@sina.com 2 School of Computer
More informationYCSB++ benchmarking tool Performance debugging advanced features of scalable table stores
YCSB++ benchmarking tool Performance debugging advanced features of scalable table stores Swapnil Patil M. Polte, W. Tantisiriroj, K. Ren, L.Xiao, J. Lopez, G.Gibson, A. Fuchs *, B. Rinaldi * Carnegie
More informationMaking Sense of NoSQL Dan McCreary, Kelly-McCreary & Associates. Minnesota Web Design Community Meetup Monday, February 3rd, :00pm to 8:30pm
Making Sense of NoSQL Dan McCreary, Kelly-McCreary & Associates Minnesota Web Design Community Meetup Monday, February 3rd, 2014 6:00pm to 8:30pm Food for thought What percentage of database transactions
More informationActive Cloud DB: A Database-Agnostic HTTP API to Key-Value Datastores
Active Cloud DB: A Database-Agnostic HTTP API to Key-Value Datastores Chris Bunch Jonathan Kupferman Chandra Krintz Computer Science Department University of California, Santa Barbara April 26, 2010 UCSB
More informationStructured Big Data 1: Google Bigtable & HBase Shiow-yang Wu ( 吳秀陽 ) CSIE, NDHU, Taiwan, ROC
Structured Big Data 1: Google Bigtable & HBase Shiow-yang Wu ( 吳秀陽 ) CSIE, NDHU, Taiwan, ROC Lecture material is mostly home-grown, partly taken with permission and courtesy from Professor Shih-Wei Liao
More informationBigtable: A Distributed Storage System for Structured Data
Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber ~Harshvardhan
More informationAccelerate Database Performance and Reduce Response Times in MongoDB Humongous Environments with the LSI Nytro MegaRAID Flash Accelerator Card
Accelerate Database Performance and Reduce Response Times in MongoDB Humongous Environments with the LSI Nytro MegaRAID Flash Accelerator Card The Rise of MongoDB Summary One of today s growing database
More informationNoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu
NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related
More informationarxiv: v1 [cs.db] 23 Aug 2010
ElasTraS: An Elastic Transactional Data Store in the Cloud Sudipto Das Divyakant Agrawal Amr El Abbadi Department of Computer Science, UC Santa Barbara, CA, USA {sudipto, agrawal, amr}@cs.ucsb.edu arxiv:1008.3751v1
More informationHarmony: Towards Automated Self-Adaptive Consistency in Cloud Storage
Harmony: Towards Automated Self-Adaptive Consistency in Cloud Storage Houssem-Eddine Chihoub, Shadi Ibrahim, Gabriel Antoniu INRIA Rennes-Bretagne Atlantique Rennes, France {houssem-eddine.chihoub, shadi.ibrahim,
More informationDistributed Systems [Fall 2012]
Distributed Systems [Fall 2012] Lec 20: Bigtable (cont ed) Slide acks: Mohsen Taheriyan (http://www-scf.usc.edu/~csci572/2011spring/presentations/taheriyan.pptx) 1 Chubby (Reminder) Lock service with a
More informationCaSSanDra: An SSD Boosted Key- Value Store
CaSSanDra: An SSD Boosted Key- Value Store Prashanth Menon, Tilmann Rabl, Mohammad Sadoghi (*), Hans- Arno Jacobsen * UNIVERSITY OF TORONTO!1 Outline ApplicaHon Performance Management Cassandra and SSDs
More informationCassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent
Tanton Jeppson CS 401R Lab 3 Cassandra, MongoDB, and HBase Introduction For my report I have chosen to take a deeper look at 3 NoSQL database systems: Cassandra, MongoDB, and HBase. I have chosen these
More informationExploring Cassandra and HBase with BigTable Model
Exploring Cassandra and HBase with BigTable Model Hemanth Gokavarapu hemagoka@indiana.edu (Guidance of Prof. Judy Qiu) Department of Computer Science Indiana University Bloomington Abstract Cassandra is
More informationMeasuring Elasticity for Cloud Databases
Measuring Elasticity for Cloud Databases Thibault Dory, Boris Mejías Peter Van Roy ICTEAM Institute Univ. catholique de Louvain dory.thibault@gmail.com, peter.vanroy@uclouvain.be, boris.mejias@uclouvain.be
More informationPebblesDB: Building Key-Value Stores using Fragmented Log Structured Merge Trees
PebblesDB: Building Key-Value Stores using Fragmented Log Structured Merge Trees Pandian Raju 1, Rohan Kadekodi 1, Vijay Chidambaram 1,2, Ittai Abraham 2 1 The University of Texas at Austin 2 VMware Research
More informationFine-tuning the Consistency-Latency Trade-off in Quorum-Replicated Distributed Storage Systems
Fine-tuning the Consistency-Latency Trade-off in Quorum-Replicated Distributed Storage Systems Marlon McKenzie Electrical and Computer Engineering University of Waterloo, Canada m2mckenzie@uwaterloo.ca
More informationLoad Balancing Technology Based On Consistent Hashing For Database Cluster Systems
National Conference on Information Technology and Computer Science (CITCS 2012) Load Balancing Technology Based On Consistent Hashing For Database Cluster Systems Zhenguo Xuan Dept. of Computer Science,
More informationScalable Storage: The drive for web-scale data management
Scalable Storage: The drive for web-scale data management Bryan Rosander University of Central Florida bryanrosander@gmail.com March 28, 2012 Abstract Data-intensive applications have become prevalent
More informationOn the Energy Proportionality of Distributed NoSQL Data Stores
On the Energy Proportionality of Distributed NoSQL Data Stores Balaji Subramaniam and Wu-chun Feng Department. of Computer Science, Virginia Tech {balaji, feng}@cs.vt.edu Abstract. The computing community
More informationBigTable A System for Distributed Structured Storage
BigTable A System for Distributed Structured Storage Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber Adapted
More informationEvaluating Auto Scalable Application on Cloud
Evaluating Auto Scalable Application on Cloud Takashi Okamoto Abstract Cloud computing enables dynamic scaling out of system resources, depending on workloads and data volume. In addition to the conventional
More informationMDHIM: A Parallel Key/Value Store Framework for HPC
MDHIM: A Parallel Key/Value Store Framework for HPC Hugh Greenberg 7/6/2015 LA-UR-15-25039 HPC Clusters Managed by a job scheduler (e.g., Slurm, Moab) Designed for running user jobs Difficult to run system
More informationCAP Theorem, BASE & DynamoDB
Indian Institute of Science Bangalore, India भ रत य व ज ञ न स स थ न ब गल र, भ रत DS256:Jan18 (3:1) Department of Computational and Data Sciences CAP Theorem, BASE & DynamoDB Yogesh Simmhan Yogesh Simmhan
More informationPoS(ISGC 2011 & OGF 31)075
1 GIS Center, Feng Chia University 100 Wen0Hwa Rd., Taichung, Taiwan E-mail: cool@gis.tw Tien-Yin Chou GIS Center, Feng Chia University 100 Wen0Hwa Rd., Taichung, Taiwan E-mail: jimmy@gis.tw Lan-Kun Chung
More informationLeveraging High-Performance In-Memory Key-Value Data Stores to Accelerate Data Intensive Tasks
Leveraging High-Performance In-Memory Key-Value Data Stores to Accelerate Data Intensive Tasks Nesrine Khouzami, Jordà Polo and David Carrera Technical Report UPC-DAC-RR-CAP-2015-19 Department of Computer
More informationYCSB+T: Bench-marking Web-Scale Transactional Databases
YCSB+T: Bench-marking Web-Scale Transactional Databases Varun Razdan 1, Rahul Jobanputra 2, Aman Modi 3, Shruti Dumbare 4 1234Student, Dept. Of CE, Sinhgad Institute of Technology, Maharashtra, India Abstract-
More informationCompSci 516 Database Systems
CompSci 516 Database Systems Lecture 20 NoSQL and Column Store Instructor: Sudeepa Roy Duke CS, Fall 2018 CompSci 516: Database Systems 1 Reading Material NOSQL: Scalable SQL and NoSQL Data Stores Rick
More informationColumn-Oriented Storage Optimization in Multi-Table Queries
Column-Oriented Storage Optimization in Multi-Table Queries Davud Mohammadpur 1*, Asma Zeid-Abadi 2 1 Faculty of Engineering, University of Zanjan, Zanjan, Iran.dmp@znu.ac.ir 2 Department of Computer,
More informationDistributed Indexing of Web Scale Datasets for the Cloud
Distributed Indexing of Web Scale Datasets for the Cloud Ioannis Konstantinou, Evangelos Angelou, Dimitrios Tsoumakos and Nectarios Koziris Computing Systems Laboratory School of Electrical and Computer
More informationRocksDB Embedded Key-Value Store for Flash and RAM
RocksDB Embedded Key-Value Store for Flash and RAM Dhruba Borthakur February 2018. Presented at Dropbox Dhruba Borthakur: Who Am I? University of Wisconsin Madison Alumni Developer of AFS: Andrew File
More informationHadoop : A Framework for Big Data Processing & Storage
Hadoop : A Framework for Big Data Processing & Storage Yojna Arora 1, Dr Dinesh Goyal 2 1 PhD Scholar, Department of Computer Science & Engineering Gyan Vihar School of Engineering & Technology Suresh
More informationA Transaction Model for Management for Replicated Data with Multiple Consistency Levels
A Transaction Model for Management for Replicated Data with Multiple Consistency Levels Anand Tripathi Department of Computer Science & Engineering University of Minnesota, Minneapolis, Minnesota, 55455
More informationA Simple Approach for Executing SQL on a NoSQL Datastore
A Simple Approach for Executing SQL on a NoSQL Datastore Ricardo Vilaça, Francisco Cruz, José Pereira, and Rui Oliveira HASLab - High-Assurance Software Laboratory INESC TEC and Universidade do Minho Braga,
More informationDefining Weakly Consistent Byzantine Fault-Tolerant Services
Defining Weakly Consistent Byzantine Fault-Tolerant Services Atul Singh, Pedro Fonseca, Petr Kuznetsov,, Rodrigo Rodrigues, Petros Maniatis MPI-SWS, Rice University, TU Berlin/DT Labs, Intel Research Berkeley
More informationDB unimo. Università degli Studi di Modena e Reggio Emilia 1
Slides partially taken from: Gautam Shroff s Web Intelligence and Big Data course on Corusera NoSQL Principles and Systems: Tecnologia delle Basi di Dati course for Computer Engineering Master students
More informationTools for Social Networking Infrastructures
Tools for Social Networking Infrastructures 1 Cassandra - a decentralised structured storage system Problem : Facebook Inbox Search hundreds of millions of users distributed infrastructure inbox changes
More informationImproving Logical Clocks in Riak with Dotted Version Vectors: A Case Study
Improving Logical Clocks in Riak with Dotted Version Vectors: A Case Study Ricardo Gonçalves Universidade do Minho, Braga, Portugal, tome@di.uminho.pt Abstract. Major web applications need the partition-tolerance
More informationData Management in the Cloud. Tim Kraska
Data Management in the Cloud Tim Kraska Montag, 22. Februar 2010 Systems Group/ETH Zurich MILK? [Anology from IM 2/09 / Daniel Abadi] 22.02.2010 Systems Group/ETH Zurich 2 Do you want milk? Buy a cow High
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More informationCS 6343: CLOUD COMPUTING Term Project
CS 6343: CLOUD COMPUTING Term Project For all projects Each group will be assigned a cluster of machines Each group should install VMM on each platform and use the VMs to simulate more machines See VMM
More informationCS 223 Final Project CuckooRings: A Data Structure for Reducing Maximum Load in Consistent Hashing
CS 223 Final Project CuckooRings: A Data Structure for Reducing Maximum Load in Consistent Hashing Jonah Kallenbach and Ankit Gupta May 2015 1 Introduction Cuckoo hashing and consistent hashing are different
More informationYCSB++ Benchmarking Tool Performance Debugging Advanced Features of Scalable Table Stores
YCSB++ Benchmarking Tool Performance Debugging Advanced Features of Scalable Table Stores Swapnil Patil Milo Polte, Wittawat Tantisiriroj, Kai Ren, Lin Xiao, Julio Lopez, Garth Gibson, Adam Fuchs *, Billie
More informationLow-Latency Multi-Datacenter Databases using Replicated Commits
Low-Latency Multi-Datacenter Databases using Replicated Commits Hatem A. Mahmoud, Alexander Pucher, Faisal Nawab, Divyakant Agrawal, Amr El Abbadi Universoty of California Santa Barbara, CA, USA {hatem,pucher,nawab,agrawal,amr}@cs.ucsb.edu
More informationDesign and Implement of Bigdata Analysis Systems
Design and Implement of Bigdata Analysis Systems Jeong-Joon Kim *Department of Computer Science & Engineering, Korea Polytechnic University, Gyeonggi-do Siheung-si 15073, Korea. Abstract The development
More informationSOLVING LOAD REBALANCING FOR DISTRIBUTED FILE SYSTEM IN CLOUD
1 SHAIK SHAHEENA, 2 SD. AFZAL AHMAD, 3 DR.PRAVEEN SHAM 1 PG SCHOLAR,CSE(CN), QUBA ENGINEERING COLLEGE & TECHNOLOGY, NELLORE 2 ASSOCIATE PROFESSOR, CSE, QUBA ENGINEERING COLLEGE & TECHNOLOGY, NELLORE 3
More informationEfficient Map Reduce Model with Hadoop Framework for Data Processing
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,
More informationManaging Object Versioning in Geo-Distributed Object Storage Systems
Managing Object Versioning in Geo-Distributed Object Storage Systems João Neto KTH joaon@kth.se Vianney Rancurel Vinh Tao Scality Scality and UPMC-LIP6 vianney.rancurel@scality.com vinh.tao@lip6.fr ABSTRACT
More informationDynamo: Amazon s Highly- Available Key- Value Store
Dynamo: Amazon s Highly- Available Key- Value Store Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan KakulapaD, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK EFFICIENT DATA PROCESSING IN PEER NETWORK USING CLOUD COMPUTING SHILPA VASANTRAO
More informationCS 655 Advanced Topics in Distributed Systems
Presented by : Walid Budgaga CS 655 Advanced Topics in Distributed Systems Computer Science Department Colorado State University 1 Outline Problem Solution Approaches Comparison Conclusion 2 Problem 3
More informationIdempotent Distributed Counters Using a Forgetful Bloom Filter
Idempotent Distributed Counters Using a Forgetful Bloom Filter Rajath Subramanyam, Indranil Gupta, Luke M Leslie, Wenting Wang Department of Computer Science University of Illinois at Urbana-Champaign,
More information10. Replication. Motivation
10. Replication Page 1 10. Replication Motivation Reliable and high-performance computation on a single instance of a data object is prone to failure. Replicate data to overcome single points of failure
More informationIntroduction to store data in Redis, a persistent and fast key-value database
AMICT 2010-2011. pp. 39 49 39 Introduction to store data in Redis, a persistent and fast key-value database Matti Paksula Department of Computer Science, University of Helsinki P.O.Box 68, FI-00014 University
More informationBenchmarking Cloud-based Data Management Systems
Benchmarking Cloud-based Data Management Systems Yingjie Shi, Xiaofeng Meng, Jing Zhao, Xiangmei Hu, Bingbing Liu and Haiping Wang School of Information, Renmin University of China Beijing, China, 1872
More informationBigTable: A System for Distributed Structured Storage
BigTable: A System for Distributed Structured Storage Jeff Dean Joint work with: Mike Burrows, Tushar Chandra, Fay Chang, Mike Epstein, Andrew Fikes, Sanjay Ghemawat, Robert Griesemer, Bob Gruber, Wilson
More informationNoSQL Performance Test
bankmark UG (haftungsbeschränkt) Bahnhofstraße 1 9432 Passau Germany www.bankmark.de info@bankmark.de T +49 851 25 49 49 F +49 851 25 49 499 NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB,
More information