/ Cloud Computing. Recitation 8 October 18, 2016
|
|
- Allan Young
- 6 years ago
- Views:
Transcription
1 / Cloud Computing Recitation 8 October 18,
2 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 3.2, OLI Unit 3, Module 13, Quiz 6 This week s schedule - Quiz 7 - Thursday, October 20th - Unit 4, Module 14 - Project October 23 Team Project: Phase 1 2
3 Last Week : A Reflection Content, Unit 3 - Module 13: - Storage and Network Virtualization - Quiz 6 completed P3.2: You explored consistency models - Sharding and Replication - Multithreaded programming - Implemented Strong consistency model - Bonus Task: Eventual Consistency 3
4 This Week: Content UNIT 4: Cloud Storage Module 14: Cloud Storage Quiz 7 - Introduction to Cloud Storage Thursday, October 20, 2016 Module 15: Case Studies: Distributed File Systems Quiz 8: Distributed File Systems Checkpoint Module 16: Case Studies: NoSQL Databases Module 17: Case Studies: Cloud Object Storage Quiz 9: NoSQL and Object Stores
5 Project 3.2 Feedback Please leave us feedback 5
6 Project 3 Weekly Modules P3.1: Files, SQL and NoSQL Primer: Storage Benchmarking P3.2: Replication and Consistency models Primer: Intro. to Java Multithreading Primer: Thread-safe programming Primer: Intro. to Consistency Models P3.3: Social network with heterogeneous backend storage
7 Distributed Databases In 2004, Amazon.com began to experience the limits of scale on a traditional web-scale system Response was a highly available key-value structured storage system called Dynamo (2007) Problem Technique used as solution Data Sharding Consistent Hashing Transient Fault Handling Sloppy Quorum / Hinted Handoff Permanent Failure Recovery Anti-entropy using Merkle trees Membership and Health Checks Gossip protocols Used in S3, DynamoDB, Cassandra Article on DynamoDB - By Werner Vogels 7
8 Distributed Databases In 2006, Google published details about their implementation of BigTable Designed as a sparse, distributed multidimensional sorted map HBase stores members of column families adjacent to each other on the file system columnar data store 8
9 Project 3.3 Review
10 Project 3.3 : Introduction Build a social network about movies: 10
11 High Fanout and Multiple Rounds of Data Fetching A single Facebook page, requires many data fetch operations Nishtala, R., Fugal, H., Grimm, S., Kwiatkowski, M., Lee, H., Li, H. C.,... & Venkataramani, V. (2013, April). Scaling Memcache at Facebook. In nsdi (Vol. 13, pp ).
12 P3.3 Data Set 1. User Profiles User Authentication System (such as a Single-Sign-On or SSO) - RDS MySQL 2. User Info / Profile - RDS MySQL 3. Action Log 4. Social Graph of the User: follower, followee, family etc. HBase User Activity System - All user generated media - MongoDB 3. Big Data Analytics System Search System Recommender System User Behaviour Analysis
13 Project 3.3 : Architecture Build a social network about movies: HBase Front-end Server Back-end Server S3 MongoDB MySQL (RDS) 13
14 MongoDB Document Database Schema-less model Scalable Automatically shards data among multiple servers Does load-balancing Complex Queries MapReduce style filter and aggregations Geo-spatial queries
15 Project 3.3 : Tasks Build a social network about movies: 15
16 Project 3.3 : Task 5 Friend recommendation 16
17 Twitter Analytics Team Project 17
18 twitter DATA ANALYTICS: PROJECT
19 Team Project System Architecture Web server architectures Dealing with large scale real world tweet data HBase and MySQL optimization
20 Team Project Phase 1: Q1 Q2 (MySQL AND HBase) CONFIRM YOUR AWS ACCOUNT AND TEAM INFO Phase 2 Q1 Q2 & Q3 (MySQL AND HBase) Phase 3 Q1 Q2, Q3 & Q4 (MySQL OR HBase)
21 Team Project Time Table Phase (and query due) Start Deadline Phase 1 Q1 Monday 10/10/ :00:01 EST Sunday 10/23/ :59:59 ET Q2 Sunday 10/30/ :59:59 ET Phase 2 Q1, Q2, Q3 Monday 10/31/ :00:01 ET Sunday 11/13/ :59:59 ET Phase 2 Live Test (Hbase/MySQL) Q1, Q2, Q3 Sunday 11/13/ :00:01 ET Sunday 11/13/ :59:59 ET Phase 3 Q1, Q2, Q3, Q4 Monday 11/14/ :00:01 ET Sunday 12/04/ :59:59 ET Phase 3 Live Test Q1, Q2, Q3, Q4 Sunday 12/04/ :00:01 ET Sunday 12/04/ :59:59 ET Code and Report Due Tuesday 11/01/ :59:59 ET Tuesday 11/15/ :59:59 ET Tuesday 12/06/ :59:59 ET Note: There will be a report due at the end of each phase, where you are expected to discuss optimizations WARNING: Check your AWS instance limits on the new account (should be > 10 instances)
22 Team Project Phase 1 Two queries Q1: Pure front end Q2: ETL + back end + front end, do both MySQL (relational DBMS) and HBase (NoSQL) Grading Submit on TPZ, you will get several numbers: Error Rate, Correctness and RPS Higher RPS, higher correctness, lower error rate higher grade Q1 is 25% of phase 1, Q2 MySQL is 25% of phase 1, Q2 HBase is 25% of phase 1, report is 25% of phase 1 22
23 Team Project, Phase 1, Q1 Step 1: Compare different front-end frameworks Step 2: Deploy the front-end Step 3: Perform decryption of a secret message Pure front end, no database needed. Need to consider scaling horizontally
24 Team Project, Phase 1, Q2 Step 1: Extract tabular data from raw tweets Input file: JSON Tweets (approx. 1 TB) Consider using a MapReduce Job for ETL ETL is expensive and there s the potential for errors, so plan carefully, test on smaller data sets Start early, or no time to optimize the backend Step 2: Load the data into HBase and MySQL (both!) Step 3: Deploy a web service for handling HTTP requests, responds with data from the backend anhigher optimized backend (MySQL andpoints HBase) throughput = More Winner gets grades, fame (?), job (?)
25 Common Q2 issues Unicode اﻟﺣوﺳﺑﺔ اﻟﺳﺣﺎﺑﯾﺔ ब दल क य ट ग 云计算 クラウドコ ンピューティング ಕ ಪ וואָלקן קאַמפּיוטינ ג облачныхвычислений Emojis Remember to do short URLs elimination
26 Hints Read the write-up carefully (read more than once) You can test only if you have a front end ETL has many corner cases, can be time consuming and expensive Start early (from the first day), your backend will be meaningless if you have incorrect data The reference server and the reference ETL file are your friends Big data challenge will easily eat up your time and money if you are careless. Think, calculate, & test before you launch an EMR cluster with 20 machines 26
27 Reminder Changes in Team Project writeup. Updated banned word list. You have a total budget of $50 for Phase 1 Your system should not cost more than $0.95 per hour, this includes (see write-ups for details): EC2 on demand instance cost even if you use spot instances, we will calculate your cost using the on-demand instance price EBS cost ELB cost Target: Q rps (for both MySQL and HBase) 27
28 Start early! Team Project Q1 Also Due Sunday
29 Upcoming Deadlines Quiz 7: Unit 4 - Module 14 - Cloud Storage Due: Thursday, 10/20/ :59PM Pittsburgh Project 3.3: Social Networking Timeline with Heterogeneous Backends Due: 10/23/ :59PM Pittsburgh Team Project: Phase 1 - Query 1, (This Sunday, Oct 23!) Due: 10/23/ :59PM Pittsburgh Team Project: Phase 1 - Query 2 Due: 10/30/ :59PM Pittsburgh
30 Q&A
/ Cloud Computing. Recitation 10 March 22nd, 2016
15-319 / 15-619 Cloud Computing Recitation 10 March 22nd, 2016 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 3.3, OLI Unit 4, Module 15, Quiz 8 This week
More information/ Cloud Computing. Recitation 7 October 10, 2017
15-319 / 15-619 Cloud Computing Recitation 7 October 10, 2017 Overview Last week s reflection Project 3.1 OLI Unit 3 - Module 10, 11, 12 Quiz 5 This week s schedule OLI Unit 3 - Module 13 Quiz 6 Project
More information/ Cloud Computing. Recitation 8 March 1 st, 2016
15-319 / 15-619 Cloud Computing Recitation 8 March 1 st, 2016 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 3.1, OLI Unit 3, Module 13, Quiz 6 This week
More information/ Cloud Computing. Recitation 9 March 17th and 19th, 2015
15-319 / 15-619 Cloud Computing Recitation 9 March 17th and 19th, 2015 Overview Administrative issues Tagging, 15619Project Last week s reflection Project 3.2 This week s schedule Project 3.3 Unit 4 -
More informationCS / Cloud Computing. Recitation 11 November 5 th and Nov 8 th, 2013
CS15-319 / 15-619 Cloud Computing Recitation 11 November 5 th and Nov 8 th, 2013 Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my
More information/ Cloud Computing. Recitation 9 March 15th, 2016
15-319 / 15-619 Cloud Computing Recitation 9 March 15th, 2016 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 3.2, OLI Unit 4, Module 14, Quiz 7 This week
More informationCS / Cloud Computing. Recitation 8 October 14 th and 16 th, 2014
CS15-319 / 15-619 Cloud Computing Recitation 8 October 14 th and 16 th, 2014 Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my answer
More information/ Cloud Computing. Recitation 6 October 2 nd, 2018
15-319 / 15-619 Cloud Computing Recitation 6 October 2 nd, 2018 1 Overview Announcements for administrative issues Last week s reflection OLI unit 3 module 7, 8 and 9 Quiz 4 Project 2.3 This week s schedule
More information/ Cloud Computing. Recitation 13 April 12 th 2016
15-319 / 15-619 Cloud Computing Recitation 13 April 12 th 2016 Overview Last week s reflection Project 4.1 Quiz 11 Budget issues Tagging, 15619Project This week s schedule Unit 5 - Modules 21 Project 4.2
More information/ Cloud Computing. Recitation 5 February 14th, 2017
15-319 / 15-619 Cloud Computing Recitation 5 February 14th, 2017 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.1, OLI Unit 2 modules 5 and 6 This week
More information/ Cloud Computing. Recitation 5 September 27 th, 2016
15-319 / 15-619 Cloud Computing Recitation 5 September 27 th, 2016 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.1, OLI Unit 2 modules 5 and 6 This week
More informationCS / Cloud Computing. Recitation 7 October 7 th and 9 th, 2014
CS15-319 / 15-619 Cloud Computing Recitation 7 October 7 th and 9 th, 2014 15-619 Project Students enrolled in 15-619 Since 12 units, an extra project worth 3-units Project will be released this week Team
More informationCS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014
CS15-319 / 15-619 Cloud Computing Recitation 3 September 9 th & 11 th, 2014 Overview Last Week s Reflection --Project 1.1, Quiz 1, Unit 1 This Week s Schedule --Unit2 (module 3 & 4), Project 1.2 Questions
More informationCS / Cloud Computing. Recitation 9 October 22 nd and 25 th, 2013
CS15-319 / 15-619 Cloud Computing Recitation 9 October 22 nd and 25 th, 2013 Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my answer
More information/ Cloud Computing. Recitation 5 September 26 th, 2017
15-319 / 15-619 Cloud Computing Recitation 5 September 26 th, 2017 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.1, OLI Unit 2 modules 5 and 6 This week
More informationBuilding High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL
Building High Performance Apps using NoSQL Swami Sivasubramanian General Manager, AWS NoSQL Building high performance apps There is a lot to building high performance apps Scalability Performance at high
More information1
1 3 4 6 7 8 9 Link to Storage Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance
More information1
1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance
More information/ Cloud Computing. Recitation 7 February 24th & 26th, 2015
15-319 / 15-619 Cloud Computing Recitation 7 February 24th & 26th, 2015 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.3, OLI unit 3 module 8 This week
More information/ Cloud Computing. Recitation 3 Sep 13 & 15, 2016
15-319 / 15-619 Cloud Computing Recitation 3 Sep 13 & 15, 2016 1 Overview Administrative Issues Last Week s Reflection Project 1.1, OLI Unit 1, Quiz 1 This Week s Schedule Project1.2, OLI Unit 2, Module
More information/ Cloud Computing. Recitation 13 April 14 th 2015
15-319 / 15-619 Cloud Computing Recitation 13 April 14 th 2015 Overview Last week s reflection Project 4.1 Budget issues Tagging, 15619Project This week s schedule Unit 5 - Modules 18 Project 4.2 Demo
More informationCIB Session 12th NoSQL Databases Structures
CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is
More informationSpotify. Scaling storage to million of users world wide. Jimmy Mårdell October 14, 2014
Cassandra @ Spotify Scaling storage to million of users world wide! Jimmy Mårdell October 14, 2014 2 About me Jimmy Mårdell Tech Product Owner in the Cassandra team 4 years at Spotify
More information/ Cloud Computing. Recitation 13 April 17th 2018
15-319 / 15-619 Cloud Computing Recitation 13 April 17th 2018 Overview Last week s reflection Team Project Phase 2 Quiz 11 OLI Unit 5: Modules 21 & 22 This week s schedule Project 4.2 No more OLI modules
More informationCS 655 Advanced Topics in Distributed Systems
Presented by : Walid Budgaga CS 655 Advanced Topics in Distributed Systems Computer Science Department Colorado State University 1 Outline Problem Solution Approaches Comparison Conclusion 2 Problem 3
More informationPROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc.
PROFESSIONAL NoSQL Shashank Tiwari WILEY John Wiley & Sons, Inc. Examining CONTENTS INTRODUCTION xvil CHAPTER 1: NOSQL: WHAT IT IS AND WHY YOU NEED IT 3 Definition and Introduction 4 Context and a Bit
More informationChallenges for Data Driven Systems
Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Data Centric Systems and Networking Emergence of Big Data Shift of Communication Paradigm From end-to-end to data
More informationCassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent
Tanton Jeppson CS 401R Lab 3 Cassandra, MongoDB, and HBase Introduction For my report I have chosen to take a deeper look at 3 NoSQL database systems: Cassandra, MongoDB, and HBase. I have chosen these
More informationChapter 24 NOSQL Databases and Big Data Storage Systems
Chapter 24 NOSQL Databases and Big Data Storage Systems - Large amounts of data such as social media, Web links, user profiles, marketing and sales, posts and tweets, road maps, spatial data, email - NOSQL
More informationNon-Relational Databases. Pelle Jakovits
Non-Relational Databases Pelle Jakovits 25 October 2017 Outline Background Relational model Database scaling The NoSQL Movement CAP Theorem Non-relational data models Key-value Document-oriented Column
More informationCIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench
CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench Abstract Implementing a Hadoop-based system for processing big data and doing analytics is a topic which has been
More information/ Cloud Computing. Recitation 15 December 6 th 2016
15-319 / 15-619 Cloud Computing Recitation 15 December 6 th 2016 Overview Last week s reflection Team project phase 3 Quiz 12 This week s schedule Phase3 report Deadline TODAY 12/6 Project 4.3 Deadline
More information/ Cloud Computing. Recitation 2 January 19 & 21, 2016
15-319 / 15-619 Cloud Computing Recitation 2 January 19 & 21, 2016 Accessing the Course Open Learning Initiative (OLI) Course Access via Blackboard http://theproject.zone AWS Account Setup Azure Account
More informationWhy NoSQL? Why Riak?
Why NoSQL? Why Riak? Justin Sheehy justin@basho.com 1 What's all of this NoSQL nonsense? Riak Voldemort HBase MongoDB Neo4j Cassandra CouchDB Membase Redis (and the list goes on...) 2 What went wrong with
More informationJargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems
Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons
More informationMigrating to Cassandra in the Cloud, the Netflix Way
Migrating to Cassandra in the Cloud, the Netflix Way Jason Brown - @jasobrown Senior Software Engineer, Netflix Tech History, 1998-2008 In the beginning, there was the webapp and a single database in a
More informationA Non-Relational Storage Analysis
A Non-Relational Storage Analysis Cassandra & Couchbase Alexandre Fonseca, Anh Thu Vu, Peter Grman Cloud Computing - 2nd semester 2012/2013 Universitat Politècnica de Catalunya Microblogging - big data?
More informationDynamo: Amazon s Highly Available Key-Value Store
Dynamo: Amazon s Highly Available Key-Value Store DeCandia et al. Amazon.com Presented by Sushil CS 5204 1 Motivation A storage system that attains high availability, performance and durability Decentralized
More informationDistributed Systems Intro and Course Overview
Distributed Systems Intro and Course Overview COS 418: Distributed Systems Lecture 1 Wyatt Lloyd Distributed Systems, What? 1) Multiple computers 2) Connected by a network 3) Doing something together Distributed
More informationZHT A Fast, Reliable and Scalable Zero- hop Distributed Hash Table
ZHT A Fast, Reliable and Scalable Zero- hop Distributed Hash Table 1 What is KVS? Why to use? Why not to use? Who s using it? Design issues A storage system A distributed hash table Spread simple structured
More informationDynamo: Amazon s Highly Available Key-value Store
Dynamo: Amazon s Highly Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall and
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 14 NoSQL References Scalable SQL and NoSQL Data Stores, Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No.
More informationDATABASE DESIGN II - 1DL400
DATABASE DESIGN II - 1DL400 Fall 2016 A second course in database systems http://www.it.uu.se/research/group/udbl/kurser/dbii_ht16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationLarge-Scale Web Applications
Large-Scale Web Applications Mendel Rosenblum Web Application Architecture Web Browser Web Server / Application server Storage System HTTP Internet CS142 Lecture Notes - Intro LAN 2 Large-Scale: Scale-Out
More informationNoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems
CompSci 516 Data Intensive Computing Systems Lecture 21 (optional) NoSQL systems Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Key- Value Stores Duke CS,
More informationNoSQL Database Comparison: Bigtable, Cassandra and MongoDB CJ Campbell Brigham Young University October 16, 2015
Running Head: NOSQL DATABASE COMPARISON: BIGTABLE, CASSANDRA AND MONGODB NoSQL Database Comparison: Bigtable, Cassandra and MongoDB CJ Campbell Brigham Young University October 16, 2015 1 INTRODUCTION
More informationGetting to know. by Michelle Darling August 2013
Getting to know by Michelle Darling mdarlingcmt@gmail.com August 2013 Agenda: What is Cassandra? Installation, CQL3 Data Modelling Summary Only 15 min to cover these, so please hold questions til the end,
More informationL22: NoSQL. CS3200 Database design (sp18 s2) 4/5/2018 Several slides courtesy of Benny Kimelfeld
L22: NoSQL CS3200 Database design (sp18 s2) https://course.ccs.neu.edu/cs3200sp18s2/ 4/5/2018 Several slides courtesy of Benny Kimelfeld 2 Outline 3 Introduction Transaction Consistency 4 main data models
More informationBeating the Final Boss: Launch your game!
Beating the Final Boss: Launch your game! Ozkan Can Solutions Architect, AWS @_ozkancan ERROR The servers are busy at this time. Please try again later. (Error Code: 42 OOPS) Retry READY FOR LAUNCH?! WORST-CASE
More information2013 AWS Worldwide Public Sector Summit Washington, D.C.
2013 AWS Worldwide Public Sector Summit Washington, D.C. EMR for Fun and for Profit Ben Butler Sr. Manager, Big Data butlerb@amazon.com @bensbutler Overview 1. What is big data? 2. What is AWS Elastic
More informationNoSQL Databases. Amir H. Payberah. Swedish Institute of Computer Science. April 10, 2014
NoSQL Databases Amir H. Payberah Swedish Institute of Computer Science amir@sics.se April 10, 2014 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 1 / 67 Database and Database Management System
More information/ Cloud Computing. Recitation 2 September 5 & 7, 2017
15-319 / 15-619 Cloud Computing Recitation 2 September 5 & 7, 2017 Accessing the Course Open Learning Initiative (OLI) Course Access via canvas.cmu.edu http://theproject.zone AWS Account Setup Azure Account
More informationFinal Exam Logistics. CS 133: Databases. Goals for Today. Some References Used. Final exam take-home. Same resources as midterm
Final Exam Logistics CS 133: Databases Fall 2018 Lec 25 12/06 NoSQL Final exam take-home Available: Friday December 14 th, 4:00pm in Olin Due: Monday December 17 th, 5:15pm Same resources as midterm Except
More informationBig Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2016)
Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2016) Week 10: Mutable State (1/2) March 15, 2016 Jimmy Lin David R. Cheriton School of Computer Science University of Waterloo These
More informationFAQs Snapshots and locks Vector Clock
//08 CS5 Introduction to Big - FALL 08 W.B.0.0 CS5 Introduction to Big //08 CS5 Introduction to Big - FALL 08 W.B. FAQs Snapshots and locks Vector Clock PART. LARGE SCALE DATA STORAGE SYSTEMS NO SQL DATA
More informationDatabase Evolution. DB NoSQL Linked Open Data. L. Vigliano
Database Evolution DB NoSQL Linked Open Data Requirements and features Large volumes of data..increasing No regular data structure to manage Relatively homogeneous elements among them (no correlation between
More informationGoal of the presentation is to give an introduction of NoSQL databases, why they are there.
1 Goal of the presentation is to give an introduction of NoSQL databases, why they are there. We want to present "Why?" first to explain the need of something like "NoSQL" and then in "What?" we go in
More informationBig Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2017)
Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2017) Week 10: Mutable State (1/2) March 14, 2017 Jimmy Lin David R. Cheriton School of Computer Science University of Waterloo These
More informationCS-580K/480K Advanced Topics in Cloud Computing. NoSQL Database
CS-580K/480K dvanced Topics in Cloud Computing NoSQL Database 1 1 Where are we? Cloud latforms 2 VM1 VM2 VM3 3 Operating System 4 1 2 3 Operating System 4 1 2 Virtualization Layer 3 Operating System 4
More informationAdvanced Database Technologies NoSQL: Not only SQL
Advanced Database Technologies NoSQL: Not only SQL Christian Grün Database & Information Systems Group NoSQL Introduction 30, 40 years history of well-established database technology all in vain? Not at
More informationDEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!
DEMYSTIFYING BIG DATA WITH RIAK USE CASES Martin Schneider Basho Technologies! Agenda Defining Big Data in Regards to Riak A Series of Trade-Offs Use Cases Q & A About Basho & Riak Basho Technologies is
More informationCA485 Ray Walshe NoSQL
NoSQL BASE vs ACID Summary Traditional relational database management systems (RDBMS) do not scale because they adhere to ACID. A strong movement within cloud computing is to utilize non-traditional data
More informationThe NoSQL Ecosystem. Adam Marcus MIT CSAIL
The NoSQL Ecosystem Adam Marcus MIT CSAIL marcua@csail.mit.edu / @marcua About Me Social Computing + Database Systems Easily Distracted: Wrote The NoSQL Ecosystem in The Architecture of Open Source Applications
More informationStudy of NoSQL Database Along With Security Comparison
Study of NoSQL Database Along With Security Comparison Ankita A. Mall [1], Jwalant B. Baria [2] [1] Student, Computer Engineering Department, Government Engineering College, Modasa, Gujarat, India ank.fetr@gmail.com
More informationCassandra Design Patterns
Cassandra Design Patterns Sanjay Sharma Chapter No. 1 "An Overview of Architecture and Data Modeling in Cassandra" In this package, you will find: A Biography of the author of the book A preview chapter
More informationAdvanced Systems Lab (Intro and Administration) G. Alonso Systems Group
Advanced Systems Lab (Intro and Administration) G. Alonso Systems Group http://www.systems.ethz.ch Overview of the Course Focus on project Individual project during semester (3 milestones) This is a project
More informationAgenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache
Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,
More informationPrincipal Solutions Architect. Architecting in the Cloud
Matt Tavis Principal Solutions Architect Architecting in the Cloud Cloud Best Practices Whitepaper Prescriptive guidance to Cloud Architects Just Search for Cloud Best Practices to find the link ttp://media.amazonwebservices.co
More informationCSE 344 JULY 9 TH NOSQL
CSE 344 JULY 9 TH NOSQL ADMINISTRATIVE MINUTIAE HW3 due Wednesday tests released actual_time should have 0s not NULLs upload new data file or use UPDATE to change 0 ~> NULL Extra OOs on Mondays 5-7pm in
More informationOutline. Introduction Background Use Cases Data Model & Query Language Architecture Conclusion
Outline Introduction Background Use Cases Data Model & Query Language Architecture Conclusion Cassandra Background What is Cassandra? Open-source database management system (DBMS) Several key features
More informationCMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS
Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22
More informationBIG DATA AND CONSISTENCY. Amy Babay
BIG DATA AND CONSISTENCY Amy Babay Outline Big Data What is it? How is it used? What problems need to be solved? Replication What are the options? Can we use this to solve Big Data s problems? Putting
More informationDistributed Databases: SQL vs NoSQL
Distributed Databases: SQL vs NoSQL Seda Unal, Yuchen Zheng April 23, 2017 1 Introduction Distributed databases have become increasingly popular in the era of big data because of their advantages over
More informationBig Data Development CASSANDRA NoSQL Training - Workshop. November 20 to (5 days) 9 am to 5 pm HOTEL DUBAI GRAND DUBAI
Big Data Development CASSANDRA NoSQL Training - Workshop November 20 to 24 2016 (5 days) 9 am to 5 pm HOTEL DUBAI GRAND DUBAI ISIDUS TECH TEAM FZE PO Box 9798 Dubai UAE, email training-coordinator@isidusnet
More informationColumn-Family Databases Cassandra and HBase
Column-Family Databases Cassandra and HBase Kevin Swingler Google Big Table Google invented BigTableto store the massive amounts of semi-structured data it was generating Basic model stores items indexed
More informationIntro To Big Data. John Urbanic Parallel Computing Scientist Pittsburgh Supercomputing Center. Copyright 2017
Intro To Big Data John Urbanic Parallel Computing Scientist Pittsburgh Supercomputing Center Copyright 2017 Big data is a broad term for data sets so large or complex that traditional data processing applications
More information10. Replication. Motivation
10. Replication Page 1 10. Replication Motivation Reliable and high-performance computation on a single instance of a data object is prone to failure. Replicate data to overcome single points of failure
More informationIntro Cassandra. Adelaide Big Data Meetup.
Intro Cassandra Adelaide Big Data Meetup instaclustr.com @Instaclustr Who am I and what do I do? Alex Lourie Worked at Red Hat, Datastax and now Instaclustr We currently manage x10s nodes for various customers,
More information8/24/2017 Week 1-B Instructor: Sangmi Lee Pallickara
Week 1-B-0 Week 1-B-1 CS535 BIG DATA FAQs Slides are available on the course web Wait list Term project topics PART 0. INTRODUCTION 2. DATA PROCESSING PARADIGMS FOR BIG DATA Sangmi Lee Pallickara Computer
More informationIntroduction to NoSQL
Introduction to NoSQL Agenda History What is NoSQL Types of NoSQL The CAP theorem History - RDBMS Relational DataBase Management Systems were invented in the 1970s. E. F. Codd, "Relational Model of Data
More informationCS / Cloud Compu1ng. Recita1on 8 March 4 th and 6 th, 2014
CS15-319 / 15-619 Cloud Compu1ng Recita1on 8 March 4 th and 6 th, 2014 Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my answer is
More informationMongoDB in AWS (MongoDB as a DBaaS)
MongoDB in AWS (MongoDB as a DBaaS) Jing Wu Zhang Lu April 2017 Goals Automatically build MongoDB cluster Flexible scaling options Automatically recover from resource failures 2 Utilizing CloudFormation
More informationIntroduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University
Introduction to Computer Science William Hsu Department of Computer Science and Engineering National Taiwan Ocean University Chapter 9: Database Systems supplementary - nosql You can have data without
More informationARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS
ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS Dr Adnene Guabtni, Senior Research Scientist, NICTA/Data61, CSIRO Adnene.Guabtni@csiro.au EC2 S3 ELB RDS AMI
More informationTools for Social Networking Infrastructures
Tools for Social Networking Infrastructures 1 Cassandra - a decentralised structured storage system Problem : Facebook Inbox Search hundreds of millions of users distributed infrastructure inbox changes
More informationIntroduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos
Instituto Politécnico de Tomar Introduction to Big Data NoSQL Databases Ricardo Campos Mestrado EI-IC Análise e Processamento de Grandes Volumes de Dados Tomar, Portugal, 2016 Part of the slides used in
More informationReview - Relational Model Concepts
Lecture 25 Overview Last Lecture Query optimisation/query execution strategies This Lecture Non-relational data models Source: web pages, textbook chapters 20-22 Next Lecture Revision Review - Relational
More informationAt Course Completion Prepares you as per certification requirements for AWS Developer Associate.
[AWS-DAW]: AWS Cloud Developer Associate Workshop Length Delivery Method : 4 days : Instructor-led (Classroom) At Course Completion Prepares you as per certification requirements for AWS Developer Associate.
More informationBIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29,
BIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, 2016 1 OBJECTIVES ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29, 2016 2 WHAT
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationamazon.com s Journey to the Cloud Jon Jenkins AWS Summit June 13, 2011
amazon.com s Journey to the Cloud Jon Jenkins jjenkin@amazon.com AWS Summit June 13, 2011 1995-2010 + First real data center Distribution Center Isolation Decouple Service Oriented Architecture Scale
More informationPresented by Sunnie S Chung CIS 612
By Yasin N. Silva, Arizona State University Presented by Sunnie S Chung CIS 612 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. See http://creativecommons.org/licenses/by-nc-sa/4.0/
More informationCompSci 516 Database Systems
CompSci 516 Database Systems Lecture 20 NoSQL and Column Store Instructor: Sudeepa Roy Duke CS, Fall 2018 CompSci 516: Database Systems 1 Reading Material NOSQL: Scalable SQL and NoSQL Data Stores Rick
More informationA Fast and High Throughput SQL Query System for Big Data
A Fast and High Throughput SQL Query System for Big Data Feng Zhu, Jie Liu, and Lijie Xu Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, China 100190
More informationAccelerating NoSQL. Running Voldemort on HailDB. Sunny Gleason March 11, 2011
Accelerating NoSQL Running Voldemort on HailDB Sunny Gleason March 11, 2011 whoami Sunny Gleason, human passion: distributed systems engineering previous... Ning : custom social networks Amazon.com : infra
More informationTopics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples
Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?
More informationLatest Trends in Database Technology NoSQL and Beyond
Latest Trends in Database Technology NoSQL and Beyond Sebas>an Marsching www.aquenos.com Why we want more than SQL Performance / Data Size Opera>onal Costs Availability 2 NoSQL NoSQL Not Only SQL 3 NoSQL
More informationSEARCHING BILLIONS OF PRODUCT LOGS IN REAL TIME. Ryan Tabora - Think Big Analytics NoSQL Search Roadshow - June 6, 2013
SEARCHING BILLIONS OF PRODUCT LOGS IN REAL TIME Ryan Tabora - Think Big Analytics NoSQL Search Roadshow - June 6, 2013 1 WHO AM I? Ryan Tabora Think Big Analytics - Senior Data Engineer Lover of dachshunds,
More informationAmazon Web Services (AWS) Solutions Architect Intermediate Level Course Content
Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content Introduction to Cloud Computing A Short history Client Server Computing Concepts Challenges with Distributed Computing Introduction
More information