Motivation Overview of NoSQL space Comparing technologies used Getting hands dirty tutorial section
|
|
- Veronica Stone
- 5 years ago
- Views:
Transcription
1 NOSQL 1
2 GOAL Motivation Overview of NoSQL space Comparing technologies used Getting hands dirty tutorial section 2
3 MOTIVATION Assume we have a product that becomes popular 3. 1
4 TYPICAL WEBSERVER ARCHITECTURE 3. 2
5 TYPICAL WEBSERVER ARCHITECTURE 3. 3
6 TYPICAL WEBSERVER ARCHITECTURE 3. 4
7 MOTIVATION We are storing more data now than ever before Connections between our data grows We might not know the structure from the beginning 3. 5
8 MOTIVATION Increasing use in industry Big Data Technologies include some of these Disclaimer: There are > 200 different NoSQL databases available, noone is an expert in all of them! Source:
9 MOTIVATION
10 Source:
11 HISTORY OF NOSQL MOVEMENT 4. 1
12 NOSQL TERM NoSQL was used by Carlo Strozzi in 1998 to name his lightweight, Strozzi NoSQL open-source relational database Did not expose SQL interface Still relational Not NoSQL as we know today 4. 2
13 NOSQL TERM Johan Oskarsson, a developer at Last.fm, reintroduced the term NoSQL in early 2009 He organized an event to discuss "open source distributed, non relational databases". Needed a short Twitter Hashtag - used #nosql 4. 3
14 TYPES OF NOSQL DATABASES 4. 4
15 NOSQL TERM The name attempted to label the emergence of an increasing number of non-relational, distributed data stores. Most of the early NoSQL systems did not attempt to provide ACID guarantees, contrary to the prevailing practice among relational database systems. Strozzi suggests, because the current NoSQL movement "departs from the relational model altogether, it should therefore have been called more appropriately 'NoREL' referring to 'No Relational'. 4. 5
16 NOSQL TERM The term NoSQL has absolutely natural origin and has no universally accepted de nition or scienti c institution behind. This title is rather characterized by the vector of development of IT away from relational databases. 4. 6
17 NOSQL TERM De nition of NoSQL: No de nition - but characteristics non-relational Open source Cluster friendly Spawned by 21st Century Website culture Schema-less Most have these characteristics 4. 7
18 NOTABLE MILESTONES Graph database Neo4j is started in Memcached is started in 2003 to power Livejournal. Memcached isn t really a database since it s memory-only but there is soon a version with le storage called memcachedb. CouchDB is started in 2005 a document database. The project moves to the Apache Foundation in Google BigTable is started in 2004, research paper is released in Research paper on Amazon Dynamo is released in
19 NOTABLE MILESTONES Document database MongoDB is started in 2007 as a part of a open source cloud computing stack and rst standalone release in Facebooks open sources the Cassandra project in 2008 Redis is persistent key-value store started in 2009 Riak Another dynamo-inspired database started in
20 NOSQL AND CONSISTENCY 5. 1
21 RDBMS ACID Single unit of information split across tables Don t wan t to write half of the data Must have atomic updates. 5. 2
22 NOSQL BASE Graph db ACID. Before BASE - we should understand CAP 5. 3
23 CAP THEOREM Brewer s CAP theorem states that, in a Distributed Computer System, we can guarantee only two of the following simultaneously: Consistency (all nodes see the same data at the same time) Availability (a guarantee that every request receives a response about whether it was successful or failed) Partition tolerance (the system continues to operate despite arbitrary message loss or failure of part of the system) 5. 4
24 CAP THEOREM 5. 5
25 CAP THEOREM CA data should be consistent between all nodes. As long as all nodes are online, users can read/write from any node and be sure that the data is the same on all nodes. CP data is consistent between all nodes and maintains partition tolerance by becoming unavailable when a node goes down. AP nodes remain online even if they can t communicate with each other and will re-sync data once the partition is resolved, but you aren t guaranteed that all nodes will have the same data (either during or after the partition) 5. 6
26 CAP THEOREM In distributed systems: Partition Tolerance is mandatory, You cannot not choose it. You can trade between consistency and availability. Really: trading Consistency and response times 5. 7
27 CAP AND RDBMS As we have ACID in RDBMS, we have chosen Consistency and Availability Hard to distribute Hard to scale! 5. 8
28 BASE A BASE system gives up on consistency so as to have greater Availability and Partition tolerance. De ned as following: 5. 9
29 CONSISTENCY Consistency: Logical or Replication Sharding data - each piece only exist on one node of cluster. Replication: Clusters have replica of data
30 CONSISTENCE VS AVAILABILITY It is a choice, that should be taken knowing the business model. Can be handled by domain choice 5. 11
31 CONSISTENCE VS AVAILABILITY Hotel Booking example: Can we allow consistency issues, providing high availability? Does hotel have spare rooms for emergencies if booking systems allows inconsistencies? 5. 12
32 CONSISTENCE VS AVAILABILITY Amazon Webshop: As long as you can continue shopping and put things into your shopping basked, all is well. For Amazon/DynamoDB: Make sure that the shopping basked is never down. Can live with consistency issues - by apologizing
33 REAL WORLD TRANSACTIONS 5. 14
34 REAL WORLD TRANSACTIONS 5. 15
35 REAL WORLD TRANSACTIONS 5. 16
36 REAL WORLD TRANSACTIONS 5. 17
37 REAL WORLD TRANSACTIONS 5. 18
38 NEED FOR TRANSACTIONS Aggregate-oriented databases don t need transactions as much. Transaction boundaries should guide in the modelling. Keep transaction within single aggregate. An update for a single document is transactional
39 MODELLING Order has line-items. But we think of it as one order! 5. 20
40 MODELLING In RDBMS we have to split this all over the db. In NoSQL, we keep it as one item Database know the aggregate boundaries (aggregate is the order). Aggregate ids can easier be split over clusters 5. 21
41 MODELLING Can also have issues: If we need information by a product, not by the order easy in RDBMS, harder in NoSQL Advantage if you always use aggregate (the order) - disadvantage if you query in many different ways. For this Store redundant so lookups can be ef cient. But remember updates must happen multiple places
42 NO SCHEMA Preaches how easy it is to have no schema, but in reality, this is not correct. You often assume implicit schema (price, quantity)
43 TYPES OF NOSQL DATABASES First a short introduction to each of the 4 broad chunks of NoSQL Databases 6. 1
44 KEY-VALUE STORES The main idea here is using a hash table where there is a unique key and a pointer to a particular item of data. The key-value model is the simplest and easiest to implement. But it is inef cient when you are only interested in querying or updating part of a value, among other disadvantages. DB does not know what the value is (image, boolean, string etc.) Think of HashMap 6. 2
45 DOCUMENT STORES The model is basically versioned documents that are collections of other key-value collections. The semi-structured documents are stored in formats like JSON. Document databases are essentially the next level of key-value, allowing nested values associated with each key. Document databases support querying more ef ciently. 6. 3
46 COLUMN BASED These were created to store and process very large amounts of data distributed over many machines. There are still keys but they point to multiple columns. The columns are arranged by column family. 6. 4
47 GRAPH DB Instead of tables of rows and columns and the rigid structure of SQL, a exible graph model is used which, again, can scale across multiple machines. Nodes and arches. Great for querying relationships Graph oriented queries that would be hard to write in SQL. Breaks data into even smaller parts 6. 5
48 KEY-VALUE AND DOCUMENT Fuzzy between key-value and document: Some key-value databases allows storage of metadata, and document dbs often have keys, that you can lookup by. In general: You take some data (big complex structure) and save it 6. 6
49 KEY-VALUE STORES 7. 1
50 KEY-VALUE STORES The most exible type of NoSQL database. In a key-value store, there is no schema and the value of the data is opaque. Values are identi ed and accessed via a key, and stored values can be numbers, strings, counters, JSON, XML, HTML, binaries, images, short videos, and more. It is the most exible NoSQL model because the application has complete control over what is stored in the value. 7. 2
51 KEY-VALUE STORES 7. 3
52 EXAMPLE TECHNOLOGIES 7. 4
53 TYPICAL BENEFITS Flexible data modeling No structure on data High performance no need to perform lock, join, union, or other operations when working with objects Think hashing for lookup 7. 5
54 TYPICAL BENEFITS Massive scalability Easy to add new node to cluster High availability Replicate data to n-nodes Operational simplicity As data is on multiple nodes, any failure should be easy to recover from 7. 6
55 TYPICAL USECASES Key-value stores handle size well and are good at processing a constant stream of read/write operations with low latency making them perfect for: 7. 7
56 EXAMPLE CODE DynamoDB dynamodb = new DynamoDB(client) Table table = dynamodb.gettable(tablename); GetItemSpec spec = new GetItemSpec().withPrimaryKey('id', 42) Item outcome = table.getitem(spec) 7. 8
57 DOCUMENT STORES 8. 1
58 DOCUMENT STORE The central concept of a document-oriented database is the notion of a document. While each document-oriented database implementation differs on the details of this de nition, in general, they all assume documents encapsulate and encode data (or information) in some standard format or encoding. Encodings in use include XML, YAML, JSON, and BSON, as well as binary forms like PDF and Microsoft Of ce documents (MS Word, Excel, and so on). Documents in a document store are roughly equivalent to the programming concept of an object. 8. 2
59 EXAMPLE TECHNOLOGIES 8. 3
60 KEYS Documents are addressed in the database via a unique key that represents that document. This key is a simple identi er (or ID), typically a string, a URI, or a path. 8. 4
61 EXAMPLE DOCUMENT 1 { } "FirstName": "Bob", "Address": "5 Oak St.", "Hobby": "sailing" 8. 5
62 EXAMPLE DOCUMENT 2 <contact> <firstname>bob</firstname> <lastname>smith</lastname> <phone type="cell">(123) </phone> <phone type="work">(890) </phone> <address> <type>home</type> <street1>123 Back St.</street1> <city>boys</city> <state>ar</state> <zip>32225</zip> <country>us</country> </address> </contact> 8. 6
63 MONGODB EXAMPLE INSERTION Send JSON to database { } "_id": ObjectID(), "name": "Jacob", "profession": "Nerd" 8. 7
64 MONGODB QUERYING (JAVA) MongoDatabase db = client().getdatabase(databasename) db.getcollection(collectionname).find(new Document("age", 42)) 8. 8
65 MONGODB QUERYING (JSON) Querying in a set { } Name: { $in: [ "Lugia", "Charmander" ] } 8. 9
66 MONGODB QUERYING (JSON) { Type1: "Fire", Legendary: "True" } Querying with AND 8. 10
67 MONGODB QUERYING (JSON) Querying with OR { $or: [{Type1: "Fire"}, {Legendary: "True"}] } 8. 11
68 SEARCH ENGINES Some search engines (aka information retrieval) systems like Elasticsearch provide enough of the core operations on documents to t the de nition of a document-oriented database
69 DATAMODELLING BASICS 9. 1
70 DESIGN TIP You should be thinking by default on how you re going to be reading the data you re about to store in the database. Statistically speaking, the average user is going to spend more time reading data from the database than writing it, so you ll want to make sure reading the data is as painless and uses as little resources as possible. READ > WRITE 9. 2
71 RUNNING EXAMPLE Lets imagine we shall make an event management app 9. 3
72 FIRST ATTEMPT { } users: { "ebrattjorge": { "first_name": "Jorge", "last_name": "Vergara", " ": "j@javebratt.com" } }, events: { "natlpizzaday": { "eventname": "National Pizza Day Celebration", "eventdate": "03/21/2017", } } 9. 4
73 ISSUES How do we relate the event attendees with the event? Let s say that you need to list everyone who went to the "National Pizza Day Celebration" 9. 5
74 IDEA events: { "natlpizzaday": { "eventname": "National Pizza Day Celebration", "eventdate": "03/21/2017", "guestlist": { "ebrattjorge": true } } } 9. 6
75 ISSUES What if we need to displaymore information on the users? How do we avoid that extra query? 9. 7
76 DE-NORMALIZATION You all know the Relational Background about normalization (avoid duplicates, structure tables, etc). NoSQL has a different ow, and it s DE-Normalization, where it is actually OK (in fact, a good practice) to duplicate data. How much data do I duplicate? 9. 8
77 DE-NORMALIZATION You have an event view, where you re seeing the information about that event, inside the view, you have a guest list, that shows just that, a list of the guests who attended the event. You re probably not going to need much information about the guests at that point, so instead of doing an extra query, you could do something like this: 9. 9
78 DE-NORMALIZATION (1) events: { "natlpizzaday": { "eventname": "National Pizza Day Celebration", "eventdate": "03/21/2017", "guestlist": { "ebrattjorge": { "first_name": "Jorge" "last_name": "Vergara" } } } } 9. 10
79 DE-NORMALIZATION (2) users: { "ebrattjorge": { "first_name": "Jorge", "last_name": "Vergara", " ": "j@javebratt.com", "events": { "natlpizzaday": { "name": "National Pizza Day", } } } } 9. 11
80 KEEPING YOUR DATA FLAT Imagine only needing the event name and pulling instead the name + a list of 5,000 people that attended to the event (and National Pizza Day? More like 5,000 million people) From SQL, imagine this as a "Many to Many" relationship, where you create an extra table to store the relationship
81 EVENTS EXAMPLE For example, let s go back to our event/user relationship, what if instead of storing the guestlist inside the event node we create a separate table for it? 9. 13
82 EVENTS EXAMPLE events: { "natlpizzaday": { "eventname": "National Pizza Day Celebration", "eventdate": "03/21/2017" } }, eventguestlist: { "natlpizzaday": { "ebrattjorge": { "first_name": "Jorge" "last_name": "Vergara" } } } 9. 14
83 EVENT DETAIL PAGE Do 2 lookups 1. The event detail 2. The event guestlist 9. 15
84 FINAL STRUCTURE events: { "natlpizzaday": { "eventname": "National Pizza Day Celebration", "eventdate": "03/21/2017" } }, eventguestlist: { "natlpizzaday": { "ebrattjorge": { "first_name": "Jorge" "last_name": "Vergara" } } }, users: { "ebrattjorge": { "first_name": "Jorge", "last_name": "Vergara", " ": "j@javebratt.com", } } 9. 16
85 SMALL ISSUE (OR NOT SO SMALL) What happens if you get a name wrong? What if you need to update someone s name? If you go into users/ebrattjorge/ and change " rst_name": "Andres" it won t affect anything other than that node, meaning you ll end up with something like this: 9. 17
86 SMALL ISSUE (OR NOT SO SMALL) eventguestlist: { "natlpizzaday": { "ebrattjorge": { "first_name": "Jorge" } } }, users: { "ebrattjorge": { "first_name": "Andres", } } 9. 18
87 QUESTIONS 10
Overview. * Some History. * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL. * NoSQL Taxonomy. *TowardsNewSQL
* Some History * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL * NoSQL Taxonomy * Towards NewSQL Overview * Some History * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL * NoSQL Taxonomy *TowardsNewSQL NoSQL
More informationIntroduction Aggregate data model Distribution Models Consistency Map-Reduce Types of NoSQL Databases
Introduction Aggregate data model Distribution Models Consistency Map-Reduce Types of NoSQL Databases Key-Value Document Column Family Graph John Edgar 2 Relational databases are the prevalent solution
More informationNOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY
NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY WHAT IS NOSQL? Stands for No-SQL or Not Only SQL. Class of non-relational data storage systems E.g.
More informationCIB Session 12th NoSQL Databases Structures
CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is
More informationIntroduction to NoSQL Databases
Introduction to NoSQL Databases Roman Kern KTI, TU Graz 2017-10-16 Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 1 / 31 Introduction Intro Why NoSQL? Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 2 / 31 Introduction
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationIntroduction to NoSQL
Introduction to NoSQL Agenda History What is NoSQL Types of NoSQL The CAP theorem History - RDBMS Relational DataBase Management Systems were invented in the 1970s. E. F. Codd, "Relational Model of Data
More informationDistributed Data Store
Distributed Data Store Large-Scale Distributed le system Q: What if we have too much data to store in a single machine? Q: How can we create one big filesystem over a cluster of machines, whose data is
More informationA Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores
A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores Nikhil Dasharath Karande 1 Department of CSE, Sanjay Ghodawat Institutes, Atigre nikhilkarande18@gmail.com Abstract- This paper
More information5/2/16. Announcements. NoSQL Motivation. The New Hipster: NoSQL. Serverless. What is the Problem? Database Systems CSE 414
Announcements Database Systems CSE 414 Lecture 16: NoSQL and JSon Current assignments: Homework 4 due tonight Web Quiz 6 due next Wednesday [There is no Web Quiz 5 Today s lecture: JSon The book covers
More informationChapter 24 NOSQL Databases and Big Data Storage Systems
Chapter 24 NOSQL Databases and Big Data Storage Systems - Large amounts of data such as social media, Web links, user profiles, marketing and sales, posts and tweets, road maps, spatial data, email - NOSQL
More informationDatabase Systems CSE 414
Database Systems CSE 414 Lecture 16: NoSQL and JSon CSE 414 - Spring 2016 1 Announcements Current assignments: Homework 4 due tonight Web Quiz 6 due next Wednesday [There is no Web Quiz 5] Today s lecture:
More information10/18/2017. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414
Announcements Database Systems CSE 414 Lecture 11: NoSQL & JSON (mostly not in textbook only Ch 11.1) HW5 will be posted on Friday and due on Nov. 14, 11pm [No Web Quiz 5] Today s lecture: NoSQL & JSON
More informationCSE 344 JULY 9 TH NOSQL
CSE 344 JULY 9 TH NOSQL ADMINISTRATIVE MINUTIAE HW3 due Wednesday tests released actual_time should have 0s not NULLs upload new data file or use UPDATE to change 0 ~> NULL Extra OOs on Mondays 5-7pm in
More informationNon-Relational Databases. Pelle Jakovits
Non-Relational Databases Pelle Jakovits 25 October 2017 Outline Background Relational model Database scaling The NoSQL Movement CAP Theorem Non-relational data models Key-value Document-oriented Column
More informationIntroduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos
Instituto Politécnico de Tomar Introduction to Big Data NoSQL Databases Ricardo Campos Mestrado EI-IC Análise e Processamento de Grandes Volumes de Dados Tomar, Portugal, 2016 Part of the slides used in
More informationRelational databases
COSC 6397 Big Data Analytics NoSQL databases Edgar Gabriel Spring 2017 Relational databases Long lasting industry standard to store data persistently Key points concurrency control, transactions, standard
More informationGoal of the presentation is to give an introduction of NoSQL databases, why they are there.
1 Goal of the presentation is to give an introduction of NoSQL databases, why they are there. We want to present "Why?" first to explain the need of something like "NoSQL" and then in "What?" we go in
More informationCSE 530A. Non-Relational Databases. Washington University Fall 2013
CSE 530A Non-Relational Databases Washington University Fall 2013 NoSQL "NoSQL" was originally the name of a specific RDBMS project that did not use a SQL interface Was co-opted years later to refer to
More informationNoSQL Databases. Amir H. Payberah. Swedish Institute of Computer Science. April 10, 2014
NoSQL Databases Amir H. Payberah Swedish Institute of Computer Science amir@sics.se April 10, 2014 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 1 / 67 Database and Database Management System
More informationNoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre
NoSQL systems: introduction and data models Riccardo Torlone Università Roma Tre Leveraging the NoSQL boom 2 Why NoSQL? In the last fourty years relational databases have been the default choice for serious
More informationNoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu
NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related
More informationIntroduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University
Introduction to Computer Science William Hsu Department of Computer Science and Engineering National Taiwan Ocean University Chapter 9: Database Systems supplementary - nosql You can have data without
More informationrelational Key-value Graph Object Document
NoSQL Databases Earlier We have spent most of our time with the relational DB model so far. There are other models: Key-value: a hash table Graph: stores graph-like structures efficiently Object: good
More informationDatabases : Lecture 1 2: Beyond ACID/Relational databases Timothy G. Griffin Lent Term Apologies to Martin Fowler ( NoSQL Distilled )
Databases : Lecture 1 2: Beyond ACID/Relational databases Timothy G. Griffin Lent Term 2016 Rise of Web and cluster-based computing NoSQL Movement Relationships vs. Aggregates Key-value store XML or JSON
More informationCISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL
CISC 7610 Lecture 5 Distributed multimedia databases Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL Motivation YouTube receives 400 hours of video per minute That is 200M hours
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL
More informationBig Data Analytics. Rasoul Karimi
Big Data Analytics Rasoul Karimi Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Big Data Analytics Big Data Analytics 1 / 1 Outline
More informationDistributed Non-Relational Databases. Pelle Jakovits
Distributed Non-Relational Databases Pelle Jakovits Tartu, 7 December 2018 Outline Relational model NoSQL Movement Non-relational data models Key-value Document-oriented Column family Graph Non-relational
More informationCassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent
Tanton Jeppson CS 401R Lab 3 Cassandra, MongoDB, and HBase Introduction For my report I have chosen to take a deeper look at 3 NoSQL database systems: Cassandra, MongoDB, and HBase. I have chosen these
More information5/1/17. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414
Announcements Database Systems CSE 414 Lecture 15: NoSQL & JSON (mostly not in textbook only Ch 11.1) 1 Homework 4 due tomorrow night [No Web Quiz 5] Midterm grading hopefully finished tonight post online
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 14 NoSQL References Scalable SQL and NoSQL Data Stores, Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No.
More informationCompSci 516 Database Systems
CompSci 516 Database Systems Lecture 20 NoSQL and Column Store Instructor: Sudeepa Roy Duke CS, Fall 2018 CompSci 516: Database Systems 1 Reading Material NOSQL: Scalable SQL and NoSQL Data Stores Rick
More informationTransactions and ACID
Transactions and ACID Kevin Swingler Contents Recap of ACID transactions in RDBMSs Transactions and ACID in MongoDB 1 Concurrency Databases are almost always accessed by multiple users concurrently A user
More informationNoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems
CompSci 516 Data Intensive Computing Systems Lecture 21 (optional) NoSQL systems Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Key- Value Stores Duke CS,
More informationA Study of NoSQL Database
A Study of NoSQL Database International Journal of Engineering Research & Technology (IJERT) Biswajeet Sethi 1, Samaresh Mishra 2, Prasant ku. Patnaik 3 1,2,3 School of Computer Engineering, KIIT University
More informationFinal Exam Logistics. CS 133: Databases. Goals for Today. Some References Used. Final exam take-home. Same resources as midterm
Final Exam Logistics CS 133: Databases Fall 2018 Lec 25 12/06 NoSQL Final exam take-home Available: Friday December 14 th, 4:00pm in Olin Due: Monday December 17 th, 5:15pm Same resources as midterm Except
More information1
1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance
More informationWhy NoSQL? Why Riak?
Why NoSQL? Why Riak? Justin Sheehy justin@basho.com 1 What's all of this NoSQL nonsense? Riak Voldemort HBase MongoDB Neo4j Cassandra CouchDB Membase Redis (and the list goes on...) 2 What went wrong with
More informationNoSQL Databases Analysis
NoSQL Databases Analysis Jeffrey Young Intro I chose to investigate Redis, MongoDB, and Neo4j. I chose Redis because I always read about Redis use and its extreme popularity yet I know little about it.
More informationAn Introduction to Big Data Formats
Introduction to Big Data Formats 1 An Introduction to Big Data Formats Understanding Avro, Parquet, and ORC WHITE PAPER Introduction to Big Data Formats 2 TABLE OF TABLE OF CONTENTS CONTENTS INTRODUCTION
More informationINFO-H415 Adanvanced Databases Documents store and cloudant
INFO-H413 Heuristic Optimization Implemenation Exercise 1 Dany S Efila ULB 000349507 Universite Libre de Bruxelles Avril 2017 INFO-H415 Adanvanced Databases Documents store and cloudant Dany S EFILA Michel
More informationArchitekturen für die Cloud
Architekturen für die Cloud Eberhard Wolff Architecture & Technology Manager adesso AG 08.06.11 What is Cloud? National Institute for Standards and Technology (NIST) Definition On-demand self-service >
More informationL22: NoSQL. CS3200 Database design (sp18 s2) 4/5/2018 Several slides courtesy of Benny Kimelfeld
L22: NoSQL CS3200 Database design (sp18 s2) https://course.ccs.neu.edu/cs3200sp18s2/ 4/5/2018 Several slides courtesy of Benny Kimelfeld 2 Outline 3 Introduction Transaction Consistency 4 main data models
More informationPresented by Sunnie S Chung CIS 612
By Yasin N. Silva, Arizona State University Presented by Sunnie S Chung CIS 612 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. See http://creativecommons.org/licenses/by-nc-sa/4.0/
More information/ Cloud Computing. Recitation 6 October 2 nd, 2018
15-319 / 15-619 Cloud Computing Recitation 6 October 2 nd, 2018 1 Overview Announcements for administrative issues Last week s reflection OLI unit 3 module 7, 8 and 9 Quiz 4 Project 2.3 This week s schedule
More informationJargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems
Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons
More informationClass Overview. Two Classes of Database Applications. NoSQL Motivation. RDBMS Review: Client-Server. RDBMS Review: Serverless
Introduction to Database Systems CSE 414 Lecture 12: NoSQL 1 Class Overview Unit 1: Intro Unit 2: Relational Data Models and Query Languages Unit 3: Non-relational data NoSQL Json SQL++ Unit 4: RDMBS internals
More informationHands-on immersion on Big Data tools
Hands-on immersion on Big Data tools NoSQL Databases Donato Summa THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Summary : Definition Main features NoSQL DBs classification
More informationDatabase Solution in Cloud Computing
Database Solution in Cloud Computing CERC liji@cnic.cn Outline Cloud Computing Database Solution Our Experiences in Database Cloud Computing SaaS Software as a Service PaaS Platform as a Service IaaS Infrastructure
More informationDatabases : Lectures 11 and 12: Beyond ACID/Relational databases Timothy G. Griffin Lent Term 2013
Databases : Lectures 11 and 12: Beyond ACID/Relational databases Timothy G. Griffin Lent Term 2013 Rise of Web and cluster-based computing NoSQL Movement Relationships vs. Aggregates Key-value store XML
More informationCS 655 Advanced Topics in Distributed Systems
Presented by : Walid Budgaga CS 655 Advanced Topics in Distributed Systems Computer Science Department Colorado State University 1 Outline Problem Solution Approaches Comparison Conclusion 2 Problem 3
More informationMongoDB Schema Design
MongoDB Schema Design Demystifying document structures in MongoDB Jon Tobin @jontobs MongoDB Overview NoSQL Document Oriented DB Dynamic Schema HA/Sharding Built In Simple async replication setup Automated
More informationAnnouncements. Two Classes of Database Applications. Class Overview. NoSQL Motivation. RDBMS Review: Serverless
Introduction to Database Systems CSE 414 Lecture 11: NoSQL 1 HW 3 due Friday Announcements Upload data with DataGrip editor see message board Azure timeout for question 5: Try DataGrip or SQLite HW 2 Grades
More informationDistributed Databases: SQL vs NoSQL
Distributed Databases: SQL vs NoSQL Seda Unal, Yuchen Zheng April 23, 2017 1 Introduction Distributed databases have become increasingly popular in the era of big data because of their advantages over
More informationLecture Notes to Big Data Management and Analytics Winter Term 2017/2018 NoSQL Databases
Lecture Notes to Big Data Management and Analytics Winter Term 2017/2018 NoSQL Databases Matthias Schubert, Matthias Renz, Felix Borutta, Evgeniy Faerman, Christian Frey, Klaus Arthur Schmid, Daniyal Kazempour,
More informationCMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS
Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22
More informationDEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!
DEMYSTIFYING BIG DATA WITH RIAK USE CASES Martin Schneider Basho Technologies! Agenda Defining Big Data in Regards to Riak A Series of Trade-Offs Use Cases Q & A About Basho & Riak Basho Technologies is
More informationA NoSQL Introduction for Relational Database Developers. Andrew Karcher Las Vegas SQL Saturday September 12th, 2015
A NoSQL Introduction for Relational Database Developers Andrew Karcher Las Vegas SQL Saturday September 12th, 2015 About Me http://www.andrewkarcher.com Twitter: @akarcher LinkedIn, Twitter Email: akarcher@gmail.com
More informationCS-580K/480K Advanced Topics in Cloud Computing. NoSQL Database
CS-580K/480K dvanced Topics in Cloud Computing NoSQL Database 1 1 Where are we? Cloud latforms 2 VM1 VM2 VM3 3 Operating System 4 1 2 3 Operating System 4 1 2 Virtualization Layer 3 Operating System 4
More informationStudy of NoSQL Database Along With Security Comparison
Study of NoSQL Database Along With Security Comparison Ankita A. Mall [1], Jwalant B. Baria [2] [1] Student, Computer Engineering Department, Government Engineering College, Modasa, Gujarat, India ank.fetr@gmail.com
More informationStoring data in databases
Storing data in databases The webinar will begin at 3pm You now have a menu in the top right corner of your screen. The red button with a white arrow allows you to expand and contract the webinar menu,
More informationAdvanced Data Management Technologies
ADMT 2017/18 Unit 15 J. Gamper 1/44 Advanced Data Management Technologies Unit 15 Introduction to NoSQL J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE ADMT 2017/18 Unit 15
More informationCOSC 416 NoSQL Databases. NoSQL Databases Overview. Dr. Ramon Lawrence University of British Columbia Okanagan
COSC 416 NoSQL Databases NoSQL Databases Overview Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca Databases Brought Back to Life!!! Image copyright: www.dragoart.com Image
More informationHigh Performance NoSQL with MongoDB
High Performance NoSQL with MongoDB History of NoSQL June 11th, 2009, San Francisco, USA Johan Oskarsson (from http://last.fm/) organized a meetup to discuss advances in data storage which were all using
More informationAdvanced Database Technologies NoSQL: Not only SQL
Advanced Database Technologies NoSQL: Not only SQL Christian Grün Database & Information Systems Group NoSQL Introduction 30, 40 years history of well-established database technology all in vain? Not at
More informationDatabase Evolution. DB NoSQL Linked Open Data. L. Vigliano
Database Evolution DB NoSQL Linked Open Data Requirements and features Large volumes of data..increasing No regular data structure to manage Relatively homogeneous elements among them (no correlation between
More informationTopics. History. Architecture. MongoDB, Mongoose - RDBMS - SQL. - NoSQL
Databases Topics History - RDBMS - SQL Architecture - SQL - NoSQL MongoDB, Mongoose Persistent Data Storage What features do we want in a persistent data storage system? We have been using text files to
More informationWhat is database? Types and Examples
What is database? Types and Examples Visit our site for more information: www.examplanning.com Facebook Page: https://www.facebook.com/examplanning10/ Twitter: https://twitter.com/examplanning10 TABLE
More informationGetting to know. by Michelle Darling August 2013
Getting to know by Michelle Darling mdarlingcmt@gmail.com August 2013 Agenda: What is Cassandra? Installation, CQL3 Data Modelling Summary Only 15 min to cover these, so please hold questions til the end,
More informationUnderstanding NoSQL Database Implementations
Understanding NoSQL Database Implementations Sadalage and Fowler, Chapters 7 11 Class 07: Understanding NoSQL Database Implementations 1 Foreword NoSQL is a broad and diverse collection of technologies.
More informationAccessing other data fdw, dblink, pglogical, plproxy,...
Accessing other data fdw, dblink, pglogical, plproxy,... Hannu Krosing, Quito 2017.12.01 1 Arctic Circle 2 Who am I Coming from Estonia PostgreSQL user since about 1990 (when it was just Postgres 4.2)
More informationMigrating Oracle Databases To Cassandra
BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra
More informationPerspectives on NoSQL
Perspectives on NoSQL PGCon 2010 Gavin M. Roy What is NoSQL? NoSQL is a movement promoting a loosely defined class of nonrelational data stores that break with a long history of relational
More informationA Global In-memory Data System for MySQL Daniel Austin, PayPal Technical Staff
A Global In-memory Data System for MySQL Daniel Austin, PayPal Technical Staff Percona Live! MySQL Conference Santa Clara, April 12th, 2012 v1.3 Intro: Globalizing NDB Proposed Architecture What We Learned
More informationScaling Without Sharding. Baron Schwartz Percona Inc Surge 2010
Scaling Without Sharding Baron Schwartz Percona Inc Surge 2010 Web Scale!!!! http://www.xtranormal.com/watch/6995033/ A Sharding Thought Experiment 64 shards per proxy [1] 1 TB of data storage per node
More informationSQL, NoSQL, MongoDB. CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden
SQL, NoSQL, MongoDB CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden SQL Databases Really better called Relational Databases Key construct is the Relation, a.k.a. the table Rows represent records Columns
More informationNOSQL Databases: The Need of Enterprises
International Journal of Allied Practice, Research and Review Website: www.ijaprr.com (ISSN 2350-1294) NOSQL Databases: The Need of Enterprises Basit Maqbool Mattu M-Tech CSE Student. (4 th semester).
More informationAvancier Methods (AM) From logical model to physical database
Methods (AM) From logical model to physical database Data structures It is illegal to copy, share or show this document (or other document published at http://avancier.co.uk) without the written permission
More informationNoSQL Databases. Concept, Types & Use-cases.
NoSQL Databases Concept, Types & Use-cases 1of93 Hello World Alon Spiegel alon@brillix.co.il Mamram grad. Programmer since 1995 DBA since 1997 Co founder and CEO since 2007 Brillix VP products since 2014
More informationExtreme Computing. NoSQL.
Extreme Computing NoSQL PREVIOUSLY: BATCH Query most/all data Results Eventually NOW: ON DEMAND Single Data Points Latency Matters One problem, three ideas We want to keep track of mutable state in a scalable
More informationFinal Exam Review 2. Kathleen Durant CS 3200 Northeastern University Lecture 23
Final Exam Review 2 Kathleen Durant CS 3200 Northeastern University Lecture 23 QUERY EVALUATION PLAN Representation of a SQL Command SELECT {DISTINCT} FROM {WHERE
More informationScaling for Humongous amounts of data with MongoDB
Scaling for Humongous amounts of data with MongoDB Alvin Richards Technical Director, EMEA alvin@10gen.com @jonnyeight alvinonmongodb.com From here... http://bit.ly/ot71m4 ...to here... http://bit.ly/oxcsis
More informationRule 14 Use Databases Appropriately
Rule 14 Use Databases Appropriately Rule 14: What, When, How, and Why What: Use relational databases when you need ACID properties to maintain relationships between your data. For other data storage needs
More informationSources. P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley
Big Data and NoSQL Sources P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley Very short history of DBMSs The seventies: IMS end of the sixties, built for the Apollo program (today: Version 15)
More informationCSE 344 Final Review. August 16 th
CSE 344 Final Review August 16 th Final In class on Friday One sheet of notes, front and back cost formulas also provided Practice exam on web site Good luck! Primary Topics Parallel DBs parallel join
More informationColumn-Family Databases Cassandra and HBase
Column-Family Databases Cassandra and HBase Kevin Swingler Google Big Table Google invented BigTableto store the massive amounts of semi-structured data it was generating Basic model stores items indexed
More informationIntro To Big Data. John Urbanic Parallel Computing Scientist Pittsburgh Supercomputing Center. Copyright 2017
Intro To Big Data John Urbanic Parallel Computing Scientist Pittsburgh Supercomputing Center Copyright 2017 Big data is a broad term for data sets so large or complex that traditional data processing applications
More information{SDD} Applied NoSQL in.net. Software Design & Development. Michael
{SDD} 2015 Software Design & Development Applied NoSQL in.net Michael Kennedy @mkennedy http://blog.michaelckennedy.net Objectives Describe the changes in the world of data management Install MongoDB and
More informationNoSQL data stores and SOS: Uniform Access to Non-Relational Database Systems Paolo Atzeni Francesca Bugiotti Luca Rossi
NoSQL data stores and SOS: Uniform Access to Non-Relational Database Systems Paolo Atzeni Francesca Bugiotti Luca Rossi Outline Context Rela&onal DBMS NoSQL Data Stores NoSQL Timeline NoSQL Data Stores
More informationChallenges for Data Driven Systems
Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Data Centric Systems and Networking Emergence of Big Data Shift of Communication Paradigm From end-to-end to data
More informationCPS352 Lecture - Other Databsse Models
Objectives: CPS352 Lecture - Other Databsse Models May 3, 2017 1. To elucidate reasons for a desire to move to models other than the relational model for some applications 2. To introduce OO extensions
More information10. Replication. Motivation
10. Replication Page 1 10. Replication Motivation Reliable and high-performance computation on a single instance of a data object is prone to failure. Replicate data to overcome single points of failure
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 6 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411 1 We
More informationApplication development with relational and non-relational databases
Application development with relational and non-relational databases Mario Lassnig European Organization for Nuclear Research (CERN) mario.lassnig@cern.ch About me Software Engineer Data Management for
More informationMongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM
MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM About us Adamo Tonete MongoDB Support Engineer Agustín Gallego MySQL Support Engineer Agenda What are MongoDB and MySQL; NoSQL
More informationCassandra Design Patterns
Cassandra Design Patterns Sanjay Sharma Chapter No. 1 "An Overview of Architecture and Data Modeling in Cassandra" In this package, you will find: A Biography of the author of the book A preview chapter
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationSpotify. Scaling storage to million of users world wide. Jimmy Mårdell October 14, 2014
Cassandra @ Spotify Scaling storage to million of users world wide! Jimmy Mårdell October 14, 2014 2 About me Jimmy Mårdell Tech Product Owner in the Cassandra team 4 years at Spotify
More informationNoSQL Databases. CPS352: Database Systems. Simon Miner Gordon College Last Revised: 4/22/15
NoSQL Databases CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/22/15 Agenda Check-in NoSQL Databases Aggregate databases Key-value, document, and column family Graph databases Related
More information