NoSQL Databases. Amir H. Payberah. Swedish Institute of Computer Science. April 10, 2014

Similar documents
BigTable: A Distributed Storage System for Structured Data

Comparing SQL and NOSQL databases

COSC 6339 Big Data Analytics. NoSQL (II) HBase. Edgar Gabriel Fall HBase. Column-Oriented data store Distributed designed to serve large tables

NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

Introduction to NoSQL Databases

Overview. * Some History. * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL. * NoSQL Taxonomy. *TowardsNewSQL

Introduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University

Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2017)

Chapter 24 NOSQL Databases and Big Data Storage Systems

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

CA485 Ray Walshe NoSQL

Presented by Sunnie S Chung CIS 612

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems

Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2016)

Introduction Aggregate data model Distribution Models Consistency Map-Reduce Types of NoSQL Databases

Introduction to NoSQL

10 Million Smart Meter Data with Apache HBase

Distributed Data Store

CIB Session 12th NoSQL Databases Structures

ADVANCED HBASE. Architecture and Schema Design GeeCON, May Lars George Director EMEA Services

Extreme Computing. NoSQL.

Jargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems

Goal of the presentation is to give an introduction of NoSQL databases, why they are there.

CSE 530A. Non-Relational Databases. Washington University Fall 2013

Ghislain Fourny. Big Data 5. Column stores

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL

CompSci 516 Database Systems

Distributed Databases: SQL vs NoSQL

CSE-E5430 Scalable Cloud Computing Lecture 9

Typical size of data you deal with on a daily basis

Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis

Database Evolution. DB NoSQL Linked Open Data. L. Vigliano

CISC 7610 Lecture 2b The beginnings of NoSQL

A BigData Tour HDFS, Ceph and MapReduce

Sources. P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley

CS November 2018

CS November 2017

CISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL

Advanced Database Technologies NoSQL: Not only SQL

Ghislain Fourny. Big Data 5. Wide column stores

Introduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos

CS 655 Advanced Topics in Distributed Systems

Scaling Up HBase. Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech. CSE6242 / CX4242: Data & Visual Analytics

Why NoSQL? Why Riak?

COSC 416 NoSQL Databases. NoSQL Databases Overview. Dr. Ramon Lawrence University of British Columbia Okanagan

CSE 444: Database Internals. Lectures 26 NoSQL: Extensible Record Stores

Data Informatics. Seon Ho Kim, Ph.D.

References. What is Bigtable? Bigtable Data Model. Outline. Key Features. CSE 444: Database Internals

Advanced Data Management Technologies

Challenges for Data Driven Systems

MongoDB An Overview. 21-Oct Socrates

Big Data Analytics. Rasoul Karimi

The NoSQL Ecosystem. Adam Marcus MIT CSAIL

HBase. Леонид Налчаджи

CS-580K/480K Advanced Topics in Cloud Computing. NoSQL Database

Modern Database Concepts

Distributed Data Analytics Partitioning

10. Replication. Motivation

BIG DATA TECHNOLOGIES: WHAT EVERY MANAGER NEEDS TO KNOW ANALYTICS AND FINANCIAL INNOVATION CONFERENCE JUNE 26-29,

Non-Relational Databases. Pelle Jakovits

HBase Installation and Configuration

HBASE INTERVIEW QUESTIONS

Motivation Overview of NoSQL space Comparing technologies used Getting hands dirty tutorial section

Shen PingCAP 2017

Final Exam Review 2. Kathleen Durant CS 3200 Northeastern University Lecture 23

Fattane Zarrinkalam کارگاه ساالنه آزمایشگاه فناوری وب

SQL, NoSQL, MongoDB. CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden

Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis

Data Informatics. Seon Ho Kim, Ph.D.

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Relational databases

Data Management for Big Data Part 1

SCALABLE DATABASES. Sergio Bossa. From Relational Databases To Polyglot Persistence.

Using space-filling curves for multidimensional

SCALABLE CONSISTENCY AND TRANSACTION MODELS

HBase... And Lewis Carroll! Twi:er,

Exploring Cassandra and HBase with BigTable Model

Database Solution in Cloud Computing

Facebook. The Technology Behind Messages (and more ) Kannan Muthukkaruppan Software Engineer, Facebook. March 11, 2011

Study of NoSQL Database Along With Security Comparison

HBase Installation and Configuration

Integrity in Distributed Databases

DATABASE DESIGN II - 1DL400

NoSQL Concepts, Techniques & Systems Part 1. Valentina Ivanova IDA, Linköping University

CIT 668: System Architecture. Distributed Databases

Google big data techniques (2)

Perspectives on NoSQL

Introduction to BigData, Hadoop:-

Database Availability and Integrity in NoSQL. Fahri Firdausillah [M ]

Topics. History. Architecture. MongoDB, Mongoose - RDBMS - SQL. - NoSQL

Database Architectures

Bigtable. A Distributed Storage System for Structured Data. Presenter: Yunming Zhang Conglong Li. Saturday, September 21, 13

Distributed File Systems II

NoSQL systems: sharding, replication and consistency. Riccardo Torlone Università Roma Tre

Haridimos Kondylakis Computer Science Department, University of Crete

HyperDex. A Distributed, Searchable Key-Value Store. Robert Escriva. Department of Computer Science Cornell University

Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao

Databases : Lecture 1 2: Beyond ACID/Relational databases Timothy G. Griffin Lent Term Apologies to Martin Fowler ( NoSQL Distilled )

Large-Scale Key-Value Stores Eventual Consistency Marco Serafini

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

Transcription:

NoSQL Databases Amir H. Payberah Swedish Institute of Computer Science amir@sics.se April 10, 2014 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 1 / 67

Database and Database Management System Database: an organized collection of data. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 2 / 67

Database and Database Management System Database: an organized collection of data. Database Management System (DBMS): a software that interacts with users, other applications, and the database itself to capture and analyze data. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 2 / 67

History of Databases 1960s Navigational data model: hierarchical model (IMS) and network model (CODASYL). Disk-aware Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 3 / 67

History of Databases 1960s Navigational data model: hierarchical model (IMS) and network model (CODASYL). Disk-aware 1970s Relational data model: Edgar F. Codd paper. Logical data is disconnected from physical information storage. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 3 / 67

History of Databases 1960s Navigational data model: hierarchical model (IMS) and network model (CODASYL). Disk-aware 1970s Relational data model: Edgar F. Codd paper. Logical data is disconnected from physical information storage. 1980s Object databases: information is represented in the form of objects. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 3 / 67

History of Databases 1960s Navigational data model: hierarchical model (IMS) and network model (CODASYL). Disk-aware 1970s Relational data model: Edgar F. Codd paper. Logical data is disconnected from physical information storage. 1980s Object databases: information is represented in the form of objects. 2000s NoSQL databases: BASE instead of ACID. NewSQL databases: scalable performance of NoSQL + ACID. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 3 / 67

Relational Databases Management Systems (RDMBSs) RDMBSs: the dominant technology for storing structured data in web and business applications. SQL is good Rich language Easy to use and integrate Rich toolset Many vendors They promise: ACID Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 4 / 67

ACID Properties Atomicity All included statements in a transaction are either executed or the whole transaction is aborted without affecting the database. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 5 / 67

ACID Properties Atomicity All included statements in a transaction are either executed or the whole transaction is aborted without affecting the database. Consistency A database is in a consistent state before and after a transaction. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 5 / 67

ACID Properties Atomicity All included statements in a transaction are either executed or the whole transaction is aborted without affecting the database. Consistency A database is in a consistent state before and after a transaction. Isolation Transactions can not see uncommitted changes in the database. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 5 / 67

ACID Properties Atomicity All included statements in a transaction are either executed or the whole transaction is aborted without affecting the database. Consistency A database is in a consistent state before and after a transaction. Isolation Transactions can not see uncommitted changes in the database. Durability Changes are written to a disk before a database commits a transaction so that committed data cannot be lost through a power failure. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 5 / 67

RDBMS is Good... Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 6 / 67

RDBMS Challenges Web-based applications caused spikes. Internet-scale data size High read-write rates Frequent schema changes Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 7 / 67

Let s Scale RDBMSs RDBMS were not designed to be distributed. Possible solutions: Replication Sharding Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 8 / 67

Let s Scale RDBMSs - Replication Master/Slave architecture Scales read operations Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 9 / 67

Let s Scale RDBMSs - Sharding Dividing the database across many machines. It scales read and write operations. Cannot execute transactions across shards (partitions). Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 10 / 67

Scaling RDBMSs is Expensive and Inefficient [http://www.couchbase.com/sites/default/files/uploads/all/whitepapers/nosqlwhitepaper.pdf] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 11 / 67

Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 12 / 67

Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 13 / 67

NoSQL Avoidance of unneeded complexity High throughput Horizontal scalability and running on commodity hardware Compromising reliability for better performance Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 14 / 67

The NoSQL Movement It was first used in 1998 by Carlo Strozzi to name his relational database that did not expose the standard SQL interface. The term was picked up again in 2009 when a Last.fm developer, Johan Oskarsson, wanted to organize an event to discuss open source distributed databases. The name attempted to label the emergence of a growing number of non-relational, distributed data stores that often did not attempt to provide ACID. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 15 / 67

NoSQL Cost and Performance [http://www.couchbase.com/sites/default/files/uploads/all/whitepapers/nosqlwhitepaper.pdf] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 16 / 67

RDBMS vs. NoSQL [http://www.couchbase.com/sites/default/files/uploads/all/whitepapers/nosqlwhitepaper.pdf] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 17 / 67

Basic Concepts Data model Partitioning Consistency Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 18 / 67

Basic Concepts Data model Partitioning Consistency Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 18 / 67

NoSQL Data Models [http://highlyscalable.wordpress.com/2012/03/01/nosql-data-modeling-techniques] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 19 / 67

Key-Value Data Model Collection of key/value pairs. Ordered Key-Value: processing over key ranges. Dynamo, Scalaris, Voldemort, Riak,... Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 20 / 67

Column-Oriented Data Model Similar to a key/value store, but the value can have multiple attributes (Columns). Column: a set of data values of a particular type. Store and process data by column instead of row. BigTable, Hbase, Cassandra,... Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 21 / 67

Document Data Model Similar to a column-oriented store, but values can have complex documents, instead of fixed format. Flexible schema. XML, YAML, JSON, and BSON. CouchDB, MongoDB,... { } FirstName: "Bob", Address: "5 Oak St.", Hobby: "sailing" { FirstName: "Jonathan", Address: "15 Wanamassa Point Road", Children: [ {Name: "Michael", Age: 10}, {Name: "Jennifer", Age: 8}, ] } Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 22 / 67

Graph Data Model Uses graph structures with nodes, edges, and properties to represent and store data. Neo4J, InfoGrid,... [http://en.wikipedia.org/wiki/graph database] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 23 / 67

Basic Concepts Data model Partitioning Consistency Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 24 / 67

Partitioning If size of data exceeds the capacity of a single machine: partitioning Sharding data (horizontal partitioning). Consistent hashing is one form of automatic sharding. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 25 / 67

Consistent Hashing Hash both data and nodes using the same hash function in a same id space. partition = hash(d) mod n, d: data, n: number of nodes Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 26 / 67

Consistent Hashing Hash both data and nodes using the same hash function in a same id space. partition = hash(d) mod n, d: data, n: number of nodes hash("fatemeh") = 12 hash("ahmad") = 2 hash("seif") = 9 hash("jim") = 14 hash("sverker") = 4 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 26 / 67

Load Imbalance (1/4) Consistent hashing may lead to imbalance. Node identifiers may not be balanced. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 27 / 67

Load Imbalance (2/4) Consistent hashing may lead to imbalance. Data identifiers may not be balanced. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 28 / 67

Load Imbalance (3/4) Consistent hashing may lead to imbalance. Hot spots. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 29 / 67

Load Imbalance (4/4) Consistent hashing may lead to imbalance. Heterogeneous nodes. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 30 / 67

Load Balancing via Virtual Nodes Each physical node picks multiple random identifiers. Each identifier represents a virtual node. Each node runs multiple virtual nodes. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 31 / 67

Replication To achieve high availability and durability, data should be replicates on multiple nodes. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 32 / 67

Basic Concepts Data model Partitioning Consistency Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 33 / 67

Consistency Strong consistency After an update completes, any subsequent access will return the updated value. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 34 / 67

Consistency Strong consistency After an update completes, any subsequent access will return the updated value. Eventual consistency Does not guarantee that subsequent accesses will return the updated value. Inconsistency window. If no new updates are made to the object, eventually all accesses will return the last updated value. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 34 / 67

Quorum Model N: the number of nodes to which a data item is replicated. R: the number of nodes a value has to be read from to be accepted. W: the number of nodes a new value has to be written to before the write operation is finished. To enforce strong consistency: R + W > N Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 35 / 67

Quorum Model N: the number of nodes to which a data item is replicated. R: the number of nodes a value has to be read from to be accepted. W: the number of nodes a new value has to be written to before the write operation is finished. To enforce strong consistency: R + W > N Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 35 / 67

Consistency vs. Availability The large-scale applications have to be reliable: availability + redundancy These properties are difficult to achieve with ACID properties. The BASE approach forfeits the ACID properties of consistency and isolation in favor of availability, graceful degradation, and performance. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 36 / 67

BASE Properties Basic Availability Possibilities of faults but not a fault of the whole system. Soft-state Copies of a data item may be inconsistent Eventually consistent Copies becomes consistent at some later time if there are no more updates to that data item Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 37 / 67

CAP Theorem Consistency Consistent state of data after the execution of an operation. Availability Clients can always read and write data. Partition Tolerance Continue the operation in the presence of network partitions. You can choose only two! Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 38 / 67

Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 39 / 67

Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 40 / 67

HBase Type of NoSQL database, based on Google Bigtable Column-oriented data store, built on top of HDFS CAP: strong consistency and partition tolerance Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 41 / 67

Columns-Oriented Databases In many analytical databases queries, few attributes are needed. Column values are stored contiguously on disk: reduces I/O. [Lars George, Hbase: The Definitive Guide, O Reilly, 2011] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 42 / 67

HBase Data Model (1/5) Table Distributed multi-dimensional sparse map Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 43 / 67

HBase Data Model (2/5) Rows Every read or write in a row is atomic. Rows sorted in lexicographical order. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 44 / 67

HBase Data Model (3/5) Column The basic unit of data access. Column families: group of (the same type) column keys. Column key naming: family:qualifier All columns in a column family are stored together in the same low level storage file (HFile). Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 45 / 67

HBase Data Model (4/5) Timestamp Each column value may contain multiple versions. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 46 / 67

HBase Data Model (5/5) Region: contiguous ranges of rows stored together. Tables are split by the system when they become too large. Auto-Sharding Each region is served by exactly one region server. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 47 / 67

Region and Region Server [Lars George, Hbase: The Definitive Guide, O Reilly, 2011] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 48 / 67

HBase Cell [Lars George, Hbase: The Definitive Guide, O Reilly, 2011] (Table, RowKey, Family, Column, Timestamp) Value Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 49 / 67

HBase Cluster [Tom White, Hadoop: The Definitive Guide, O Reilly, 2012] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 50 / 67

HBase Components [Lars George, Hbase: The Definitive Guide, O Reilly, 2011] Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 51 / 67

HBase Components - Region Server Responsible for all read and write requests for all regions they serve. Split regions that have exceeded the thresholds. Region servers are added or removed dynamically. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 52 / 67

HBase Components - Master Responsible for managing regions and their locations. Assigning regions to region servers (uses Zookeeper). Handling load balancing of regions across region servers. Doesn t actually store or read data. Clients communicate directly with region servers. Responsible for schema management and changes. Adding/removing tables and column families. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 53 / 67

HBase Components - Zookeeper A coordinator service: not part of HBase Master uses Zookeeper for region assignment. Ensures that there is only one master running. Stores the bootstrap location for region discovery: a registry for region servers Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 54 / 67

HBase Components - HFile The data is stored in HFiles. HFiles are immutable sequences of blocks and saved in HDFS. Block index is stored at the end of HFiles. Cannot remove key-values out of HFiles. Delete marker (tombstone marker) indicates the removed records. Hides the marked data from reading clients. Updating key/value pairs: picking the latest timestamp. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 55 / 67

HBase Components - WAL and memstore When data is added it is written to a log called Write Ahead Log (WAL) and is also stored in memory (memstore). When in-memory data exceeds maximum value it is flushed to an HFile. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 56 / 67

Compaction Flushing memstores to disk causes more and more HFiles to be created. HBase uses a compaction mechanism to merges the files into larger ones. Minor compaction: rewriting smaller files into fewer but larger ones. Major compaction: rewriting all files within a column family for a region into a single new one. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 57 / 67

Summary NoSQL data models: key-value, column-oriented, document-oriented, graph-based Sharding and consistent hashing ACID vs. BASE CAP (Consistency vs. Availability) HBase: column-oriented NoSQL database Master, Region server, HFile, memstore, AWL Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 58 / 67

HBase Installation and Shell Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 59 / 67

HBase Installation Three options Local (Standalone) Mode Pseudo-Distributed Mode Fully-Distributed Mode Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 60 / 67

Installation - Local Uses local filesystem (not HDFS). Runs HBase and Zookeeper in the same JVM. Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 61 / 67

Installation - Pseudo-Distributed (1/3) Requires HDFS. Mimics Fully-Distributed but runs on just one host. Good for testing, debugging and prototyping, not for production. Configuration files: hbase-env.sh hbase-site.xml Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 62 / 67

Installation - Pseudo-Distributed (2/3) Specify environment variables in hbase-env.sh export JAVA_HOME=/opt/jdk1.7.0_51 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 63 / 67

Installation - Pseudo-Distributed (3/3) Starts an HBase Master process, a ZooKeeper server, and a Region- Server process. Configure in hbase-site.xml <property> <name>hbase.cluster.distributed</name> <value>true</value> </property> <property> <name>hbase.rootdir</name> <value>hdfs://localhost:8020/hbase</value> </property> Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 64 / 67

Start HBase and Test Start the HBase daemon. start-hbase.sh hbase shell Web-based management Master host: http://localhost:60010 Region host: http://localhost:60030 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 65 / 67

HBase Shell status list create Blog, {NAME=> info }, {NAME=> content } # put table, row_id, family:column, value put Blog, Matt-001, info:title, Elephant put Blog, Matt-001, info:author, Matt put Blog, Matt-001, info:date, 2009.05.06 put Blog, Matt-001, content:post, Do elephants like monkeys? # get table, row_id get Blog, Matt-001 get Blog, Matt-001, {COLUMN=>[ info:author, content:post ]} scan Blog Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 66 / 67

Questions? Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 67 / 67