Developing in NoSQL with Couchbase

Size: px
Start display at page:

Download "Developing in NoSQL with Couchbase"

Transcription

1 Developing in NoSQL with Couchbase Raghavan Rags Srinivas Developer Advocate Simple. Fast. Elastic.

2 Speaker Introduction Architect and Evangelist working with developers Speaker at JavaOne, RSA conferences, Sun Tech Days, JUGs and other developer conferences Taught undergrad and grad courses Technology Evangelist at Sun Microsystems for 10+ years Still trying to understand and work more effectively on Java and distributed systems Couchbase Developer Advocate working with Java and Ruby developers Philosophy: Better to have an unanswered question than a unquestioned answer

3 Agenda > Introduction and Getting Started with the APIs > Demonstration: Topology changes and xdcr > Hadoop Sqoop connector and use cases > Secondary Indexing with Query and Views > Roadmap and conclusions

4 A BIT OF NOSQL Intro. Demos. Hadoop Connector Views Summary

5 NoSQL (from a Couchbase perspective) Short for Not Only SQL Not single class of database (i.e., NoSQL is not analogous to the RDMBS classification) Generally refers to databases that neither expose a SQL interface nor use the relational data model > Perhaps better thought of as non-relational databases Typically don t offer common RDBMS features for performance gains > Often schema-less > No key constraints > No multi-step transactions > No concept of a join Well suited for cloud deployments

6 CAP Theorem/Brewer s Conjecture The CAP theorem states that it is impossible for a distributed computer system to simultaneously provide all three of the following guarantees: Consistency means that each node/client always has the same view of the data, Availability means that all clients can always read and write, Partition tolerance means that the system works well across physical network partitions.

7 CAP Systems CA: Consistency and Availability > Consider a system that uses a two phase commit for distributed transactions > The commit is impeded if a network segment is unavailable CP: Consistency and Partition Tolerance > Consider a system that uses pessimistic locking for concurrency control > Node failures hinder availability AP: Availability and Partition Tolerance > Consider a system that always responds (e.g., DNS) > Reads might be dirty as data awaits propagation

8 NoSQL Taxonomies NoSQL Databases Come in Many Flavors* > XML (myxmldb, Tamino, Sedna) > Tabular (Hbase, Big Table) > Key/Value (Couchbase, Redis, Riak, Cassandra) > Object (db4o, JADE) > Graph (Trinity, neo4j, InfoGrid) > Document store (Couchbase Server 2.0, CouchDB, MongoDB) * These are loose taxonomies

9 Web Application Architecture and Performance Application Scales Out Just add more commodity web servers Web Servers Load Balancer RelaConal Database Database Scales Up Get a bigger, more complex server - Expensive and disrupcve sharding - Doesn t perform at Web Scale

10 Couchbase data layer scales like application logic tier Data layer now scales with linear cost and constant performance. Application Scales Out Just add more commodity web servers Web Servers Load Balancer Couchbase Servers Database Scales Out Just add more commodity data servers Horizontally scalable, schema- less, auto- sharding, high- performance at Web Scale Scaling out flattens the cost and performance curves.

11 WHAT IS COUCHBASE? Intro. Demos. Hadoop Connector Views Summary

12 Couchbase Server Overview Open source, Apache 2.0 licensed Database Commercial support provided by Couchbase Free community edition for developers Enterprise edition free for 2 nodes, support license required for larger clusters Administered via web console or RESTful API Available for Windows, Linux and Mac (development only) Written in C/C++ with some components in Erlang Both a distributed key/value store and a document oriented database > Implements Memcached API (version 1.8) > Implements most of CouchDB RESTful view API (version 2.0)

13 memcached Couchbase Membase CouchDB

14 Rapidly Growing Community 1,000s of community deployments More than 150 paying customers Over 3,500 nodes deployed in production by customers

15 Customers with Production Deployments

16 OMGPOP s DrawSomething Went Viral Pictionary Like Game #1 Paid & Free App in 84 Countries 35m Downloads

17 Couchbase Server 1.8 Architecture Memcapable Memcapable 2.0 Moxi Memcached Couchbase EP Engine Data Manager storage interface REST management API/Web UI Heartbeat Process monitor ConfiguraCon manager Global singleton supervisor Rebalance orchestrator Node health monitor vbucket state and replicacon manager Cluster Manager Persistence Layer hop on each node one per cluster Erlang/OTP HTTP 8091 Erlang port mapper 4369 Distributed Erlang

18 Component Architecture Memcapable Memcapable 2.0 Moxi Memcached Couchbase EP Engine storage interface REST Management / WebUI Heartbeat Process monitor Configuration Manager Global singleton supervisor Rebalance orchestrator Node health monitor vbucket state and replicacon manager Persistence Layer htt p on each node one per cluster Erlang/OTP HTTP 8091 Erlang port mapper 4369 Distributed Erlang

19 A Distributed Hash Table Document Store Application set(key, json) get(key) returns json Database Cluster {! DData id": brewery_legacy,! type : brewery,! "name" : "Legacy Brewing,! address": "525 Canal Street Reading", "updated": " :00:20", }!

20 Couchbase Distributed Key/Value Store Features All documents/items are stored and retrieved using standard hashtable operations (i.e, get, add, etc.) Keys are distributed evenly across the nodes of a cluster by way of vbuckets > A vbucket is a level of indirection between a key and value, and the node on which it is stored Constant time access to items via keys Single dimension hashtable with no secondary keys

21 Couchbase Document Store Features Items can be stored as JSON documents Non-JSON items are stored as binary attachments to CouchDB JSON documents Secondary, B-tree indexes are created by way of views > Views are accessed via RESTful API > Indexes are incrementally updated after access not insert > Caller specifies whether to accept stale data Views may be queried by key or range Indexes are stored across the cluster and accumulated from all nodes when requested

22 Workflow (including replication) User action results in the need to change the VALUE of KEY 1 4 SET request sent over network to master server 2 3 Application updates key s VALUE, performs SET operation Couchbase driver hashes KEY, identifies KEY s master server 5 Couchbase replicates KEY-VALUE pair, caches it in memory and stores it to disk

23 GETTING STARTED Intro. Demos. Hadoop Connector Views Summary

24 Installation of the Server for development Operating systems supported RedHat RPM Ubuntu dpkg Windows Mac OS GUI Installer Unattended installation Quickstart once installed (invokes the Admin Console)

25 COUCHBASE CLIENT INTERFACE Intro. Demos. Hadoop Connector Views Summary

26 Client Setup: Getting Cluster Configuration Couchbase Client {. Cluster Configuration over REST "bucketcapabilities": [ "touch", "sync", "couchapi" ], "bucketcapabilitiesver": "sync-1.0", "buckettype": couchbase", "name": "default", "nodelocator": "vbucket", "nodes": [ Server Node Server Node Server Node Server Node

27 Client at Runtime: Adding a node Couchbase Client New node Cluster Topology Update coming online Server Node Server Node Server Node Server Node Server Node

28 Opening a Connection (Java) Connect to the Cluster URI of any cluster node List<URI> uris = new LinkedList<URI>(); uris.add(uri.create(" try { client = new CouchbaseClient(uris, "default", ""); } catch (Exception e) { System.err.println("Error connecting to Couchbase: " + e.getmessage()); System.exit(0); }

29 PROTOCOL OVERVIEW Intro. Demos Hadoop Connector Views Summary

30 Store Operations Operation Description add() Adds new value if key does not exist or returns error. replace() Replaces existing value if specified key already exists. set() Sets new value for specified key. TTL allow for expiry time specified in seconds: Expiry Value Values < 30*24*60*60 Values > 30*24*60*60 Description Time in seconds to expiry Absolute time from epoch for expiry

31 Retrieve Operations Operation Description get() Get a value. getandtouch() Get a value and update the expiry time. getbulk() Get multiple values simultaneously, more efficient. gets() Get a value and the CAS value.

32 ASYNCHRONOUS OPERATIONS Intro. Advanced opertations Doc. Design Views Summary

33 Couchbase Java Client Asynchronous Functions Synchronous commands Force wait on application until return from server Asynchronous commands Allow background operation until response available Operations return Future object

34 It s Asynchronous Set Operation is asynchronous // Do an asynchronous set! OperationFuture<Boolean> setop =! client.set(key, EXP_TIME, VALUE);!! // Do something not related to set!! // Check to see if our set succeeded! // We block when we call the get()! if (setop.get().booleanvalue()) {! System.out.println("Set Succeeded");! } else {! System.err.println("Set failed: " +! setop.getstatus().getmessage());! }!

35 Distributed System Design: Concurrency Controls Compare and Swap Operations > Often referred to as CAS > Optimistic concurrency control > Available with many mutation operations, depending on client. Success Get with Lock > Often referred to as GETL > Pessimistic concurrency control > Locks have a short TTL > Locks released with CAS operations > Useful when working with object graphs Actor 1 Actor 2 Couchbase Server A CAS mismatch B C D F E

36 CAS Operation CAS Example! CASValue<Object> casv = client.gets(key);! Thread.sleep(random.nextInt(1000)); // a random workload!! // Wake up and do a set based on the previous CAS value! Future<CASResponse> setop =! client.asynccas(key, casv.getcas(), random.nextlong());! // Wait for the CAS Response! try {! if (setop.get().equals(casresponse.ok)) {! System.out.println("Set Succeeded");! } else {! System.err.println("Set failed: ");! }! }!!

37 CONCURRENCY Intro. Demos Hadoop Connector Views Summary

38 KEYS AND RELATIONSHIPS Intro. Demos Hadoop Connector Views Summary

39 Utilizing the strengths of the Platform Couchbase > Distributed Key, Value pairs > De-normalized Data > Fast access Session store Hadoop > Unstructured/semi-structured data > Analytics and frequent data analysis > Leverage the Hadoop Ecosystem (Pig, Hive, Hbase, etc.)

40 Use cases Ad Targeting > Decisions have to be made in milliseconds based on historical trends, location, user preferences, etc. Virtual Reality > Sub-second response time based on data available from number of different sources Gaming applications > Responsive applications that integrate with data in the back end

41 SQOOP

42 Introducing Sqoop Easily Import/Export Data into Hadoop Generate Datatypes for use in MapReduce Applications Integrate with Pig, Hive and Hbase Easily export Data from Hadoop HDFS Hive HBase Sqoop Pig Hadoop

43 COUCHBASE PLUGIN

44 Couchbase Plugin Based on the Couchbase Tap Interface Allows importing and exporting of entire database key mutations Couchbase 1. Data imported via Tap mechanism 3. Data exported back to Couchbase HDFS 2. Hadoop Processing

45 Couchbase Import and Export $ sqoop import -connect --table DUMP $ sqoop import -connect --table BACKFILL_5 $ sqoop export --connect --table DUMP export-dir DUMP For Imports, table must be: > DUMP: All keys currently in Couchbase > BACKFILL_n: All key mutations for n minutes Specified username maps to bucket > By default set to default bucket

46 Ad Targeting Application Couchbase > System of Record > Provides fast access to users and being able to record events Hadoop > Analytics > Generates a count of the # of clicks Couchbase 1. User/Event Data is imported to HDFS HDFS 2. Event data is consolidated with Map/Reduce 3. Consolidated data Is used to target ads.

47 Importing to Hadoop

48 Counting the clicks

49 Analyze using Pig

50 Counting the ads

51 Analyze and Consolidate

52 THE VIEW AND QUERY API Intro. Demos Hadoop Connector Views Summary

53 Couchbase Server 2.0 Identical feature set to Couchbase Server 1.8 > Cache-layer > High-performance > Cluster > Elastic > Core interface Adds > Document based storage (JSON) > Views with materialized indexes > Querying

54 Why use JSON? JSON (JavaScript Object Notation) > Lightweight Data-interchange format > Easy for humans to read and manipulate > Easy for machines to parse and generate (many parsers are available) > JSON is language independent (although it uses similar constructs)

55 What Are Views Views create perspectives on a collection of documents Views can cover a few different use cases Simple secondary indexes (the most common) Aggregation functions Example: count the number of North American Ales Organizing related data Use Map/Reduce Map defines the relationship between fields in documents and output table Reduce provides method for collating/summarizing Views materialized indexes Views are not create until accessed Data writes are fast (no index) Index updates all changes since last update

56 View Processing Extract fields from JSON documents and produce an index of the selected information

57 View Creation using Incremental Map/ Reduce map function creates a mapping between input data and output reduce function provides a summary (such as count, sum, etc.)

58 Map Func:ons Map outputs one or more rows of data Map outputs: > Document ID > View Key (user configurable) > View Value View Key Controls > SorCng > Querying Key + Value stored in the Index Maps provide query base

59 Map/Reduce Map Function A map function to get breweries by province function (doc) { if (doc.type == "brewery" && doc.province) { emit(doc.province, doc.name); } }

60 Map/Reduce Map Results The conceptual output of the map function is [ {"key":"connecticut", "value":"cottrell Brewing"}, {"key":"connecticut","value":"thomas Hooker Brewing Co."}, {"key":"massachusetts","value":"harpoon Brewing Company"}, {"key":"new York","value":"Brewery Ommegang"}, {"key":"vermont","value":"long Trail Brewing Company"} ]

61 Map/Reduce Reduce Function A reduce function to count breweries by province > Or simply use the build in _count function function (keys, values) { } return values.length; The conceptual input to the reduce function is: [ {"keys":["connecticut", "Connecticut"], "values": ["Cottrell Brewing", "Thomas Hooker Brewing Co."]}, {"keys":["massachusetts"],"values":["harpoon Brewing Co."]}, {"keys":["new York"],"values":["Brewery Ommegang"]}, {"keys":["vermont"],"values":["long Trail Brewing Company ]} ]

62 Map/Reduce Reduce Results The conceptual result of the reduce is {"key":"connecticut","value":2}, {"key":"massachusetts","value":1}, {"key":"new York","value":1}, {"key":"vermont","value":1}

63 THE QUERY API Intro. Demos Hadoop Connector Views Summary

64 Query APIs // map function! function (doc) {! if (doc.type == "beer") {! emit(doc._id, null);! }! }!! // Java code! Query query = new Query();!! query.setreduce(false);! query.setincludedocs(true);! query.setstale(stale.false);!!

65 THE VIEW API Intro. Demos Hadoop Connector Views Summary

66 View APIs with Java // map function! function (doc) {! if (doc.type == "beer") {! emit(doc._id, null);! }! }!! // Java code! View view = client.getview("beers", "beers");!! ViewResponse result = client.query(view, query);!! Iterator<ViewRow> itr = result.iterator();!! while (itr.hasnext()) {! row = itr.next();! doc = (String) row.getdocument();! // do something! }!!!

67 USING THE QUERY API FOR CALCULATION Intro. Advanced opertations Doc. Design Views Summary

68 Querying with Java Custom Reduce // map function! function(doc) {! if (doc.type == "beer") {! if(doc.abv) {! emit(doc.abv, 1);! }}}!! //reduce function! _count!! // Java code! View view = client.getview("beers", "beers_count_abv");! query.setgroup(true);!! while (itr.hasnext()) {! row = itr.next();! System.out.println(String.format("%s: %s,! row.getkey(), row.getvalue()));! }!

69 View Calculation Result 10: 2! 9.6: 5! 9.1: 1! 9: 3! 8.7: 2! 8.5: 1! 8: 2! 7.5: 4! 7: 4! 6.7: 1! 6.6: 1! 6.2: 1! 6: 2! 5.9: 1! 5.6: 1! 5.2: 1! 5: 1! 4.8: 2! 4.5: 1! 4: 2!

70 RESOURCES AND SUMMARY Intro. Demos Hadoop Connector Views Summary

71 Java Client Library Roadmap Server 1.8 Compatible > 1.0.1/2.8.0 > 1.0.2/2.8.0 (Memcache Node imbalance) > 1.0.x/2.8.0 (Replica Read) Server 2.0 Compatible > 1.1-dp/2.8.1 (Views) > 1.1-dp2/2.8.x (Observe) > 1.1/2.8.x (Final) Other Features > Spring Integration > Your feedback?!

72 Resources and Call For Action Couchbase Server Downloads > > Views > Developing with Client libraries > Couchbase Java Client Library wiki tips and tricks > +Library Open Beer Database > Couchbase Internals >

73 THANKS - Q&A Raghavan Rags Srinivas, Couchbase Inc. 73

Developing in NoSQL with Couchbase and Java. Raghavan N. Srinivas Couchbase 123

Developing in NoSQL with Couchbase and Java. Raghavan N. Srinivas Couchbase 123 Developing in NoSQL with Couchbase and Java Raghavan N. Srinivas Couchbase 123 Objective A technical overview To getting started on programming with Couchbase using Java To learn about the basic operations,

More information

Couchbase Server. Chris Anderson Chief

Couchbase Server. Chris Anderson Chief Couchbase Server Chris Anderson Chief Architect @jchris 1 Couchbase Server Simple = Fast Elas=c NoSQL Database Formerly known as Membase Server 2 Couchbase Server Features Built- in clustering All nodes

More information

Couchbase Architecture Couchbase Inc. 1

Couchbase Architecture Couchbase Inc. 1 Couchbase Architecture 2015 Couchbase Inc. 1 $whoami Laurent Doguin Couchbase Developer Advocate @ldoguin laurent.doguin@couchbase.com 2015 Couchbase Inc. 2 2 Big Data = Operational + Analytic (NoSQL +

More information

Jargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems

Jargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons

More information

The NoSQL Landscape. Frank Weigel VP, Field Technical Opera;ons

The NoSQL Landscape. Frank Weigel VP, Field Technical Opera;ons The NoSQL Landscape Frank Weigel VP, Field Technical Opera;ons What we ll talk about Why RDBMS are not enough? What are the different NoSQL taxonomies? Which NoSQL is right for me? Macro Trends Driving

More information

CS Silvia Zuffi - Sunil Mallya. Slides credits: official membase meetings

CS Silvia Zuffi - Sunil Mallya. Slides credits: official membase meetings CS227 - Silvia Zuffi - Sunil Mallya Slides credits: official membase meetings Schedule Overview silvia History silvia Data Model silvia Architecture sunil Transaction support sunil Case studies silvia

More information

Big Data Hadoop Course Content

Big Data Hadoop Course Content Big Data Hadoop Course Content Topics covered in the training Introduction to Linux and Big Data Virtual Machine ( VM) Introduction/ Installation of VirtualBox and the Big Data VM Introduction to Linux

More information

Friday, April 26, 13

Friday, April 26, 13 Introduc)on to Map Reduce with Couchbase Tugdual Grall / @tgrall NoSQL Ma)ers 13 - Cologne - April 25th 2013 About Me Tugdual Tug Grall Couchbase exo Technical Evangelist CTO Oracle Developer/Product Manager

More information

Matt Ingenthron. Couchbase, Inc.

Matt Ingenthron. Couchbase, Inc. Matt Ingenthron Couchbase, Inc. 2 What is Membase? Before: Application scales linearly, data hits wall Application Scales Out Just add more commodity web servers Database Scales Up Get a bigger, more complex

More information

Non-Relational Databases. Pelle Jakovits

Non-Relational Databases. Pelle Jakovits Non-Relational Databases Pelle Jakovits 25 October 2017 Outline Background Relational model Database scaling The NoSQL Movement CAP Theorem Non-relational data models Key-value Document-oriented Column

More information

Big Data Analytics using Apache Hadoop and Spark with Scala

Big Data Analytics using Apache Hadoop and Spark with Scala Big Data Analytics using Apache Hadoop and Spark with Scala Training Highlights : 80% of the training is with Practical Demo (On Custom Cloudera and Ubuntu Machines) 20% Theory Portion will be important

More information

Introduction to NoSQL Databases

Introduction to NoSQL Databases Introduction to NoSQL Databases Roman Kern KTI, TU Graz 2017-10-16 Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 1 / 31 Introduction Intro Why NoSQL? Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 2 / 31 Introduction

More information

Hadoop An Overview. - Socrates CCDH

Hadoop An Overview. - Socrates CCDH Hadoop An Overview - Socrates CCDH What is Big Data? Volume Not Gigabyte. Terabyte, Petabyte, Exabyte, Zettabyte - Due to handheld gadgets,and HD format images and videos - In total data, 90% of them collected

More information

CIB Session 12th NoSQL Databases Structures

CIB Session 12th NoSQL Databases Structures CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is

More information

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals

More information

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems CompSci 516 Data Intensive Computing Systems Lecture 21 (optional) NoSQL systems Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Key- Value Stores Duke CS,

More information

CISC 7610 Lecture 2b The beginnings of NoSQL

CISC 7610 Lecture 2b The beginnings of NoSQL CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone

More information

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 14 NoSQL References Scalable SQL and NoSQL Data Stores, Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No.

More information

5/2/16. Announcements. NoSQL Motivation. The New Hipster: NoSQL. Serverless. What is the Problem? Database Systems CSE 414

5/2/16. Announcements. NoSQL Motivation. The New Hipster: NoSQL. Serverless. What is the Problem? Database Systems CSE 414 Announcements Database Systems CSE 414 Lecture 16: NoSQL and JSon Current assignments: Homework 4 due tonight Web Quiz 6 due next Wednesday [There is no Web Quiz 5 Today s lecture: JSon The book covers

More information

Introduction to BigData, Hadoop:-

Introduction to BigData, Hadoop:- Introduction to BigData, Hadoop:- Big Data Introduction: Hadoop Introduction What is Hadoop? Why Hadoop? Hadoop History. Different types of Components in Hadoop? HDFS, MapReduce, PIG, Hive, SQOOP, HBASE,

More information

Database Systems CSE 414

Database Systems CSE 414 Database Systems CSE 414 Lecture 16: NoSQL and JSon CSE 414 - Spring 2016 1 Announcements Current assignments: Homework 4 due tonight Web Quiz 6 due next Wednesday [There is no Web Quiz 5] Today s lecture:

More information

Goal of the presentation is to give an introduction of NoSQL databases, why they are there.

Goal of the presentation is to give an introduction of NoSQL databases, why they are there. 1 Goal of the presentation is to give an introduction of NoSQL databases, why they are there. We want to present "Why?" first to explain the need of something like "NoSQL" and then in "What?" we go in

More information

NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY WHAT IS NOSQL? Stands for No-SQL or Not Only SQL. Class of non-relational data storage systems E.g.

More information

CSE 344 JULY 9 TH NOSQL

CSE 344 JULY 9 TH NOSQL CSE 344 JULY 9 TH NOSQL ADMINISTRATIVE MINUTIAE HW3 due Wednesday tests released actual_time should have 0s not NULLs upload new data file or use UPDATE to change 0 ~> NULL Extra OOs on Mondays 5-7pm in

More information

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423

More information

Innovatus Technologies

Innovatus Technologies HADOOP 2.X BIGDATA ANALYTICS 1. Java Overview of Java Classes and Objects Garbage Collection and Modifiers Inheritance, Aggregation, Polymorphism Command line argument Abstract class and Interfaces String

More information

Relational databases

Relational databases COSC 6397 Big Data Analytics NoSQL databases Edgar Gabriel Spring 2017 Relational databases Long lasting industry standard to store data persistently Key points concurrency control, transactions, standard

More information

Oracle NoSQL Database Enterprise Edition, Version 18.1

Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across

More information

CS-580K/480K Advanced Topics in Cloud Computing. NoSQL Database

CS-580K/480K Advanced Topics in Cloud Computing. NoSQL Database CS-580K/480K dvanced Topics in Cloud Computing NoSQL Database 1 1 Where are we? Cloud latforms 2 VM1 VM2 VM3 3 Operating System 4 1 2 3 Operating System 4 1 2 Virtualization Layer 3 Operating System 4

More information

Introduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos

Introduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos Instituto Politécnico de Tomar Introduction to Big Data NoSQL Databases Ricardo Campos Mestrado EI-IC Análise e Processamento de Grandes Volumes de Dados Tomar, Portugal, 2016 Part of the slides used in

More information

Big Data Architect.

Big Data Architect. Big Data Architect www.austech.edu.au WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional

More information

Oracle NoSQL Database Enterprise Edition, Version 18.1

Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across

More information

DEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!

DEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies! DEMYSTIFYING BIG DATA WITH RIAK USE CASES Martin Schneider Basho Technologies! Agenda Defining Big Data in Regards to Riak A Series of Trade-Offs Use Cases Q & A About Basho & Riak Basho Technologies is

More information

Hadoop Development Introduction

Hadoop Development Introduction Hadoop Development Introduction What is Bigdata? Evolution of Bigdata Types of Data and their Significance Need for Bigdata Analytics Why Bigdata with Hadoop? History of Hadoop Why Hadoop is in demand

More information

Intro to Couchbase Server for ColdFusion - Clustered NoSQL and Caching at its Finest

Intro to Couchbase Server for ColdFusion - Clustered NoSQL and Caching at its Finest Tweet Intro to Couchbase Server for ColdFusion - Clustered NoSQL and Caching at its Finest Brad Wood Jul 26, 2013 Today we are starting a new blogging series on how to leverage Couchbase NoSQL from ColdFusion

More information

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture Big Data Syllabus Hadoop YARN Setup Programming in YARN framework j Understanding big data and Hadoop Big Data Limitations and Solutions of existing Data Analytics Architecture Hadoop Features Hadoop Ecosystem

More information

PROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc.

PROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc. PROFESSIONAL NoSQL Shashank Tiwari WILEY John Wiley & Sons, Inc. Examining CONTENTS INTRODUCTION xvil CHAPTER 1: NOSQL: WHAT IT IS AND WHY YOU NEED IT 3 Definition and Introduction 4 Context and a Bit

More information

Microsoft Big Data and Hadoop

Microsoft Big Data and Hadoop Microsoft Big Data and Hadoop Lara Rubbelke @sqlgal Cindy Gross @sqlcindy 2 The world of data is changing The 4Vs of Big Data http://nosql.mypopescu.com/post/9621746531/a-definition-of-big-data 3 Common

More information

10/18/2017. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414

10/18/2017. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414 Announcements Database Systems CSE 414 Lecture 11: NoSQL & JSON (mostly not in textbook only Ch 11.1) HW5 will be posted on Friday and due on Nov. 14, 11pm [No Web Quiz 5] Today s lecture: NoSQL & JSON

More information

Introduction to Hadoop. High Availability Scaling Advantages and Challenges. Introduction to Big Data

Introduction to Hadoop. High Availability Scaling Advantages and Challenges. Introduction to Big Data Introduction to Hadoop High Availability Scaling Advantages and Challenges Introduction to Big Data What is Big data Big Data opportunities Big Data Challenges Characteristics of Big data Introduction

More information

OpenEdge & CouchDB. Integrating the OpenEdge ABL with CouchDB. Don Beattie Software Architect Quicken Loans Inc.

OpenEdge & CouchDB. Integrating the OpenEdge ABL with CouchDB. Don Beattie Software Architect Quicken Loans Inc. OpenEdge & CouchDB Integrating the OpenEdge ABL with CouchDB Don Beattie Software Architect Quicken Loans Inc. Apache CouchDB has started. Time to relax. Intro The OpenEdge RDBMS is a great database that

More information

Realtime visitor analysis with Couchbase and Elasticsearch

Realtime visitor analysis with Couchbase and Elasticsearch Realtime visitor analysis with Couchbase and Elasticsearch Jeroen Reijn @jreijn #nosql13 About me Jeroen Reijn Software engineer Hippo @jreijn http://blog.jeroenreijn.com About Hippo Visitor Analysis OneHippo

More information

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION

More information

Distributed Non-Relational Databases. Pelle Jakovits

Distributed Non-Relational Databases. Pelle Jakovits Distributed Non-Relational Databases Pelle Jakovits Tartu, 7 December 2018 Outline Relational model NoSQL Movement Non-relational data models Key-value Document-oriented Column family Graph Non-relational

More information

Sources. P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley

Sources. P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley Big Data and NoSQL Sources P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley Very short history of DBMSs The seventies: IMS end of the sixties, built for the Apollo program (today: Version 15)

More information

DATABASE DESIGN II - 1DL400

DATABASE DESIGN II - 1DL400 DATABASE DESIGN II - 1DL400 Fall 2016 A second course in database systems http://www.it.uu.se/research/group/udbl/kurser/dbii_ht16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,

More information

1

1 1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance

More information

Relational to NoSQL: Getting started from SQL Server. Shane Johnson Sr. Product Marketing Manager Couchbase

Relational to NoSQL: Getting started from SQL Server. Shane Johnson Sr. Product Marketing Manager Couchbase Relational to NoSQL: Getting started from SQL Server Shane Johnson Sr. Product Marketing Manager Couchbase Today s agenda Why NoSQL? Identifying the right application Modeling your data Accessing your

More information

This is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem.

This is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem. About the Tutorial Sqoop is a tool designed to transfer data between Hadoop and relational database servers. It is used to import data from relational databases such as MySQL, Oracle to Hadoop HDFS, and

More information

/ Cloud Computing. Recitation 6 October 2 nd, 2018

/ Cloud Computing. Recitation 6 October 2 nd, 2018 15-319 / 15-619 Cloud Computing Recitation 6 October 2 nd, 2018 1 Overview Announcements for administrative issues Last week s reflection OLI unit 3 module 7, 8 and 9 Quiz 4 Project 2.3 This week s schedule

More information

A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores

A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores Nikhil Dasharath Karande 1 Department of CSE, Sanjay Ghodawat Institutes, Atigre nikhilkarande18@gmail.com Abstract- This paper

More information

Understanding NoSQL Database Implementations

Understanding NoSQL Database Implementations Understanding NoSQL Database Implementations Sadalage and Fowler, Chapters 7 11 Class 07: Understanding NoSQL Database Implementations 1 Foreword NoSQL is a broad and diverse collection of technologies.

More information

CompSci 516 Database Systems

CompSci 516 Database Systems CompSci 516 Database Systems Lecture 20 NoSQL and Column Store Instructor: Sudeepa Roy Duke CS, Fall 2018 CompSci 516: Database Systems 1 Reading Material NOSQL: Scalable SQL and NoSQL Data Stores Rick

More information

Challenges for Data Driven Systems

Challenges for Data Driven Systems Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Data Centric Systems and Networking Emergence of Big Data Shift of Communication Paradigm From end-to-end to data

More information

Data Informatics. Seon Ho Kim, Ph.D.

Data Informatics. Seon Ho Kim, Ph.D. Data Informatics Seon Ho Kim, Ph.D. seonkim@usc.edu HBase HBase is.. A distributed data store that can scale horizontally to 1,000s of commodity servers and petabytes of indexed storage. Designed to operate

More information

IBM Planning Analytics Workspace Local Distributed Soufiane Azizi. IBM Planning Analytics

IBM Planning Analytics Workspace Local Distributed Soufiane Azizi. IBM Planning Analytics IBM Planning Analytics Workspace Local Distributed Soufiane Azizi IBM Planning Analytics IBM Canada - Cognos Ottawa Lab. IBM Planning Analytics Agenda 1. Demo PAW High Availability on a Prebuilt Swarm

More information

Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017

Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 About the Presentation Problems Existing Solutions Denis Magda

More information

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Oracle zsig Conference IBM LinuxONE and z System Servers Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Sam Amsavelu Oracle on z Architect IBM Washington

More information

relational Relational to Riak Why Move From Relational to Riak? Introduction High Availability Riak At-a-Glance

relational Relational to Riak Why Move From Relational to Riak? Introduction High Availability Riak At-a-Glance WHITEPAPER Relational to Riak relational Introduction This whitepaper looks at why companies choose Riak over a relational database. We focus specifically on availability, scalability, and the / data model.

More information

Class Overview. Two Classes of Database Applications. NoSQL Motivation. RDBMS Review: Client-Server. RDBMS Review: Serverless

Class Overview. Two Classes of Database Applications. NoSQL Motivation. RDBMS Review: Client-Server. RDBMS Review: Serverless Introduction to Database Systems CSE 414 Lecture 12: NoSQL 1 Class Overview Unit 1: Intro Unit 2: Relational Data Models and Query Languages Unit 3: Non-relational data NoSQL Json SQL++ Unit 4: RDMBS internals

More information

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related

More information

Study of NoSQL Database Along With Security Comparison

Study of NoSQL Database Along With Security Comparison Study of NoSQL Database Along With Security Comparison Ankita A. Mall [1], Jwalant B. Baria [2] [1] Student, Computer Engineering Department, Government Engineering College, Modasa, Gujarat, India ank.fetr@gmail.com

More information

Overview. * Some History. * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL. * NoSQL Taxonomy. *TowardsNewSQL

Overview. * Some History. * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL. * NoSQL Taxonomy. *TowardsNewSQL * Some History * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL * NoSQL Taxonomy * Towards NewSQL Overview * Some History * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL * NoSQL Taxonomy *TowardsNewSQL NoSQL

More information

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Raanan Dagan and Rohit Pujari September 25, 2017 Washington, DC Forward-Looking Statements During the course of this presentation, we may

More information

vbuckets: The Core Enabling Mechanism for Couchbase Server Data Distribution (aka Auto-Sharding )

vbuckets: The Core Enabling Mechanism for Couchbase Server Data Distribution (aka Auto-Sharding ) vbuckets: The Core Enabling Mechanism for Data Distribution (aka Auto-Sharding ) Table of Contents vbucket Defined 3 key-vbucket-server ping illustrated 4 vbuckets in a world of s 5 TCP ports Deployment

More information

5/1/17. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414

5/1/17. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414 Announcements Database Systems CSE 414 Lecture 15: NoSQL & JSON (mostly not in textbook only Ch 11.1) 1 Homework 4 due tomorrow night [No Web Quiz 5] Midterm grading hopefully finished tonight post online

More information

June 20, 2017 Revision NoSQL Database Architectural Comparison

June 20, 2017 Revision NoSQL Database Architectural Comparison June 20, 2017 Revision 0.07 NoSQL Database Architectural Comparison Table of Contents Executive Summary... 1 Introduction... 2 Cluster Topology... 4 Consistency Model... 6 Replication Strategy... 8 Failover

More information

NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE. Nicolas Buchschacher - University of Geneva - ADASS 2018

NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE. Nicolas Buchschacher - University of Geneva - ADASS 2018 NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE DACE https://dace.unige.ch Data and Analysis Center for Exoplanets. Facility to store, exchange and analyse data

More information

Introduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University

Introduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University Introduction to Computer Science William Hsu Department of Computer Science and Engineering National Taiwan Ocean University Chapter 9: Database Systems supplementary - nosql You can have data without

More information

Presented by Sunnie S Chung CIS 612

Presented by Sunnie S Chung CIS 612 By Yasin N. Silva, Arizona State University Presented by Sunnie S Chung CIS 612 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. See http://creativecommons.org/licenses/by-nc-sa/4.0/

More information

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved. Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources

More information

MongoDB An Overview. 21-Oct Socrates

MongoDB An Overview. 21-Oct Socrates MongoDB An Overview 21-Oct-2016 Socrates Agenda What is NoSQL DB? Types of NoSQL DBs DBMS and MongoDB Comparison Why MongoDB? MongoDB Architecture Storage Engines Data Model Query Language Security Data

More information

Table of Index Hadoop for Developers Hibernate: Using Hibernate For Java Database Access HP FlexNetwork Fundamentals, Rev. 14.21 HP Navigating the Journey to Cloud, Rev. 15.11 HP OneView 1.20 Rev.15.21

More information

Distributed Systems 16. Distributed File Systems II

Distributed Systems 16. Distributed File Systems II Distributed Systems 16. Distributed File Systems II Paul Krzyzanowski pxk@cs.rutgers.edu 1 Review NFS RPC-based access AFS Long-term caching CODA Read/write replication & disconnected operation DFS AFS

More information

CSE 530A. Non-Relational Databases. Washington University Fall 2013

CSE 530A. Non-Relational Databases. Washington University Fall 2013 CSE 530A Non-Relational Databases Washington University Fall 2013 NoSQL "NoSQL" was originally the name of a specific RDBMS project that did not use a SQL interface Was co-opted years later to refer to

More information

Announcements. Two Classes of Database Applications. Class Overview. NoSQL Motivation. RDBMS Review: Serverless

Announcements. Two Classes of Database Applications. Class Overview. NoSQL Motivation. RDBMS Review: Serverless Introduction to Database Systems CSE 414 Lecture 11: NoSQL 1 HW 3 due Friday Announcements Upload data with DataGrip editor see message board Azure timeout for question 5: Try DataGrip or SQLite HW 2 Grades

More information

Hadoop & Big Data Analytics Complete Practical & Real-time Training

Hadoop & Big Data Analytics Complete Practical & Real-time Training An ISO Certified Training Institute A Unit of Sequelgate Innovative Technologies Pvt. Ltd. www.sqlschool.com Hadoop & Big Data Analytics Complete Practical & Real-time Training Mode : Instructor Led LIVE

More information

Oral Questions and Answers (DBMS LAB) Questions & Answers- DBMS

Oral Questions and Answers (DBMS LAB) Questions & Answers- DBMS Questions & Answers- DBMS https://career.guru99.com/top-50-database-interview-questions/ 1) Define Database. A prearranged collection of figures known as data is called database. 2) What is DBMS? Database

More information

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Using the SDACK Architecture to Build a Big Data Product Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Outline A Threat Analytic Big Data product The SDACK Architecture Akka Streams and data

More information

A NoSQL Introduction for Relational Database Developers. Andrew Karcher Las Vegas SQL Saturday September 12th, 2015

A NoSQL Introduction for Relational Database Developers. Andrew Karcher Las Vegas SQL Saturday September 12th, 2015 A NoSQL Introduction for Relational Database Developers Andrew Karcher Las Vegas SQL Saturday September 12th, 2015 About Me http://www.andrewkarcher.com Twitter: @akarcher LinkedIn, Twitter Email: akarcher@gmail.com

More information

MySQL High Availability

MySQL High Availability MySQL High Availability InnoDB Cluster and NDB Cluster Ted Wennmark ted.wennmark@oracle.com Copyright 2016, Oracle and/or its its affiliates. All All rights reserved. Safe Harbor Statement The following

More information

Course Content MongoDB

Course Content MongoDB Course Content MongoDB 1. Course introduction and mongodb Essentials (basics) 2. Introduction to NoSQL databases What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL

More information

@Pentaho #BigDataWebSeries

@Pentaho #BigDataWebSeries Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of

More information

OPEN SOURCE DB SYSTEMS TYPES OF DBMS

OPEN SOURCE DB SYSTEMS TYPES OF DBMS OPEN SOURCE DB SYSTEMS Anna Topol 1 TYPES OF DBMS Relational Key-Value Document-oriented Graph 2 DBMS SELECTION Multi-platform or platform-agnostic Offers persistent storage Fairly well known Actively

More information

Making Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari

Making Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari Making Non-Distributed Databases, Distributed Ioannis Papapanagiotou, PhD Shailesh Birari Dynomite Ecosystem Dynomite - Proxy layer Dyno - Client Dynomite-manager - Ecosystem orchestrator Dynomite-explorer

More information

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou The Hadoop Ecosystem EECS 4415 Big Data Systems Tilemachos Pechlivanoglou tipech@eecs.yorku.ca A lot of tools designed to work with Hadoop 2 HDFS, MapReduce Hadoop Distributed File System Core Hadoop component

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

Getting to know. by Michelle Darling August 2013

Getting to know. by Michelle Darling August 2013 Getting to know by Michelle Darling mdarlingcmt@gmail.com August 2013 Agenda: What is Cassandra? Installation, CQL3 Data Modelling Summary Only 15 min to cover these, so please hold questions til the end,

More information

Scaling for Humongous amounts of data with MongoDB

Scaling for Humongous amounts of data with MongoDB Scaling for Humongous amounts of data with MongoDB Alvin Richards Technical Director, EMEA alvin@10gen.com @jonnyeight alvinonmongodb.com From here... http://bit.ly/ot71m4 ...to here... http://bit.ly/oxcsis

More information

1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions

1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions Big Data Hadoop Architect Online Training (Big Data Hadoop + Apache Spark & Scala+ MongoDB Developer And Administrator + Apache Cassandra + Impala Training + Apache Kafka + Apache Storm) 1 Big Data Hadoop

More information

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University CPSC 426/526 Cloud Computing Ennan Zhai Computer Science Department Yale University Recall: Lec-7 In the lec-7, I talked about: - P2P vs Enterprise control - Firewall - NATs - Software defined network

More information

Putting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt

Putting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt Putting together the platform: Riak, Redis, Solr and Spark Bryan Hunt 1 $ whoami Bryan Hunt Client Services Engineer @binarytemple 2 Minimum viable product - the ideologically correct doctrine 1. Start

More information

NoSQL Databases. Amir H. Payberah. Swedish Institute of Computer Science. April 10, 2014

NoSQL Databases. Amir H. Payberah. Swedish Institute of Computer Science. April 10, 2014 NoSQL Databases Amir H. Payberah Swedish Institute of Computer Science amir@sics.se April 10, 2014 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 1 / 67 Database and Database Management System

More information

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for

More information

Introduction to NoSQL

Introduction to NoSQL Introduction to NoSQL Agenda History What is NoSQL Types of NoSQL The CAP theorem History - RDBMS Relational DataBase Management Systems were invented in the 1970s. E. F. Codd, "Relational Model of Data

More information

Parallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce

Parallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce Parallel Programming Principle and Practice Lecture 10 Big Data Processing with MapReduce Outline MapReduce Programming Model MapReduce Examples Hadoop 2 Incredible Things That Happen Every Minute On The

More information

SQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism

SQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism Big Data and Hadoop with Azure HDInsight Andrew Brust Senior Director, Technical Product Marketing and Evangelism Datameer Level: Intermediate Meet Andrew Senior Director, Technical Product Marketing and

More information

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( ) Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL

More information

1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions

1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions 1Z0-449 Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions Table of Contents Introduction to 1Z0-449 Exam on Oracle Big Data 2017 Implementation Essentials... 2 Oracle 1Z0-449

More information

Databricks, an Introduction

Databricks, an Introduction Databricks, an Introduction Chuck Connell, Insight Digital Innovation Insight Presentation Speaker Bio Senior Data Architect at Insight Digital Innovation Focus on Azure big data services HDInsight/Hadoop,

More information