Developing in NoSQL with Couchbase
|
|
- Gwen Goodman
- 6 years ago
- Views:
Transcription
1 Developing in NoSQL with Couchbase Raghavan Rags Srinivas Developer Advocate Simple. Fast. Elastic.
2 Speaker Introduction Architect and Evangelist working with developers Speaker at JavaOne, RSA conferences, Sun Tech Days, JUGs and other developer conferences Taught undergrad and grad courses Technology Evangelist at Sun Microsystems for 10+ years Still trying to understand and work more effectively on Java and distributed systems Couchbase Developer Advocate working with Java and Ruby developers Philosophy: Better to have an unanswered question than a unquestioned answer
3 Agenda > Introduction and Getting Started with the APIs > Demonstration: Topology changes and xdcr > Hadoop Sqoop connector and use cases > Secondary Indexing with Query and Views > Roadmap and conclusions
4 A BIT OF NOSQL Intro. Demos. Hadoop Connector Views Summary
5 NoSQL (from a Couchbase perspective) Short for Not Only SQL Not single class of database (i.e., NoSQL is not analogous to the RDMBS classification) Generally refers to databases that neither expose a SQL interface nor use the relational data model > Perhaps better thought of as non-relational databases Typically don t offer common RDBMS features for performance gains > Often schema-less > No key constraints > No multi-step transactions > No concept of a join Well suited for cloud deployments
6 CAP Theorem/Brewer s Conjecture The CAP theorem states that it is impossible for a distributed computer system to simultaneously provide all three of the following guarantees: Consistency means that each node/client always has the same view of the data, Availability means that all clients can always read and write, Partition tolerance means that the system works well across physical network partitions.
7 CAP Systems CA: Consistency and Availability > Consider a system that uses a two phase commit for distributed transactions > The commit is impeded if a network segment is unavailable CP: Consistency and Partition Tolerance > Consider a system that uses pessimistic locking for concurrency control > Node failures hinder availability AP: Availability and Partition Tolerance > Consider a system that always responds (e.g., DNS) > Reads might be dirty as data awaits propagation
8 NoSQL Taxonomies NoSQL Databases Come in Many Flavors* > XML (myxmldb, Tamino, Sedna) > Tabular (Hbase, Big Table) > Key/Value (Couchbase, Redis, Riak, Cassandra) > Object (db4o, JADE) > Graph (Trinity, neo4j, InfoGrid) > Document store (Couchbase Server 2.0, CouchDB, MongoDB) * These are loose taxonomies
9 Web Application Architecture and Performance Application Scales Out Just add more commodity web servers Web Servers Load Balancer RelaConal Database Database Scales Up Get a bigger, more complex server - Expensive and disrupcve sharding - Doesn t perform at Web Scale
10 Couchbase data layer scales like application logic tier Data layer now scales with linear cost and constant performance. Application Scales Out Just add more commodity web servers Web Servers Load Balancer Couchbase Servers Database Scales Out Just add more commodity data servers Horizontally scalable, schema- less, auto- sharding, high- performance at Web Scale Scaling out flattens the cost and performance curves.
11 WHAT IS COUCHBASE? Intro. Demos. Hadoop Connector Views Summary
12 Couchbase Server Overview Open source, Apache 2.0 licensed Database Commercial support provided by Couchbase Free community edition for developers Enterprise edition free for 2 nodes, support license required for larger clusters Administered via web console or RESTful API Available for Windows, Linux and Mac (development only) Written in C/C++ with some components in Erlang Both a distributed key/value store and a document oriented database > Implements Memcached API (version 1.8) > Implements most of CouchDB RESTful view API (version 2.0)
13 memcached Couchbase Membase CouchDB
14 Rapidly Growing Community 1,000s of community deployments More than 150 paying customers Over 3,500 nodes deployed in production by customers
15 Customers with Production Deployments
16 OMGPOP s DrawSomething Went Viral Pictionary Like Game #1 Paid & Free App in 84 Countries 35m Downloads
17 Couchbase Server 1.8 Architecture Memcapable Memcapable 2.0 Moxi Memcached Couchbase EP Engine Data Manager storage interface REST management API/Web UI Heartbeat Process monitor ConfiguraCon manager Global singleton supervisor Rebalance orchestrator Node health monitor vbucket state and replicacon manager Cluster Manager Persistence Layer hop on each node one per cluster Erlang/OTP HTTP 8091 Erlang port mapper 4369 Distributed Erlang
18 Component Architecture Memcapable Memcapable 2.0 Moxi Memcached Couchbase EP Engine storage interface REST Management / WebUI Heartbeat Process monitor Configuration Manager Global singleton supervisor Rebalance orchestrator Node health monitor vbucket state and replicacon manager Persistence Layer htt p on each node one per cluster Erlang/OTP HTTP 8091 Erlang port mapper 4369 Distributed Erlang
19 A Distributed Hash Table Document Store Application set(key, json) get(key) returns json Database Cluster {! DData id": brewery_legacy,! type : brewery,! "name" : "Legacy Brewing,! address": "525 Canal Street Reading", "updated": " :00:20", }!
20 Couchbase Distributed Key/Value Store Features All documents/items are stored and retrieved using standard hashtable operations (i.e, get, add, etc.) Keys are distributed evenly across the nodes of a cluster by way of vbuckets > A vbucket is a level of indirection between a key and value, and the node on which it is stored Constant time access to items via keys Single dimension hashtable with no secondary keys
21 Couchbase Document Store Features Items can be stored as JSON documents Non-JSON items are stored as binary attachments to CouchDB JSON documents Secondary, B-tree indexes are created by way of views > Views are accessed via RESTful API > Indexes are incrementally updated after access not insert > Caller specifies whether to accept stale data Views may be queried by key or range Indexes are stored across the cluster and accumulated from all nodes when requested
22 Workflow (including replication) User action results in the need to change the VALUE of KEY 1 4 SET request sent over network to master server 2 3 Application updates key s VALUE, performs SET operation Couchbase driver hashes KEY, identifies KEY s master server 5 Couchbase replicates KEY-VALUE pair, caches it in memory and stores it to disk
23 GETTING STARTED Intro. Demos. Hadoop Connector Views Summary
24 Installation of the Server for development Operating systems supported RedHat RPM Ubuntu dpkg Windows Mac OS GUI Installer Unattended installation Quickstart once installed (invokes the Admin Console)
25 COUCHBASE CLIENT INTERFACE Intro. Demos. Hadoop Connector Views Summary
26 Client Setup: Getting Cluster Configuration Couchbase Client {. Cluster Configuration over REST "bucketcapabilities": [ "touch", "sync", "couchapi" ], "bucketcapabilitiesver": "sync-1.0", "buckettype": couchbase", "name": "default", "nodelocator": "vbucket", "nodes": [ Server Node Server Node Server Node Server Node
27 Client at Runtime: Adding a node Couchbase Client New node Cluster Topology Update coming online Server Node Server Node Server Node Server Node Server Node
28 Opening a Connection (Java) Connect to the Cluster URI of any cluster node List<URI> uris = new LinkedList<URI>(); uris.add(uri.create(" try { client = new CouchbaseClient(uris, "default", ""); } catch (Exception e) { System.err.println("Error connecting to Couchbase: " + e.getmessage()); System.exit(0); }
29 PROTOCOL OVERVIEW Intro. Demos Hadoop Connector Views Summary
30 Store Operations Operation Description add() Adds new value if key does not exist or returns error. replace() Replaces existing value if specified key already exists. set() Sets new value for specified key. TTL allow for expiry time specified in seconds: Expiry Value Values < 30*24*60*60 Values > 30*24*60*60 Description Time in seconds to expiry Absolute time from epoch for expiry
31 Retrieve Operations Operation Description get() Get a value. getandtouch() Get a value and update the expiry time. getbulk() Get multiple values simultaneously, more efficient. gets() Get a value and the CAS value.
32 ASYNCHRONOUS OPERATIONS Intro. Advanced opertations Doc. Design Views Summary
33 Couchbase Java Client Asynchronous Functions Synchronous commands Force wait on application until return from server Asynchronous commands Allow background operation until response available Operations return Future object
34 It s Asynchronous Set Operation is asynchronous // Do an asynchronous set! OperationFuture<Boolean> setop =! client.set(key, EXP_TIME, VALUE);!! // Do something not related to set!! // Check to see if our set succeeded! // We block when we call the get()! if (setop.get().booleanvalue()) {! System.out.println("Set Succeeded");! } else {! System.err.println("Set failed: " +! setop.getstatus().getmessage());! }!
35 Distributed System Design: Concurrency Controls Compare and Swap Operations > Often referred to as CAS > Optimistic concurrency control > Available with many mutation operations, depending on client. Success Get with Lock > Often referred to as GETL > Pessimistic concurrency control > Locks have a short TTL > Locks released with CAS operations > Useful when working with object graphs Actor 1 Actor 2 Couchbase Server A CAS mismatch B C D F E
36 CAS Operation CAS Example! CASValue<Object> casv = client.gets(key);! Thread.sleep(random.nextInt(1000)); // a random workload!! // Wake up and do a set based on the previous CAS value! Future<CASResponse> setop =! client.asynccas(key, casv.getcas(), random.nextlong());! // Wait for the CAS Response! try {! if (setop.get().equals(casresponse.ok)) {! System.out.println("Set Succeeded");! } else {! System.err.println("Set failed: ");! }! }!!
37 CONCURRENCY Intro. Demos Hadoop Connector Views Summary
38 KEYS AND RELATIONSHIPS Intro. Demos Hadoop Connector Views Summary
39 Utilizing the strengths of the Platform Couchbase > Distributed Key, Value pairs > De-normalized Data > Fast access Session store Hadoop > Unstructured/semi-structured data > Analytics and frequent data analysis > Leverage the Hadoop Ecosystem (Pig, Hive, Hbase, etc.)
40 Use cases Ad Targeting > Decisions have to be made in milliseconds based on historical trends, location, user preferences, etc. Virtual Reality > Sub-second response time based on data available from number of different sources Gaming applications > Responsive applications that integrate with data in the back end
41 SQOOP
42 Introducing Sqoop Easily Import/Export Data into Hadoop Generate Datatypes for use in MapReduce Applications Integrate with Pig, Hive and Hbase Easily export Data from Hadoop HDFS Hive HBase Sqoop Pig Hadoop
43 COUCHBASE PLUGIN
44 Couchbase Plugin Based on the Couchbase Tap Interface Allows importing and exporting of entire database key mutations Couchbase 1. Data imported via Tap mechanism 3. Data exported back to Couchbase HDFS 2. Hadoop Processing
45 Couchbase Import and Export $ sqoop import -connect --table DUMP $ sqoop import -connect --table BACKFILL_5 $ sqoop export --connect --table DUMP export-dir DUMP For Imports, table must be: > DUMP: All keys currently in Couchbase > BACKFILL_n: All key mutations for n minutes Specified username maps to bucket > By default set to default bucket
46 Ad Targeting Application Couchbase > System of Record > Provides fast access to users and being able to record events Hadoop > Analytics > Generates a count of the # of clicks Couchbase 1. User/Event Data is imported to HDFS HDFS 2. Event data is consolidated with Map/Reduce 3. Consolidated data Is used to target ads.
47 Importing to Hadoop
48 Counting the clicks
49 Analyze using Pig
50 Counting the ads
51 Analyze and Consolidate
52 THE VIEW AND QUERY API Intro. Demos Hadoop Connector Views Summary
53 Couchbase Server 2.0 Identical feature set to Couchbase Server 1.8 > Cache-layer > High-performance > Cluster > Elastic > Core interface Adds > Document based storage (JSON) > Views with materialized indexes > Querying
54 Why use JSON? JSON (JavaScript Object Notation) > Lightweight Data-interchange format > Easy for humans to read and manipulate > Easy for machines to parse and generate (many parsers are available) > JSON is language independent (although it uses similar constructs)
55 What Are Views Views create perspectives on a collection of documents Views can cover a few different use cases Simple secondary indexes (the most common) Aggregation functions Example: count the number of North American Ales Organizing related data Use Map/Reduce Map defines the relationship between fields in documents and output table Reduce provides method for collating/summarizing Views materialized indexes Views are not create until accessed Data writes are fast (no index) Index updates all changes since last update
56 View Processing Extract fields from JSON documents and produce an index of the selected information
57 View Creation using Incremental Map/ Reduce map function creates a mapping between input data and output reduce function provides a summary (such as count, sum, etc.)
58 Map Func:ons Map outputs one or more rows of data Map outputs: > Document ID > View Key (user configurable) > View Value View Key Controls > SorCng > Querying Key + Value stored in the Index Maps provide query base
59 Map/Reduce Map Function A map function to get breweries by province function (doc) { if (doc.type == "brewery" && doc.province) { emit(doc.province, doc.name); } }
60 Map/Reduce Map Results The conceptual output of the map function is [ {"key":"connecticut", "value":"cottrell Brewing"}, {"key":"connecticut","value":"thomas Hooker Brewing Co."}, {"key":"massachusetts","value":"harpoon Brewing Company"}, {"key":"new York","value":"Brewery Ommegang"}, {"key":"vermont","value":"long Trail Brewing Company"} ]
61 Map/Reduce Reduce Function A reduce function to count breweries by province > Or simply use the build in _count function function (keys, values) { } return values.length; The conceptual input to the reduce function is: [ {"keys":["connecticut", "Connecticut"], "values": ["Cottrell Brewing", "Thomas Hooker Brewing Co."]}, {"keys":["massachusetts"],"values":["harpoon Brewing Co."]}, {"keys":["new York"],"values":["Brewery Ommegang"]}, {"keys":["vermont"],"values":["long Trail Brewing Company ]} ]
62 Map/Reduce Reduce Results The conceptual result of the reduce is {"key":"connecticut","value":2}, {"key":"massachusetts","value":1}, {"key":"new York","value":1}, {"key":"vermont","value":1}
63 THE QUERY API Intro. Demos Hadoop Connector Views Summary
64 Query APIs // map function! function (doc) {! if (doc.type == "beer") {! emit(doc._id, null);! }! }!! // Java code! Query query = new Query();!! query.setreduce(false);! query.setincludedocs(true);! query.setstale(stale.false);!!
65 THE VIEW API Intro. Demos Hadoop Connector Views Summary
66 View APIs with Java // map function! function (doc) {! if (doc.type == "beer") {! emit(doc._id, null);! }! }!! // Java code! View view = client.getview("beers", "beers");!! ViewResponse result = client.query(view, query);!! Iterator<ViewRow> itr = result.iterator();!! while (itr.hasnext()) {! row = itr.next();! doc = (String) row.getdocument();! // do something! }!!!
67 USING THE QUERY API FOR CALCULATION Intro. Advanced opertations Doc. Design Views Summary
68 Querying with Java Custom Reduce // map function! function(doc) {! if (doc.type == "beer") {! if(doc.abv) {! emit(doc.abv, 1);! }}}!! //reduce function! _count!! // Java code! View view = client.getview("beers", "beers_count_abv");! query.setgroup(true);!! while (itr.hasnext()) {! row = itr.next();! System.out.println(String.format("%s: %s,! row.getkey(), row.getvalue()));! }!
69 View Calculation Result 10: 2! 9.6: 5! 9.1: 1! 9: 3! 8.7: 2! 8.5: 1! 8: 2! 7.5: 4! 7: 4! 6.7: 1! 6.6: 1! 6.2: 1! 6: 2! 5.9: 1! 5.6: 1! 5.2: 1! 5: 1! 4.8: 2! 4.5: 1! 4: 2!
70 RESOURCES AND SUMMARY Intro. Demos Hadoop Connector Views Summary
71 Java Client Library Roadmap Server 1.8 Compatible > 1.0.1/2.8.0 > 1.0.2/2.8.0 (Memcache Node imbalance) > 1.0.x/2.8.0 (Replica Read) Server 2.0 Compatible > 1.1-dp/2.8.1 (Views) > 1.1-dp2/2.8.x (Observe) > 1.1/2.8.x (Final) Other Features > Spring Integration > Your feedback?!
72 Resources and Call For Action Couchbase Server Downloads > > Views > Developing with Client libraries > Couchbase Java Client Library wiki tips and tricks > +Library Open Beer Database > Couchbase Internals >
73 THANKS - Q&A Raghavan Rags Srinivas, Couchbase Inc. 73
Developing in NoSQL with Couchbase and Java. Raghavan N. Srinivas Couchbase 123
Developing in NoSQL with Couchbase and Java Raghavan N. Srinivas Couchbase 123 Objective A technical overview To getting started on programming with Couchbase using Java To learn about the basic operations,
More informationCouchbase Server. Chris Anderson Chief
Couchbase Server Chris Anderson Chief Architect @jchris 1 Couchbase Server Simple = Fast Elas=c NoSQL Database Formerly known as Membase Server 2 Couchbase Server Features Built- in clustering All nodes
More informationCouchbase Architecture Couchbase Inc. 1
Couchbase Architecture 2015 Couchbase Inc. 1 $whoami Laurent Doguin Couchbase Developer Advocate @ldoguin laurent.doguin@couchbase.com 2015 Couchbase Inc. 2 2 Big Data = Operational + Analytic (NoSQL +
More informationJargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems
Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons
More informationThe NoSQL Landscape. Frank Weigel VP, Field Technical Opera;ons
The NoSQL Landscape Frank Weigel VP, Field Technical Opera;ons What we ll talk about Why RDBMS are not enough? What are the different NoSQL taxonomies? Which NoSQL is right for me? Macro Trends Driving
More informationCS Silvia Zuffi - Sunil Mallya. Slides credits: official membase meetings
CS227 - Silvia Zuffi - Sunil Mallya Slides credits: official membase meetings Schedule Overview silvia History silvia Data Model silvia Architecture sunil Transaction support sunil Case studies silvia
More informationBig Data Hadoop Course Content
Big Data Hadoop Course Content Topics covered in the training Introduction to Linux and Big Data Virtual Machine ( VM) Introduction/ Installation of VirtualBox and the Big Data VM Introduction to Linux
More informationFriday, April 26, 13
Introduc)on to Map Reduce with Couchbase Tugdual Grall / @tgrall NoSQL Ma)ers 13 - Cologne - April 25th 2013 About Me Tugdual Tug Grall Couchbase exo Technical Evangelist CTO Oracle Developer/Product Manager
More informationMatt Ingenthron. Couchbase, Inc.
Matt Ingenthron Couchbase, Inc. 2 What is Membase? Before: Application scales linearly, data hits wall Application Scales Out Just add more commodity web servers Database Scales Up Get a bigger, more complex
More informationNon-Relational Databases. Pelle Jakovits
Non-Relational Databases Pelle Jakovits 25 October 2017 Outline Background Relational model Database scaling The NoSQL Movement CAP Theorem Non-relational data models Key-value Document-oriented Column
More informationBig Data Analytics using Apache Hadoop and Spark with Scala
Big Data Analytics using Apache Hadoop and Spark with Scala Training Highlights : 80% of the training is with Practical Demo (On Custom Cloudera and Ubuntu Machines) 20% Theory Portion will be important
More informationIntroduction to NoSQL Databases
Introduction to NoSQL Databases Roman Kern KTI, TU Graz 2017-10-16 Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 1 / 31 Introduction Intro Why NoSQL? Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 2 / 31 Introduction
More informationHadoop An Overview. - Socrates CCDH
Hadoop An Overview - Socrates CCDH What is Big Data? Volume Not Gigabyte. Terabyte, Petabyte, Exabyte, Zettabyte - Due to handheld gadgets,and HD format images and videos - In total data, 90% of them collected
More informationCIB Session 12th NoSQL Databases Structures
CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is
More informationBig Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours
Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals
More informationNoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems
CompSci 516 Data Intensive Computing Systems Lecture 21 (optional) NoSQL systems Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Key- Value Stores Duke CS,
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 14 NoSQL References Scalable SQL and NoSQL Data Stores, Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No.
More information5/2/16. Announcements. NoSQL Motivation. The New Hipster: NoSQL. Serverless. What is the Problem? Database Systems CSE 414
Announcements Database Systems CSE 414 Lecture 16: NoSQL and JSon Current assignments: Homework 4 due tonight Web Quiz 6 due next Wednesday [There is no Web Quiz 5 Today s lecture: JSon The book covers
More informationIntroduction to BigData, Hadoop:-
Introduction to BigData, Hadoop:- Big Data Introduction: Hadoop Introduction What is Hadoop? Why Hadoop? Hadoop History. Different types of Components in Hadoop? HDFS, MapReduce, PIG, Hive, SQOOP, HBASE,
More informationDatabase Systems CSE 414
Database Systems CSE 414 Lecture 16: NoSQL and JSon CSE 414 - Spring 2016 1 Announcements Current assignments: Homework 4 due tonight Web Quiz 6 due next Wednesday [There is no Web Quiz 5] Today s lecture:
More informationGoal of the presentation is to give an introduction of NoSQL databases, why they are there.
1 Goal of the presentation is to give an introduction of NoSQL databases, why they are there. We want to present "Why?" first to explain the need of something like "NoSQL" and then in "What?" we go in
More informationNOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY
NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY WHAT IS NOSQL? Stands for No-SQL or Not Only SQL. Class of non-relational data storage systems E.g.
More informationCSE 344 JULY 9 TH NOSQL
CSE 344 JULY 9 TH NOSQL ADMINISTRATIVE MINUTIAE HW3 due Wednesday tests released actual_time should have 0s not NULLs upload new data file or use UPDATE to change 0 ~> NULL Extra OOs on Mondays 5-7pm in
More informationWe are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info
We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423
More informationInnovatus Technologies
HADOOP 2.X BIGDATA ANALYTICS 1. Java Overview of Java Classes and Objects Garbage Collection and Modifiers Inheritance, Aggregation, Polymorphism Command line argument Abstract class and Interfaces String
More informationRelational databases
COSC 6397 Big Data Analytics NoSQL databases Edgar Gabriel Spring 2017 Relational databases Long lasting industry standard to store data persistently Key points concurrency control, transactions, standard
More informationOracle NoSQL Database Enterprise Edition, Version 18.1
Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across
More informationCS-580K/480K Advanced Topics in Cloud Computing. NoSQL Database
CS-580K/480K dvanced Topics in Cloud Computing NoSQL Database 1 1 Where are we? Cloud latforms 2 VM1 VM2 VM3 3 Operating System 4 1 2 3 Operating System 4 1 2 Virtualization Layer 3 Operating System 4
More informationIntroduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos
Instituto Politécnico de Tomar Introduction to Big Data NoSQL Databases Ricardo Campos Mestrado EI-IC Análise e Processamento de Grandes Volumes de Dados Tomar, Portugal, 2016 Part of the slides used in
More informationBig Data Architect.
Big Data Architect www.austech.edu.au WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional
More informationOracle NoSQL Database Enterprise Edition, Version 18.1
Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across
More informationDEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!
DEMYSTIFYING BIG DATA WITH RIAK USE CASES Martin Schneider Basho Technologies! Agenda Defining Big Data in Regards to Riak A Series of Trade-Offs Use Cases Q & A About Basho & Riak Basho Technologies is
More informationHadoop Development Introduction
Hadoop Development Introduction What is Bigdata? Evolution of Bigdata Types of Data and their Significance Need for Bigdata Analytics Why Bigdata with Hadoop? History of Hadoop Why Hadoop is in demand
More informationIntro to Couchbase Server for ColdFusion - Clustered NoSQL and Caching at its Finest
Tweet Intro to Couchbase Server for ColdFusion - Clustered NoSQL and Caching at its Finest Brad Wood Jul 26, 2013 Today we are starting a new blogging series on how to leverage Couchbase NoSQL from ColdFusion
More informationBig Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture
Big Data Syllabus Hadoop YARN Setup Programming in YARN framework j Understanding big data and Hadoop Big Data Limitations and Solutions of existing Data Analytics Architecture Hadoop Features Hadoop Ecosystem
More informationPROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc.
PROFESSIONAL NoSQL Shashank Tiwari WILEY John Wiley & Sons, Inc. Examining CONTENTS INTRODUCTION xvil CHAPTER 1: NOSQL: WHAT IT IS AND WHY YOU NEED IT 3 Definition and Introduction 4 Context and a Bit
More informationMicrosoft Big Data and Hadoop
Microsoft Big Data and Hadoop Lara Rubbelke @sqlgal Cindy Gross @sqlcindy 2 The world of data is changing The 4Vs of Big Data http://nosql.mypopescu.com/post/9621746531/a-definition-of-big-data 3 Common
More information10/18/2017. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414
Announcements Database Systems CSE 414 Lecture 11: NoSQL & JSON (mostly not in textbook only Ch 11.1) HW5 will be posted on Friday and due on Nov. 14, 11pm [No Web Quiz 5] Today s lecture: NoSQL & JSON
More informationIntroduction to Hadoop. High Availability Scaling Advantages and Challenges. Introduction to Big Data
Introduction to Hadoop High Availability Scaling Advantages and Challenges Introduction to Big Data What is Big data Big Data opportunities Big Data Challenges Characteristics of Big data Introduction
More informationOpenEdge & CouchDB. Integrating the OpenEdge ABL with CouchDB. Don Beattie Software Architect Quicken Loans Inc.
OpenEdge & CouchDB Integrating the OpenEdge ABL with CouchDB Don Beattie Software Architect Quicken Loans Inc. Apache CouchDB has started. Time to relax. Intro The OpenEdge RDBMS is a great database that
More informationRealtime visitor analysis with Couchbase and Elasticsearch
Realtime visitor analysis with Couchbase and Elasticsearch Jeroen Reijn @jreijn #nosql13 About me Jeroen Reijn Software engineer Hippo @jreijn http://blog.jeroenreijn.com About Hippo Visitor Analysis OneHippo
More informationBig Data. Big Data Analyst. Big Data Engineer. Big Data Architect
Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION
More informationDistributed Non-Relational Databases. Pelle Jakovits
Distributed Non-Relational Databases Pelle Jakovits Tartu, 7 December 2018 Outline Relational model NoSQL Movement Non-relational data models Key-value Document-oriented Column family Graph Non-relational
More informationSources. P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley
Big Data and NoSQL Sources P. J. Sadalage, M Fowler, NoSQL Distilled, Addison Wesley Very short history of DBMSs The seventies: IMS end of the sixties, built for the Apollo program (today: Version 15)
More informationDATABASE DESIGN II - 1DL400
DATABASE DESIGN II - 1DL400 Fall 2016 A second course in database systems http://www.it.uu.se/research/group/udbl/kurser/dbii_ht16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More information1
1 2 3 6 7 8 9 10 Storage & IO Benchmarking Primer Running sysbench and preparing data Use the prepare option to generate the data. Experiments Run sysbench with different storage systems and instance
More informationRelational to NoSQL: Getting started from SQL Server. Shane Johnson Sr. Product Marketing Manager Couchbase
Relational to NoSQL: Getting started from SQL Server Shane Johnson Sr. Product Marketing Manager Couchbase Today s agenda Why NoSQL? Identifying the right application Modeling your data Accessing your
More informationThis is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem.
About the Tutorial Sqoop is a tool designed to transfer data between Hadoop and relational database servers. It is used to import data from relational databases such as MySQL, Oracle to Hadoop HDFS, and
More information/ Cloud Computing. Recitation 6 October 2 nd, 2018
15-319 / 15-619 Cloud Computing Recitation 6 October 2 nd, 2018 1 Overview Announcements for administrative issues Last week s reflection OLI unit 3 module 7, 8 and 9 Quiz 4 Project 2.3 This week s schedule
More informationA Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores
A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores Nikhil Dasharath Karande 1 Department of CSE, Sanjay Ghodawat Institutes, Atigre nikhilkarande18@gmail.com Abstract- This paper
More informationUnderstanding NoSQL Database Implementations
Understanding NoSQL Database Implementations Sadalage and Fowler, Chapters 7 11 Class 07: Understanding NoSQL Database Implementations 1 Foreword NoSQL is a broad and diverse collection of technologies.
More informationCompSci 516 Database Systems
CompSci 516 Database Systems Lecture 20 NoSQL and Column Store Instructor: Sudeepa Roy Duke CS, Fall 2018 CompSci 516: Database Systems 1 Reading Material NOSQL: Scalable SQL and NoSQL Data Stores Rick
More informationChallenges for Data Driven Systems
Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Data Centric Systems and Networking Emergence of Big Data Shift of Communication Paradigm From end-to-end to data
More informationData Informatics. Seon Ho Kim, Ph.D.
Data Informatics Seon Ho Kim, Ph.D. seonkim@usc.edu HBase HBase is.. A distributed data store that can scale horizontally to 1,000s of commodity servers and petabytes of indexed storage. Designed to operate
More informationIBM Planning Analytics Workspace Local Distributed Soufiane Azizi. IBM Planning Analytics
IBM Planning Analytics Workspace Local Distributed Soufiane Azizi IBM Planning Analytics IBM Canada - Cognos Ottawa Lab. IBM Planning Analytics Agenda 1. Demo PAW High Availability on a Prebuilt Swarm
More informationAccelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017
Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 About the Presentation Problems Existing Solutions Denis Magda
More informationIntegrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers
Oracle zsig Conference IBM LinuxONE and z System Servers Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Sam Amsavelu Oracle on z Architect IBM Washington
More informationrelational Relational to Riak Why Move From Relational to Riak? Introduction High Availability Riak At-a-Glance
WHITEPAPER Relational to Riak relational Introduction This whitepaper looks at why companies choose Riak over a relational database. We focus specifically on availability, scalability, and the / data model.
More informationClass Overview. Two Classes of Database Applications. NoSQL Motivation. RDBMS Review: Client-Server. RDBMS Review: Serverless
Introduction to Database Systems CSE 414 Lecture 12: NoSQL 1 Class Overview Unit 1: Intro Unit 2: Relational Data Models and Query Languages Unit 3: Non-relational data NoSQL Json SQL++ Unit 4: RDMBS internals
More informationNoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu
NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related
More informationStudy of NoSQL Database Along With Security Comparison
Study of NoSQL Database Along With Security Comparison Ankita A. Mall [1], Jwalant B. Baria [2] [1] Student, Computer Engineering Department, Government Engineering College, Modasa, Gujarat, India ank.fetr@gmail.com
More informationOverview. * Some History. * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL. * NoSQL Taxonomy. *TowardsNewSQL
* Some History * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL * NoSQL Taxonomy * Towards NewSQL Overview * Some History * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL * NoSQL Taxonomy *TowardsNewSQL NoSQL
More informationBring Context To Your Machine Data With Hadoop, RDBMS & Splunk
Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Raanan Dagan and Rohit Pujari September 25, 2017 Washington, DC Forward-Looking Statements During the course of this presentation, we may
More informationvbuckets: The Core Enabling Mechanism for Couchbase Server Data Distribution (aka Auto-Sharding )
vbuckets: The Core Enabling Mechanism for Data Distribution (aka Auto-Sharding ) Table of Contents vbucket Defined 3 key-vbucket-server ping illustrated 4 vbuckets in a world of s 5 TCP ports Deployment
More information5/1/17. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414
Announcements Database Systems CSE 414 Lecture 15: NoSQL & JSON (mostly not in textbook only Ch 11.1) 1 Homework 4 due tomorrow night [No Web Quiz 5] Midterm grading hopefully finished tonight post online
More informationJune 20, 2017 Revision NoSQL Database Architectural Comparison
June 20, 2017 Revision 0.07 NoSQL Database Architectural Comparison Table of Contents Executive Summary... 1 Introduction... 2 Cluster Topology... 4 Consistency Model... 6 Replication Strategy... 8 Failover
More informationNoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE. Nicolas Buchschacher - University of Geneva - ADASS 2018
NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE DACE https://dace.unige.ch Data and Analysis Center for Exoplanets. Facility to store, exchange and analyse data
More informationIntroduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University
Introduction to Computer Science William Hsu Department of Computer Science and Engineering National Taiwan Ocean University Chapter 9: Database Systems supplementary - nosql You can have data without
More informationPresented by Sunnie S Chung CIS 612
By Yasin N. Silva, Arizona State University Presented by Sunnie S Chung CIS 612 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. See http://creativecommons.org/licenses/by-nc-sa/4.0/
More informationGain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.
Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources
More informationMongoDB An Overview. 21-Oct Socrates
MongoDB An Overview 21-Oct-2016 Socrates Agenda What is NoSQL DB? Types of NoSQL DBs DBMS and MongoDB Comparison Why MongoDB? MongoDB Architecture Storage Engines Data Model Query Language Security Data
More informationTable of Index Hadoop for Developers Hibernate: Using Hibernate For Java Database Access HP FlexNetwork Fundamentals, Rev. 14.21 HP Navigating the Journey to Cloud, Rev. 15.11 HP OneView 1.20 Rev.15.21
More informationDistributed Systems 16. Distributed File Systems II
Distributed Systems 16. Distributed File Systems II Paul Krzyzanowski pxk@cs.rutgers.edu 1 Review NFS RPC-based access AFS Long-term caching CODA Read/write replication & disconnected operation DFS AFS
More informationCSE 530A. Non-Relational Databases. Washington University Fall 2013
CSE 530A Non-Relational Databases Washington University Fall 2013 NoSQL "NoSQL" was originally the name of a specific RDBMS project that did not use a SQL interface Was co-opted years later to refer to
More informationAnnouncements. Two Classes of Database Applications. Class Overview. NoSQL Motivation. RDBMS Review: Serverless
Introduction to Database Systems CSE 414 Lecture 11: NoSQL 1 HW 3 due Friday Announcements Upload data with DataGrip editor see message board Azure timeout for question 5: Try DataGrip or SQLite HW 2 Grades
More informationHadoop & Big Data Analytics Complete Practical & Real-time Training
An ISO Certified Training Institute A Unit of Sequelgate Innovative Technologies Pvt. Ltd. www.sqlschool.com Hadoop & Big Data Analytics Complete Practical & Real-time Training Mode : Instructor Led LIVE
More informationOral Questions and Answers (DBMS LAB) Questions & Answers- DBMS
Questions & Answers- DBMS https://career.guru99.com/top-50-database-interview-questions/ 1) Define Database. A prearranged collection of figures known as data is called database. 2) What is DBMS? Database
More informationUsing the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver
Using the SDACK Architecture to Build a Big Data Product Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Outline A Threat Analytic Big Data product The SDACK Architecture Akka Streams and data
More informationA NoSQL Introduction for Relational Database Developers. Andrew Karcher Las Vegas SQL Saturday September 12th, 2015
A NoSQL Introduction for Relational Database Developers Andrew Karcher Las Vegas SQL Saturday September 12th, 2015 About Me http://www.andrewkarcher.com Twitter: @akarcher LinkedIn, Twitter Email: akarcher@gmail.com
More informationMySQL High Availability
MySQL High Availability InnoDB Cluster and NDB Cluster Ted Wennmark ted.wennmark@oracle.com Copyright 2016, Oracle and/or its its affiliates. All All rights reserved. Safe Harbor Statement The following
More informationCourse Content MongoDB
Course Content MongoDB 1. Course introduction and mongodb Essentials (basics) 2. Introduction to NoSQL databases What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL
More information@Pentaho #BigDataWebSeries
Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of
More informationOPEN SOURCE DB SYSTEMS TYPES OF DBMS
OPEN SOURCE DB SYSTEMS Anna Topol 1 TYPES OF DBMS Relational Key-Value Document-oriented Graph 2 DBMS SELECTION Multi-platform or platform-agnostic Offers persistent storage Fairly well known Actively
More informationMaking Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari
Making Non-Distributed Databases, Distributed Ioannis Papapanagiotou, PhD Shailesh Birari Dynomite Ecosystem Dynomite - Proxy layer Dyno - Client Dynomite-manager - Ecosystem orchestrator Dynomite-explorer
More informationThe Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou
The Hadoop Ecosystem EECS 4415 Big Data Systems Tilemachos Pechlivanoglou tipech@eecs.yorku.ca A lot of tools designed to work with Hadoop 2 HDFS, MapReduce Hadoop Distributed File System Core Hadoop component
More informationBig Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara
Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case
More informationGetting to know. by Michelle Darling August 2013
Getting to know by Michelle Darling mdarlingcmt@gmail.com August 2013 Agenda: What is Cassandra? Installation, CQL3 Data Modelling Summary Only 15 min to cover these, so please hold questions til the end,
More informationScaling for Humongous amounts of data with MongoDB
Scaling for Humongous amounts of data with MongoDB Alvin Richards Technical Director, EMEA alvin@10gen.com @jonnyeight alvinonmongodb.com From here... http://bit.ly/ot71m4 ...to here... http://bit.ly/oxcsis
More information1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions
Big Data Hadoop Architect Online Training (Big Data Hadoop + Apache Spark & Scala+ MongoDB Developer And Administrator + Apache Cassandra + Impala Training + Apache Kafka + Apache Storm) 1 Big Data Hadoop
More informationCPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University
CPSC 426/526 Cloud Computing Ennan Zhai Computer Science Department Yale University Recall: Lec-7 In the lec-7, I talked about: - P2P vs Enterprise control - Firewall - NATs - Software defined network
More informationPutting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt
Putting together the platform: Riak, Redis, Solr and Spark Bryan Hunt 1 $ whoami Bryan Hunt Client Services Engineer @binarytemple 2 Minimum viable product - the ideologically correct doctrine 1. Start
More informationNoSQL Databases. Amir H. Payberah. Swedish Institute of Computer Science. April 10, 2014
NoSQL Databases Amir H. Payberah Swedish Institute of Computer Science amir@sics.se April 10, 2014 Amir H. Payberah (SICS) NoSQL Databases April 10, 2014 1 / 67 Database and Database Management System
More information<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure
MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for
More informationIntroduction to NoSQL
Introduction to NoSQL Agenda History What is NoSQL Types of NoSQL The CAP theorem History - RDBMS Relational DataBase Management Systems were invented in the 1970s. E. F. Codd, "Relational Model of Data
More informationParallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce
Parallel Programming Principle and Practice Lecture 10 Big Data Processing with MapReduce Outline MapReduce Programming Model MapReduce Examples Hadoop 2 Incredible Things That Happen Every Minute On The
More informationSQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism
Big Data and Hadoop with Azure HDInsight Andrew Brust Senior Director, Technical Product Marketing and Evangelism Datameer Level: Intermediate Meet Andrew Senior Director, Technical Product Marketing and
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More information1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions
1Z0-449 Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions Table of Contents Introduction to 1Z0-449 Exam on Oracle Big Data 2017 Implementation Essentials... 2 Oracle 1Z0-449
More informationDatabricks, an Introduction
Databricks, an Introduction Chuck Connell, Insight Digital Innovation Insight Presentation Speaker Bio Senior Data Architect at Insight Digital Innovation Focus on Azure big data services HDInsight/Hadoop,
More information