Triple R Riak, Redis and RabbitMQ at XING
|
|
- Gwendoline Wheeler
- 5 years ago
- Views:
Transcription
1 Triple R Riak, Redis and RabbitMQ at XING Dr. Stefan Kaes, Sebastian Röbke NoSQL matters Cologne, April 27, 2013
2 ActivityStream Intro
3
4
5
6 3 Types of Feeds
7 News Feed
8 Me Feed
9 Company Feed
10 Activity Creation
11
12 ActivityStream POST /activitystream/activities
13
14 ActivityStream events.participation.changed
15 Events App events.event.created events.participation.changed... Groups App groups.article.created groups.member.joined... User App users.profile.updated users.contact.created... etc ActivityStream
16 Old Approach
17 Activity INSERT INTO `activities`... Comment INSERT INTO `comments`... several hundred millions Like INSERT INTO `likes`...
18 Efficient Activity Creation
19 But... slow reads
20
21 Groups App Companies App User App GET group memberships Settings GET privacy settings GET followed companies etc.... GET contacts ActivityStream activities? likes comments
22 Groups App Companies App User App GET group memberships Settings GET privacy settings GET followed companies etc.... GET contacts ActivityStream activities likes comments
23 Activities immediately visible
24 Consistency
25 SQL Databases are well understood
26 Database Master is Single Point of Failure
27 Sharding
28 Unsatisfactory Read Performance
29 New Approach
30 Materialized Feeds
31 Activity create User App GET contacts... etc. ActivityStream Storage activities news feeds me feeds company feeds likes comments?
32 Requirements
33 Better Read Performance
34 Activities created by me must be visible to myself immediately
35 Activities created by others should appear within a reasonable time frame in my stream
36 Storage layer must tolerate high read and write loads
37 Storage layer must provide easy capacity scaling
38 Low maintenance
39 Option 1: Do it yourself SQL database design
40 Option 2: Off the shelf NoSQL database
41 We chose
42 We tend to view it as a highly available distributed hash table
43 Eventual consistency/conflict resolution is the hard part
44 Bounded size feeds are easy
45 Unbounded feeds are much harder
46 Object Model
47 JSON Documents
48 Activities
49 Activities
50 Activities
51 Activities 2P-Set
52 JSON Documents
53 Feeds
54 Feeds bounded list of chunk references
55 Feeds chunk sequence number
56 Feeds youngest activity ref
57 Feeds oldest activity ref
58 Feeds size of referenced chunk
59 JSON Documents
60 FeedChunk
61 FeedChunk 2P-Set
62 The Migration
63 Incremental rollout
64 Part 1: From old to new
65 Let s start simple!
66 Replicating some data
67 Old ActivityStream activity.created activity.deleted comment.created comment.deleted like.created like.deleted data migration processors New ActivityStream
68 Measuring performance
69 Old ActivityStream newsfeed.viewed mefeed.viewed companyfeed.viewed data migration processors shadow query processors New ActivityStream
70
71 Part 2: From new to old
72 Old ActivityStream POST /activitystream/activities DELETE /activitystream/activities/{id} POST /activitystream/activities/{activity_id}/comments DELETE /activitystream/activities/{activity_id}/comments/{id} PUT /activitystream/activities/{activity_id}/likes/{user_id} DELETE /activitystream/activities/{activity_id}/likes/{user_id} Beta User B Beta User A Beta User C New ActivityStream
73 Old ActivityStream POST /activitystream/activities DELETE /activitystream/activities/{id} POST /activitystream/activities/{activity_id}/comments DELETE /activitystream/activities/{activity_id}/comments/{id} PUT /activitystream/activities/{activity_id}/likes/{user_id} DELETE /activitystream/activities/{activity_id}/likes/{user_id} Beta User A Beta User B Beta User C activity id New ActivityStream
74 Part 3: What about the old data?
75 Bulk Data Migration: Failed Version 1
76 Bulk Data Migration: Failed Version 1 1. Reset data in the new system 2. Query the old system REST API for the feeds 3. Store them in the new system 4. Switch to the new system
77 This was naive
78 The old system was way too slow to return the millions of feeds in their full length
79 Bulk Data Migration: Failed Version 2
80 Bulk Data Migration: Failed Version 2 1. Reset data in the new system 2. Read all activities based on a dump of the old system 3. Publish created messages to RabbitMQ for each activity/comment/like 4. Let the new system build its data structures 5. Switch to the new system
81 This was naive
82 You can t replay the history of 2.5 years this way
83 Bulk Data Migration: Successful Version
84 Bulk Data Migration: Successful Version 1. Reset data in the new system 2. Obtain data dump from old system 3. Extract data from the dumps and compute a representation of the feeds in Redis with a massive amount of batch processors 4. Use this data to build up the structures in the new system 5. Switch to the new system
85 It worked!
86 But...
87 A lot of additional code
88 Fragile, manual steps involved
89 A lot of technology: RabbitMQ, Riak, Redis, Varnish
90 One run took 5 days
91 But it worked!
92 Current Status
93 New system is live for all users since 12/12/12
94 Old and new system were kept in sync till April 2013
95 In case of serious trouble, we could have switched to the old system within seconds
96 Performance goals have been met
97 Performance goals have been met Old system New system happy t < 0.1s 0.17% 62.01% satisfied t < 0.5s 41.36% 99.71% tolerating 0.5s t < 2s 58.20% 0.28% frustrated t 2s 0.44% 0.00% Apdex Score
98 Production setup 10 Riak Servers as Primary Cluster 10 Riak Servers as Backup Cluster (Multi-DC Replication) SSDs, Raid 0 and a proper Linux file I/O scheduler setting (noop) Bitcask storage backend 4 REST API Servers 4 Background Worker Servers Monitoring using Ganglia and Logjam (App Performance)
99 Lessons learned
100 Eventual consistency sounds easy, but is hard to implement correctly in practice There s a steep learning curve at the beginning High update rates and large objects don t go together well, if your storage system offers just get, put and delete operations Achieving high performance requires careful thought about data structures, algorithms and access patterns Building a new system from scratch is lot easier than migrating an existing system
101 Protobuffs API faster than HTTP Use the best performing JSON library you can find (Ruby: Oj gem) Avoid a full-blown ORM for Riak if you care about performance (Ruby: Ripple gem)
102 At one point we saturated the Gigabit network cards on the Riak cluster This lead to compressing all data before storing it on the cluster and breaking news feeds into chunks
103 Thank you for your attention! Dr. Stefan Kaes Sebastian Röbke We re hiring: The professional network
Redis to the Rescue? O Reilly MySQL Conference
Redis to the Rescue? O Reilly MySQL Conference 2011-04-13 Who? Tim Lossen / @tlossen Berlin, Germany backend developer at wooga Redis Intro Case 1: Monster World Case 2: Happy Hospital Discussion Redis
More informationIntroduction Storage Processing Monitoring Review. Scaling at Showyou. Operations. September 26, 2011
Scaling at Showyou Operations September 26, 2011 I m Kyle Kingsbury Handle aphyr Code http://github.com/aphyr Email kyle@remixation.com Focus Backend, API, ops What the hell is Showyou? Nontrivial complexity
More informationO Reilly RailsConf,
O Reilly RailsConf, 2011-05- 18 Who is that guy? Jesper Richter- Reichhelm / @jrirei Berlin, Germany Head of Engineering @ wooga Wooga does social games Wooga has dedicated game teams Cooming soon PHP
More informationScaling. Marty Weiner Grayskull, Eternia. Yashh Nelapati Gotham City
Scaling Marty Weiner Grayskull, Eternia Yashh Nelapati Gotham City Pinterest is... An online pinboard to organize and share what inspires you. Relationships Marty Weiner Grayskull, Eternia Yashh Nelapati
More informationPNUTS: Yahoo! s Hosted Data Serving Platform. Reading Review by: Alex Degtiar (adegtiar) /30/2013
PNUTS: Yahoo! s Hosted Data Serving Platform Reading Review by: Alex Degtiar (adegtiar) 15-799 9/30/2013 What is PNUTS? Yahoo s NoSQL database Motivated by web applications Massively parallel Geographically
More informationSCALABLE DATABASES. Sergio Bossa. From Relational Databases To Polyglot Persistence.
SCALABLE DATABASES From Relational Databases To Polyglot Persistence Sergio Bossa sergio.bossa@gmail.com http://twitter.com/sbtourist About Me Software architect and engineer Gioco Digitale (online gambling
More informationCourse Content MongoDB
Course Content MongoDB 1. Course introduction and mongodb Essentials (basics) 2. Introduction to NoSQL databases What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL
More informationThe course modules of MongoDB developer and administrator online certification training:
The course modules of MongoDB developer and administrator online certification training: 1 An Overview of the Course Introduction to the course Table of Contents Course Objectives Course Overview Value
More informationBuffering to Redis for Efficient Real-Time Processing. Percona Live, April 24, 2018
Buffering to Redis for Efficient Real-Time Processing Percona Live, April 24, 2018 Presenting Today Jon Hyman CTO & Co-Founder Braze (Formerly Appboy) @jon_hyman Mobile is at the vanguard of a new wave
More informationCouchbase Architecture Couchbase Inc. 1
Couchbase Architecture 2015 Couchbase Inc. 1 $whoami Laurent Doguin Couchbase Developer Advocate @ldoguin laurent.doguin@couchbase.com 2015 Couchbase Inc. 2 2 Big Data = Operational + Analytic (NoSQL +
More informationAlwaysOn Availability Groups: Backups, Restores, and CHECKDB
AlwaysOn Availability Groups: Backups, Restores, and CHECKDB www.brentozar.com sp_blitz sp_blitzfirst email newsletter videos SQL Critical Care 2016 Brent Ozar Unlimited. All rights reserved. 1 What I
More informationDesign Patterns for Large- Scale Data Management. Robert Hodges OSCON 2013
Design Patterns for Large- Scale Data Management Robert Hodges OSCON 2013 The Start-Up Dilemma 1. You are releasing Online Storefront V 1.0 2. It could be a complete bust 3. But it could be *really* big
More informationArchitecture and Design of MySQL Powered Applications. Peter Zaitsev CEO, Percona Highload Moscow, Russia 31 Oct 2014
Architecture and Design of MySQL Powered Applications Peter Zaitsev CEO, Percona Highload++ 2014 Moscow, Russia 31 Oct 2014 About Percona 2 Open Source Software for MySQL Ecosystem Percona Server Percona
More informationArchitekturen für die Cloud
Architekturen für die Cloud Eberhard Wolff Architecture & Technology Manager adesso AG 08.06.11 What is Cloud? National Institute for Standards and Technology (NIST) Definition On-demand self-service >
More informationMongoDB Schema Design for. David Murphy MongoDB Practice Manager - Percona
MongoDB Schema Design for the Click "Dynamic to edit Master Schema" title World style David Murphy MongoDB Practice Manager - Percona Who is this Person and What Does He Know? Former MongoDB Master Former
More informationChapter 24 NOSQL Databases and Big Data Storage Systems
Chapter 24 NOSQL Databases and Big Data Storage Systems - Large amounts of data such as social media, Web links, user profiles, marketing and sales, posts and tweets, road maps, spatial data, email - NOSQL
More informationScaling. Yashh Nelapati Gotham City. Marty Weiner Krypton. Friday, July 27, 12
Scaling Marty Weiner Krypton Yashh Nelapati Gotham City Pinterest is... An online pinboard to organize and share what inspires you. Relationships Marty Weiner Grayskull, Eternia Relationships Marty
More informationPutting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt
Putting together the platform: Riak, Redis, Solr and Spark Bryan Hunt 1 $ whoami Bryan Hunt Client Services Engineer @binarytemple 2 Minimum viable product - the ideologically correct doctrine 1. Start
More informationTomasz Szumlak WFiIS AGH 23/10/2017, Kraków
Python in the Enterprise Django Intro Tomasz Szumlak WFiIS AGH 23/10/2017, Kraków Going beyond Django is a Web framework very popular! It is not the only one, and cannot do wonders There are many others:
More informationCIB Session 12th NoSQL Databases Structures
CIB Session 12th NoSQL Databases Structures By: Shahab Safaee & Morteza Zahedi Software Engineering PhD Email: safaee.shx@gmail.com, morteza.zahedi.a@gmail.com cibtrc.ir cibtrc cibtrc 2 Agenda What is
More informationApp Engine: Datastore Introduction
App Engine: Datastore Introduction Part 1 Another very useful course: https://www.udacity.com/course/developing-scalableapps-in-java--ud859 1 Topics cover in this lesson What is Datastore? Datastore and
More informationMap-Reduce. John Hughes
Map-Reduce John Hughes The Problem 850TB in 2006 The Solution? Thousands of commodity computers networked together 1,000 computers 850GB each How to make them work together? Early Days Hundreds of ad-hoc
More informationEventually Consistent HTTP with Statebox and Riak
Eventually Consistent HTTP with Statebox and Riak Author: Bob Ippolito (@etrepum) Date: November 2011 Venue: QCon San Francisco 2011 1/62 Introduction This talk isn't really about web. It's about how we
More informationDistributed Systems. 05r. Case study: Google Cluster Architecture. Paul Krzyzanowski. Rutgers University. Fall 2016
Distributed Systems 05r. Case study: Google Cluster Architecture Paul Krzyzanowski Rutgers University Fall 2016 1 A note about relevancy This describes the Google search cluster architecture in the mid
More informationGoogle File System. Arun Sundaram Operating Systems
Arun Sundaram Operating Systems 1 Assumptions GFS built with commodity hardware GFS stores a modest number of large files A few million files, each typically 100MB or larger (Multi-GB files are common)
More informationDiscover GraphQL with Python, Graphene and Odoo. FOSDEM Stéphane Bidoul Version 1.0.4
Discover GraphQL with Python, Graphene and Odoo FOSDEM 2019-02-03 Stéphane Bidoul Version 1.0.4 2 / 47 A short story Why this talk 3 / 47 /me in a nutshell @sbidoul CTO of (https://acsone.eu)
More informationOutline. Spanner Mo/va/on. Tom Anderson
Spanner Mo/va/on Tom Anderson Outline Last week: Chubby: coordina/on service BigTable: scalable storage of structured data GFS: large- scale storage for bulk data Today/Friday: Lessons from GFS/BigTable
More informationDEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!
DEMYSTIFYING BIG DATA WITH RIAK USE CASES Martin Schneider Basho Technologies! Agenda Defining Big Data in Regards to Riak A Series of Trade-Offs Use Cases Q & A About Basho & Riak Basho Technologies is
More informationIntroduction to Hadoop. High Availability Scaling Advantages and Challenges. Introduction to Big Data
Introduction to Hadoop High Availability Scaling Advantages and Challenges Introduction to Big Data What is Big data Big Data opportunities Big Data Challenges Characteristics of Big data Introduction
More informationTALK 1: CONVINCE YOUR BOSS: CHOOSE THE "RIGHT" DATABASE. Prof. Dr. Stefan Edlich Beuth University of Technology Berlin (App.Sc.)
TALK 1: CONVINCE YOUR BOSS: CHOOSE THE "RIGHT" DATABASE Prof. Dr. Stefan Edlich Beuth University of Technology Berlin (App.Sc.) nosqlfrankfurt.de nosql powerdays 2 years of NoSQL Consulting http://nosql-database.org
More informationIntroduction to NoSQL Databases
Introduction to NoSQL Databases Roman Kern KTI, TU Graz 2017-10-16 Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 1 / 31 Introduction Intro Why NoSQL? Roman Kern (KTI, TU Graz) Dbase2 2017-10-16 2 / 31 Introduction
More informationDIVING IN: INSIDE THE DATA CENTER
1 DIVING IN: INSIDE THE DATA CENTER Anwar Alhenshiri Data centers 2 Once traffic reaches a data center it tunnels in First passes through a filter that blocks attacks Next, a router that directs it to
More informationTime-Series Data in MongoDB on a Budget. Peter Schwaller Senior Director Server Engineering, Percona Santa Clara, California April 23th 25th, 2018
Time-Series Data in MongoDB on a Budget Peter Schwaller Senior Director Server Engineering, Percona Santa Clara, California April 23th 25th, 2018 TIME SERIES DATA in MongoDB on a Budget Click to add text
More informationRavenDB & document stores
université libre de bruxelles INFO-H415 - Advanced Databases RavenDB & document stores Authors: Yasin Arslan Jacky Trinh Professor: Esteban Zimányi Contents 1 Introduction 3 1.1 Présentation...................................
More informationRealtime visitor analysis with Couchbase and Elasticsearch
Realtime visitor analysis with Couchbase and Elasticsearch Jeroen Reijn @jreijn #nosql13 About me Jeroen Reijn Software engineer Hippo @jreijn http://blog.jeroenreijn.com About Hippo Visitor Analysis OneHippo
More informationHBase Solutions at Facebook
HBase Solutions at Facebook Nicolas Spiegelberg Software Engineer, Facebook QCon Hangzhou, October 28 th, 2012 Outline HBase Overview Single Tenant: Messages Selection Criteria Multi-tenant Solutions
More informationFrom the event loop to the distributed system. Martyn 3rd November, 2011
From the event loop to the distributed system Martyn Loughran martyn@pusher.com @mloughran 3rd November, 2011 From the event loop to the distributed system From the event loop to the distributed system
More informationMongoDB. David Murphy MongoDB Practice Manager, Percona
MongoDB Click Replication to edit Master and Sharding title style David Murphy MongoDB Practice Manager, Percona Who is this Person and What Does He Know? Former MongoDB Master Former Lead DBA for ObjectRocket,
More informationThe Google File System
The Google File System Sanjay Ghemawat, Howard Gobioff and Shun Tak Leung Google* Shivesh Kumar Sharma fl4164@wayne.edu Fall 2015 004395771 Overview Google file system is a scalable distributed file system
More informationWindows Servers In Microsoft Azure
$6/Month Windows Servers In Microsoft Azure What I m Going Over 1. How inexpensive servers in Microsoft Azure are 2. How I get Windows servers for $6/month 3. Why Azure hosted servers are way better 4.
More informationPerspectives on NoSQL
Perspectives on NoSQL PGCon 2010 Gavin M. Roy What is NoSQL? NoSQL is a movement promoting a loosely defined class of nonrelational data stores that break with a long history of relational
More information+ + a journey to zero-downtime
+ + a journey to zero-downtime James Halsall Technical Team Lead @ Inviqa Christian Dornhoff Head of IT (cross-channel) @ toom A bit about toom baumarkt One of Germany s biggest DIY companies Annual turnover
More informationElasticSearch in Production
ElasticSearch in Production lessons learned Anne Veling, ApacheCon EU, November 6, 2012 agenda! Introduction! ElasticSearch! Udini! Upcoming Tool! Lessons Learned introduction! Anne Veling, @anneveling!
More informationMongoDB - a No SQL Database What you need to know as an Oracle DBA
MongoDB - a No SQL Database What you need to know as an Oracle DBA David Burnham Aims of this Presentation To introduce NoSQL database technology specifically using MongoDB as an example To enable the
More informationA Global In-memory Data System for MySQL Daniel Austin, PayPal Technical Staff
A Global In-memory Data System for MySQL Daniel Austin, PayPal Technical Staff Percona Live! MySQL Conference Santa Clara, April 12th, 2012 v1.3 Intro: Globalizing NDB Proposed Architecture What We Learned
More informationSpotify. Scaling storage to million of users world wide. Jimmy Mårdell October 14, 2014
Cassandra @ Spotify Scaling storage to million of users world wide! Jimmy Mårdell October 14, 2014 2 About me Jimmy Mårdell Tech Product Owner in the Cassandra team 4 years at Spotify
More informationA Brief Introduction to Key-Value Stores
A Brief Introduction to Key-Value Stores Jakob Blomer ALICE Offline Week, CERN July 1st 2015 1 / 26 1 A Little Bit on NoSQL, ACID, BASE, and the CAP Theorem 2 Three Examples: Riak, ZooKeeper, RAMCloud
More informationBuilding High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL
Building High Performance Apps using NoSQL Swami Sivasubramanian General Manager, AWS NoSQL Building high performance apps There is a lot to building high performance apps Scalability Performance at high
More informationTechnology Overview ScaleArc. All Rights Reserved.
2014 ScaleArc. All Rights Reserved. Contents Contents...1 ScaleArc Overview...1 Who ScaleArc Helps...2 Historical Database Challenges...3 Use Cases and Projects...5 Sample ScaleArc Customers...5 Summary
More informationRelational databases
COSC 6397 Big Data Analytics NoSQL databases Edgar Gabriel Spring 2017 Relational databases Long lasting industry standard to store data persistently Key points concurrency control, transactions, standard
More informationMap-Reduce (PFP Lecture 12) John Hughes
Map-Reduce (PFP Lecture 12) John Hughes The Problem 850TB in 2006 The Solution? Thousands of commodity computers networked together 1,000 computers 850GB each How to make them work together? Early Days
More informationA NoSQL Introduction for Relational Database Developers. Andrew Karcher Las Vegas SQL Saturday September 12th, 2015
A NoSQL Introduction for Relational Database Developers Andrew Karcher Las Vegas SQL Saturday September 12th, 2015 About Me http://www.andrewkarcher.com Twitter: @akarcher LinkedIn, Twitter Email: akarcher@gmail.com
More informationPrivacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras
Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras Lecture 08 Tutorial 2, Part 2, Facebook API (Refer Slide Time: 00:12)
More informationMicrosoft vision for a new era
Microsoft vision for a new era United platform for the modern service provider MICROSOFT AZURE CUSTOMER DATACENTER CONSISTENT PLATFORM SERVICE PROVIDER Enterprise-grade Global reach, scale, and security
More informationSourceForge. Mark Ramm
MongoDB @ SourceForge Mark Ramm We had a problem six weeks the other sourceforge over 90% of traffic Design goals Improve Usability (more data, more dynamic pages) Improve Performance Improve Reliability
More informationThe Care and Feeding of a MySQL Database for Linux Adminstrators. Dave Stokes MySQL Community Manager
The Care and Feeding of a MySQL Database for Linux Adminstrators Dave Stokes MySQL Community Manager David.Stokes@Oracle.com Simple Introduction This is a general introduction to running a MySQL database
More informationA Distributed System Case Study: Apache Kafka. High throughput messaging for diverse consumers
A Distributed System Case Study: Apache Kafka High throughput messaging for diverse consumers As always, this is not a tutorial Some of the concepts may no longer be part of the current system or implemented
More informationAutomated Testing of Tableau Dashboards
Kinesis Technical Whitepapers April 2018 Kinesis CI Automated Testing of Tableau Dashboards Abstract Companies make business critical decisions every day, based on data from their business intelligence
More informationHDFS Architecture. Gregory Kesden, CSE-291 (Storage Systems) Fall 2017
HDFS Architecture Gregory Kesden, CSE-291 (Storage Systems) Fall 2017 Based Upon: http://hadoop.apache.org/docs/r3.0.0-alpha1/hadoopproject-dist/hadoop-hdfs/hdfsdesign.html Assumptions At scale, hardware
More informationVoldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation
Voldemort Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Leader Election 1/29 Outline 1 2 3 Smruti R. Sarangi Leader Election 2/29 Data
More informationConsistency Without Transactions Global Family Tree
Consistency Without Transactions Global Family Tree NoSQL Matters Cologne Spring 2014 2014 by Intellectual Reserve, Inc. All rights reserved. 1 Contents Introduction to FamilySearch Family Tree Motivation
More informationStreaming Log Analytics with Kafka
Streaming Log Analytics with Kafka Kresten Krab Thorup, Humio CTO Log Everything, Answer Anything, In Real-Time. Why this talk? Humio is a Log Analytics system Designed to run on-prem High volume, real
More informationIntroduction to Database Services
Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational
More informationMigrating massive monitoring to Bigtable without downtime. Martin Parm, Infrastructure Engineer for Monitoring
Migrating massive monitoring to Bigtable without downtime Martin Parm, Infrastructure Engineer for Monitoring This is a big deal. -- Nicholas Harteau/VP, Engineering & Infrastructure https://news.spotify.com/dk/2016/02/23/announcing-spotify-infrastructures-googley-future/
More informationNOSQL DATABASES OCTOBER 20, A comparison between the MongoDB, Cassandra, and Redis databases ANDREW HYTE
NOSQL DATABASES A comparison between the MongoDB, Cassandra, and Redis databases OCTOBER 20, 2015 ANDREW HYTE Contents Introduction... 2 MongoDB... 2 History... 2 Data Model... 2 Physical Storage... 3
More informationThe NoSQL movement. CouchDB as an example
The NoSQL movement CouchDB as an example About me sleepnova - I'm a freelancer Interests: emerging technology, digital art web, embedded system, javascript, programming language Some of my works: Chrome
More informationWorking with Velti NoSQL Roadshow Basel
Working with Velti Our robust technology has been used by major broadcasters and media clients for over 7 years Voting, Polling and Real-time Interactivity through second screen solutions Incremental revenue
More informationAccount Activity Migration guide & set up
Account Activity Migration guide & set up Agenda 1 2 3 4 5 What is the Account Activity (AAAPI)? User Streams & Site Streams overview What s different & what s changing? How to migrate to AAAPI? Questions?
More informationDistributed Systems. 29. Distributed Caching Paul Krzyzanowski. Rutgers University. Fall 2014
Distributed Systems 29. Distributed Caching Paul Krzyzanowski Rutgers University Fall 2014 December 5, 2014 2013 Paul Krzyzanowski 1 Caching Purpose of a cache Temporary storage to increase data access
More informationChapter 3 `How a Storage Policy Works
Chapter 3 `How a Storage Policy Works 32 - How a Storage Policy Works A Storage Policy defines the lifecycle management rules for all protected data. In its most basic form, a storage policy can be thought
More informationCS5412: DIVING IN: INSIDE THE DATA CENTER
1 CS5412: DIVING IN: INSIDE THE DATA CENTER Lecture V Ken Birman We ve seen one cloud service 2 Inside a cloud, Dynamo is an example of a service used to make sure that cloud-hosted applications can scale
More informationCA485 Ray Walshe Google File System
Google File System Overview Google File System is scalable, distributed file system on inexpensive commodity hardware that provides: Fault Tolerance File system runs on hundreds or thousands of storage
More informationRESPONSIVE WEB DESIGN IN 24 HOURS, SAMS TEACH YOURSELF BY JENNIFER KYRNIN
RESPONSIVE WEB DESIGN IN 24 HOURS, SAMS TEACH YOURSELF BY JENNIFER KYRNIN DOWNLOAD EBOOK : RESPONSIVE WEB DESIGN IN 24 HOURS, SAMS TEACH Click link bellow and free register to download ebook: RESPONSIVE
More informationPostgres-XC PG session #3. Michael PAQUIER Paris, 2012/02/02
Postgres-XC PG session #3 Michael PAQUIER Paris, 2012/02/02 Agenda Self-introduction Highlights of Postgres-XC Core architecture overview Performance High-availability Release status 2 Self-introduction
More informationDATABASE DESIGN II - 1DL400
DATABASE DESIGN II - 1DL400 Fall 2016 A second course in database systems http://www.it.uu.se/research/group/udbl/kurser/dbii_ht16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationMySQL Performance Improvements
Taking Advantage of MySQL Performance Improvements Baron Schwartz, Percona Inc. Introduction About Me (Baron Schwartz) Author of High Performance MySQL 2 nd Edition Creator of Maatkit, innotop, and so
More informationMySQL. The Right Database for GIS Sometimes
MySQL The Right Database for GIS Sometimes Who am I? Web/GIS Software Engineer with Cimbura.com BS in IT, MGIS Michael Moore I like making and using tools (digital or physical) GIS Web Services I m most
More informationThe Google File System
October 13, 2010 Based on: S. Ghemawat, H. Gobioff, and S.-T. Leung: The Google file system, in Proceedings ACM SOSP 2003, Lake George, NY, USA, October 2003. 1 Assumptions Interface Architecture Single
More informationFLORIDA DEPARTMENT OF TRANSPORTATION PRODUCTION BIG DATA PLATFORM
FLORIDA DEPARTMENT OF TRANSPORTATION PRODUCTION BIG DATA PLATFORM RECOMMENDATION AND JUSTIFACTION Executive Summary: VHB has been tasked by the Florida Department of Transportation District Five to design
More informationData Analytics with HPC. Data Streaming
Data Analytics with HPC Data Streaming Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us
More informationIntegrate with confidence
Integrate with confidence Testing and verifying API integrations Andrew Spinks aspinks@dius.com.au @andrew_spinks Andrew Spinks aspinks@dius.com.au @andrew_spinks Pact? http://pact.io Fat clients The
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL
More informationPostgres-XC PostgreSQL Conference Michael PAQUIER Tokyo, 2012/02/24
Postgres-XC PostgreSQL Conference 2012 Michael PAQUIER Tokyo, 2012/02/24 Agenda Self-introduction Highlights of Postgres-XC Core architecture overview Performance High-availability Release status Copyright
More informationDatabase Availability and Integrity in NoSQL. Fahri Firdausillah [M ]
Database Availability and Integrity in NoSQL Fahri Firdausillah [M031010012] What is NoSQL Stands for Not Only SQL Mostly addressing some of the points: nonrelational, distributed, horizontal scalable,
More informationFixing Twitter.... and Finding your own Fail Whale. John Adams Twitter Operations
Fixing Twitter... and Finding your own Fail Whale John Adams Twitter Operations Operations Small team, growing rapidly. What do we do? Software Performance (back-end) Availability Capacity
More informationAzure Development Course
Azure Development Course About This Course This section provides a brief description of the course, audience, suggested prerequisites, and course objectives. COURSE DESCRIPTION This course is intended
More informationJargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems
Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons
More informationManaging IoT and Time Series Data with Amazon ElastiCache for Redis
Managing IoT and Time Series Data with ElastiCache for Redis Darin Briskman, ElastiCache Developer Outreach Michael Labib, Specialist Solutions Architect 2016, Web Services, Inc. or its Affiliates. All
More informationMarkLogic as a Real-Time Data Hub
MarkLogic as a Real-Time Data Hub Prepared by Mike Bowers 5/9/2018 version 1.3 About the Author Michael Bowers Principal Engineer Data Architect Using NoSQL professionally for 10 years Author Pro CSS and
More informationUsing Redis As a Time Series Database
WHITE PAPER Using Redis As a Time Series Database Dr.Josiah Carlson, Author of Redis in Action CONTENTS Executive Summary 2 Use Cases 2 Advanced Analysis Using a Sorted Set with Hashes 2 Event Analysis
More informationHow Enova Financial Uses Postgres. Jim Nasby, Lead Database Architect
How Enova Financial Uses Postgres Jim Nasby, Lead Database Architect Who are we? Some history Migration Where are we today? (The cheerleading section) Cool stuff Q&A Overview 2 Who are we? Who are we?
More informationDupScout DUPLICATE FILES FINDER
DupScout DUPLICATE FILES FINDER User Manual Version 10.3 Dec 2017 www.dupscout.com info@flexense.com 1 1 Product Overview...3 2 DupScout Product Versions...7 3 Using Desktop Product Versions...8 3.1 Product
More informationGFS: The Google File System. Dr. Yingwu Zhu
GFS: The Google File System Dr. Yingwu Zhu Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one big CPU More storage, CPU required than one PC can
More informationChanging Requirements for Distributed File Systems in Cloud Storage
Changing Requirements for Distributed File Systems in Cloud Storage Wesley Leggette Cleversafe Presentation Agenda r About Cleversafe r Scalability, our core driver r Object storage as basis for filesystem
More informationRiak. Distributed, replicated, highly available
INTRO TO RIAK Riak Overview Riak Distributed Riak Distributed, replicated, highly available Riak Distributed, highly available, eventually consistent Riak Distributed, highly available, eventually consistent,
More informationROCHE MOBILE APP FOR ONE OF THE BIGGEST PHARMACEUTICAL COMPANY VERIFIED REVIEW 5.0 / 5.0 CASE STUDY
ROCHE MOBILE APP FOR ONE OF THE BIGGEST PHARMACEUTICAL COMPANY VERIFIED REVIEW 5.0 / 5.0 CASE STUDY PROJECT SUMMARY Roche Mobile App is a medtech app for all oncologist in Poland, made for one of the biggest
More informationScalable Streaming Analytics
Scalable Streaming Analytics KARTHIK RAMASAMY @karthikz TALK OUTLINE BEGIN I! II ( III b Overview Storm Overview Storm Internals IV Z V K Heron Operational Experiences END WHAT IS ANALYTICS? according
More informationInnoDB Scalability Limits. Peter Zaitsev, Vadim Tkachenko Percona Inc MySQL Users Conference 2008 April 14-17, 2008
InnoDB Scalability Limits Peter Zaitsev, Vadim Tkachenko Percona Inc MySQL Users Conference 2008 April 14-17, 2008 -2- Who are the Speakers? Founders of Percona Inc MySQL Performance and Scaling consulting
More informationwhitepaper Using Redis As a Time Series Database: Why and How
whitepaper Using Redis As a Time Series Database: Why and How Author: Dr.Josiah Carlson, Author of Redis in Action Table of Contents Executive Summary 2 A Note on Race Conditions and Transactions 2 Use
More information