SCALE OUT AND CONQUER: ARCHITECTURAL DECISIONS BEHIND DISTRIBUTED IN-MEMORY SYSTEMS VLADIMIR OZEROV YAKOV ZHDANOV
|
|
- Tracy Stafford
- 5 years ago
- Views:
Transcription
1 SCALE OUT AND CONQUER: ARCHITECTURAL DECISIONS BEHIND DISTRIBUTED IN-MEMORY SYSTEMS VLADIMIR OZEROV YAKOV ZHDANOV
2 WHO? Yakov Zhdanov: - GridGain s Product Development VP - With GridGain since Apache Ignite committer and PMC - Passion for performance & scalability - Finding ways to make product better - St. Petersburg, Russia
3 WHY IN-MEMORY?
4 PLAN 1. Data partitioning and affinity functions examples
5 PLAN 1. Data partitioning and affinity functions examples 2. Data affinity colocation
6 PLAN 1. Data partitioning and affinity functions examples 2. Data affinity colocation 3. Synchronization in distributed systems
7 PLAN Data partitioning and affinity functions examples Data affinity colocation Synchronization in distributed systems Multithreading: local architecture
8 PLAN Data partitioning and affinity functions examples Data affinity colocation Synchronization in distributed systems Multithreading: local architecture
9 WHERE? On which node of the cluster does the key reside?
10 AFFINITY Partitio n Nod e
11 AFFINITY
12 AFFINITY Ke y Partitio n Nod e
13 AFFINITY
14 NAIVE AFFINITY
15 NAIVE AFFINITY
16 NAIVE AFFINITY
17 NAIVE AFFINITY
18 NAIVE AFFINITY
19 NAIVE AFFINITY
20 NAIVE AFFINITY Problem: partition to node mapping depends on nodes count. NODE = F (PARTITION, NODES_COUNT);
21 AFFINITY: BETTER ALGORITHMS Consistent hashing [1] Rendezvous hashing (or highest random weight - HRW) [2] [1] [2]
22 RENDEZVOUS AFFINITY WEIGHT = W(PARTITION, NODE);
23 RENDEZVOUS AFFINITY
24 RENDEZVOUS AFFINITY
25 RENDEZVOUS AFFINITY
26 RENDEZVOUS AFFINITY
27 RENDEZVOUS AFFINITY
28 RENDEZVOUS AFFINITY: EVEN DISTRIBUTION?
29 PLAN Data partitioning and affinity functions examples Data affinity colocation Synchronization in distributed systems Multithreading: local architecture
30 TRANSACTIONS: NO COLOCATION 1: class Customer { 2: long id; 3: City city; 4: }
31 TRANSACTIONS: NO COLOCATION
32 TRANSACTIONS: NO COLOCATION 2 (2 nodes)
33 TRANSACTIONS: NO COLOCATION 2 (2 nodes) 2 (primary + backup)
34 TRANSACTIONS: NO COLOCATION 2 (2 nodes) 2 (primary + backup) 2 (two-phase commit)
35 TRANSACTIONS: NO COLOCATION (2 nodes) (primary + backup) (two-phase commit) (request-response)
36 TRANSACTIONS: NO COLOCATION (2 nodes) (primary + backup) (two-phase commit) (request-response) 16 Messages
37 TRANSACTIONS: NO COLOCATION
38 TRANSACTIONS: WITH COLOCATION 1: class Customer { 2: long id; 3: 5: City city; 6: }
39 TRANSACTIONS: WITH COLOCATION
40 TRANSACTIONS: WITH COLOCATION 1 (1 node)
41 TRANSACTIONS: WITH COLOCATION 1 (1 node) 2 (primary + backup)
42 TRANSACTIONS: WITH COLOCATION 1 (1 node) 2 (primary + backup) (one-phase commit)
43 TRANSACTIONS: WITH COLOCATION 1 (1 node) 2 (primary + backup) (one-phase commit) 1 (request-response)
44 TRANSACTIONS: WITH COLOCATION 1 (1 node) 2 (primary + backup) (one-phase commit) 1 (request-response) 4 Messages
45 TRANSACTIONS: COLOCATION VS NO COLOCATION 4 Messages VS 16 Messages
46 SQL Let s run a query on our data
47 SQL No colocation: FULL SCAN
48 SQL No colocation: FULL SCAN
49 SQL No colocation: FULL SCAN 1/3x Latency
50 SQL No colocation: FULL SCAN 1/3x Latency 3x Capacity
51 SQL
52 SQL 1 node
53 SQL 1 node N nodes
54 SQL What about complexity? log 1_000_000 20
55 SQL What about complexity? log 1_000_000 vs log 333_333 log 333_333 log 333_
56 SQL: INDEXED
57 SQL No colocation: INDEXED QUERY Same latency! Same capacity!
58 SQL: INDEX AND COLOCATION Colocation: INDEXED QUERY
59 SQL Colocation: INDEXED QUERY
60 SQL: INDEX AND COLOCATION Colocation: INDEXED QUERY Same latency But 3x capacity!
61 SQL: EVEN DISTRIBUTION WITH COLOCATION?
62 SQL: JOINS IN DISTRIBUTED ENVIRONMENT
63 SQL: JOINS WITH COLOCATION
64 SQL: JOINS WITH REPLICATION
65 PLAN Data partitioning and affinity functions examples Data affinity colocation Synchronization in distributed systems Multithreading: local architecture
66 SYNCHRONIZATION: LOCAL COUNTER 1: AtomicLong ctr; 2: 3: long getnext() { 4: return ctr.incrementandget(); 5: }
67 SYNCHRONIZATION: LOCAL (RE-INVENTING A BICYCLE) 1: 2: 3: 4: 5: 6: 7: 8: 9: 10: 11: 12: 13: AtomicLong ctr; ThreadLocal<Long> localctr; long getnext() { long res = localctr.get(); if (res % 1000 == 0) res = ctr.getandadd(1000); localctr.set(++res); return res; }
68 SYNCHRONIZATION: LOCAL
69 SYNCHRONIZATION: DISTRIBUTED
70 SYNCHRONIZATION: COUNTER IN THE CLUSTER Local implementation: millions ops/sec Distributed implementation: thousands ops/sec
71 SYNCHRONIZATION: COUNTER IN THE CLUSTER Proper requirements: Unique Monotonously growing 8 bytes
72 SYNCHRONIZATION: COUNTER IN THE CLUSTER Requirements: unique, monotonous, 8 bytes.
73 SYNCHRONIZATION: COUNTER IN THE CLUSTER Requirements: unique, monotonous, 8 bytes.
74 SYNCHRONIZATION: COUNTER IN THE CLUSTER Requirements: unique, monotonous, 8 bytes.
75 SYNCHRONIZATION: COUNTER IN THE CLUSTER Requirements: unique, monotonous, 8 bytes. See also: org.apache.ignite.lang.igniteuuid
76 SYNCHRONIZATION AS FRICTION FOR A CAR
77 SYNCHRONIZATION: DATA TO CODE
78 SYNCHRONIZATION: DATA TO CODE 1: Account acc = cache.get(acckey); 3: 3: acc.add(100); 4: 5: cache.put(acckey, acc);
79 SYNCHRONIZATION: DATA TO CODE 1: Account acc = cache.get(acckey); 3: 3: acc.add(100); 4: 5: cache.put(acckey, acc);
80 SYNCHRONIZATION: CODE TO DATA
81 SYNCHRONIZATION: CODE TO DATA 1: cache.invoke(acckey, (entry) -> { 1: Account acc = entry.getvalue(); 3: 3: acc.add(100); 4: 5: entry.setvalue(acc); 6: });
82 SYNCHRONIZATION: CODE TO DATA 1: cache.invoke(acckey, (entry) -> { 1: Account acc = entry.getvalue(); 3: 3: acc.add(100); 4: 5: entry.setvalue(acc); 6: });
83 SYNCHRONIZATION: DATA TO CODE What if we have a bug?!
84 SYNCHRONIZATION: CODE TO DATA What if we have a bug?!
85 SYNCHRONIZATION: CODE TO DATA What if we have a bug?! `
86 PLAN Data partitioning and affinity functions examples Data affinity colocation Synchronization in distributed systems Multithreading: local architecture
87 LOCAL TASKS DISTRIBUTION
88 LOCAL TASKS DISTRIBUTION
89 LOCAL TASKS DISTRIBUTION
90 LOCAL TASKS DISTRIBUTION
91 LOCAL TASKS DISTRIBUTION
92 LOCAL TASKS DISTRIBUTION: THREAD PER PARTITION
93 LOCAL TASKS DISTRIBUTION: THREAD PER PARTITION
94 LOCAL TASKS DISTRIBUTION: THREAD PER PARTITION
95 LOCAL TASKS DISTRIBUTION: THREAD PER PARTITION
96 LESSONS LEARNED 1) Data partitioning: balance and stability
97 LESSONS LEARNED 1) Data partitioning: balance and stability 2) Colocation: balance and efficiency
98 LESSONS LEARNED 1) Data partitioning: balance and stability 2) Colocation: balance and efficiency 3) Data model: should be adopted accordingly
99 LESSONS LEARNED 1) 2) 3) 4) Data partitioning: balance and stability Colocation: balance and efficiency Data model: should be adopted accordingly Synchronization: delicate and only when really needed
100 LESSONS LEARNED 1) Data partitioning: balance and stability 2) Colocation: balance and efficiency 3) Data model: should be adopted accordingly 4) Synchronization: delicate and only when really needed 5) Thread per partition: can improve simple operations, but also may slow down complex ones
101 CONTACTS
102 QUESTIONS? ANY QUESTIONS?
GridGain and Apache Ignite In-Memory Performance with Durability of Disk
GridGain and Apache Ignite In-Memory Performance with Durability of Disk Dmitriy Setrakyan Apache Ignite PMC GridGain Founder & CPO http://ignite.apache.org #apacheignite Agenda What is GridGain and Ignite
More informationAssignment No: Create a College database and apply different queries on it. 2. Implement GUI for SQL queries and display result of the query
Assignment No: 1 GUI Implementation for SQL queries Learning Outcomes: At the end of this assignment students will be able to To create a simple table Write queries for the manipulation of the table Design
More informationB.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2
Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,
More informationApache Ignite and Apache Spark Where Fast Data Meets the IoT
Apache Ignite and Apache Spark Where Fast Data Meets the IoT Denis Magda GridGain Product Manager Apache Ignite PMC http://ignite.apache.org #apacheignite #denismagda Agenda IoT Demands to Software IoT
More informationMySQL Database Scalability
MySQL Database Scalability Nextcloud Conference 2016 TU Berlin Oli Sennhauser Senior MySQL Consultant at FromDual GmbH oli.sennhauser@fromdual.com 1 / 14 About FromDual GmbH Support Consulting remote-dba
More informationHIGH AVAILABILITY AND DISASTER RECOVERY FOR IMDG VLADIMIR KOMAROV, MIKHAIL GORELOV SBERBANK OF RUSSIA
HIGH AVAILABILITY AND DISASTER RECOVERY FOR IMDG VLADIMIR KOMAROV, MIKHAIL GORELOV SBERBANK OF RUSSIA 1 ABOUT SPEAKERS Vladimir Komarov Enterprise IT Architect vikomarov@sberbank.ru in Sberbank since 2010.
More informationAccelerate MySQL for Demanding OLAP and OLTP Use Case with Apache Ignite December 7, 2016
Accelerate MySQL for Demanding OLAP and OLTP Use Case with Apache Ignite December 7, 2016 Nikita Ivanov CTO and Co-Founder GridGain Systems Peter Zaitsev CEO and Co-Founder Percona About the Presentation
More informationApache Ignite Best Practices: Cluster Topology Management and Data Replication
Apache Ignite Best Practices: Cluster Topology Management and Data Replication Ivan Rakov Apache Ignite Committer Senior Software Engineer in GridGain Systems Too many data? Distributed Systems to the
More informationIn-Memory Computing Essentials
In-Memory Computing Essentials for Architects and Developers: Part 1 Denis Magda Ignite PMC Chair GridGain Director of Product Management Agenda Apache Ignite Overview Clustering and Deployment Distributed
More informationConceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc.
Conceptual Modeling on Tencent s Distributed Database Systems Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc. Outline Introduction System overview of TDSQL Conceptual Modeling on TDSQL Applications Conclusion
More informationScalability of web applications
Scalability of web applications CSCI 470: Web Science Keith Vertanen Copyright 2014 Scalability questions Overview What's important in order to build scalable web sites? High availability vs. load balancing
More information<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store
Oracle NoSQL Database A Distributed Key-Value Store Charles Lamb The following is intended to outline our general product direction. It is intended for information purposes only,
More informationWhat is the Future of PostgreSQL?
What is the Future of PostgreSQL? Robert Haas 2013 EDB All rights reserved. 1 PostgreSQL Popularity By The Numbers Date Rating Increase vs. Prior Year % Increase January 2016 282.401 +27.913 +11% January
More informationB2W Software Resource Requirements & Recommendations
B2W Software Resource Requirements & Recommendations v2019.2, December 21, 2018 Introduction... 2 Resource Determination Factors... 2 Environment Configuration Options... 3 Considerations... 3 Estimate...
More informationMySQL Architecture Design Patterns for Performance, Scalability, and Availability
MySQL Architecture Design Patterns for Performance, Scalability, and Availability Brian Miezejewski Principal Manager Consulting Alexander Rubin Principal Consultant Agenda HA and
More informationProject Genesis. Cafepress.com Product Catalog Hundreds of Millions of Products Millions of new products every week Accelerating growth
Scaling with HiveDB Project Genesis Cafepress.com Product Catalog Hundreds of Millions of Products Millions of new products every week Accelerating growth Enter Jeremy and HiveDB Our Requirements OLTP
More informationHDFS Federation. Sanjay Radia Founder and Hortonworks. Page 1
HDFS Federation Sanjay Radia Founder and Architect @ Hortonworks Page 1 About Me Apache Hadoop Committer and Member of Hadoop PMC Architect of core-hadoop @ Yahoo - Focusing on HDFS, MapReduce scheduler,
More informationMySQL Cluster Student Guide
MySQL Cluster Student Guide D62018GC11 Edition 1.1 November 2012 D79677 Technical Contributor and Reviewer Mat Keep Editors Aju Kumar Daniel Milne Graphic Designer Seema Bopaiah Publishers Sujatha Nagendra
More informationArchitecture of a Real-Time Operational DBMS
Architecture of a Real-Time Operational DBMS Srini V. Srinivasan Founder, Chief Development Officer Aerospike CMG India Keynote Thane December 3, 2016 [ CMGI Keynote, Thane, India. 2016 Aerospike Inc.
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More informationBuilding a Scalable Architecture for Web Apps - Part I (Lessons Directi)
Intelligent People. Uncommon Ideas. Building a Scalable Architecture for Web Apps - Part I (Lessons Learned @ Directi) By Bhavin Turakhia CEO, Directi (http://www.directi.com http://wiki.directi.com http://careers.directi.com)
More informationCeph: A Scalable, High-Performance Distributed File System
Ceph: A Scalable, High-Performance Distributed File System S. A. Weil, S. A. Brandt, E. L. Miller, D. D. E. Long Presented by Philip Snowberger Department of Computer Science and Engineering University
More informationAzure-persistence MARTIN MUDRA
Azure-persistence MARTIN MUDRA Storage service access Blobs Queues Tables Storage service Horizontally scalable Zone Redundancy Accounts Based on Uri Pricing Calculator Azure table storage Storage Account
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationHeckaton. SQL Server's Memory Optimized OLTP Engine
Heckaton SQL Server's Memory Optimized OLTP Engine Agenda Introduction to Hekaton Design Consideration High Level Architecture Storage and Indexing Query Processing Transaction Management Transaction Durability
More informationThe Care and Feeding of a MySQL Database for Linux Adminstrators. Dave Stokes MySQL Community Manager
The Care and Feeding of a MySQL Database for Linux Adminstrators Dave Stokes MySQL Community Manager David.Stokes@Oracle.com Simple Introduction This is a general introduction to running a MySQL database
More informationApache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source
Apache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source DMITRIY SETRAKYAN Founder, PPMC https://ignite.apache.org @apacheignite @dsetrakyan Agenda About In- Memory Computing Apache Ignite
More informationDistributed Data Store
Distributed Data Store Large-Scale Distributed le system Q: What if we have too much data to store in a single machine? Q: How can we create one big filesystem over a cluster of machines, whose data is
More informationNew Oracle NoSQL Database APIs that Speed Insertion and Retrieval
New Oracle NoSQL Database APIs that Speed Insertion and Retrieval O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 1 NEW ORACLE NoSQL DATABASE APIs that SPEED INSERTION AND RETRIEVAL Introduction
More informationDelegates must have a working knowledge of MariaDB or MySQL Database Administration.
MariaDB Performance & Tuning SA-MARDBAPT MariaDB Performance & Tuning Course Overview This MariaDB Performance & Tuning course is designed for Database Administrators who wish to monitor and tune the performance
More informationHuge market -- essentially all high performance databases work this way
11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch
More information<Insert Picture Here> MySQL Cluster What are we working on
MySQL Cluster What are we working on Mario Beck Principal Consultant The following is intended to outline our general product direction. It is intended for information purposes only,
More informationScaling Without Sharding. Baron Schwartz Percona Inc Surge 2010
Scaling Without Sharding Baron Schwartz Percona Inc Surge 2010 Web Scale!!!! http://www.xtranormal.com/watch/6995033/ A Sharding Thought Experiment 64 shards per proxy [1] 1 TB of data storage per node
More informationDistributed ACID Transac2ons in Apache Ignite
Distributed ACID Transac2ons in Apache Ignite Akmal Chaudhri GridGain hbp://ignite.apache.org #apacheignite My Background Pre-2000 Developer Academic (City University) Consultant Technical Architect Post-2000
More informationCISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL
CISC 7610 Lecture 5 Distributed multimedia databases Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL Motivation YouTube receives 400 hours of video per minute That is 200M hours
More informationMySQL Cluster Web Scalability, % Availability. Andrew
MySQL Cluster Web Scalability, 99.999% Availability Andrew Morgan @andrewmorgan www.clusterdb.com Safe Harbour Statement The following is intended to outline our general product direction. It is intended
More informationBuilding Better. SQL Server Databases
Building Better SQL Server Databases Who is this guy? Eric Cobb SQL Server Database Administrator MCSE: Data Platform MCSE: Data Management and Analytics 1999-2013: Webmaster, Programmer, Developer 2014+:
More informationTIBCO StreamBase 10 Distributed Computing and High Availability. November 2017
TIBCO StreamBase 10 Distributed Computing and High Availability November 2017 Distributed Computing Distributed Computing location transparent objects and method invocation allowing transparent horizontal
More informationAccelerating Real-Time Big Data. Breaking the limitations of captive NVMe storage
Accelerating Real-Time Big Data Breaking the limitations of captive NVMe storage 18M IOPs in 2u Agenda Everything related to storage is changing! The 3rd Platform NVM Express architected for solid state
More informationIntroduction to Hadoop. Owen O Malley Yahoo!, Grid Team
Introduction to Hadoop Owen O Malley Yahoo!, Grid Team owen@yahoo-inc.com Who Am I? Yahoo! Architect on Hadoop Map/Reduce Design, review, and implement features in Hadoop Working on Hadoop full time since
More informationSQL Server 2014: In-Memory OLTP for Database Administrators
SQL Server 2014: In-Memory OLTP for Database Administrators Presenter: Sunil Agarwal Moderator: Angela Henry Session Objectives And Takeaways Session Objective(s): Understand the SQL Server 2014 In-Memory
More information7. Query Processing and Optimization
7. Query Processing and Optimization Processing a Query 103 Indexing for Performance Simple (individual) index B + -tree index Matching index scan vs nonmatching index scan Unique index one entry and one
More informationComparitive Study of Advanced Database Replication Strategies
Comparitive Study of Advanced Database Replication Strategies A. Pramod Kumar 1 M.Tech, B.SATEESH 2 M.Tech ACE Engineering College, JNTUH 1 CBIT, (OsmaniaUniversity) 2 Abstract In this paper we are comparing
More informationPerformance Monitoring Always On Availability Groups. Anthony E. Nocentino
Performance Monitoring Always On Availability Groups Anthony E. Nocentino aen@centinosystems.com Anthony E. Nocentino Consultant and Trainer Founder and President of Centino Systems Specialize in system
More informationIT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including:
IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including: 1. IT Cost Containment 84 topics 2. Cloud Computing Readiness 225
More information2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk
In-Memory Performance Durability of Disk Ignite the Fire in your SQL App Akmal B. Chaudhri Technology Evangelist GridGain Systems Agenda SQL Capabilities Connectivity Data Definition Language Data Manipulation
More informationApache Tomcat. Tomcat Clustering: Part 3 Clustering. Mark Thomas, 15 April Pivotal Software, Inc. All rights reserved.
2 Apache Tomcat Tomcat Clustering: Part 3 Clustering Mark Thomas, 15 April 2015 Introduction Apache Tomcat committer since December 2003 markt@apache.org Tomcat 8 release manager Member of the Servlet,
More informationCloud Programming on Java EE Platforms. mgr inż. Piotr Nowak
Cloud Programming on Java EE Platforms mgr inż. Piotr Nowak Distributed data caching environment Hadoop Apache Ignite "2 Cache what is cache? how it is used? "3 Cache - hardware buffer temporary storage
More informationMassive Scalability With InterSystems IRIS Data Platform
Massive Scalability With InterSystems IRIS Data Platform Introduction Faced with the enormous and ever-growing amounts of data being generated in the world today, software architects need to pay special
More information<Insert Picture Here>
Caching Schemes & Accessing Data Lesson 2 Objectives After completing this lesson, you should be able to: Describe the different caching schemes that Coherence
More informationEnterprise Open Source Databases
Enterprise Open Source Databases WHITE PAPER MariaDB vs. Oracle MySQL vs. EnterpriseDB MariaDB TX Born of the community. Raised in the enterprise. MariaDB TX, with a history of proven enterprise reliability
More information<Insert Picture Here> Oracle Application Cache Solution: Coherence
Oracle Application Cache Solution: Coherence 黃開印 Kevin Huang Principal Sales Consultant Outline Oracle Data Grid Solution for Application Caching Use Cases Coherence Features Summary
More informationPerformance Monitoring AlwaysOn Availability Groups. Anthony E. Nocentino
Performance Monitoring AlwaysOn Availability Groups Anthony E. Nocentino aen@centinosystems.com Anthony E. Nocentino Consultant and Trainer Founder and President of Centino Systems Specialize in system
More informationLecture 4 Naming. Prof. Wilson Rivera. University of Puerto Rico at Mayaguez Electrical and Computer Engineering Department
Lecture 4 Naming Prof. Wilson Rivera University of Puerto Rico at Mayaguez Electrical and Computer Engineering Department Outline Names, identifiers, addresses Flat naming Structured naming Attribute based
More informationdavidklee.net heraflux.com linkedin.com/in/davidaklee
@kleegeek davidklee.net heraflux.com linkedin.com/in/davidaklee Specialties / Focus Areas / Passions: Performance Tuning & Troubleshooting Virtualization Cloud Enablement Infrastructure Architecture Health
More information10. Replication. Motivation
10. Replication Page 1 10. Replication Motivation Reliable and high-performance computation on a single instance of a data object is prone to failure. Replicate data to overcome single points of failure
More informationHigh-Performance Distributed DBMS for Analytics
1 High-Performance Distributed DBMS for Analytics 2 About me Developer, hardware engineering background Head of Analytic Products Department in Yandex jkee@yandex-team.ru 3 About Yandex One of the largest
More informationSomething to think about. Problems. Purpose. Vocabulary. Query Evaluation Techniques for large DB. Part 1. Fact:
Query Evaluation Techniques for large DB Part 1 Fact: While data base management systems are standard tools in business data processing they are slowly being introduced to all the other emerging data base
More informationShen PingCAP 2017
Shen Li @ PingCAP About me Shen Li ( 申砾 ) Tech Lead of TiDB, VP of Engineering Netease / 360 / PingCAP Infrastructure software engineer WHY DO WE NEED A NEW DATABASE? Brief History Standalone RDBMS NoSQL
More informationInfrastructure Tuning
Infrastructure Tuning For SQL Server Performance SQL PASS Performance Virtual Chapter 2014.07.24 About David Klee @kleegeek davidklee.net gplus.to/kleegeek linked.com/a/davidaklee Specialties / Focus Areas
More informationRelational to NoSQL: Getting started from SQL Server. Shane Johnson Sr. Product Marketing Manager Couchbase
Relational to NoSQL: Getting started from SQL Server Shane Johnson Sr. Product Marketing Manager Couchbase Today s agenda Why NoSQL? Identifying the right application Modeling your data Accessing your
More informationPerformance Monitoring AlwaysOn Availability Groups. Anthony E. Nocentino
Performance Monitoring AlwaysOn Availability Groups Anthony E. Nocentino aen@centinosystems.com TUGA IT 2017 LISBON, PORTUGAL THANK YOU TO OUR SPONSORS Anthony E. Nocentino Consultant and Trainer Founder
More informationFLAT DATACENTER STORAGE CHANDNI MODI (FN8692)
FLAT DATACENTER STORAGE CHANDNI MODI (FN8692) OUTLINE Flat datacenter storage Deterministic data placement in fds Metadata properties of fds Per-blob metadata in fds Dynamic Work Allocation in fds Replication
More information2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk
In-Memory Performance Durability of Disk Meeting the Challenges of Fast Data in Healthcare with In-Memory Technologies Akmal Chaudhri Technology Evangelist GridGain Agenda Introduction Fast Data in Healthcare
More informationData Informatics. Seon Ho Kim, Ph.D.
Data Informatics Seon Ho Kim, Ph.D. seonkim@usc.edu HBase HBase is.. A distributed data store that can scale horizontally to 1,000s of commodity servers and petabytes of indexed storage. Designed to operate
More informationDistributed KIDS Labs 1
Distributed Databases @ KIDS Labs 1 Distributed Database System A distributed database system consists of loosely coupled sites that share no physical component Appears to user as a single system Database
More informationCSE 544: Principles of Database Systems
CSE 544: Principles of Database Systems Anatomy of a DBMS, Parallel Databases 1 Announcements Lecture on Thursday, May 2nd: Moved to 9am-10:30am, CSE 403 Paper reviews: Anatomy paper was due yesterday;
More informationApp Engine: Datastore Introduction
App Engine: Datastore Introduction Part 1 Another very useful course: https://www.udacity.com/course/developing-scalableapps-in-java--ud859 1 Topics cover in this lesson What is Datastore? Datastore and
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 14 NoSQL References Scalable SQL and NoSQL Data Stores, Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No.
More informationBuilding Better. SQL Server Databases
Building Better SQL Server Databases Who is this guy? Eric Cobb Started in IT in 1999 as a "webmaster Developer for 14 years Microsoft Certified Solutions Expert (MCSE) Data Platform Data Management and
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationIndexing Large-Scale Data
Indexing Large-Scale Data Serge Abiteboul Ioana Manolescu Philippe Rigaux Marie-Christine Rousset Pierre Senellart Web Data Management and Distribution http://webdam.inria.fr/textbook November 16, 2010
More informationIn-Memory Performance Durability of Disk GridGain Systems, Inc.
In-Memory Performance Durability of Disk Apache Ignite In-Memory Hammer for Your Data Science Toolkit Denis Magda Ignite PMC Chair GridGain Director of Product Management Agenda Apache Ignite Overview
More informationEddies: Continuously Adaptive Query Processing. Jae Kyu Chun Feb. 17, 2003
Eddies: Continuously Adaptive Query Processing Jae Kyu Chun Feb. 17, 2003 Query in Large Scale System Hardware and Workload Complexity heterogeneous hardware mix unpredictable hardware performance Data
More informationA Practical Scalable Distributed B-Tree
A Practical Scalable Distributed B-Tree CS 848 Paper Presentation Marcos K. Aguilera, Wojciech Golab, Mehul A. Shah PVLDB 08 March 8, 2010 Presenter: Evguenia (Elmi) Eflov Presentation Outline 1 Background
More informationCSE 544 Principles of Database Management Systems
CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 5 - DBMS Architecture and Indexing 1 Announcements HW1 is due next Thursday How is it going? Projects: Proposals are due
More informationCorda Performance To infinity and beyond! James Carlyle Chief Engineer, R3 7 March 2018
Corda Performance To infinity and beyond! James Carlyle Chief Engineer, R3 7 March 2018 Performance Matters! Why does it matter? Ultimate capacity of network Efficiency and costs of infrastructure But.
More informationMySQL Replication. Rick Golba and Stephane Combaudon April 15, 2015
MySQL Replication Rick Golba and Stephane Combaudon April 15, 2015 Agenda What is, and what is not, MySQL Replication Replication Use Cases Types of replication Replication lag Replication errors Replication
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 11/15/12 Agenda Check-in Centralized and Client-Server Models Parallelism Distributed Databases Homework 6 Check-in
More informationColumn-Family Databases Cassandra and HBase
Column-Family Databases Cassandra and HBase Kevin Swingler Google Big Table Google invented BigTableto store the massive amounts of semi-structured data it was generating Basic model stores items indexed
More informationNo Compromises. Distributed Transactions with Consistency, Availability, Performance
No Compromises Distributed Transactions with Consistency, Availability, Performance Aleksandar Dragojevic, Dushyanth Narayanan, Edmund B. Nightingale, Matthew Renzelmann, Alex Shamis, Anirudh Badam, Miguel
More informationBigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao
Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI 2006 Presented by Xiang Gao 2014-11-05 Outline Motivation Data Model APIs Building Blocks Implementation Refinement
More informationCockroachDB on DC/OS. Ben Darnell, CTO, Cockroach Labs
CockroachDB on DC/OS Ben Darnell, CTO, Cockroach Labs Agenda A cloud-native database CockroachDB on DC/OS Why CockroachDB Demo! Cloud-Native Database What is Cloud-Native? Horizontally scalable Individual
More informationAccelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017
Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017 About the Presentation Problems Existing Solutions Denis Magda
More informationHBase Solutions at Facebook
HBase Solutions at Facebook Nicolas Spiegelberg Software Engineer, Facebook QCon Hangzhou, October 28 th, 2012 Outline HBase Overview Single Tenant: Messages Selection Criteria Multi-tenant Solutions
More informationChord: A Scalable Peer-to-peer Lookup Service For Internet Applications
Chord: A Scalable Peer-to-peer Lookup Service For Internet Applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan Presented by Jibin Yuan ION STOICA Professor of CS
More informationA Fast and High Throughput SQL Query System for Big Data
A Fast and High Throughput SQL Query System for Big Data Feng Zhu, Jie Liu, and Lijie Xu Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, China 100190
More informationIntroduction to MySQL NDB Cluster. Yves Trudeau Ph. D. Percona Live DC/January 2012
Introduction to MySQL NDB Cluster Yves Trudeau Ph. D. Percona Live DC/January 2012 Agenda What is NDB Cluster? How MySQL uses NDB Cluster Good use cases Bad use cases Example of tuning What is NDB cluster?
More informationHorizontal or vertical scalability? Horizontal scaling is challenging. Today. Scaling Out Key-Value Storage
Horizontal or vertical scalability? Scaling Out Key-Value Storage COS 418: Distributed Systems Lecture 8 Kyle Jamieson Vertical Scaling Horizontal Scaling [Selected content adapted from M. Freedman, B.
More informationGreenplum Architecture Class Outline
Greenplum Architecture Class Outline Introduction to the Greenplum Architecture What is Parallel Processing? The Basics of a Single Computer Data in Memory is Fast as Lightning Parallel Processing Of Data
More informationChapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems!
Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and
More informationDatabase Management and Tuning
Database Management and Tuning Index Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 4 Acknowledgements: The slides are provided by Nikolaus Augsten and have
More informationOutline. Database Management and Tuning. What is an Index? Key of an Index. Index Tuning. Johann Gamper. Unit 4
Outline Database Management and Tuning Johann Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Unit 4 1 2 Conclusion Acknowledgements: The slides are provided by Nikolaus Augsten
More informationMySQL High Availability. Michael Messina Senior Managing Consultant, Rolta-AdvizeX /
MySQL High Availability Michael Messina Senior Managing Consultant, Rolta-AdvizeX mmessina@advizex.com / mike.messina@rolta.com Introduction Michael Messina Senior Managing Consultant Rolta-AdvizeX, Working
More informationIntroducing Oracle R Enterprise 1.4 -
Hello, and welcome to this online, self-paced lesson entitled Introducing Oracle R Enterprise. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle. I
More informationAlwaysOn Availability Groups: Backups, Restores, and CHECKDB
AlwaysOn Availability Groups: Backups, Restores, and CHECKDB www.brentozar.com sp_blitz sp_blitzfirst email newsletter videos SQL Critical Care 2016 Brent Ozar Unlimited. All rights reserved. 1 What I
More informationColumnstore in real life
Columnstore in real life Enrique Catalá Bañuls Computer Engineer Microsoft Data Platform MVP Mentor at SolidQ Tuning and HA ecatala@solidq.com @enriquecatala Agenda What is real-time operational analytics
More informationPart 12 殷亚凤. Homepage: Room 301, Building of Computer Science and Technology
Part 12 殷亚凤 Email: yafeng@nju.edu.cn Homepage: http://cs.nju.edu.cn/yafeng/ Room 301, Building of Computer Science and Technology Distributed Web-based systems The WWW is a huge client-server system with
More informationIntroduction to MySQL Cluster: Architecture and Use
Introduction to MySQL Cluster: Architecture and Use Arjen Lentz, MySQL AB (arjen@mysql.com) (Based on an original paper by Stewart Smith, MySQL AB) An overview of the MySQL Cluster architecture, what's
More informationWhy Choose Percona Server For MySQL? Tyler Duzan
Why Choose Percona Server For MySQL? Tyler Duzan Product Manager Who Am I? My name is Tyler Duzan Formerly an operations engineer for more than 12 years focused on security and automation Now a Product
More information