COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Size: px
Start display at page:

Download "COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED."

Transcription

1 Building and Operating High Performance MarkLogic Apps James Clippinger, VP, Strategic Accounts, MarkLogic Erin Miller, Manager, Performance Engineering, MarkLogic COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

2 Who Are We? James Clippinger, VP, Strategic Accounts, MarkLogic Erin Miller, Manager, Performance Engineering, MarkLogic SLIDE: 2 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

3 The Problem: Debugging Performance in Prod SLIDE: 3 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

4 Performance Success Development: understand MarkLogic Server and how it works Agile dev, agile performance testing Firefights: Avoid when possible (see above bullet points). Use methodology when necessary (ie, don t take the humidifier apart without the manual) SLIDE: 4 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

5 Understanding MarkLogic Server SLIDE: 5 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

6 Ingest: What Happens? Document loaded into in-memory stand Transaction written to journal Checkpoint Merge SLIDE: 6 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

7 Ingest: Resource Costs CPU: Merges operate as background I/O; show up as nice % CPU: index creation/updates (ingest + merges) CPU: transforms Memory: increases as content is loaded (tree, termlist, triples, reverse) - Preload mapped data: all memory mapped file data pages loaded at end of merge Network: NAS impact; mlcp fastload avoids network hops but can only be used for new content (not updates) and when forest topology won t change. How big is your cluster? SLIDE: 7 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

8 Ingest: Understanding Merges and IOPS Merge: multiple stands combined, reclaims disk space, optimizes indexes Runs automatically based on the merge algorithm, or on demand Storage: need total indexed forest size + 64GB/forest (8.0-1 and lower, or 96GB/forest 9.0 and greater) - DA/HA changes everything - DB replication. What if replication pauses? All unshipped changes retained on the master. 2x 4x indexed data size - DB restores: 2x indexed data + 64 or 96 GB per forests IOPS: Spikey I/O when in-memory stand flushed to disk - 512KB sequential writes, journal writes <= 512KB SLIDE: 8 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

9 Understanding Queries: What Happens Where SLIDE: 9 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

10 Index Resolution and Filtering 1. NARROW DOCUMENT SET WITH INDEXES 2. FILTER RESULTS TO REMOVE FALSE POSITIVES 1 INITIAL RESULT SET 2 FILTERED RESULTS Quick brown fox SOURCE DATA The clown spotted the quick brown monkey and the slow brown fox in the quick clown car more indexes enabled = fewer results FINAL RESULTS FINAL RESULTS SLIDE: 10 COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

11 Queries: from 100 docs to 100M docs One clue that you re filtering: long-running queries ( SLIDE: 11 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

12 Queries: Filtered Searches Remember: Context is everything First: Tune indexes to reduce false positives - Resolve queries from the indexes - Use filtering only if you expect to have false positives and your application can t tolerate false positives SLIDE: 12 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

13 Queries: Understanding Distributed Caches SLIDE: 13 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

14 Queries: Caches and their Contents List cache Compressed tree cache Expanded tree cache Triples and Triples value cach Not caches, but still in memory: memory mapped files - Range indexes - Lexicons SLIDE: 14 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

15 Queries: Making Best use of your Caches Cache sizes optimized at install-time - What if I add memory? - Increase cache sizes - What if I separate into D- and E-nodes? - Create two groups with two different cache sizes - E-nodes should have more ETC, less CTC and List Cache - D-nodes should have more CTC and List, less ETC - But don t forget about reindexing SLIDE: 15 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

16 Queries: Case Study Use case: I need to update a config file once a minute based upon the content in the db Config file keeps track of different document types: - News - Tweets - Stock Prices - Articles SLIDE: 16 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

17 Queries: Case Study: Attempt One SLIDE: 17 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

18 Queries: Case Study: Attempt One Use the Query Console built-in profiler, or grow your own (prof:report) SLIDE: 18 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

19 Queries: Case Study: Remediation at Scale Oh yeah, filtered searches! SLIDE: 19 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

20 Queries: Case Study: Remediation, Step Two Now the count is taking a really long time. So let s get rid of that: SLIDE: 20 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

21 Queries: Case Study: Remediation at Scale with Load Now it works great in dev, even with 100M docs. But with load, performance is horrible SLIDE: 21 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

22 Queries: Case Study: Remediation at Scale What could possibly be wrong? - Use meters: SLIDE: 22 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

23 Queries: Case Study: Remediation at Scale But I m only doing one document update, my one small config doc. - But you re in the context of an update transaction, so EVERY doc referenced in the transaction will be locked. - Do you have other cts:search statements in your module? - To fix this, do the update in a separate transaction. - Be intentional about update transactions SLIDE: 23 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

24 Queries: Case Study: Remediation at Scale What other techniques might we use to solve this problem? - Element in doc with content-type (ie, <content-type>tweet</content-type> ) - But you still need to find those docs at runtime (remember list cache, CTC, ETC?) - Collections - No need to have custom code that creates collection info on ingest just use the mlcp option output_collections - Get benefits of range indexes - Envelope pattern and canonical format (see Data Hub presentation) SLIDE: 24 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

25 SQL and Semantics SLIDE: June 2017 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. COPYRIGHT

26 SQL and Semantics Memory, Memory, Memory! Analytics workloads Separate e- and d-nodes Use at least 64 Gb RAM E-Nodes for high-performance applications SLIDE: 26 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

27 SQL and Semantics Debug Trace Events Turn on diagnostic profiling and then: - SPARQL Cost Analysis - SPARQL Execution XDMP:MEM_CANCELED - Your query was using too much memory try to tune to return fewer triples/rows SLIDE: 27 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

28 System Resources: Wrap up Use Meters to understand: - Cache hits/misses, document locks, CPU, disk, memory utilization Be intentional about: - Transformations - Updates - Filtered Results Separate analytical workloads (Semantics, SQL) with E- D-nodes SLIDE: 28 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

29 Themes in Recent Performance Issues

30 THEME Monitoring In Context Meters is your best friend, but it can t live your life for you Know what your system should be doing Know if it s reasonable to expect certain performance SLIDE: 30 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

31 Monitoring In Context Resources (most problematic to least) - Disk Bandwidth - RAM Capacity - CPU Capacity - Disk Capacity - Network Bandwidth OS bugs MarkLogic bugs SLIDE: 31 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

32 THEME Speed Costs Money Consider the expense of code choices Every index in production should have demonstrable business value SLIDE: 32 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

33 THEME Load Testing Best way to reduce performance risk Real load against production (or equivalent) hardware Push load until bottleneck found SLIDE: 33 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

34 Recent Performance Cases

35 Case: Intermittent Slowness System was not meeting SLAs for a few minutes at start of each hour Meters showed CPU maxed with iowait during slowness SLIDE: 35 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

36 Case: General Slow Queries Queries running slower than expected Disk bandwidth normal Low user CPU, some iowait No swapping SLIDE: 36 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

37 Case: Unusual CPU Activity Very high system CPU utilization and low user CPU utilization under load Only seen on larger AWS instances SLIDE: 37 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

38 Case: High Memory Consumption RAM utilization very high, eventually leading to swapping Caches configured to default values SLIDE: 38 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

39 Case: Everything s Okay Until. High-volume production system running great, meeting all SLAs DBA runs report in Query Console after overnight data load System response time spikes for a few minutes while report is running SLIDE: 39 COPYRIGHT 13 June 2017MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

40 Questions?

Optimizing the Performance of Your MarkLogic Applications

Optimizing the Performance of Your MarkLogic Applications Optimizing the Performance of Your MarkLogic Applications JAMES CLIPPINGER Vice President, Strategic Accounts, MarkLogic @jclip ERIN MILLER Manager, Performance Engineering, MarkLogic 4 June 2018 MARKLOGIC

More information

Understanding System Resources

Understanding System Resources PERFORMANCE Understanding System Resources MARKLOGIC WHITE PAPER OCTOBER 2018 There are many aspects involved in understanding resource consumption. To make sense of them, this paper will break down the

More information

5 Fundamental Strategies for Building a Data-centered Data Center

5 Fundamental Strategies for Building a Data-centered Data Center 5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse

More information

Study Guide. MarkLogic Professional Certification. Taking a Written Exam. General Preparation. Developer Written Exam Guide

Study Guide. MarkLogic Professional Certification. Taking a Written Exam. General Preparation. Developer Written Exam Guide Study Guide MarkLogic Professional Certification Taking a Written Exam General Preparation Developer Written Exam Guide Administrator Written Exam Guide Example Written Exam Questions Hands-On Exam Overview

More information

Performance Testing With MarkLogic

Performance Testing With MarkLogic Performance Testing With MarkLogic MARKLOGIC WHITE PAPER OCTOBER 2018 Database performance testing is critical to deploying and running high performance enterprise applications. This technical white paper

More information

RUNNING YOUR MARKLOGIC CLUSTER

RUNNING YOUR MARKLOGIC CLUSTER RUNNING YOUR MARKLOGIC CLUSTER Alex Bleasdale, Manager - EMEA Support Services, MarkLogic First, some motivational quotes Our task, your task... is to try to connect the dots before something happens Donald

More information

How To Rock with MyRocks. Vadim Tkachenko CTO, Percona Webinar, Jan

How To Rock with MyRocks. Vadim Tkachenko CTO, Percona Webinar, Jan How To Rock with MyRocks Vadim Tkachenko CTO, Percona Webinar, Jan-16 2019 Agenda MyRocks intro and internals MyRocks limitations Benchmarks: When to choose MyRocks over InnoDB Tuning for the best results

More information

Distributed Filesystem

Distributed Filesystem Distributed Filesystem 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributing Code! Don t move data to workers move workers to the data! - Store data on the local disks of nodes in the

More information

MySQL Performance Improvements

MySQL Performance Improvements Taking Advantage of MySQL Performance Improvements Baron Schwartz, Percona Inc. Introduction About Me (Baron Schwartz) Author of High Performance MySQL 2 nd Edition Creator of Maatkit, innotop, and so

More information

The Right Read Optimization is Actually Write Optimization. Leif Walsh

The Right Read Optimization is Actually Write Optimization. Leif Walsh The Right Read Optimization is Actually Write Optimization Leif Walsh leif@tokutek.com The Right Read Optimization is Write Optimization Situation: I have some data. I want to learn things about the world,

More information

Secrets of PostgreSQL Performance. Frank Wiles Revolution Systems

Secrets of PostgreSQL Performance. Frank Wiles Revolution Systems Secrets of PostgreSQL Performance Frank Wiles Revolution Systems Measure Everything Before And After DON T DO DUMB THINGS! Don t... Assume PostgreSQL is like MySQL Database server doing double duty Disk

More information

HOW TO PLAN & EXECUTE A SUCCESSFUL CLOUD MIGRATION

HOW TO PLAN & EXECUTE A SUCCESSFUL CLOUD MIGRATION HOW TO PLAN & EXECUTE A SUCCESSFUL CLOUD MIGRATION Steve Bertoldi, Solutions Director, MarkLogic Agenda Cloud computing and on premise issues Comparison of traditional vs cloud architecture Review of use

More information

MarkLogic Server. Monitoring MarkLogic Guide. MarkLogic 8 February, Copyright 2015 MarkLogic Corporation. All rights reserved.

MarkLogic Server. Monitoring MarkLogic Guide. MarkLogic 8 February, Copyright 2015 MarkLogic Corporation. All rights reserved. Monitoring MarkLogic Guide 1 MarkLogic 8 February, 2015 Last Revised: 8.0-1, February, 2015 Copyright 2015 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents Monitoring MarkLogic

More information

GFS: The Google File System. Dr. Yingwu Zhu

GFS: The Google File System. Dr. Yingwu Zhu GFS: The Google File System Dr. Yingwu Zhu Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one big CPU More storage, CPU required than one PC can

More information

Debugging Applications in Pervasive Computing

Debugging Applications in Pervasive Computing Debugging Applications in Pervasive Computing Larry May 1, 2006 SMA 5508; MIT 6.883 1 Outline Video of Speech Controlled Animation Survey of approaches to debugging Turning bugs into features Speech recognition

More information

Distributed File Systems II

Distributed File Systems II Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation

More information

XTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER

XTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER 1 XTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER MAX FISHMAN XTREMIO PRODUCT MANAGEMENT 2 THE ALL FLASH ARRAY REVOLUTION ALL FLASH ARRAY 3 XTREMIO ENABLES THE AGILE DATA CENTER 10%

More information

Huge market -- essentially all high performance databases work this way

Huge market -- essentially all high performance databases work this way 11/5/2017 Lecture 16 -- Parallel & Distributed Databases Parallel/distributed databases: goal provide exactly the same API (SQL) and abstractions (relational tables), but partition data across a bunch

More information

Running MarkLogic in Containers (Both Docker and Kubernetes)

Running MarkLogic in Containers (Both Docker and Kubernetes) Running MarkLogic in Containers (Both Docker and Kubernetes) Emma Liu Product Manager, MarkLogic Vitaly Korolev Staff QA Engineer, MarkLogic @vitaly_korolev 4 June 2018 MARKLOGIC CORPORATION Source: http://turnoff.us/image/en/tech-adoption.png

More information

CS 134: Operating Systems

CS 134: Operating Systems CS 134: Operating Systems More Memory Management CS 134: Operating Systems More Memory Management 1 / 27 2 / 27 Overview Overview Overview Segmentation Recap Segmentation Recap Segmentation Recap Segmentation

More information

Your Speakers. Iwan e1 Rahabok Linkedin.com/in/e1ang

Your Speakers. Iwan e1 Rahabok Linkedin.com/in/e1ang Your Speakers Iwan e1 Rahabok virtual-red-dot.info e1@vmware.com @e1_ang Linkedin.com/in/e1ang 9119 9226 Sunny Dua vxpresss.blogspot.com duas@vmware.com @sunny_dua Linkedin.com/in/duasunny 2 Need more

More information

davidklee.net heraflux.com linkedin.com/in/davidaklee

davidklee.net heraflux.com linkedin.com/in/davidaklee @kleegeek davidklee.net heraflux.com linkedin.com/in/davidaklee Specialties / Focus Areas / Passions: Performance Tuning & Troubleshooting Virtualization Cloud Enablement Infrastructure Architecture Health

More information

Introduction to Database Services

Introduction to Database Services Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational

More information

Practical Strategies For High Performance SQL Server High Availability

Practical Strategies For High Performance SQL Server High Availability Practical Strategies For High Performance SQL Server High Availability Jason Aw, Strategic Business Development SIOS Technology Join 3 question poll for lucky draw https://www.surveymonkey.com/r/8hmmg3n

More information

MarkLogic Server. Monitoring MarkLogic Guide. MarkLogic 9 May, Copyright 2017 MarkLogic Corporation. All rights reserved.

MarkLogic Server. Monitoring MarkLogic Guide. MarkLogic 9 May, Copyright 2017 MarkLogic Corporation. All rights reserved. Monitoring MarkLogic Guide 1 MarkLogic 9 May, 2017 Last Revised: 9.0-2, July, 2017 Copyright 2017 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents Monitoring MarkLogic Guide

More information

MarkLogic Server. Monitoring MarkLogic Guide. MarkLogic 5 October, Copyright 2012 MarkLogic Corporation. All rights reserved.

MarkLogic Server. Monitoring MarkLogic Guide. MarkLogic 5 October, Copyright 2012 MarkLogic Corporation. All rights reserved. Monitoring MarkLogic Guide 1 MarkLogic 5 October, 2011 Last Revised: 5.0-3, March, 2012 Copyright 2012 MarkLogic Corporation. All rights reserved. 1.0 Monitoring MarkLogic Server...4 1.1 Overview...4 1.2

More information

MarkLogic Server. Query Performance and Tuning Guide. MarkLogic 9 May, Copyright 2018 MarkLogic Corporation. All rights reserved.

MarkLogic Server. Query Performance and Tuning Guide. MarkLogic 9 May, Copyright 2018 MarkLogic Corporation. All rights reserved. Query Performance and Tuning Guide 1 MarkLogic 9 May, 2017 Last Revised: 9.0-2, July, 2017 Copyright 2018 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents Query Performance

More information

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for

More information

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved

BERLIN. 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved BERLIN 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Amazon Aurora: Amazon s New Relational Database Engine Carlos Conde Technology Evangelist @caarlco 2015, Amazon Web Services,

More information

EsgynDB Enterprise 2.0 Platform Reference Architecture

EsgynDB Enterprise 2.0 Platform Reference Architecture EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed

More information

Update The Statistics On A Single Table+sql Server 2005

Update The Statistics On A Single Table+sql Server 2005 Update The Statistics On A Single Table+sql Server 2005 There are different ways statistics are created and maintained in SQL Server: to find out all of those statistics created by SQL Server Query Optimizer

More information

JUMPSTART: THE BASICS FOR GETTING STARTED WITH MARKLOGIC Ruth Stryker, Senior Courseware Developer and Technical Instructor, MarkLogic

JUMPSTART: THE BASICS FOR GETTING STARTED WITH MARKLOGIC Ruth Stryker, Senior Courseware Developer and Technical Instructor, MarkLogic JUMPSTART: THE BASICS FOR GETTING STARTED WITH MARKLOGIC Ruth Stryker, Senior Courseware Developer and Technical Instructor, MarkLogic So we know that MarkLogic Is an enterprise NoSQL database Can be used

More information

OS and Hardware Tuning

OS and Hardware Tuning OS and Hardware Tuning Tuning Considerations OS Threads Thread Switching Priorities Virtual Memory DB buffer size File System Disk layout and access Hardware Storage subsystem Configuring the disk array

More information

MySQL Performance Optimization and Troubleshooting with PMM. Peter Zaitsev, CEO, Percona

MySQL Performance Optimization and Troubleshooting with PMM. Peter Zaitsev, CEO, Percona MySQL Performance Optimization and Troubleshooting with PMM Peter Zaitsev, CEO, Percona In the Presentation Practical approach to deal with some of the common MySQL Issues 2 Assumptions You re looking

More information

Scaling with Postgres. Robert Treat Percona Performance Conference 2009

Scaling with Postgres. Robert Treat Percona Performance Conference 2009 Scaling with Postgres Robert Treat Percona Performance Conference 2009 Disclaimer Every really big site uses custom solutions. Don t expect to get there using out of the box software. You don t have to

More information

PostgreSQL Performance The basics

PostgreSQL Performance The basics PostgreSQL Performance The basics Joshua D. Drake jd@commandprompt.com Command Prompt, Inc. United States PostgreSQL Software in the Public Interest The dumb simple RAID 1 or 10 (RAID 5 is for chumps)

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

How Insurers are Realising the Promise of Big Data

How Insurers are Realising the Promise of Big Data How Insurers are Realising the Promise of Big Data Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies

More information

Running Databases in Containers.

Running Databases in Containers. Running Databases in Containers. How to Overcome the Challenges of Data Frank Stienhans CTO Prepared for Evolution of Enterprise IT Subjective Perspective CONTAINERS 1. More Choices CLOUD 2. Faster Delivery

More information

OS and HW Tuning Considerations!

OS and HW Tuning Considerations! Administração e Optimização de Bases de Dados 2012/2013 Hardware and OS Tuning Bruno Martins DEI@Técnico e DMIR@INESC-ID OS and HW Tuning Considerations OS " Threads Thread Switching Priorities " Virtual

More information

Advanced Database Systems

Advanced Database Systems Lecture II Storage Layer Kyumars Sheykh Esmaili Course s Syllabus Core Topics Storage Layer Query Processing and Optimization Transaction Management and Recovery Advanced Topics Cloud Computing and Web

More information

Root Cause Analysis for SAP HANA. June, 2015

Root Cause Analysis for SAP HANA. June, 2015 Root Cause Analysis for SAP HANA June, 2015 Process behind Application Operations Monitor Notify Analyze Optimize Proactive real-time monitoring Reactive handling of critical events Lower mean time to

More information

Get a Second Opinion: Enterprise GIS Health Checks. David Crosby Esri Southeast Regional Professional Services Charlotte, NC

Get a Second Opinion: Enterprise GIS Health Checks. David Crosby Esri Southeast Regional Professional Services Charlotte, NC Get a Second Opinion: Enterprise GIS Health Checks David Crosby Esri Southeast Regional Professional Services Charlotte, NC E M E R A L D S A P P H I R E T H A N K Y O U T O O UR SPONSORS What is an Enterprise

More information

Unlimited Scalability in the Cloud A Case Study of Migration to Amazon DynamoDB

Unlimited Scalability in the Cloud A Case Study of Migration to Amazon DynamoDB Unlimited Scalability in the Cloud A Case Study of Migration to Amazon DynamoDB Steve Saporta CTO, SpinCar Mar 19, 2016 SpinCar When a web-based business grows... More customers = more transactions More

More information

big picture parallel db (one data center) mix of OLTP and batch analysis lots of data, high r/w rates, 1000s of cheap boxes thus many failures

big picture parallel db (one data center) mix of OLTP and batch analysis lots of data, high r/w rates, 1000s of cheap boxes thus many failures Lecture 20 -- 11/20/2017 BigTable big picture parallel db (one data center) mix of OLTP and batch analysis lots of data, high r/w rates, 1000s of cheap boxes thus many failures what does paper say Google

More information

Fully Optimize FULLY OPTIMIZE YOUR DBA RESOURCES

Fully Optimize FULLY OPTIMIZE YOUR DBA RESOURCES Fully Optimize FULLY OPTIMIZE YOUR DBA RESOURCES IMPROVE SERVER PERFORMANCE, UPTIME, AND AVAILABILITY WHILE LOWERING COSTS WE LL COVER THESE TOP WAYS TO OPTIMIZE YOUR RESOURCES: 1 Be Smart About Your Wait

More information

Aurora, RDS, or On-Prem, Which is right for you

Aurora, RDS, or On-Prem, Which is right for you Aurora, RDS, or On-Prem, Which is right for you Kathy Gibbs Database Specialist TAM Katgibbs@amazon.com Santa Clara, California April 23th 25th, 2018 Agenda RDS Aurora EC2 On-Premise Wrap-up/Recommendation

More information

Solid Access Technologies, LLC

Solid Access Technologies, LLC Newburyport, MA, USA USSD 200 USSD 200 The I/O Bandwidth Company Solid Access Technologies, LLC Solid Access Technologies, LLC Why Are We Here? The Storage Perfect Storm Traditional I/O Bottleneck Reduction

More information

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014 CS15-319 / 15-619 Cloud Computing Recitation 3 September 9 th & 11 th, 2014 Overview Last Week s Reflection --Project 1.1, Quiz 1, Unit 1 This Week s Schedule --Unit2 (module 3 & 4), Project 1.2 Questions

More information

HIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS

HIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS HIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS Proven Companies and Products Fusion-io Leader in PCIe enterprise flash platforms Accelerates mission-critical applications

More information

Choosing Hardware and Operating Systems for MySQL. Apr 15, 2009 O'Reilly MySQL Conference and Expo Santa Clara,CA by Peter Zaitsev, Percona Inc

Choosing Hardware and Operating Systems for MySQL. Apr 15, 2009 O'Reilly MySQL Conference and Expo Santa Clara,CA by Peter Zaitsev, Percona Inc Choosing Hardware and Operating Systems for MySQL Apr 15, 2009 O'Reilly MySQL Conference and Expo Santa Clara,CA by Peter Zaitsev, Percona Inc -2- We will speak about Choosing Hardware Choosing Operating

More information

Ten things hyperconvergence can do for you

Ten things hyperconvergence can do for you Ten things hyperconvergence can do for you Francis O Haire Director, Technology & Strategy DataSolutions Evolution of Enterprise Infrastructure 1990s Today Virtualization Server Server Server Server Scale-Out

More information

BUILDING THE RIGHT INFRASTRUCTURE FOR VDI

BUILDING THE RIGHT INFRASTRUCTURE FOR VDI BUILDING THE RIGHT INFRASTRUCTURE FOR VDI Doing it right! #ILTACON #ILTA145 LISA STONE Client Systems Architect Cornerstone.IT Lisa.Stone@Cornerstone.IT WHAT S UNDER THE HOOD? Real World Remediation XenApp

More information

Next-Generation Cloud Platform

Next-Generation Cloud Platform Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology

More information

NetVault Backup Client and Server Sizing Guide 2.1

NetVault Backup Client and Server Sizing Guide 2.1 NetVault Backup Client and Server Sizing Guide 2.1 Recommended hardware and storage configurations for NetVault Backup 10.x and 11.x September, 2017 Page 1 Table of Contents 1. Abstract... 3 2. Introduction...

More information

RAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University

RAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University RAMCloud and the Low- Latency Datacenter John Ousterhout Stanford University Most important driver for innovation in computer systems: Rise of the datacenter Phase 1: large scale Phase 2: low latency Introduction

More information

Inside the PostgreSQL Shared Buffer Cache

Inside the PostgreSQL Shared Buffer Cache Truviso 07/07/2008 About this presentation The master source for these slides is http://www.westnet.com/ gsmith/content/postgresql You can also find a machine-usable version of the source code to the later

More information

Splunk is a great tool for exploring your log data. It s very powerful, but

Splunk is a great tool for exploring your log data. It s very powerful, but Sysadmin David Lang David Lang is a site reliability engineer at Google. He spent more than a decade at Intuit working in the Security Department for the Banking Division. He was introduced to Linux in

More information

Infrastructure Tuning

Infrastructure Tuning Infrastructure Tuning For SQL Server Performance SQL PASS Performance Virtual Chapter 2014.07.24 About David Klee @kleegeek davidklee.net gplus.to/kleegeek linked.com/a/davidaklee Specialties / Focus Areas

More information

NetVault Backup Client and Server Sizing Guide 3.0

NetVault Backup Client and Server Sizing Guide 3.0 NetVault Backup Client and Server Sizing Guide 3.0 Recommended hardware and storage configurations for NetVault Backup 12.x September 2018 Page 1 Table of Contents 1. Abstract... 3 2. Introduction... 3

More information

A Flexible Data Warehouse Architecture

A Flexible Data Warehouse Architecture A Flexible Data Warehouse Architecture Building the Ideal Data Warehouse Platform Mike Ault Oracle Guru Texas Memory Systems NYOUG Dec 2010 Michael R. Ault Oracle Guru - Nuclear Navy 6 years - Nuclear

More information

Toad for Oracle Suite 2017 Functional Matrix

Toad for Oracle Suite 2017 Functional Matrix Toad for Oracle Suite 2017 Functional Matrix Essential Functionality Base Xpert Module (add-on) Developer DBA Runs directly on Windows OS Browse and navigate through objects Create and manipulate database

More information

Taking Control of Your VMware vsphere Environment with Operations Management

Taking Control of Your VMware vsphere Environment with Operations Management Taking Control of Your VMware vsphere Environment with Operations Management Daniel Ho (VMware Hong Kong Limited) 2010 VMware Inc. All rights reserved Agenda vsom (vsphere Operations Management) Introduction

More information

What is the Future of PostgreSQL?

What is the Future of PostgreSQL? What is the Future of PostgreSQL? Robert Haas 2013 EDB All rights reserved. 1 PostgreSQL Popularity By The Numbers Date Rating Increase vs. Prior Year % Increase January 2016 282.401 +27.913 +11% January

More information

Fast, In-Memory Analytics on PPDM. Calgary 2016

Fast, In-Memory Analytics on PPDM. Calgary 2016 Fast, In-Memory Analytics on PPDM Calgary 2016 In-Memory Analytics A BI methodology to solve complex and timesensitive business scenarios by using system memory as opposed to physical disk, by increasing

More information

Operating Systems. Lecture File system implementation. Master of Computer Science PUF - Hồ Chí Minh 2016/2017

Operating Systems. Lecture File system implementation. Master of Computer Science PUF - Hồ Chí Minh 2016/2017 Operating Systems Lecture 7.2 - File system implementation Adrien Krähenbühl Master of Computer Science PUF - Hồ Chí Minh 2016/2017 Design FAT or indexed allocation? UFS, FFS & Ext2 Journaling with Ext3

More information

High Noon at AWS. ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2

High Noon at AWS. ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2 High Noon at AWS ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2 Introduction Amazon Web Services (AWS) are gaining popularity, and for good reasons. The Amazon Relational Database

More information

Jyotheswar Kuricheti

Jyotheswar Kuricheti Jyotheswar Kuricheti 1 Agenda: 1. Performance Tuning Overview 2. Identify Bottlenecks 3. Optimizing at different levels : Target Source Mapping Session System 2 3 Performance Tuning Overview: 4 What is

More information

Catalogic DPX TM 4.3. ECX 2.0 Best Practices for Deployment and Cataloging

Catalogic DPX TM 4.3. ECX 2.0 Best Practices for Deployment and Cataloging Catalogic DPX TM 4.3 ECX 2.0 Best Practices for Deployment and Cataloging 1 Catalogic Software, Inc TM, 2015. All rights reserved. This publication contains proprietary and confidential material, and is

More information

Map-Reduce. Marco Mura 2010 March, 31th

Map-Reduce. Marco Mura 2010 March, 31th Map-Reduce Marco Mura (mura@di.unipi.it) 2010 March, 31th This paper is a note from the 2009-2010 course Strumenti di programmazione per sistemi paralleli e distribuiti and it s based by the lessons of

More information

Tips and Tricks on Successful Upgrade to 11gR2

Tips and Tricks on Successful Upgrade to 11gR2 Tips and Tricks on Successful Upgrade to 11gR2 Project Approval Get buy in from CIO and other groups Justify the need End of premier support Extended Support Cost To be current Benefits of new release

More information

[MS10987A]: Performance Tuning and Optimizing SQL Databases

[MS10987A]: Performance Tuning and Optimizing SQL Databases [MS10987A]: Performance Tuning and Optimizing SQL Databases Length : 4 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft SQL Server Delivery Method : Instructor-led (Classroom) Course

More information

Tuning PostgreSQL for performance

Tuning PostgreSQL for performance 1 sur 5 03/02/2006 12:42 Tuning PostgreSQL for performance Shridhar Daithankar, Josh Berkus July 3, 2003 Copyright 2003 Shridhar Daithankar and Josh Berkus. Authorized for re-distribution only under the

More information

IBM Tivoli Storage Manager for Windows Version Installation Guide IBM

IBM Tivoli Storage Manager for Windows Version Installation Guide IBM IBM Tivoli Storage Manager for Windows Version 7.1.8 Installation Guide IBM IBM Tivoli Storage Manager for Windows Version 7.1.8 Installation Guide IBM Note: Before you use this information and the product

More information

Distributed Systems. Lec 10: Distributed File Systems GFS. Slide acks: Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung

Distributed Systems. Lec 10: Distributed File Systems GFS. Slide acks: Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Distributed Systems Lec 10: Distributed File Systems GFS Slide acks: Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung 1 Distributed File Systems NFS AFS GFS Some themes in these classes: Workload-oriented

More information

SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Enterprise Intranet Collaboration Environment

SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Enterprise Intranet Collaboration Environment SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Enterprise Intranet Collaboration Environment This document is provided as-is. Information and views expressed in this document, including

More information

ArcGIS Enterprise: Performance and Scalability Best Practices. Darren Baird, PE, Esri

ArcGIS Enterprise: Performance and Scalability Best Practices. Darren Baird, PE, Esri ArcGIS Enterprise: Performance and Scalability Best Practices Darren Baird, PE, Esri dbaird@esri.com What is ArcGIS Enterprise What s Included with ArcGIS Enterprise ArcGIS Server the core web services

More information

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu Database Architecture 2 & Storage Instructor: Matei Zaharia cs245.stanford.edu Summary from Last Time System R mostly matched the architecture of a modern RDBMS» SQL» Many storage & access methods» Cost-based

More information

Course Outline. Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led

Course Outline. Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led Performance Tuning and Optimizing SQL Databases Course 10987B: 4 days Instructor Led About this course This four-day instructor-led course provides students who manage and maintain SQL Server databases

More information

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Copyright 2018, Oracle and/or its affiliates. All rights reserved. Beyond SQL Tuning: Insider's Guide to Maximizing SQL Performance Monday, Oct 22 10:30 a.m. - 11:15 a.m. Marriott Marquis (Golden Gate Level) - Golden Gate A Ashish Agrawal Group Product Manager Oracle

More information

OPS-23: OpenEdge Performance Basics

OPS-23: OpenEdge Performance Basics OPS-23: OpenEdge Performance Basics White Star Software adam@wss.com Agenda Goals of performance tuning Operating system setup OpenEdge setup Setting OpenEdge parameters Tuning APWs OpenEdge utilities

More information

MarkLogic Technology Briefing

MarkLogic Technology Briefing MarkLogic Technology Briefing Edd Patterson CTO/VP Systems Engineering, Americas Slide 1 Agenda Introductions About MarkLogic MarkLogic Server Deep Dive Slide 2 MarkLogic Overview Company Highlights Headquartered

More information

CS3600 SYSTEMS AND NETWORKS

CS3600 SYSTEMS AND NETWORKS CS3600 SYSTEMS AND NETWORKS NORTHEASTERN UNIVERSITY Lecture 11: File System Implementation Prof. Alan Mislove (amislove@ccs.neu.edu) File-System Structure File structure Logical storage unit Collection

More information

Putting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt

Putting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt Putting together the platform: Riak, Redis, Solr and Spark Bryan Hunt 1 $ whoami Bryan Hunt Client Services Engineer @binarytemple 2 Minimum viable product - the ideologically correct doctrine 1. Start

More information

The Data Protection Rule

The Data Protection Rule The 3-2-1 Data Protection Rule Presenter: Pankaj Jaiswal, Systems Engineer Agenda Why? The 3-2-1 Rule Layers and possibilities Q&A More and better IT More services applications and services faster Business

More information

IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including:

IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including: IT Best Practices Audit TCS offers a wide range of IT Best Practices Audit content covering 15 subjects and over 2200 topics, including: 1. IT Cost Containment 84 topics 2. Cloud Computing Readiness 225

More information

<Insert Picture Here> New MySQL Enterprise Backup 4.1: Better Very Large Database Backup & Recovery and More!

<Insert Picture Here> New MySQL Enterprise Backup 4.1: Better Very Large Database Backup & Recovery and More! New MySQL Enterprise Backup 4.1: Better Very Large Database Backup & Recovery and More! Mike Frank MySQL Product Management - Director The following is intended to outline our general

More information

GFS: The Google File System

GFS: The Google File System GFS: The Google File System Brad Karp UCL Computer Science CS GZ03 / M030 24 th October 2014 Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one

More information

Get ready to be what s next.

Get ready to be what s next. Get ready to be what s next. Jared Shockley http://jaredontech.com Senior Service Engineer Prior Experience @jshoq Primary Experience Areas Agenda What is Microsoft Azure? Provider-hosted Apps Hosting

More information

OracleMan Consulting

OracleMan Consulting Introduction to AWR and Tuning Some New Things in 11g Earl Shaffer CTO/Oracle Practice Manager OracleManConsulting@Gmail.com OracleMan Consulting OMC - Who are we? Oracle DBA on-site and remote services

More information

ECE Engineering Robust Server Software. Spring 2018

ECE Engineering Robust Server Software. Spring 2018 ECE590-02 Engineering Robust Server Software Spring 2018 Business Continuity: Disaster Recovery Tyler Bletsch Duke University Includes material adapted from the course Information Storage and Management

More information

Step by Step Guide to Upgrade SCCM 2012 R2 Environment to SCCM R2 SP1 and R2 SP1 CU2

Step by Step Guide to Upgrade SCCM 2012 R2 Environment to SCCM R2 SP1 and R2 SP1 CU2 Step by Step Guide to Upgrade SCCM 2012 R2 Environment to SCCM R2 SP1 and R2 SP1 CU2 In This guide we will perform upgrade on our existing SCCM 2012 R2 hierarchy and upgrade it to SCCM R2 SP1 and then

More information

SQL Server 2016 Query Store

SQL Server 2016 Query Store SQL Server 2016 Query Store Sergio Govoni Twitter: @segovoni Blog: http://sqlblog.com/blogs/sergio_govoni Sergio Govoni SQL Server MVP sqlblog.com/blogs/sergio_govoni @segovoni ugiss.org manning.com/delaney

More information

CS5460: Operating Systems Lecture 20: File System Reliability

CS5460: Operating Systems Lecture 20: File System Reliability CS5460: Operating Systems Lecture 20: File System Reliability File System Optimizations Modern Historic Technique Disk buffer cache Aggregated disk I/O Prefetching Disk head scheduling Disk interleaving

More information

Public Cloud Leverage For IT/Business Alignment Business Goals Agility to speed time to market, adapt to market demands Elasticity to meet demand whil

Public Cloud Leverage For IT/Business Alignment Business Goals Agility to speed time to market, adapt to market demands Elasticity to meet demand whil LHC2386BU True Costs Savings Modeling and Costing A Migration to VMware Cloud on AWS Chris Grossmeier chrisg@cloudphysics.com John Blumenthal john@cloudphysics.com #VMworld Public Cloud Leverage For IT/Business

More information

Windows Servers In Microsoft Azure

Windows Servers In Microsoft Azure $6/Month Windows Servers In Microsoft Azure What I m Going Over 1. How inexpensive servers in Microsoft Azure are 2. How I get Windows servers for $6/month 3. Why Azure hosted servers are way better 4.

More information

Distributed Data Infrastructures, Fall 2017, Chapter 2. Jussi Kangasharju

Distributed Data Infrastructures, Fall 2017, Chapter 2. Jussi Kangasharju Distributed Data Infrastructures, Fall 2017, Chapter 2 Jussi Kangasharju Chapter Outline Warehouse-scale computing overview Workloads and software infrastructure Failures and repairs Note: Term Warehouse-scale

More information

SoftNAS Cloud Performance Evaluation on Microsoft Azure

SoftNAS Cloud Performance Evaluation on Microsoft Azure SoftNAS Cloud Performance Evaluation on Microsoft Azure November 30, 2016 Contents SoftNAS Cloud Overview... 3 Introduction... 3 Executive Summary... 4 Key Findings for Azure:... 5 Test Methodology...

More information

SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Social Environment

SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Social Environment SharePoint 2010 Technical Case Study: Microsoft SharePoint Server 2010 Social Environment This document is provided as-is. Information and views expressed in this document, including URL and other Internet

More information

Diagnostics in Testing and Performance Engineering

Diagnostics in Testing and Performance Engineering Diagnostics in Testing and Performance Engineering This document talks about importance of diagnostics in application testing and performance engineering space. Here are some of the diagnostics best practices

More information