Francesco Cesarini. Concurrency + Distribution = Availability + Scalability.

Size: px
Start display at page:

Download "Francesco Cesarini. Concurrency + Distribution = Availability + Scalability."

Transcription

1 Concurrency + Distribution = Availability + Scalability Francesco Cesarini francesco@erlang-solutions.com

2 Chapter 13 Ch 13: Node Architecture

3 Chapter 13 Ch 13: Distributed Architectures Ch 14: Systems That Never Stop Ch 15: Scaling Out Ch 16: Monitoring and Preemptive Support

4 Concurrency The world is concurrent. People speak to each other through message passing. Things fail.

5 Mutable State Immutable State

6 Problem 1 with mutable state: Your program crashes whilst executing in the critical section

7 Problem 2 with mutable state: Where do you locate your state Seattle London

8 Problem 3 with mutable state: What happens if your network connectivity fails Seattle London

9 Problem 1 with mutable state: Your program crashes whilst executing in the critical section London Your state does not get corrupted.

10 Problem 2 with mutable state: Where do you locate your state Seattle London You do not Locate state, you copy it.

11 Problem 3 with mutable state: What happens if your network connectivity fails Seattle London Make sure your business logic and databases handle network splits!

12 Distributed Architectures A node is the smallest executable standalone unit consisting of a running instance of the Erlang runtime system.

13 Clients Server Front-end Node Node Types Business Node Database Service Node

14 Fully Meshed

15 Dynamo

16 hash(sessionid1) hash(sessionid2) Dynamo Key Range Vnode 1 Vnode 2 Vnode 3 Vnode 4 Vnode 32 Vnodes Erlang Node 1 vnode 1,5, 9, 13, 17, 21, 25, 29 Erlang Node 2 vnode 2, 6, 10, 14, 18, 22, 26, 30 Erlang Node 3 vnode 3, 7,11, 15, 19, 23, 27, 31 Erlang Node 4 vnode 4, 8, 12, 16, 20, 24, 28, 32 Erlang Nodes

17 Service Dynamo Service Service Service Service

18 Service Service Dynamo Service Service Service Service Service Service

19 Service Bus Service Bus Service Service Service

20 Peer to Peer

21 Peer to Front- End Front- End Peer Front- End Front- End

22 Clients Networking DMZ Server Business

23 Clients Networking IMAP Server DMZ Business Database

24 STEPS EVOLVING AROUND DISTRIBUTION 1. Split up your system's functionality into manageable, stand-alone nodes. 2. Choose a distributed architectural pattern. 3. Choose the network protocols your nodes, node families, and clusters will use when communicating with each other. 4. Define your node interfaces, state and data model.

25 Systems That Never Stop You need at least two computers to make a fault tolerant system.

26 Fault Tolerance Request Client Request Client Error Request Request Error/Timeout

27 Resilience Request1 Client Error Request2 Client Reply Request2 Reply

28 Client Request Load Balancer Request Client Request Load Balancer Request Reply Client Reply Load Balancer Reliability Request Request Reply

29 Sharing Data You have at least two computers to make a fault tolerant system, you need to share state and data.

30 Client1 Client2 Login Load Balancer Login Login Request Request Client1 Load Balancer Unknown session Unknown session Client1 Login Login Load Balancer Share Nothing Login Login Request Unknown session Login Session1 Session2 Session2 Session1 Session2

31 Client Buy book Load Balancer Client Buy train set Load Balancer Share Something Buy Buy Buy Buy Session1 book Session1 Session1 train set

32 Client Buy book Load Balancer Buy Buy Load Balancer Buy train set Remove Share Everything Remove book Load Balancer Buy Buy Remove Session1 book Session1 book Session1 book train set Session1 train set Session1 train set

33 Network Partitions Session1 Session1 Session1 Session1 book train set train set book

34 Retry Strategy Client 1 2 Request Request Client Reply Request Reply Request {duplicate, Reply}

35 Trade-offs Recovery Strategy Sharing Data Consistency exactly once at least once at most once Reliability share everything share something share nothing Availability Availability

36 STEPS EVOLVING AROUND AVAILABILITY, CONSISTENCY & RELIABILITY For every interface function in your nodes, you need to pick a retry strategy For all your data and state, pick your sharing strategy across node families, clusters and types, taking into consideration the needs of your retry strategy.

37 Scaling Out Distribute for scale and replicate for availability.

38 Scaling Vertically Client Client Client Client Server

39 Scaling Horizontally web server web server web server web server web server web server web server web server Node Node Node

40 Trade-offs Recovery Strategy Sharing Data Consistency exactly once at least once at most once Availability share everything share something share nothing Scalability Scalability

41 Capacity Planning

42 Capacity Planning

43 - CAPACITY PLANNING - Capacity planning is the design phase which guarantees that your system can withstand the load it was built to handle, and with time, scaling to handle increased demand. No single point of failure Cluster blueprint for scalability Load Regulation Back Pressure

44 Monitoring and Preemptive Support With the right tools and approach, the five nines once reserved for Telecom systems are now easily attainable in whatever other vertical for which you might be developing software.

45 Heterogeneous multi-core hardware is here to stay Different cores doing different things CPUs, GPUs, FPGA Parallella Board Dual core ARM processor + FPGA 1GB RAM + MicroSD Card 16 or 64 core Epiphany co-processor Gigabit Ethernet 2x USB ports + HDMI port

46 Heterogeneous multi-core hardware is here to stay Different cores doing different things CPUs, GPUs, FPGA Parallella Board Dual core ARM processor + FPGA 1GB RAM + MicroSD Card 16 or 64 core Epiphany co-processor Gigabit Ethernet 2x USB ports + HDMI port

47 The Fastest Computer in the World! Tianhe-2 Chinese National University of Defence Technology petaflops/s (November 2013) 16,000 Nodes, each with 2 Ivy Bridge multicores and 3 Xeon Phis 3,120,000 x86 cores in total

48 The Fastest Computer in the World! Tianhe-2 Chinese National University of Defence Technology petaflops/s (November 2013) 16,000 Nodes, each with 2 Ivy Bridge multicores and 3 Xeon Phis 3,120,000 x86 cores in total

49

50

51 1. For all your data and state, pick your sharing strategy across node families, clusters and types, taking 1. Split up your system's functionality into manageable, stand-alone nodes. 2. Decide what distributed architectural pattern you are going to use. 3. Decide what network protocols your nodes, node families and clusters will use when. communicating with each other. 4. Define your node interfaces, state and data model. 5. For every interface function in your nodes, you need to pick a retry strategy.

52 and data model. 5. Split up your system's functionality into manageable, stand-alone nodes For all your data and state, pick your sharing strategy across node families, clusters and types, taking into consideration the needs of your retry strategy. 7. Reiterate through steps 1, 2, 3, 4, 5 & 6 until you have the trade-offs which suit your specification. 8. Design your cluster blueprint, looking at node ratios for scaling up and down. 9. Indentify where to apply backpressure and load regulation. 10. Define your O&M approach, defining system and business alarms, logs and metrics.

53 THANK YOU! Any questions? Discount Code: authd 50% off the Early Release 40% off the printed copy

2015 Erlang Solutions Ltd

2015 Erlang Solutions Ltd Thinking in a Highly Concurrent, Mostly-functional Language GOTO Berlin Berlin, December 4 th 2015 Francesco Cesarini Founder & Technical Director @francescoc francesco@erlang-solutions.com Thinking in

More information

Thinking in a Highly Concurrent, Mostly-functional Language

Thinking in a Highly Concurrent, Mostly-functional Language Thinking in a Highly Concurrent, Mostly-functional Language Build Stuff Vilnius, November 19th 2014 Francesco Cesarini Founder & Technical Director @francescoc francesco@erlang-solutions.com Thinking in

More information

FRANCESCO CESARINI. presents ERLANG/OTP. Francesco Cesarini Erlang

FRANCESCO CESARINI. presents ERLANG/OTP. Francesco Cesarini Erlang FRANCESCO CESARINI presents Francesco Cesarini Erlang Solutions ERLANG/OTP @FrancescoC francesco@erlang-solutions.com www.erlang-solutions.com WHAT IS SCALABILITY? WHAT IS (MASSIVE) CONCURRENCY? WHAT

More information

It also performs many parallelization operations like, data loading and query processing.

It also performs many parallelization operations like, data loading and query processing. Introduction to Parallel Databases Companies need to handle huge amount of data with high data transfer rate. The client server and centralized system is not much efficient. The need to improve the efficiency

More information

Message Passing. Advanced Operating Systems Tutorial 7

Message Passing. Advanced Operating Systems Tutorial 7 Message Passing Advanced Operating Systems Tutorial 7 Tutorial Outline Review of Lectured Material Discussion: Erlang and message passing 2 Review of Lectured Material Message passing systems Limitations

More information

A Closer Look at the Epiphany IV 28nm 64 core Coprocessor. Andreas Olofsson PEGPUM 2013

A Closer Look at the Epiphany IV 28nm 64 core Coprocessor. Andreas Olofsson PEGPUM 2013 A Closer Look at the Epiphany IV 28nm 64 core Coprocessor Andreas Olofsson PEGPUM 2013 1 Adapteva Achieves 3 World Firsts 1. First processor company to reach 50 GFLOPS/W 3. First semiconductor company

More information

An Introduction to Erlang

An Introduction to Erlang Erlang Solutions Ltd An Introduction to Erlang From behind the trenches GOTO Amsterdam Amsterdam, May 25 th 2012 Francesco Cesarini Founder, Technical Director @FrancescoC francesco@erlang-solutions.com

More information

IntegratedArchitecture

IntegratedArchitecture IntegratedArchitecture x PERFORMANCE ARCHITECTURE A HIGH Smart. Productive. Secure. Automation system with intelligence that just works. Simple integrated development resulting in improved real-time operations

More information

Distributed Systems. Thoai Nam Faculty of Computer Science and Engineering HCMC University of Technology

Distributed Systems. Thoai Nam Faculty of Computer Science and Engineering HCMC University of Technology Distributed Systems Thoai Nam Faculty of Computer Science and Engineering HCMC University of Technology Chapter 1: Introduction Distributed Systems Hardware & software Transparency Scalability Distributed

More information

An Introduction to Erlang

An Introduction to Erlang Erlang Solutions Ltd An Introduction to Erlang From behind the trenches GOTO Copenhagen May 13 th, 2011 Francesco Cesarini Founder, Technical Director @FrancescoC francesco@erlang-solutions.com So Here

More information

Introduction to Distributed Data Systems

Introduction to Distributed Data Systems Introduction to Distributed Data Systems Serge Abiteboul Ioana Manolescu Philippe Rigaux Marie-Christine Rousset Pierre Senellart Web Data Management and Distribution http://webdam.inria.fr/textbook January

More information

Distributed Systems LEEC (2006/07 2º Sem.)

Distributed Systems LEEC (2006/07 2º Sem.) Distributed Systems LEEC (2006/07 2º Sem.) Introduction João Paulo Carvalho Universidade Técnica de Lisboa / Instituto Superior Técnico Outline Definition of a Distributed System Goals Connecting Users

More information

HPC Architectures. Types of resource currently in use

HPC Architectures. Types of resource currently in use HPC Architectures Types of resource currently in use Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us

More information

Mini USB 3.0 Dual Display Dock

Mini USB 3.0 Dual Display Dock Mini USB 3.0 Dual Display Dock VGA & HDMI Dual Display / Gigabit Ethernet / USB 3.0 Hub QUICK INSTALLATION GUIDE - 1 - INTRODUCTION Thanks for purchasing the Mini USB 3.0 Dual Display Dock. USB Type C

More information

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent Tanton Jeppson CS 401R Lab 3 Cassandra, MongoDB, and HBase Introduction For my report I have chosen to take a deeper look at 3 NoSQL database systems: Cassandra, MongoDB, and HBase. I have chosen these

More information

Distributed Systems 24. Fault Tolerance

Distributed Systems 24. Fault Tolerance Distributed Systems 24. Fault Tolerance Paul Krzyzanowski pxk@cs.rutgers.edu 1 Faults Deviation from expected behavior Due to a variety of factors: Hardware failure Software bugs Operator errors Network

More information

The Actor Model applied to the Raspberry Pi and the Embedded Domain. Omer

The Actor Model applied to the Raspberry Pi and the Embedded Domain. Omer The Actor Model applied to the Raspberry Pi and the Embedded Domain Omer Kilic @OmerK omer@erlang-solutions.com Agenda Current state of Embedded Systems Overview of the Actor Model Erlang Embedded Project

More information

OVERVIEW OF DIFFERENT APPLICATION SERVER MODELS

OVERVIEW OF DIFFERENT APPLICATION SERVER MODELS OVERVIEW OF DIFFERENT APPLICATION SERVER MODELS Before you start Objectives: learn what is application server, what is thin-client and what is fat-client, and about different types of application server

More information

Cloud Computing CS

Cloud Computing CS Cloud Computing CS 15-319 Distributed File Systems and Cloud Storage Part I Lecture 12, Feb 22, 2012 Majd F. Sakr, Mohammad Hammoud and Suhail Rehman 1 Today Last two sessions Pregel, Dryad and GraphLab

More information

BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR. Petri Kero CTO / Ministry of Games

BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR. Petri Kero CTO / Ministry of Games BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR Petri Kero CTO / Ministry of Games MOBILE GAME BACKEND CHALLENGES Lots of concurrent users Complex interactions between players Persistent world with frequent

More information

There s STILL plenty of room at the bottom! Andreas Olofsson

There s STILL plenty of room at the bottom! Andreas Olofsson There s STILL plenty of room at the bottom! Andreas Olofsson 1 Richard Feynman s Lecture (1959) There's Plenty of Room at the Bottom An Invitation to Enter a New Field of Physics Why cannot we write the

More information

Assessing performance in HP LeftHand SANs

Assessing performance in HP LeftHand SANs Assessing performance in HP LeftHand SANs HP LeftHand Starter, Virtualization, and Multi-Site SANs deliver reliable, scalable, and predictable performance White paper Introduction... 2 The advantages of

More information

The Google File System

The Google File System The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google SOSP 03, October 19 22, 2003, New York, USA Hyeon-Gyu Lee, and Yeong-Jae Woo Memory & Storage Architecture Lab. School

More information

An Oracle White Paper May Oracle VM 3: Overview of Disaster Recovery Solutions

An Oracle White Paper May Oracle VM 3: Overview of Disaster Recovery Solutions An Oracle White Paper May 2014 Oracle VM 3: Overview of Disaster Recovery Solutions Contents Introduction... 1 Overview of DR Solutions with Oracle VM... 2 Choose your DR solution path... 2 Continuous

More information

Distributed System Chapter 16 Issues in ch 17, ch 18

Distributed System Chapter 16 Issues in ch 17, ch 18 Distributed System Chapter 16 Issues in ch 17, ch 18 1 Chapter 16: Distributed System Structures! Motivation! Types of Network-Based Operating Systems! Network Structure! Network Topology! Communication

More information

Microsoft SQL Server on Stratus ftserver Systems

Microsoft SQL Server on Stratus ftserver Systems W H I T E P A P E R Microsoft SQL Server on Stratus ftserver Systems Security, scalability and reliability at its best Uptime that approaches six nines Significant cost savings for your business Only from

More information

Overview of Tianhe-2

Overview of Tianhe-2 Overview of Tianhe-2 (MilkyWay-2) Supercomputer Yutong Lu School of Computer Science, National University of Defense Technology; State Key Laboratory of High Performance Computing, China ytlu@nudt.edu.cn

More information

ERLANG EVOLVES FOR MULTI-CORE AND CLOUD ENVIRONMENTS

ERLANG EVOLVES FOR MULTI-CORE AND CLOUD ENVIRONMENTS ERLANG EVOLVES FOR MULTI-CORE AND CLOUD ENVIRONMENTS Torben Hoffmann Erlang Solutions Ltd. @LeHoff http://musings-of-an-erlang-priest.blogspot.dk/ 1 Agenda Erlang fundamentals Challenges 2 2 Warning 1:

More information

High Availability Using Fault Tolerance in the SAN. Wendy Betts, IBM Mark Fleming, IBM

High Availability Using Fault Tolerance in the SAN. Wendy Betts, IBM Mark Fleming, IBM High Availability Using Fault Tolerance in the SAN Wendy Betts, IBM Mark Fleming, IBM SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individual

More information

Large-scale cluster management at Google with Borg

Large-scale cluster management at Google with Borg Large-scale cluster management at Google with Borg Abhishek Verma, Luis Pedrosa, Madhukar Korupolu, David Oppenheimer, Eric Tune, John Wilkes Google Inc. Slides heavily derived from John Wilkes s presentation

More information

Cisco Universal Small Cell 8050 Enterprise Management System

Cisco Universal Small Cell 8050 Enterprise Management System Data Sheet Cisco Universal Small Cell 8050 Enterprise Management System The Cisco Universal Small Cell 8050 Enterprise Management System (USC 8050 EMS) is part of the Cisco Universal Small Cell Solution,

More information

Distributed Systems. Overview. Distributed Systems September A distributed system is a piece of software that ensures that:

Distributed Systems. Overview. Distributed Systems September A distributed system is a piece of software that ensures that: Distributed Systems Overview Distributed Systems September 2002 1 Distributed System: Definition A distributed system is a piece of software that ensures that: A collection of independent computers that

More information

Patriot Hardware and Systems Software Requirements

Patriot Hardware and Systems Software Requirements Patriot Hardware and Systems Software Requirements Patriot is designed and written for Microsoft Windows in native C# and.net. As a result, it is a stable and consistent Windows application. Patriot is

More information

Chapter 1: Distributed Information Systems

Chapter 1: Distributed Information Systems Chapter 1: Distributed Information Systems Contents - Chapter 1 Design of an information system Layers and tiers Bottom up design Top down design Architecture of an information system One tier Two tier

More information

FYS Data acquisition & control. Introduction. Spring 2018 Lecture #1. Reading: RWI (Real World Instrumentation) Chapter 1.

FYS Data acquisition & control. Introduction. Spring 2018 Lecture #1. Reading: RWI (Real World Instrumentation) Chapter 1. FYS3240-4240 Data acquisition & control Introduction Spring 2018 Lecture #1 Reading: RWI (Real World Instrumentation) Chapter 1. Bekkeng 14.01.2018 Topics Instrumentation: Data acquisition and control

More information

Client Server & Distributed System. A Basic Introduction

Client Server & Distributed System. A Basic Introduction Client Server & Distributed System A Basic Introduction 1 Client Server Architecture A network architecture in which each computer or process on the network is either a client or a server. Source: http://webopedia.lycos.com

More information

Taurus Super-S Combo

Taurus Super-S Combo Dual-Bay Storage Enclosure for 3.5 Serial ATA Hard Drives User Manual May 11, 2009 - v1.1 EN Table of Contents CHAPTER 1 - INTRODUCTION 1 CHAPTER 3 - SYSTEM SET UP 7 ICON KEY 1 THE TAURUS SUPER-S COMBO

More information

Chapter 2 Parallel Hardware

Chapter 2 Parallel Hardware Chapter 2 Parallel Hardware Part I. Preliminaries Chapter 1. What Is Parallel Computing? Chapter 2. Parallel Hardware Chapter 3. Parallel Software Chapter 4. Parallel Applications Chapter 5. Supercomputers

More information

HP Supporting the HP ProLiant Storage Server Product Family.

HP Supporting the HP ProLiant Storage Server Product Family. HP HP0-698 Supporting the HP ProLiant Storage Server Product Family https://killexams.com/pass4sure/exam-detail/hp0-698 QUESTION: 1 What does Volume Shadow Copy provide?. A. backup to disks B. LUN duplication

More information

Open Compute Stack (OpenCS) Overview. D.D. Nikolić Updated: 20 August 2018 DAE Tools Project,

Open Compute Stack (OpenCS) Overview. D.D. Nikolić Updated: 20 August 2018 DAE Tools Project, Open Compute Stack (OpenCS) Overview D.D. Nikolić Updated: 20 August 2018 DAE Tools Project, http://www.daetools.com/opencs What is OpenCS? A framework for: Platform-independent model specification 1.

More information

Erlang. Joe Armstrong.

Erlang. Joe Armstrong. Erlang Joe Armstrong joe.armstrong@ericsson.com 1 Who is Joe? Inventor of Erlang, UBF, Open Floppy Grid Chief designer of OTP Founder of the company Bluetail Currently Software Architect Ericsson Current

More information

Database Architectures

Database Architectures B0B36DBS, BD6B36DBS: Database Systems h p://www.ksi.m.cuni.cz/~svoboda/courses/172-b0b36dbs/ Lecture 11 Database Architectures Authors: Tomáš Skopal, Irena Holubová Lecturer: Mar n Svoboda, mar n.svoboda@fel.cvut.cz

More information

Gustavo Alonso, ETH Zürich. Web services: Concepts, Architectures and Applications - Chapter 1 2

Gustavo Alonso, ETH Zürich. Web services: Concepts, Architectures and Applications - Chapter 1 2 Chapter 1: Distributed Information Systems Gustavo Alonso Computer Science Department Swiss Federal Institute of Technology (ETHZ) alonso@inf.ethz.ch http://www.iks.inf.ethz.ch/ Contents - Chapter 1 Design

More information

Wind River. All Rights Reserved.

Wind River. All Rights Reserved. 1 Using Simulation to Develop and Maintain a System of Connected Devices Didier Poirot Simics Technical Account Manager THE CHALLENGES OF DEVELOPING CONNECTED ELECTRONIC SYSTEMS 3 Mobile Networks Update

More information

Monitor System Status

Monitor System Status View Cluster Nodes Status, on page 1 View Hardware Status, on page 1 View Network Status, on page 2 View Installed Software, on page 2 View System Status, on page 2 View IP Preferences, on page 3 View

More information

Telecom Open Rack Concept

Telecom Open Rack Concept Accelerating Communication Services TM Telecom Open Rack Concept OCP Telco WG May 23, 2016 MT 02.04.15 02.07.151 Concept & Philosophy Leverage current good ideas from Open Compute Project as a base model

More information

A QoS-Aware Middleware for Fault Tolerant Web Services

A QoS-Aware Middleware for Fault Tolerant Web Services A QoS-Aware Middleware for Fault Tolerant Web Services Zibin Zheng and Michael R. Lyu Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China ISSRE 2008, Seattle,

More information

Flat Datacenter Storage. Edmund B. Nightingale, Jeremy Elson, et al. 6.S897

Flat Datacenter Storage. Edmund B. Nightingale, Jeremy Elson, et al. 6.S897 Flat Datacenter Storage Edmund B. Nightingale, Jeremy Elson, et al. 6.S897 Motivation Imagine a world with flat data storage Simple, Centralized, and easy to program Unfortunately, datacenter networks

More information

Parallella: A $99 Open Hardware Parallel Computing Platform

Parallella: A $99 Open Hardware Parallel Computing Platform Inventing the Future of Computing Parallella: A $99 Open Hardware Parallel Computing Platform Andreas Olofsson andreas@adapteva.com IPDPS May 22th, Cambridge, MA Adapteva Achieves 3 World Firsts 1. First

More information

Google File System. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google fall DIP Heerak lim, Donghun Koo

Google File System. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google fall DIP Heerak lim, Donghun Koo Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google 2017 fall DIP Heerak lim, Donghun Koo 1 Agenda Introduction Design overview Systems interactions Master operation Fault tolerance

More information

An Introduction to Erlang

An Introduction to Erlang Erlang Solutions Ltd An Introduction to Erlang From behind the trenches Erlang Factory Lite Zurich, April 23 rd 2012 Francesco Cesarini Founder, Technical Director @FrancescoC francesco@erlang-solutions.com

More information

Slurm Workload Manager Overview SC15

Slurm Workload Manager Overview SC15 Slurm Workload Manager Overview SC15 Alejandro Sanchez alex@schedmd.com Slurm Workload Manager Overview Originally intended as simple resource manager, but has evolved into sophisticated batch scheduler

More information

Distributed Systems 23. Fault Tolerance

Distributed Systems 23. Fault Tolerance Distributed Systems 23. Fault Tolerance Paul Krzyzanowski pxk@cs.rutgers.edu 4/20/2011 1 Faults Deviation from expected behavior Due to a variety of factors: Hardware failure Software bugs Operator errors

More information

arm MULTICORE PLATFORMS FOR ADVANCED APPLICATIONS Product Longevity

arm MULTICORE PLATFORMS FOR ADVANCED APPLICATIONS Product Longevity The new i.mx B processor series builds on 15+ years of cutting-edge technology, robust software, and broad ecosystem support to continue the NXP tradition of helping customers find the right balance between

More information

LBRN - HPC systems : CCT, LSU

LBRN - HPC systems : CCT, LSU LBRN - HPC systems : CCT, LSU HPC systems @ CCT & LSU LSU HPC Philip SuperMike-II SuperMIC LONI HPC Eric Qeenbee2 CCT HPC Delta LSU HPC Philip 3 Compute 32 Compute Two 2.93 GHz Quad Core Nehalem Xeon 64-bit

More information

Interconnect Your Future

Interconnect Your Future Interconnect Your Future Gilad Shainer 2nd Annual MVAPICH User Group (MUG) Meeting, August 2014 Complete High-Performance Scalable Interconnect Infrastructure Comprehensive End-to-End Software Accelerators

More information

SuccessMaker Learning Management System System Requirements v1.0

SuccessMaker Learning Management System System Requirements v1.0 SuccessMaker Learning Management System System Requirements v1.0 Released July 2008 Copyright @ 2008 Pearson Education, Inc. or its affiliates. All rights reserved. SuccessMaker is a registered trademark,

More information

Scott Meder Senior Regional Sales Manager

Scott Meder Senior Regional Sales Manager www.raima.com Scott Meder Senior Regional Sales Manager scott.meder@raima.com Short Introduction to Raima What is Data Management What are your requirements? How do I make the right decision? - Architecture

More information

GFS: The Google File System

GFS: The Google File System GFS: The Google File System Brad Karp UCL Computer Science CS GZ03 / M030 24 th October 2014 Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one

More information

Google is Really Different.

Google is Really Different. COMP 790-088 -- Distributed File Systems Google File System 7 Google is Really Different. Huge Datacenters in 5+ Worldwide Locations Datacenters house multiple server clusters Coming soon to Lenior, NC

More information

Indexing Large-Scale Data

Indexing Large-Scale Data Indexing Large-Scale Data Serge Abiteboul Ioana Manolescu Philippe Rigaux Marie-Christine Rousset Pierre Senellart Web Data Management and Distribution http://webdam.inria.fr/textbook November 16, 2010

More information

Module 15: Network Structures

Module 15: Network Structures Module 15: Network Structures Background Topology Network Types Communication Communication Protocol Robustness Design Strategies 15.1 A Distributed System 15.2 Motivation Resource sharing sharing and

More information

Installing VMware vsphere 5.1 Components

Installing VMware vsphere 5.1 Components Installing VMware vsphere 5.1 Components Module 14 You Are Here Course Introduction Introduction to Virtualization Creating Virtual Machines VMware vcenter Server Configuring and Managing Virtual Networks

More information

Reducing Time-to-Market with i.mx6-based Qseven Modules

Reducing Time-to-Market with i.mx6-based Qseven Modules Reducing Time-to-Market with i.mx6-based Qseven Modules congatec Facts The preferred global vendor for innovative embedded solutions to enable competitive advantages for our customers. Founded December

More information

Distributed Shared Memory

Distributed Shared Memory Distributed Shared Memory EECS 498 Farnam Jahanian University of Michigan Reading List Supplemental Handout: pp. 312-313, 333-353 from Tanenbaum Dist. OS text (dist. in class) What DSM? Concept & Design

More information

Module 16: Distributed System Structures

Module 16: Distributed System Structures Chapter 16: Distributed System Structures Module 16: Distributed System Structures Motivation Types of Network-Based Operating Systems Network Structure Network Topology Communication Structure Communication

More information

WOMBATOAM OPERATIONS & MAINTENANCE FOR ERLANG & ELIXIR SYSTEMS

WOMBATOAM OPERATIONS & MAINTENANCE FOR ERLANG & ELIXIR SYSTEMS version 3.0.0 3.0.0 IMPROVEMENTS OVER 2.0.0 MONITORING ++ Extensive dashboard redesign with a new, more intuitive user interface using GridStack. ++ Improved Mnesia netsplit service to detect and fix partitions

More information

WOMBATOAM OPERATIONS & MAINTENANCE FOR ERLANG & ELIXIR SYSTEMS

WOMBATOAM OPERATIONS & MAINTENANCE FOR ERLANG & ELIXIR SYSTEMS version 3.0.0 3.0.0 IMPROVEMENTS OVER 2.0.0 MONITORING ++ Extensive dashboard redesign with a new, more intuitive user interface using GridStack. ++ Improved Mnesia netsplit service to detect and fix partitions

More information

Graph Database and Analytics in a GPU- Accelerated Cloud Offering

Graph Database and Analytics in a GPU- Accelerated Cloud Offering Graph Database and Analytics in a GPU- Accelerated Cloud Offering - Blazegraph GPU @ Cirrascale Cloud Brad Bebee, CEO, Blazegraph Dave Driggers, Chief Executive and Technical Officer, Cirrascale Corporation

More information

Tech Sheet Benefits of Dual Redundant Active/Active Storage Architecture

Tech Sheet Benefits of Dual Redundant Active/Active Storage Architecture Tech Sheet Benefits of Dual Redundant Active/Active Storage Architecture Tech Sheet Introduction Storage servers constitute the very foundation of any data center and their features, performance and scalability

More information

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND FIBRE CHANNEL INFRASTRUCTURE

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND FIBRE CHANNEL INFRASTRUCTURE DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND FIBRE CHANNEL INFRASTRUCTURE Design Guide APRIL 0 The information in this publication is provided as is. Dell Inc. makes no representations or warranties

More information

IBM Enterprise X-Architecture Technology

IBM Enterprise X-Architecture Technology Mainframe-inspired technologies for industry-standard servers IBM Enterprise X-Architecture Technology Highlights Innovative technology provides revolutionary scalability to help stretch your IT dollars

More information

Configuration Maximums VMware Infrastructure 3: ESX Server 3.5 Update 2, ESX Server 3i version 3.5 Update 2, VirtualCenter 2.

Configuration Maximums VMware Infrastructure 3: ESX Server 3.5 Update 2, ESX Server 3i version 3.5 Update 2, VirtualCenter 2. Topic Configuration s VMware Infrastructure 3: ESX Server 3.5 Update 2, ESX Server 3i version 3.5 Update 2, VirtualCenter 2.5 Update 2 When you are selecting and configuring your virtual and physical equipment,

More information

The Lion of storage systems

The Lion of storage systems The Lion of storage systems Rakuten. Inc, Yosuke Hara Mar 21, 2013 1 The Lion of storage systems http://www.leofs.org LeoFS v0.14.0 was released! 2 Table of Contents 1. Motivation 2. Overview & Inside

More information

CS October 2017

CS October 2017 Atomic Transactions Transaction An operation composed of a number of discrete steps. Distributed Systems 11. Distributed Commit Protocols All the steps must be completed for the transaction to be committed.

More information

NAS System. User s Manual. Revision 1.0

NAS System. User s Manual. Revision 1.0 User s Manual Revision 1.0 Before You Begin efore going through with this manual, you should read and focus on the following safety guidelines. Information about the NAS system s packaging and delivery

More information

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer Oracle Exadata X7 Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer 05.12.2017 Oracle Engineered Systems ZFS Backup Appliance Zero Data Loss Recovery Appliance Exadata Database

More information

Ten (or so) Small Computers

Ten (or so) Small Computers Ten (or so) Small Computers by Jon "maddog" Hall Executive Director Linux International and President, Project Cauã 1 of 50 Who Am I? Half Electrical Engineer, Half Business, Half Computer Software In

More information

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure Nutanix Tech Note Virtualizing Microsoft Applications on Web-Scale Infrastructure The increase in virtualization of critical applications has brought significant attention to compute and storage infrastructure.

More information

The Golden Trinity of Erlang How Something Simple Has Real Business Value

The Golden Trinity of Erlang How Something Simple Has Real Business Value The Golden Trinity of Erlang How Something Simple Has Real Business Value Torben Hoffmann CTO, Erlang Solutions torben.hoffmann@erlang-solutions.com @LeHoff Why this talk? Why this talk? Introduce The

More information

BeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008

BeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008 Design and build an inexpensive DFS Fabrizio Manfredi Furuholmen FrOSCon August 2008 Agenda Overview Introduction Old way openafs New way Hadoop CEPH Conclusion Overview Why Distributed File system? Handle

More information

Distributed Transaction Management 2003

Distributed Transaction Management 2003 Distributed Transaction Management 2003 Jyrki Nummenmaa http://www.cs.uta.fi/~dtm jyrki@cs.uta.fi General information We will view this from the course web page. Motivation We will pick up some motivating

More information

Dell EMC. VxBlock Systems for VMware NSX 6.2 Architecture Overview

Dell EMC. VxBlock Systems for VMware NSX 6.2 Architecture Overview Dell EMC VxBlock Systems for VMware NSX 6.2 Architecture Overview Document revision 1.6 December 2018 Revision history Date Document revision Description of changes December 2018 1.6 Remove note about

More information

CUDA GPGPU Workshop 2012

CUDA GPGPU Workshop 2012 CUDA GPGPU Workshop 2012 Parallel Programming: C thread, Open MP, and Open MPI Presenter: Nasrin Sultana Wichita State University 07/10/2012 Parallel Programming: Open MP, MPI, Open MPI & CUDA Outline

More information

Taurus S2. User Manual. Dual-Bay Storage Enclosure for 3.5 Serial ATA Hard Drives. (English )

Taurus S2. User Manual. Dual-Bay Storage Enclosure for 3.5 Serial ATA Hard Drives. (English ) Dual-Bay Storage Enclosure for 3.5 Serial ATA Hard Drives User Manual (English ) v1.1 April 23, 2008 EN Table of Contents CHAPTER 1 - INTRODUCTION 1 CHAPTER 3 - SYSTEM SET UP 7 ICON KEY 1 THE TAURUS S2

More information

The Cray Rainier System: Integrated Scalar/Vector Computing

The Cray Rainier System: Integrated Scalar/Vector Computing THE SUPERCOMPUTER COMPANY The Cray Rainier System: Integrated Scalar/Vector Computing Per Nyberg 11 th ECMWF Workshop on HPC in Meteorology Topics Current Product Overview Cray Technology Strengths Rainier

More information

Last time. Distributed systems Lecture 6: Elections, distributed transactions, and replication. DrRobert N. M. Watson

Last time. Distributed systems Lecture 6: Elections, distributed transactions, and replication. DrRobert N. M. Watson Distributed systems Lecture 6: Elections, distributed transactions, and replication DrRobert N. M. Watson 1 Last time Saw how we can build ordered multicast Messages between processes in a group Need to

More information

An Alternative to GPU Acceleration For Mobile Platforms

An Alternative to GPU Acceleration For Mobile Platforms Inventing the Future of Computing An Alternative to GPU Acceleration For Mobile Platforms Andreas Olofsson andreas@adapteva.com 50 th DAC June 5th, Austin, TX Adapteva Achieves 3 World Firsts 1. First

More information

Dynamo: Amazon s Highly Available Key-Value Store

Dynamo: Amazon s Highly Available Key-Value Store Dynamo: Amazon s Highly Available Key-Value Store DeCandia et al. Amazon.com Presented by Sushil CS 5204 1 Motivation A storage system that attains high availability, performance and durability Decentralized

More information

Large-Scale Key-Value Stores Eventual Consistency Marco Serafini

Large-Scale Key-Value Stores Eventual Consistency Marco Serafini Large-Scale Key-Value Stores Eventual Consistency Marco Serafini COMPSCI 590S Lecture 13 Goals of Key-Value Stores Export simple API put(key, value) get(key) Simpler and faster than a DBMS Less complexity,

More information

Distributed Systems Conclusions & Exam. Brian Nielsen

Distributed Systems Conclusions & Exam. Brian Nielsen Distributed Systems Conclusions & Exam Brian Nielsen bnielsen@cs.aau.dk Definition A distributed system is the one in which hardware and software components at networked computers communicate and coordinate

More information

PT APPLICATION FIREWALL BASED ON PT UNIFIED CHASSIS

PT APPLICATION FIREWALL BASED ON PT UNIFIED CHASSIS PT APPLICATION FIREWALL BASED ON PT UNIFIED CHASSIS PT Application Firewall is available as both hardware and virtual appliances. Hardware appliances are based on PT Unified Chassis for high-load applications.

More information

The BioHPC Nucleus Cluster & Future Developments

The BioHPC Nucleus Cluster & Future Developments 1 The BioHPC Nucleus Cluster & Future Developments Overview Today we ll talk about the BioHPC Nucleus HPC cluster with some technical details for those interested! How is it designed? What hardware does

More information

Chapter 5. The MapReduce Programming Model and Implementation

Chapter 5. The MapReduce Programming Model and Implementation Chapter 5. The MapReduce Programming Model and Implementation - Traditional computing: data-to-computing (send data to computing) * Data stored in separate repository * Data brought into system for computing

More information

The MOSIX Algorithms for Managing Cluster, Multi-Clusters, GPU Clusters and Clouds

The MOSIX Algorithms for Managing Cluster, Multi-Clusters, GPU Clusters and Clouds The MOSIX Algorithms for Managing Cluster, Multi-Clusters, GPU Clusters and Clouds Prof. Amnon Barak Department of Computer Science The Hebrew University of Jerusalem http:// www. MOSIX. Org 1 Background

More information

Distributed Systems. 19. Fault Tolerance Paul Krzyzanowski. Rutgers University. Fall 2013

Distributed Systems. 19. Fault Tolerance Paul Krzyzanowski. Rutgers University. Fall 2013 Distributed Systems 19. Fault Tolerance Paul Krzyzanowski Rutgers University Fall 2013 November 27, 2013 2013 Paul Krzyzanowski 1 Faults Deviation from expected behavior Due to a variety of factors: Hardware

More information

Silicon Motion s Graphics Display SoCs

Silicon Motion s Graphics Display SoCs WHITE PAPER Silicon Motion s Graphics Display SoCs Enable 4K High Definition and Low Power Power and bandwidth: the twin challenges of implementing a solution for bridging any computer to any high-definition

More information

Network Service Appliances

Network Service Appliances Purpose-built Appliance Real-time system environmental and fault monitoring SNMP monitoring with Infoblox MIBS Redundant cooling fans ECC RAM Top quality, enterprise-class components Custom-designed chassis

More information

Advanced Databases ( CIS 6930) Fall Instructor: Dr. Markus Schneider. Group 17 Anirudh Sarma Bhaskara Sreeharsha Poluru Ameya Devbhankar

Advanced Databases ( CIS 6930) Fall Instructor: Dr. Markus Schneider. Group 17 Anirudh Sarma Bhaskara Sreeharsha Poluru Ameya Devbhankar Advanced Databases ( CIS 6930) Fall 2016 Instructor: Dr. Markus Schneider Group 17 Anirudh Sarma Bhaskara Sreeharsha Poluru Ameya Devbhankar BEFORE WE BEGIN NOSQL : It is mechanism for storage & retrieval

More information

Lecture 23 Database System Architectures

Lecture 23 Database System Architectures CMSC 461, Database Management Systems Spring 2018 Lecture 23 Database System Architectures These slides are based on Database System Concepts 6 th edition book (whereas some quotes and figures are used

More information