Distributed Systems Principles and Paradigms

Similar documents
Distributed Systems Principles and Paradigms. Chapter 01: Introduction

Distributed Systems Principles and Paradigms. Chapter 01: Introduction. Contents. Distributed System: Definition.

CA464 Distributed Programming

Distributed Systems. Overview. Distributed Systems September A distributed system is a piece of software that ensures that:

Introduction. Distributed Systems IT332

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 1. Introduction

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 1. Introduction

Distributed Systems. Edited by. Ghada Ahmed, PhD. Fall (3rd Edition) Maarten van Steen and Tanenbaum

Distributed Systems. Chapter 1: Introduction

Advanced Distributed Systems

CSE 5306 Distributed Systems. Course Introduction

Chapter 1: Introduction 1/29

Outline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems

Distributed Systems Principles and Paradigms

Introduction to Distributed Systems (DS)

Introduction to Distributed Systems. INF5040/9040 Autumn 2018 Lecturer: Eli Gjørven (ifi/uio)

Distributed System: Definition

Introduction Distributed Systems

Introduction to Distributed Systems (DS)

Chapter 1 Introduction

Distributed Information Processing

Concepts of Distributed Systems 2006/2007

DISTRIBUTED SYSTEMS. Second Edition. Andrew S. Tanenbaum Maarten Van Steen. Vrije Universiteit Amsterdam, 7'he Netherlands PEARSON.

Chapter 1: Distributed Systems: What is a distributed system? Fall 2013

What is a distributed system?

Introduction to Distributed Systems

Designing Issues For Distributed Computing System: An Empirical View

TDP3471 Distributed and Parallel Computing

Introduction to Distributed Systems

Distributed Systems Principles and Paradigms

Distributed Operating Systems Spring Prashant Shenoy UMass Computer Science.

Distributed Systems LEEC (2006/07 2º Sem.)

Distributed Operating Systems Fall Prashant Shenoy UMass Computer Science. CS677: Distributed OS

Distributed Systems. Lecture 4 Othon Michail COMP 212 1/27

Announcements. me your survey: See the Announcements page. Today. Reading. Take a break around 10:15am. Ack: Some figures are from Coulouris

Distributed Systems. Characteristics of Distributed Systems. Lecture Notes 1 Basic Concepts. Operating Systems. Anand Tripathi

Distributed Systems. Characteristics of Distributed Systems. Characteristics of Distributed Systems. Goals in Distributed System Designs

Distributed Systems Principles and Paradigms

Distributed Systems. Prof. Dr. Schahram Dustdar Distributed Systems Group Vienna University of Technology. dsg.tuwien.ac.

Introduction to Distributed Systems

Support for resource sharing Openness Concurrency Scalability Fault tolerance (reliability) Transparence. CS550: Advanced Operating Systems 2

It also performs many parallelization operations like, data loading and query processing.

Distributed Systems Principles and Paradigms. Chapter 12: Distributed Web-Based Systems

Client Server & Distributed System. A Basic Introduction

Introduction to Distributed Systems

02 - Distributed Systems

Chapter 2 System Models

Distributed and Operating Systems Spring Prashant Shenoy UMass Computer Science.

02 - Distributed Systems

A Location Service for Worldwide Distributed Objects

CS655: Advanced Topics in Distributed Systems [Fall 2013] Dept. Of Computer Science, Colorado State University

Introduction to Distributed Systems

Introduction. Chapter 1

Ian Foster, CS554: Data-Intensive Computing

Distributed Systems. Prof. Dr. Schahram Dustdar Distributed Systems Group Vienna University of Technology. dsg.tuwien.ac.

Chapter 8 Fault Tolerance

殷亚凤. Processes. Distributed Systems [3]

Multiprocessors 2007/2008

Chapter 8 Fault Tolerance

2.1 What are distributed systems? What are systems? Different kind of systems How to distribute systems? 2.2 Communication concepts

Advanced Operating Systems

Lecture 1: January 22

Outline. Distributed Computing Systems. The Rise of Distributed Systems. Depiction of a Distributed System 4/15/2014

Distributed Systems COMP 212. Lecture 1 Othon Michail

Remote Procedure Call. Tom Anderson

Chapter 20: Database System Architectures

Replication and Consistency. Fall 2010 Jussi Kangasharju

Verteilte Systeme (Distributed Systems)

Lecture 1: January 23

Introduction to Distributed Systems

Distributed Systems Principles and Paradigms. Chapter 07: Consistency & Replication

Assignment 5. Georgia Koloniari

System Models for Distributed Systems

Distributed Systems COMP 212. Lecture 1 Othon Michail

Lecture 6 Consistency and Replication

Chapter 17: Distributed Systems (DS)

Replication in Distributed Systems

Cycle Sharing Systems

Crossbar switch. Chapter 2: Concepts and Architectures. Traditional Computer Architecture. Computer System Architectures. Flynn Architectures (2)

Distributed Systems COMP 212. Lecture 15 Othon Michail

Introduction to Distributed Systems

Distributed Computing. Santa Clara University 2016

Chapter 1: Distributed Information Systems

Chapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems!

Outline. INF3190:Distributed Systems - Examples. Last week: Definitions Transparencies Challenges&pitfalls Architecturalstyles

Distributed Systems. Chapter 02

Chapter 2 Distributed Information Systems Architecture

Distributed File Systems. CS432: Distributed Systems Spring 2017

THE IMPACT OF E-COMMERCE ON DEVELOPING A COURSE IN OPERATING SYSTEMS: AN INTERPRETIVE STUDY

Lecture 23 Database System Architectures

Lecture 9: MIMD Architectures

Communication. Distributed Systems Santa Clara University 2016

Module 1 - Distributed System Architectures & Models

CS4513 Distributed Computer Systems

Datacenter replication solution with quasardb

Chapter Outline. Chapter 2 Distributed Information Systems Architecture. Distributed transactions (quick refresh) Layers of an information system

Advanced Memory Management

Distributed systems. Lecture 6: distributed transactions, elections, consensus and replication. Malte Schwarzkopf

A NEW DISTRIBUTED COMPOSITE OBJECT MODEL FOR COLLABORATIVE COMPUTING

Today: Fault Tolerance. Replica Management

Transcription:

Distributed Systems Principles and Paradigms Chapter 01 (version September 5, 2007) Maarten van Steen Vrije Universiteit Amsterdam, Faculty of Science Dept. Mathematics and Computer Science Room R4.20. Tel: (020) 598 7784 E-mail:steen@cs.vu.nl, URL: www.cs.vu.nl/ steen/ 01 Introduction 02 Architectures 03 Processes 04 Communication 05 Naming 06 Synchronization 07 Consistency and Replication 08 Fault Tolerance 09 Security 10 Distributed Object-Based Systems 11 Distributed File Systems 12 Distributed Web-Based Systems 13 Distributed Coordination-Based Systems 00 1 / Distributed System: Definition A distributed system is a piece of software that ensures that: a collection of independent computers appears to its users as a single coherent system Two aspects: (1) independent computers and (2) single system middleware. Computer 1 Computer 2 Computer 3 Computer 4 Appl. A Application B Appl. C Distributed system layer (middleware) Local OS 1 Local OS 2 Local OS 3 Local OS 4 Network 01 1 Introduction/1.1 Definition

Goals of Distributed Systems Making resources available Distribution transparency Openness Scalability 01 2 Introduction/1.2 Goals Distribution Transparency Transparency Access Location Migration Relocation Replication Concurrency Failure Description Hides differences in data representation and invocation mechanisms Hides where an object resides Hides from an object the ability of a system to change that object s location Hides from a client the ability of a system to change the location of an object to which the client is bound Hides the fact that an object or its state may be replicated and that replicas reside at different locations Hides the coordination of activities between objects to achieve consistency at a higher level Hides failure and possible recovery of objects Note: Distribution transparency may be set as a goal, but achieving it is a different story. 01 3 Introduction/1.2 Goals

Degree of Transparency Observation: Aiming at full distribution transparency may be too much: Users may be located in different continents; distribution is apparent and not something you want to hide Completely hiding failures of networks and nodes is (theoretically and practically) impossible You cannot distinguish a slow computer from a failing one You can never be sure that a server actually performed an operation before a crash Full transparency will cost performance, exposing distribution of the system Keeping Web caches exactly up-to-date with the master copy Immediately flushing write operations to disk for fault tolerance 01 4 Introduction/1.2 Goals Openness of Distributed Systems Open distributed system: Be able to interact with services from other open systems, irrespective of the underlying environment: Systems should conform to well-defined interfaces Systems should support portability of s Systems should easily interoperate Achieving openness: At least make the distributed system independent from heterogeneity of the underlying environment: Hardware Platforms Languages 01 5 Introduction/1.2 Goals

Policies versus Mechanisms Implementing openness: Requires support for different policies specified by s and users: What level of consistency do we require for clientcached data? Which operations do we allow downloaded code to perform? Which QoS requirements do we adjust in the face of varying bandwidth? What level of secrecy do we require for communication? Implementing openness: Ideally, a distributed system provides only mechanisms: Allow (dynamic) setting of caching policies, preferably per cachable item Support different levels of trust for mobile code Provide adjustable QoS parameters per data stream Offer different encryption algorithms 01 6 Introduction/1.2 Goals Scale in Distributed Systems Observation: Many developers of modern distributed system easily use the adjective scalable without making clear why their system actually scales. Scalability: At least three components: Number of users and/or processes (size scalability) Maximum distance between nodes (geographical scalability) Number of administrative domains (administrative scalability) Most systems account only, to a certain extent, for size scalability. The (non)solution: powerful servers. Today, the challenge lies in geographical and administrative scalability. 01 7 Introduction/1.2 Goals

Techniques for Scaling Hide communication latencies: Avoid waiting for responses; do something else: Make use of asynchronous communication Have separate handler for incoming response Problem: not every fits this model Distribution: Partition data and computations across multiple machines: Move computations to clients (Java applets) Decentralized naming services (DNS) Decentralized information systems (WWW) Replication/caching: Make copies of data available at different machines: Replicated file servers and databases Mirrored Web sites Web caches (in browsers and proxies) File caching (at server and client) 01 8 Introduction/1.2 Goals Scaling The Problem Observation: Applying scaling techniques is easy, except for one thing: Having multiple copies (cached or replicated), leads to inconsistencies: modifying one copy makes that copy different from the rest. Always keeping copies consistent and in a general way requires global synchronization on each modification. Global synchronization precludes large-scale solutions. Observation: If we can tolerate inconsistencies, we may reduce the need for global synchronization. Observation: Tolerating inconsistencies is dependent. 01 9 Introduction/1.2 Goals

Developing Distributed Systems: Pitfalls Observation: Many distributed systems are needlessly complex caused by mistakes that required patching later on. There are many false assumptions: The network is reliable The network is secure The network is homogeneous The topology does not change Latency is zero Bandwidth is infinite Transport cost is zero There is one administrator 01 10 Introduction/1.2 Goals Types of Distributed Systems Distributed Computing Systems Distributed Information Systems Distributed Pervasive Systems 01 11 Introduction/1.3 Types of Distributed Systems

Distributed Computing Systems (1/2) Observation: Many distributed systems are configured for High-Performance Computing: Cluster Computing: Essentially a group of high-end systems connected through a LAN: Homogeneous: same OS, near-identical hardware Single managing node Master node Compute node Compute node Compute node Management Parallel libs Component of parallel Component of parallel Component of parallel Local OS Local OS Local OS Local OS Remote access network Standard network High-speed network 01 12 Introduction/1.3 Types of Distributed Systems Distributed Computing Systems (2/2) Grid Computing: The next step: lots of nodes from everywhere: Heterogeneous Dispersed across several organizations Can easily span a wide-area network Note: To allow for collaborations, grids generally use virtual organizations. In essence, this is a grouping of users (or better: their IDs) that will allow for authorization on resource allocation. 01 13 Introduction/1.3 Types of Distributed Systems

Distributed Information Systems Observation: The vast amount of distributed systems in use today are forms of traditional information systems, that now integrate legacy systems. Example: Transaction processing systems. BEGIN TRANSACTION(server, transaction); READ(transaction, file-1, data); WRITE(transaction, file-2, data); newdata := MODIFIED(data); IF WRONG(newData) THEN ABORT TRANSACTION(transaction); ELSE WRITE(transaction, file-2, newdata); END TRANSACTION(transaction); END IF; Essential: All READ and WRITE operations are executed, i.e. their effects are made permanent at the execution of END TRANSACTION. Observation: Transactions form an atomic operation. 01 14 Introduction/1.3 Types of Distributed Systems Distributed Information Systems: Transactions Model: A transaction is a collection of operations on the state of an object (database, object composition, etc.) that satisfies the following properties (ACID): Atomicity: All operations either succeed, or all of them fail. When the transaction fails, the state of the object will remain unaffected by the transaction. Consistency: A transaction establishes a valid state transition. This does not exclude the possibility of invalid, intermediate states during the transaction s execution. Isolation: Concurrent transactions do not interfere with each other. It appears to each transaction T that other transactions occur either before T, or after T, but never both. Durability: After the execution of a transaction, its effects are made permanent: changes to the state survive failures. 01 15 Introduction/1.3 Types of Distributed Systems

Transaction Processing Monitor Observation: In many cases, the data involved in a transaction is distributed across several servers. A TP Monitor is responsible for coordinating the execution of a transaction: Reply Server Transaction Client Requests Reply TP monitor Request Request Reply Request Server Reply Server 01 16 Introduction/1.3 Types of Distributed Systems Distributed Information Systems: Enterprise Application Integration Problem: A TP monitor works fine for database s, but in many cases, the apps needed to be separated from the databases they were acting on. Instead, what was needed were facilities for direct communicationm between s: Client Client Communication middleware Server-side Server-side Server-side Remote Procedure Call (RPC) Message-Oriented Middleware (MOM) 01 17 Introduction/1.3 Types of Distributed Systems

Distributed Pervasive Systems Observation: There is a next-generation of distributed systems emerging in which the nodes are small, mobile, and often embedded as part of a larger system. Some requirements: Contextual change: The system is part of an environment in which changes should be immediately accounted for. Ad hoc composition: Each node may be used in a very different ways by different users. Requires ease-of-configuration. Sharing is the default: Nodes come and go, providing sharable services and information. Calls again for simplicity. Observation: Pervasiveness and distribution transparency may not always form a good match. 01 18 Introduction/1.3 Types of Distributed Systems Pervasive Systems: Examples Home Systems: Should be completely self-organizing: There should be no system administrator Provide a personal space for each of its users Simplest solution: a centralized home box? Electronic health systems: Devices are physically close to a person: Where and how should monitored data be stored? How can we prevent loss of crucial data? What is needed to generate and propagate alerts? How can security be enforced? How can physicians provide online feedback? Tilt sensor ECG sensor PDA Motion sensors Transmitter GPRS/UMTS External storage body-area network (a) body-area network (b) 01 19 Introduction/1.3 Types of Distributed Systems

Sensor networks Characteristics: The nodes to which sensors are attached are: Many (10s-1000s) Simple (i.e., hardly any memory, CPU power, or communication facilities) Often battery-powered (or even battery-less) Sensor networks as distributed systems: consider them from a database perspective: Operator's site Sensor network Sensor data is sent directly to operator (a) Operator's site Each sensor can process and store data Query Sensor network Sensors send only answers (b) 01 20 Introduction/1.3 Types of Distributed Systems