Distributed Systems are Everywhere!" CS162 Operating Systems and Systems Programming Lecture 22 Client-Server" Client-Server" Message Passing"

Similar documents
Distributed Systems are Everywhere!" CS162 Operating Systems and Systems Programming Lecture 22 Client-Server" Client-Server" Message Passing"

EECS 122: Introduction to Computer Networks DNS and WWW. Internet Names & Addresses

CS 194: Distributed Systems WWW and Web Services

EE 122: HyperText Transfer Protocol (HTTP)

Page 1. Goals for Today" Remote Procedure Call" Raw messaging is a bit too low-level for programming! RPC Details"

0 0& Basic Background. Now let s get into how things really work!

CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 22: Remote Procedure Call (RPC)

DISTRIBUTED COMPUTER SYSTEMS

CHAPTER - 4 REMOTE COMMUNICATION

CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 21: Network Protocols (and 2 Phase Commit)

Today CSCI Communication. Communication in Distributed Systems. Communication in Distributed Systems. Remote Procedure Calls (RPC)

MODELS OF DISTRIBUTED SYSTEMS

Content Delivery on the Web: HTTP and CDNs

The World Wide Web. EE 122: Intro to Communication Networks. Fall 2006 (MW 4-5:30 in Donner 155) Vern Paxson TAs: Dilip Antony Joseph and Sukun Kim

Announcements. The World Wide Web. Retrieving From the Server. Goals of Today s Lecture. The World Wide Web. POP3 Protocol

MODELS OF DISTRIBUTED SYSTEMS

Internet Content Distribution

CS603: Distributed Systems

Remote Procedure Call

Remote Procedure Calls

Remote Invocation. Today. Next time. l Overlay networks and P2P. l Request-reply, RPC, RMI

416 Distributed Systems. RPC Day 2 Jan 12, 2018

CSCI-1680 WWW Rodrigo Fonseca

Communication. Distributed Systems Santa Clara University 2016

416 Distributed Systems. RPC Day 2 Jan 11, 2017

Application Protocols and HTTP

CprE Fault Tolerance. Dr. Yong Guan. Department of Electrical and Computer Engineering & Information Assurance Center Iowa State University

COMMUNICATION IN DISTRIBUTED SYSTEMS

Networked Systems and Services, Fall 2018 Chapter 4. Jussi Kangasharju Markku Kojo Lea Kutvonen

Computer Networks. Wenzhong Li. Nanjing University

Announcements. The World Wide Web. Goals of Today s Lecture. The World Wide Web HTML. Web Components

Remote Procedure Calls CS 707

Lecture 8: February 17

CS454/654 Midterm Exam Fall 2004

Remote Procedure Call. Tom Anderson

Outline. EEC-681/781 Distributed Computing Systems. The OSI Network Architecture. Inter-Process Communications (IPC) Lecture 4

CSCI-1680 WWW Rodrigo Fonseca

CS162 Operating Systems and Systems Programming Lecture 24. Network Communication Abstractions / Distributed Programming

Remote Invocation. Today. Next time. l Indirect communication. l Request-reply, RPC, RMI

Page 1. Goals for Today" Remote Procedure Call" Distributed Systems Message Passing" CS162 Operating Systems and Systems Programming Lecture 23

Internet Technology. 06. Exam 1 Review Paul Krzyzanowski. Rutgers University. Spring 2016

CS162 Operating Systems and Systems Programming Lecture 24. Distributed File Systems. Review: Distributed Applications

CS162 Operating Systems and Systems Programming Lecture 23

CSci Introduction to Distributed Systems. Communication: RPC

Fault Tolerance Part I. CS403/534 Distributed Systems Erkay Savas Sabanci University

Introduc)on to Computer Networks

Chapter 8 Fault Tolerance

Lecture 8: February 19

Distributed Systems Theory 4. Remote Procedure Call. October 17, 2008

CS162 Operating Systems and Systems Programming Lecture 23. Network Communication Abstractions / Distributed Programming

RMI & RPC. CS 475, Spring 2019 Concurrent & Distributed Systems

Internet Technology 3/2/2016

INTERNET ENGINEERING. HTTP Protocol. Sadegh Aliakbary

CS162 Operating Systems and Systems Programming Lecture 23. Network Communication Abstractions / Distributed Programming

Module 3 - Distributed Objects & Remote Invocation

Module 3 Accessing Remote Resources CS655! 3-1!

Remote Procedure Call (RPC) and Transparency

Application Level Protocols

CCNA Exploration Network Fundamentals. Chapter 03 Application Functionality and Protocols

CCNA Exploration1 Chapter 3: Application Layer Functionality and Protocols

RMI: Design & Implementation

Application Layer: The Web and HTTP Sec 2.2 Prof Lina Battestilli Fall 2017

COP 5611 Operating Systems Spring Dan C. Marinescu Office: HEC 439 B Office hours: M-Wd 2:00-3:00 PM

Assignment #2. Csci4211 Spring Due on March 6th, Notes: There are five questions in this assignment. Each question has 10 points.

Overview Content Delivery Computer Networking Lecture 15: The Web Peter Steenkiste. Fall 2016

CMPE 151: Network Administration. Servers

Network. CS Advanced Operating Systems Structures and Implementation Lecture 13. File Systems (Con t) RAID/Journaling/VFS.

REMOTE PROCEDURE CALLS EE324

CS 268: Internet Architecture & E2E Arguments. Today s Agenda. Scott Shenker and Ion Stoica (Fall, 2010) Design goals.

Distributed Objects and Remote Invocation. Programming Models for Distributed Applications

Communication. Overview

Distributed Systems COMP 212. Lecture 8 Othon Michail

Advanced Distributed Systems

Distributed Systems Exam 1 Review Paul Krzyzanowski. Rutgers University. Fall 2016

Fault Tolerance. Distributed Systems. September 2002

Distributed Systems Exam 1 Review. Paul Krzyzanowski. Rutgers University. Fall 2016

SAI/ST course Distributed Systems

Communication in Distributed Systems

Web, HTTP, Caching, CDNs

WWW, REST, and Web Services

Distributed Systems 8. Remote Procedure Calls

Networked Applications: Sockets. Goals of Todayʼs Lecture. End System: Computer on the ʻNet. Client-server paradigm End systems Clients and servers

02 - Distributed Systems

WEB TECHNOLOGIES CHAPTER 1

Distributed systems. Lecture 6: distributed transactions, elections, consensus and replication. Malte Schwarzkopf

Distributed Systems. Lecture 06 Remote Procedure Calls Thursday, September 13 th, 2018

Network Protocols. Sarah Diesburg Operating Systems CS 3430

May Gerd Liefländer System Architecture Group Universität Karlsruhe (TH), Systemarchitektur

Application Layer Introduction; HTTP; FTP

Notes beforehand... For more details: See the (online) presentation program.

Today: Fault Tolerance. Replica Management

Distributed Information Processing

EEC-682/782 Computer Networks I

Today: Distributed Objects. Distributed Objects

Chapter 10: Application Layer CCENT Routing and Switching Introduction to Networks v6.0

Communication. Distributed Systems IT332

Structured communication (Remote invocation)

CMSC 332 Computer Networking Web and FTP

Page 1. Goals for Today" Discussion" Example: Reliable File Transfer" CS162 Operating Systems and Systems Programming Lecture 11

1/10/16. RPC and Clocks. Tom Anderson. Last Time. Synchroniza>on RPC. Lab 1 RPC

Transcription:

CS162 Operating Systems and Systems Programming Lecture 22 Client- April 18, 2011! Ion Stoica! http://inst.eecs.berkeley.edu/~cs162! Distributed Systems are Everywhere!" We need (want?) to share physical devices (e.g., printers) and information (e.g., files)! Many applications are distributed in nature (e.g., ATM machines, airline reservations)! Many large problems can be solved by decomposing smaller problems that run in parallel (e.g., MapReduce, SETI@home)! Next three lectures: go over three distributed system models! Client-server! Peer-to-peer! Cloud(cluster) computing! Lec 22.2! Client- One or more clients interacting with one or more servers providing a service, e.g.,! Web! E-mail, chat! Printer! Airline reservation! On-line shopping! Store/streaming video, audio, and/or photos!! In this lecture! End-to-end message communication! Remote Procedure Calls! Two phase commit transactions! World Wide Web! Lec 22.3! Message Passing" Process A (e.g., client) sends a packet to process B (e.g., server)! Proc. A (port 10) Internet 16.25.31.10 128.15.11.12 Proc. B (port 7) Lec 22.4! Page 1

Proc. A (port 10) data Message Passing Details" Proc. A (port 10) Internet Proc. B (port 7) 16.25.31.10 128.15.11.12 Proc. B (port 7) Transport Network Datalink data data 10 7 10 7 16.25.31.10 128.15.11.12 data data 10 7 10 7 16.25.31.10 128.15.11.12 Transport Network Datalink Physical Physical 16.25.31.10 Internet 128.15.11.12 data From Message Passing to Remote Procedure Call" Raw messaging is a bit too low-level for programming! Another option: Remote Procedure Call (RPC)! Looks like a local procedure call on client! Translated automatically into a procedure call on remote machine (server) Implementation:! Uses request/response message passing under the covers! Lec 22.5! Lec 22.6! Example: Local Procedure Call" Remote Procedure Call" Machine" Process" n = sum(4, 7);" sum(i, j)" int i, j;" {" return (i+j);" }" Transparently invoke a procedure (services) implemented in a different address space either on the same machine or a different machine! Services can be run wherever itʼs most appropriate! Access to local and remote services looks the same! Challenges:! Argument (parameter) passing, potentially across different architectures! Discover where the service is located! Handle failures transparently" Lec 22.7! Lec 22.8! Page 2

RPC: Argument Passing" Client and server use stubs to glue pieces together! Client-side stub is responsible for marshalling arguments and unmarshalling the return values! Server-side stub is responsible for unmarshalling arguments and marshalling the return values! Marshalling involves (depending on system) converting values to a canonical form, serializing objects, copying arguments passed by reference, etc.! Needs to account for cross-language and cross-platform issues! Technique: compiler generated stubs! Input: interface definition language (IDL)!» Contains, among other things, types of arguments/return! Output: stub code in the appropriate source language! Machine A" Machine B" Client" (caller)" (callee)" RPC Information Flow" call" return" return" call" bundle" args" Client" Stub" unbundle" ret vals" bundle" ret vals" Stub" unbundle" args" send" receive" send" receive" Packet" Handler" Network" Network" Packet" Handler" Lec 22.9! Lec 22.10! Example: Remote Procedure Call" Client and Server Stubs" Stubs " Principle of RPC between a client and server program.! Client" Process" Process" n = sum(4, 7);" message" sum" 4" 7" message" sum" 4" 7" sum(i, j)" int i, j;" {" return (i+j);" }" OS" OS" Lec 22.11! Lec 22.12! Page 3

Encoding" Server and client may encode arguments differently, e.g.,! Big-endian: store from most-to-least significant byte! Little-endian: store from least-to-most significant byte! Parameter Specification and Stub Generation" a) A procedure! b) The corresponding message.! a) Original message on x86 (e.g., little endian)! b) The message after receipt on the SPARC (e.g., big endian)! c) The message after being inverted. (The little numbers in boxes indicate the address of each byte)! Lec 22.13! Lec 22.14! Service Discovery: RPC Binding" How does client know which machine to send RPC?! Need to translate name of remote service into network endpoint (e.g., host:port)! Binding: the process of converting a user-visible name into a network endpoint!» Static: fixed at compile time!» Dynamic: performed at runtime! Example of RPC Binding" Distributed Computing Environment (DCE) framework! DCE daemon:! Allow local services to record their services locally! Resolve service name to local end-point (i.e., port)! Directory machine: resolve service name to DCE daemon (host:portʼ) on machine running the service! Dynamic Binding! Most RPC systems use dynamic binding via name service! Why dynamic binding?!» Access control: check who is permitted to access service!» Fail-over: If server fails, use a different one! Lec 22.15! Lec 22.16! Page 4

RPC Semantics in the Presence of Failures" The client is unable to locate the server! The request message from the client to server is lost! The reply message from the server is lost! The server crashes after receiving a request! Client is Unable to Locate Causes: server down, different version of server binary,! Fixes! Return (-1) to indicate failure (in Unix use errno to indicate failure type)!» What if (-1) is a legal return value?! Use exceptions!» Transparency is lost! The client crashes after sending a request! Lec 22.17! Lec 22.18! Lost Request Message" Lost Reply Message" Easiest to deal with! Just retransmit the message!! If multiple message are lost then! client is unable to locate server error! Far more difficult to deal with: client doesnʼt know what happened at server! Did server execute the procedure or not?! Possible fixes! Retransmit the request!» Only works if operation is idempotent: itʼs fine to execute it twice! What if operation not idempotent?!» Assign unique sequence numbers to every request! Lec 22.19! Lec 22.20! Page 5

Server Crashes" Client Crashes" Three cases! Crash after execution! Crash before execution! Crash during the execution! Three possible semantics! At least once semantics!» Client keeps trying until it gets a reply! At most once semantics!» Client gives up on failure! Exactly once semantics!» Can this be correctly implemented?! Letʼs the server computation orphan" Orphans can! Waste CPU cycles! Lock files! Client reboots and it gets the old reply immediately! Lec 22.21! Lec 22.22! Client Crashes: Possible Solutions" Extermination:! Client keeps a log, reads it when reboots, and kills the orphan! Disadvantage: high overhead to maintain the log! Reincarnation:! Divide times in epochs! Client broadcasts epoch when reboots! Upon hearing a new epoch servers kills the orphans! Disadvantage: doesnʼt solve problem when network partitioned! Expiration:! Each RPC is given a lease T to finish computation! If it does not, it needs to ask for another lease! If client reboots after T sec all orphans are gone! Problem: what is a good value of T?! RPC Semantics: Discussion" The original goal: provide the same semantics as a local call! Impossible to achieve in a distributed system! Dealing with remote failures fundamentally affects transparency! Ideal interface: balance the easy of use with making visible the errors to users! Lec 22.23! Lec 22.24! Page 6

Asynchronous RPC (1)" Asynchronous RPC (2)" A client and server interacting through two asynchronous RPCs! 2-12 a) The interconnection between client and server in a traditional RPC! b) The interaction using asynchronous RPC! Lec 22.25! Lec 22.26! Decisions in the Presence of Failures" How does the client know for sure whether a server has performed an operation on the behalf of the client or not?! General solution: two phase commit:! Ensure two or more parties come at the same same decision even if one or more machines fail! Only fail stop failures; cannot handle arbitrary failures (e.g., malicious nodes or communication networks). Why?! Two-Phase Commit Protocol! One node plays the role of coordinator! Phase 1, coordinator sends out a request to commit!» each participant responds with yes or no! Phase 2!» If everyone says yes, coordinator sends out a commit!» If someone says no, coordinator sends out an abort! Two-Phase Commit Details" Each participant uses a local, persistent, corrupt-free log to keep track of whether a commit has happened! If a machine crashes, when it wakes up it first checks its log to recover state of world at time of crash! Log can be used to complete this process such that all machines either commit or donʼt commit! Timeouts can be used to retry if coordinator doesnʼt hear from all participants! Lec 22.27! Lec 22.28! Page 7

Phase 1! Phase 2! Two-Phase Commit Example" Simple Example: A Client, B Bank! A wants to make a deposit and then wants to make sure that the bank has indeed updated the balance! A plays the coordinator role in this case! A! Begin transaction! Decision to Commit! End commit! Update balance,! Query to commit! Agree to Commit! Commit! Committed! B! Update balance! Ready to commit! Commit! Phase 1! Two-Phase Commit Example (contʼd)" Begin transaction! Timeout! A! B! Update balance,! Query to commit! Update balance,! Query to commit! What if B crashes at beginning?! Wakes up, does nothing; A will timeout and retry! Lec 22.29! Lec 22.30! Phase 1! Phase 2! Two-Phase Commit Example (contʼd)" Begin transaction! Decision to Commit! Abort! A! B! Update balance,! Query to commit! Agree to Commit! Abort! Update balance! Ready to commit! Abort! Phase 1! Phase 2! Two-Phase Commit Example (contʼd)" Begin transaction! Decision to Commit! End commit! Timeout! A! Update balance,! Query to commit! Agree to Commit! Commit! Commit! Committed! B! Update balance! Ready to commit! Commit! What if A crashes at beginning of phase 2?! Wakes up, sees that there is a transaction in progress; sends Abort to B! What if B crashes at beginning of phase 2?! B comes back up, looks at log; when A (re)sends it Commit message, it will say, oh, ok, commit! Lec 22.31! Lec 22.32! Page 8

Two-Phase Commit Gotchas" Undesirable feature of Two-Phase Commit: blocking! One machine can be stalled until another site recovers, e.g.,!» Coordinator crashes everyone needs to wait for it to come back;! A blocked site holds resources (locks on updated items, pages pinned in memory, etc) until learns fate of update! Alternatives such as Three Phase Commit donʼt have this blocking problem! Vannevar Bush (1890-1974) " The Web History (I)" 1945: Vannevar Bush, Memex:! "a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility" (See http://www.iath.virginia.edu/elab/hfl0051.html)" Lec 22.33! 4/18! Memex" Ion Stoica CS162 UCB Spring 2011! Lec 22.34! The Web History (II)" The Web History (III)" Ted Nelson 1967, Ted Nelson, Xanadu:! A world-wide publishing network that would allow information to be stored not as separate files but as connected literature! Owners of documents would be automatically paid via electronic means for the virtual copying of their documents! Coined the term Hypertext! Tim Berners-Lee" World Wide Web (WWW): a distributed database of pages linked through Hypertext Transport Protocol (HTTP)! First HTTP implementation - 1990!» Tim Berners-Lee at CERN! HTTP/0.9 1991!» Simple GET command for the Web! HTTP/1.0 1992!» Client/Server information, simple caching! HTTP/1.1-1996! Lec 22.35! Lec 22.36! Page 9

The Web" Core components:! Servers: store files and execute remote commands! Browsers: retrieve and display pages! Uniform Resource Locators (URLs): way to refer to pages! A protocol to transfer information between clients and servers! HTTP! Uniform Record Locator (URL)" protocol://host-name:port/directory-path/resource Extend the idea of hierarchical namespaces to include anything in a file system! ftp://www.cs.berkeley.edu/~istoica/pubs.html! Extend to program executions as well! http://us.f413.mail.yahoo.com/ym/showletter?box=%40b %40Bulk&MsgId=2604_1744106_29699_1123_1261_0_28917_3552_ 1289957100&Search=&Nhead=f&YY=31454&order=down&sort=date &pos=0&view=a&head=b! Server side processing can be incorporated in the name! Lec 22.37! Lec 22.38! Hyper Text Transfer Protocol (HTTP)" Big Picture" Client-server architecture! Synchronous request/reply protocol! Runs over TCP, Port 80! Stateless! Client" Establish" connection" Client " request" TCP Syn" TCP syn + ack " TCP ack + HTTP GET" Request" response" Close" connection" Lec 22.39! Lec 22.40! Page 10

Hyper Text Transfer Protocol Commands" GET transfer resource from given URL! HEAD GET resource metadata (headers) only! PUT store/modify resource under given URL! 1x informational! 2x success! 3x redirection! 4x client error in request! Response Codes" 5x server error; canʼt satisfy the request! DELETE remove resource! POST provide input for a process identified by the given URL (usually used to post CGI parameters)! Lec 22.41! Lec 22.42! Client Request" Server Response" Steps to get the resource: http://www.eecs.berkeley.edu/index.html! 1. Use DNS to obtain the IP address of www.eecs.berkeley.edu! 2. Send to an HTTP request:! GET /index.html HTTP/1.0 HTTP/1.0 200 OK Content-Type: text/html Content-Length: 1234 Last-Modified: Mon, 19 Nov 2001 15:31:20 GMT <HTML> <HEAD> <TITLE>EECS Home Page</TITLE> </HEAD> </BODY> </HTML> Lec 22.43! Lec 22.44! Page 11

Client" HTTP/1.0 Example" HTTP/1.0 Performance" Create a new TCP connection for each resource! Large number of embedded objects in a web page! Many short lived connections! Request image 1" Transfer image 1" Request image 2" Transfer image 2" TCP transfer! Too slow for small object! It takes time to ramp-up (i.e., exit slow-start phase)! Finish display" page " Request text" Transfer text" Connections may be set up in parallel (5 is default in most browsers)! Lec 22.45! Lec 22.46! HTTP/1.0 Caching Support" A modifier to the GET request:! If-modified-since return a not modified response if resource was not modified since specified time! A response header:! Expires specify to the client for how long it is safe to cache the resource! A request directive:! No-cache ignore all caches and get resource directly from server! These features can be best taken advantage of with HTTP proxies! Locality of reference increases if many clients share a proxy! HTTP/1.1 (1996)" Performance:! Persistent connections! Pipelined requests/responses!! Efficient caching support! Network Cache assumed more explicitly in the design! Gives more control to the server on how it wants data cached! Support for virtual hosting! Allows to run multiple web servers on the same machine! Lec 22.47! Lec 22.48! Page 12

Persistent Connections" Pipelined Requests/Responses" Allow multiple transfers over one connection! Avoid multiple TCP connection setups! Avoid multiple TCP slow starts (i.e., TCP ramp ups)! Buffer requests and responses to reduce the number of packets! Multiple requests can be contained in one TCP segment! Note: order of responses has to be maintained! Client" Request 1" Request 2" Request 3" Transfer 1" Transfer 2" Transfer 3" Lec 22.49! Lec 22.50! Caching and Replication" Base-line " Problem: You are a web content provider! How do you handle millions of web clients?! How do you ensure that all clients experience good performance?! How do you maintain availability in the presence of server and network failures?! Many clients transfer same information! Generate unnecessary server and network load! Clients experience unnecessary latency! Solutions:! Add more servers at different locations If you are CNN this might work!! Caching! Content Distribution Networks (Replication)! Clients" ISP-1" Backbone ISP" ISP-2" Lec 22.51! Lec 22.52! Page 13

Reverse Caches" Cache documents close to server decrease server load! Typically done by content providers! Forward Proxies" Cache documents close to clients reduce network traffic and decrease latency! Typically done by ISPs or corporate LANs! Reverse caches" Reverse caches" Backbone ISP" Backbone ISP" ISP-1" ISP-2" Forward caches" ISP-1" ISP-2" Clients" Clients" Lec 22.53! Lec 22.54! Content Distribution Networks (CDNs)" Integrate forward and reverse caching functionalities into one overlay network (usually) administrated by one entity! Example: Akamai! Documents are cached both! As a result of clientsʼ requests (pull)! Pushed in the expectation of a high access rate! Beside caching do processing, e.g.,! Handle dynamic web pages! Transcoding!!!!! Example: Akamai" Akamai creates new domain names for each client content provider.! e.g., a128.g.akamai.net! The CDNʼs DNS servers are authoritative for the new domains! The client content provider modifies its content so that embedded URLs reference the new domains.! Akamaize content, e.g.: http://www.cnn.com/image-of-theday.gif becomes http://a128.g.akamai.net/image-of-the-day.gif.! Lec 22.55! 56 Lec 22.56! Page 14

Example: Akamai" Summary" DNS server for nhc.noaa.gov get http://www.nhc.noaa.gov www.nhc.noaa.gov Akamaizes its content. lookup a128.g.akamai.net local DNS server akamai.net DNS servers Akamaized response object has inline URLs for secondary content at a128.g.akamai.net and other Akamaimanaged DNS names. a Akamai servers store/ cache secondary content for Akamaized services. b c Remote Procedure Call (RPC): Call procedure (service) on remote machine! Provides same interface as local procedure call! Automatic packing and unpacking of arguments without user programming (in stub)! Two-phase commit: distributed decision making! First, make sure everyone guarantees that everyone is ready to commit! Next, ask everyone to commit! Hypertext Transport Protocol: request-response! Use DNS to locate server! HTTP 1.1 vs. 1.0: added support for persistent connections and pipeline to improve performance! Caching: key to increase scalability! Lec 22.57! Lec 22.58! Page 15