CONTENT-DISTRIBUTION NETWORKS

Similar documents
CSE 124: CONTENT-DISTRIBUTION NETWORKS. George Porter December 4, 2017

11/13/2018 CACHING, CONTENT-DISTRIBUTION NETWORKS, AND OVERLAY NETWORKS ATTRIBUTION

Server Selection Mechanism. Server Selection Policy. Content Distribution Network. Content Distribution Networks. Proactive content replication

Content Distribu-on Networks (CDNs)

Mul$media Networking. #9 CDN Solu$ons Semester Ganjil 2012 PTIIK Universitas Brawijaya

CSE 486/586 Distributed Systems

Distributed Systems. 21. Content Delivery Networks (CDN) Paul Krzyzanowski. Rutgers University. Fall 2018

CS November 2018

CS November 2017

Web, HTTP, Caching, CDNs

416 Distributed Systems. March 23, 2018 CDNs

Content Delivery on the Web: HTTP and CDNs

Introduction to Computer Networks

Content Distribution. Today. l Challenges of content delivery l Content distribution networks l CDN through an example

TAIL LATENCY AND PERFORMANCE AT SCALE

CSE 124: TAIL LATENCY AND PERFORMANCE AT SCALE. George Porter November 27, 2017

Files/News/Software Distribution on Demand. Replicated Internet Sites. Means of Content Distribution. Computer Networks 11/9/2009

CSE 124: LAMPORT AND VECTOR CLOCKS. George Porter October 30, 2017

How Akamai delivers your packets - the insight. Christian Kaufmann SwiNOG #21 11th Nov 2010

Drafting Behind Akamai (Travelocity-Based Detouring)

CSE 124: QUANTIFYING PERFORMANCE AT SCALE AND COURSE REVIEW. George Porter December 6, 2017

IP ADDRESSES, NAMING, AND DNS

CS 43: Computer Networks. 14: DHTs and CDNs October 3, 2018

CSE 123b Communications Software

Today s class. CSE 123b Communications Software. Telnet. Network File System (NFS) Quick descriptions of some other sample applications

Overview Computer Networking Lecture 16: Delivering Content: Peer to Peer and CDNs Peter Steenkiste

CS 43: Computer Networks BitTorrent & Content Distribution. Kevin Webb Swarthmore College September 28, 2017

BGP Case Studies. ISP Workshops

John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante

Outline Computer Networking. HTTP Basics (Review) How to Mark End of Message? (Review)

TIME ATTRIBUTION 11/4/2018. George Porter Nov 6 and 8, 2018

HTTP and Web Content Delivery

CSE 124: TIME SYNCHRONIZATION, CRISTIAN S ALGORITHM, BERKELEY ALGORITHM, NTP. George Porter October 27, 2017

PEER-TO-PEER NETWORKS, DHTS, AND CHORD

SaaS Providers. ThousandEyes for. Summary

PRIMARY-BACKUP REPLICATION

ThousandEyes for. Application Delivery White Paper

Naming and Content Distribution

CSC2231: DNS with DHTs

THE DATACENTER AS A COMPUTER AND COURSE REVIEW

From Internet Data Centers to Data Centers in the Cloud

EECS 122: Introduction to Computer Networks DNS and WWW. Internet Names & Addresses

What is the relationship between a domain name (e.g., youtube.com) and an IP address?

Internet Anycast: Performance, Problems and Potential

COMP6218: Content Caches. Prof Leslie Carr

Democratizing Content Publication with Coral

Democratizing Content Publication with Coral

Chapter 2 Application Layer

Peering with Akamai. CaribNOG November 2015 Belize City, Belize

OpenCache. A Platform for Efficient Video Delivery. Matthew Broadbent. 1 st Year PhD Student

The Design and Implementation of a Next Generation Name Service for the Internet (CoDoNS) Presented By: Kamalakar Kambhatla

TWO-PHASE COMMIT ATTRIBUTION 5/11/2018. George Porter May 9 and 11, 2018

CSEN 503 Introduction to Communication Networks

List of measurements in rural area

CSEN 404 Introduction to Networks. Mervat AbuElkheir Mohamed Abdelrazik. ** Slides are attributed to J. F. Kurose

Scaling Internet TV Content Delivery ALEX GUTARIN DIRECTOR OF ENGINEERING, NETFLIX

Peer-to-peer computing research a fad?

Computer Networks. Wenzhong Li. Nanjing University

Venugopal Ramasubramanian Emin Gün Sirer SIGCOMM 04

The World Wide Web. EE 122: Intro to Communication Networks. Fall 2006 (MW 4-5:30 in Donner 155) Vern Paxson TAs: Dilip Antony Joseph and Sukun Kim

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. Caching, Content Distribution and Load Balancing

THE AUTHORITATIVE GUIDE TO DNS TERMINOLOGY

CE693: Adv. Computer Networking

Finding a needle in Haystack: Facebook's photo storage

Last Class: Consistency Models. Today: Implementation Issues

Sharding & CDNs. CS 475, Spring 2018 Concurrent & Distributed Systems

An Introduction to Overlay Networks PlanetLab: A Virtual Overlay Network Testbed

Scaling Without Sharding. Baron Schwartz Percona Inc Surge 2010

A Tale of Three CDNs

Internet Services & Protocols

CSE 124: OPTIONS, SIGNALS, TIMEOUTS, AND CONCURRENCY. George Porter Oct 13, 2017

Content Distribution Networks

Distributed Systems Principles and Paradigms. Chapter 12: Distributed Web-Based Systems

Announcements. The World Wide Web. Retrieving From the Server. Goals of Today s Lecture. The World Wide Web. POP3 Protocol

Partitioning the Internet

Authors: Rupa Krishnan, Harsha V. Madhyastha, Sridhar Srinivasan, Sushant Jain, Arvind Krishnamurthy, Thomas Anderson, Jie Gao

Domain Name System.

CompSci 356: Computer Network Architectures Lecture 21: Overlay Networks Chap 9.4. Xiaowei Yang

Today: World Wide Web! Traditional Web-Based Systems!

Control and Adapt Mechanisms Drive Successful Video Delivery Within Global Enterprise Networks

DNS and Modern Network Services. Amin Vahdat CSE 123b April 27, 2006

Service Provider Multihoming

LatLong: Diagnosing Wide-Area Latency Changes for CDNs

VirtuLocity VLNCloud Software Acceleration Service Virtualized acceleration wherever and whenever you need it

End-user mapping: Next-Generation Request Routing for Content Delivery

CSC 401 Data and Computer Communications Networks

Finding a Needle in a Haystack. Facebook s Photo Storage Jack Hartner

Ossification of the Internet

Factors Affecting Performance of Web Flows in Cellular Networks

Outline Applications. Central Server Hierarchical Peer-to-peer. 31-Jan-02 Ubiquitous Computing 1

ELEC / COMP 177 Fall Some slides from Kurose and Ross, Computer Networking, 5 th Edition

Web caches (proxy server) Applications (part 3) Applications (part 3) Caching example (1) More about Web caching

Cisco Videoscape Distribution Suite Service Broker

An Overlay Architecture for End-to-End Internet Service Availability

DMAP : Global Name Resolution Services Through Direct Mapping

Optimal Cache Allocation for Content-Centric Networking

CSE 124: IP ADDRESSES, NAMING, AND DNS. George Porter Oct 4, 2017

Overlay and P2P Networks. Applications. Prof. Sasu Tarkoma and

CSE 124: THE DATACENTER AS A COMPUTER. George Porter November 20 and 22, 2017

Multipath Transport, Resource Pooling, and implications for Routing

Transcription:

CONTENT-DISTRIBUTION NETWORKS George Porter June 1, 2018 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative Commons license These slides incorporate material from: Kyle Jamieson, Princeton University (also under a CC BY-NC-SA 3.0 Creative Commons license) 1

ANNOUNCEMENTS OUTLINE 1. Web caching 2. Content-distribution networks Featuring Overlay networks 2

WEB CACHING Many clients transfer the same information Generates redundant server and network load Also, clients may experience high latency Origin server ISP-1 Backbone ISP ISP-2 Clients WHY WEB CACHING? Motivation for placing content closer to client: User gets better response time Content providers get happier users Network gets reduced load Why does caching work? Exploits locality of reference How well does caching work? Very well, up to a limit Large overlap in content But many unique requests 3

CACHING WITH REVERSE PROXIES Cache data close to origin server decrease server load Typically done by content providers Client thinks it is talking to the origin server (the server with content) Does not work for dynamic content Reverse proxies Origin server Backbone ISP ISP-1 ISP-2 Clients CACHING WITH FORWARD PROXIES Cache close to clients less network traffic, less latency Typically done by ISPs or corporate LANs Client configured to send requests to forward proxy Reduces traffic on ISP-1 s access link, origin server, and backbone ISP Reverse proxies Origin server Backbone ISP Forward proxies ISP-1 ISP-2 Clients 4

CACHING & LOAD-BALANCING: OUTSTANDING PROBLEMS Problem ca. 2002: How to reliably deliver large amounts of content to users worldwide? Popular event: Flash crowds overwhelm (replicated) web server, access link, or back-end database infrastructure More rich content: audio, video, photos Web caching: Diversity causes low cache hit rates (25 40%) OUTLINE 1. Web caching 2. Content-distribution networks Featuring Overlay networks 5

CONTENT DISTRIBUTION NETWORKS Proactive content replication Content provider (e.g. CNN) pushes content out from its own origin server CDN replicates the content On many servers spread throughout the Internet Updating the replicas Updates pushed to replicas when the content changes CDN server in S. America Origin server in N. America CDN distribution node CDN server in Europe CDN server in Asia REPLICA SELECTION: GOALS Live server For availability Lowest load To balance load across the servers Closest Nearest geographically, or in round-trip time Best performance Throughput, latency, reliability Requires continuous monitoring of liveness, load, and performance 6

AKAMAI STATISTICS Distributed servers Servers: ~100,000 Networks: ~1,000 Countries: ~70 Many customers Apple, BBC, FOX, GM IBM, MTV, NASA, NBC, NFL, NPR, Puma, Red Bull, Rutgers, SAP, Client requests 20+M per second Half in the top 45 networks 20% of all Web traffic worldwide HOW AKAMAI USES DNS cnn.com (content provider) DNS TLD server GET index. cache.cnn.com/foo.jpg html 1 2 global regional End user Nearby 7

HOW AKAMAI USES DNS cnn.com (content provider) DNS TLD server 1 2 DNS lookup cache.cnn.com 3 4 ALIAS: g.akamai.net global regional End user Nearby HOW AKAMAI USES DNS cnn.com (content provider) DNS TLD server DNS lookup g.akamai.net 1 2 3 5 4 6 ALIAS a73.g.akamai.net global regional End user Nearby 8

HOW AKAMAI USES DNS cnn.com (content provider) DNS TLD server 1 2 3 5 4 6 7 global regional 8 End user Address 1.2.3.4 Nearby HOW AKAMAI USES DNS cnn.com (content provider) DNS TLD server 1 2 3 5 4 6 7 8 global regional End user GET /foo.jpg Host: cache.cnn.com 9 Nearby 9

HOW AKAMAI USES DNS cnn.com (content provider) GET foo.jpg DNS TLD server 1 2 11 12 5 3 4 6 7 8 global regional End user GET /foo.jpg Host: cache.cnn.com 9 Nearby HOW AKAMAI USES DNS cnn.com (content provider) DNS TLD server 1 2 11 12 5 3 4 6 7 8 global regional End user 9 10 Nearby 10

HOW AKAMAI WORKS: CACHE HIT cnn.com (content provider) DNS TLD server 1 2 3 4 global regional End user 5 6 Nearby MAPPING SYSTEM Equivalence classes of IP addresses IP addresses experiencing similar performance Quantify how well they connect to each other Collect and combine measurements Ping, traceroute, BGP routes, server logs e.g., over 100 TB of logs per days Network latency, loss, throughput, and connectivity 11

ROUTING CLIENT REQUESTS WITH THE MAP Map each IP class to a preferred server Based on performance, health, etc. Updated roughly every minute Short, 60-sec DNS TTLs in regional DNS accomplish this Map client request to a server in the Load balancer selects a specific server e.g., to maximize the cache hit rate ADAPTING TO FAILURES Failing hard drive on a server Suspends after finishing in progress requests Failed server Another server takes over for the IP address Low-level map updated quickly (load balancer) Failed, or network path High-level map updated quickly (ping/traceroute) 12

TAKE-AWAY POINTS: CDNS Content distribution is hard Many, diverse, changing objects Clients distributed all over the world Moving content to the client is key Reduces latency, improves throughput, reliability Content distribution solutions evolved: Load balancing, reactive caching, to Proactive content distribution networks OUTLINE 1. Web caching 2. Content-distribution networks Featuring Overlay networks 13

ABSTRACTIONS AND OVERLAYS Overlay Network Physical Network OVERLAYS FOR ROUTING: WHY? Triangle inequality doesn t hold in networks! 14

FORWARDING TRAFFIC THROUGH TUNNELING OVERLAY NETWORKS FOR ROUTING SF UK SD AUS Underlying network Internet connectivity (IP Routing) 15

OVERLAY NETWORKS SF UK SD AUS Potential overlay connectivity SF as root OVERLAY NETWORKS SF 50 UK 10 150 250 100 SD 300 AUS Determine edge weights E.g., bandwidth, latency 16

OVERLAY NETWORKS SF 50 UK 10 100 SD AUS Build overlay connectivity An application-layer distribution tree 17