P2P content distribution

Similar documents
BitTorrent and CoolStreaming

P2P content distribution Jukka K. Nurminen

Peer-to-peer. T Applications and Services in Internet, Fall Jukka K. Nurminen. 1 V1-Filename.ppt / / Jukka K.

Peer-to-peer & Energy Consumption

P2P Applications. Reti di Elaboratori Corso di Laurea in Informatica Università degli Studi di Roma La Sapienza Canale A-L Prof.ssa Chiara Petrioli

P2P Applications. Reti di Elaboratori Corso di Laurea in Informatica Università degli Studi di Roma La Sapienza

Extreme Computing. BitTorrent and incentive-based overlay networks.

Peer to Peer Systems and Probabilistic Protocols

internet technologies and standards

P2P Networks - General

Octoshape. Commercial hosting not cable to home, founded 2003

CS 3516: Advanced Computer Networks

Peer-to-Peer Applications Reading: 9.4

Lecture 17: Peer-to-Peer System and BitTorrent

Lesson 9 Applications of DHT: Bittorrent Mainline DHT, the KAD network

BitTorrent. Masood Khosroshahy. July Tech. Report. Copyright 2009 Masood Khosroshahy, All rights reserved.

Scalability of the BitTorrent P2P Application

Peer-to-Peer (P2P) Architectures

Peer-to-Peer Streaming Systems. Behzad Akbari

Content Distribution and BitTorrent [Based on slides by Cosmin Arad]

Content Search. Unstructured P2P. Jukka K. Nurminen

CompSci 356: Computer Network Architectures Lecture 21: Overlay Networks Chap 9.4. Xiaowei Yang

CSE 486/586 Distributed Systems Peer-to-Peer Architectures

Introduction to P P Networks

Mobile Peer-to-Peer Business Models T Network Services Business Models. Mikko Heikkinen

A Measurement Study of Attacks on BitTorrent Seeds

Lecture 8: Application Layer P2P Applications and DHTs

Live P2P Streaming with Scalable Video Coding and Network Coding

EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Overlay Networks: Motivations

Azureus Plugin for Facebook Integration

Heterogeneous Gossip. Davide Frey Rachid Guerraoui Anne-Marie Kermarrec Boris Koldehofe Maxime Monod Martin Mogensen Vivien Quéma

EE 122: Peer-to-Peer Networks

Distributed Systems. peer-to-peer Johan Montelius ID2201. Distributed Systems ID2201

Distributed Data Management. Christoph Lofi Institut für Informationssysteme Technische Universität Braunschweig

Broadcast Routing. Chapter 5 Multicast and P2P. In-network Duplication. Spanning Tree

Small Is Not Always Beautiful

Overlay Networks: Motivations. EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Motivations (cont d) Goals.

CS5412: TORRENTS AND TIT-FOR-TAT

Taming the Flood: How I Learned to Stop Worrying and Love the Swarm Yogesh Vedpathak Cleversafe

Peer-to-peer systems and overlay networks

Department of Computer Science Institute for System Architecture, Chair for Computer Networks. File Sharing

Last Lecture SMTP. SUNY at Buffalo; CSE 489/589 Modern Networking Concepts; Fall 2010; Instructor: Hung Q. Ngo 1

Overview Computer Networking Lecture 16: Delivering Content: Peer to Peer and CDNs Peter Steenkiste

Page 1. How Did it Start?" Model" Main Challenge" CS162 Operating Systems and Systems Programming Lecture 24. Peer-to-Peer Networks"

Application-Layer Protocols Peer-to-Peer Systems, Media Streaming & Content Delivery Networks

Peer-to-Peer Systems. Internet Computing Workshop Tom Chothia

SECURED SOCIAL TUBE FOR VIDEO SHARING IN OSN SYSTEM

Spotify Behind the Scenes

CSCI-1680 P2P Rodrigo Fonseca

Content Overlays (continued) Nick Feamster CS 7260 March 26, 2007

P2P Content Distribution

Merging the best of HTTP and P2P

Push-Pull Two-layer Super-Peer based P2P Live Media Streaming

Modeling Leechers Attack in BitTorrent

Game Theory. Presented by Hakim Weatherspoon

Second Generation P2P Live Streaming

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Peer-to-Peer Internet Applications: A Review

Towards Low-Redundancy Push-Pull P2P Live Streaming

Peer-to-Peer Networks 12 Fast Download

Advanced Internet Architectures

Peer-to-Peer Systems. Network Science: Introduction. P2P History: P2P History: 1999 today

Content Search. Unstructured P2P

A HYBRID PUSH-PULL OVERLAY NETWORK FOR PEER-TO-PEER VIDEO STREAMING

CS 43: Computer Networks BitTorrent & Content Distribution. Kevin Webb Swarthmore College September 28, 2017

Cooperative End-to-end content distribution. Márk Jelasity

Chunk Scheduling Strategies In Peer to Peer System-A Review

CS 3516: Computer Networks

Introduction to Peer-to-Peer Systems

improving the performance and robustness of P2P live streaming with Contracts

The Scalability of Swarming Peer-to-Peer Content Delivery

Do incentives build robustness in BitTorrent?

CS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University

arxiv:cs.ni/ v1 21 Nov 2006

MASTER DEGREE COMPUTER SCIENCE COMPUTER SCIENCE AND NETWORKING. Peer to Peer Systems LAURA RICCI 2/5/2011

Peer-to-Peer Secure Update for Heterogeneous Edge Devices

An Empirical Study of Flash Crowd Dynamics in a P2P-based Live Video Streaming System

P2P and Handheld Devices Jukka K. Nurminen

DISTRIBUTED SYSTEMS CSCI 4963/ /4/2015

A DHT-Aided Chunk-Driven Overlay for Scalable and Efficient Peer-to-Peer Live Streaming

416 Distributed Systems. Mar 3, Peer-to-Peer Part 2

BitTorrent. Internet Technologies and Applications

Application Layer: P2P File Distribution

Peer Assisted Content Distribution over Router Assisted Overlay Multicast

CS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University

BitRiver: Final Report

Dynamic Load-Balanced Multicast for Data-Intensive Applications on Clouds 1

Enhancing Downloading Time By Using Content Distribution Algorithm

Topology Optimization in Hybrid Tree/Mesh-based Peer-to-Peer Streaming System

BiToS: Enhancing BitTorrent for Supporting Streaming Applications

Peer to Peer Networks

Adaptive Server Allocation for Peer-assisted VoD

Peer-to-Peer Media Streaming

12/5/16. Peer to Peer Systems. Peer-to-peer - definitions. Client-Server vs. Peer-to-peer. P2P use case file sharing. Topics

Anatomy of a P2P Content Distribution System with Network Coding

Collaborative Multi-Source Scheme for Multimedia Content Distribution

A Case Study of Large Scale P2P Video Multicast

Inside the New Coolstreaming: Principles, Measurements and Performance Implications

Value of YouTube to the music industry Paper V Direct value to the industry

Overlay and P2P Networks. Introduction and unstructured networks. Prof. Sasu Tarkoma

Transcription:

P2P content distribution T-110.7100 Applications and Services in Internet, Fall 2010 Jukka K. Nurminen 1 V1-Filename.ppt / yyyy-mm-dd / Initials

Steps of content sharing Share content Find content Transfer content 2 V1-Filename.ppt /2008-10-22 / Jukka K. Nurminen

Azureus BitTorrent client 3 2008-10-22 / Jukka K. Nurminen

BitTorrent content downloading Efficient content distribution Bram Cohen, 2001 Key idea: you can receive faster than what your peer is able to send Peer serving multiple users Asynchronous connections E2E speed of Internet File divided into pieces, recipient receives pieces from multiple peers Each recipient supplies pieces of the data to newer recipients 4 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

BitTorrent, 2002 Tracker 2. Tracker provides a list of peers which have all or pieces of content 1. Get Torrent file 4. Distribute pieces to others 3. Get pieces of content from peers Most popular P2P technoloy Targets efficient content download Upload capacity is the bottleneck Sharers < Loaders Unsymmetric link speeds Especially for large files (video) The more popular content, the better it works Open protocol, many open source clients, commercial services

BitTorrent components Maintaining information about which peers have the content available Normal website hosting of metadata files (torrent-files) tracker website seed leech Peer with entire file 6 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen Peer that is still downloading (has only parts of the file)

BitTorrent joining a torrent Adapted from Nikitas Liogkas, Robert Nelson, Eddie Kohler, Lixia Zhang, Exploiting BitTorrent For Fun, University of California, Los Angeles new leecher metadata file 1 website 2 join peer list 3 tracker data request 4 seed/leecher 1. obtain the metadata file (.torrent -file) 2. contact the tracker 3. obtain a peer list (contains seeds & leechers) 4. contact peers from that list for data 7 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

BitTorrent exchanging data leecher B leecher A I have! seed leecher C Download sub-pieces in parallel Verify pieces using hashes Advertise received pieces to the entire peer list Look for the rarest pieces 8 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

Spotify music streaming hybrid technology, 2008 Spotify server 3. Send pieces of content 2a. Send part (beginning) of song for play 2b. Send list of peers who have are likely to have the song 1. Ask for song 4. Distribute content pieces further

BitTorrent Summary Benefits reduced cost and burden on any given individual source much higher redundancy greater resistance to abuse or "flash crowds less dependence on the original distributor Disadvantages Slow start and finish downloads take time to rise to full speed because peer connections take time to establish Special end game algorithms Full content has to be downloaded before playing can start (in most cases) Central tracker can be a bottleneck Distributed trackers based on DHT Applications Legal video distribution (e.g. BitTorrent, Vuze) Illegal video distribution (e.g. PirateBay) Distribution of patches (e.g. Wow, Linux distros) 10 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

P2P live streaming TV over the Internet 11 V1-Filename.ppt / yyyy-mm-dd / Initials

PPLive, TVU, PPLive is a P2P television network software that famous all over the world. It has the largest number of users, the most extensive coverage in internet. PPLive 100 million downloads of its P2P streaming video client 24 million users per month access to 900 or so live TV channels 200 individual advertisers this year alone 12 2008-10-22 / Jukka K. Nurminen Source: iresearch, August, 2008

Traditional stream delivery models Server Widely used, simple and easy Free Internet radios, YouTube, Liveleak.com, Google video, Allows using standard clients (browser) Limited server output capacity / stream quality; expensive to scale Server grid Content delivery network Expensive to scale IP multicast / LAN multicast The ideal model proposed for 20+ years Not available in large scale Internet Technical + non-technical constraints Perhaps possible in local environments 13 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

P2P streaming ( peercasting ) Each receiver of the stream forwards it to other receivers Promises No servers required Infinite scalability Challenges Churn: peers constantly join and leave the network Limited peer capabilities: asymmetric data connections Limited peer visibility: NAT, firewall Optimal use of network resources 14 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

Multicast tree (ca. 2002) First practical approach End-System Multicast II Open source solutions (peercast, freecast) Over 20 well-known variants Peers form a tree topology Own tree for each data stream Forward stream down the tree Works in practice Scales 10 100 1000? users Problems Large output bandwidth required Tree optimization Tree repair due to churn Less than half of peers can contribute source 15 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

Data-driven overlay (ca. 2004) The mainstream practical approach Active area for current research Coolstreaming (2004), Chainsaw (2005), GridMedia (2006), PRIME (2006), HotStreaming (2007) BitTorrent for streams Chunk stream in small pieces Distribute pieces in a swarm Works well in practice Most large-scale solutions Coolstreaming, PPLive, Roxbeam, Sopcast Scales to 10k 100k 1M? source 16 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

Basic data-driven overlay approach Coolstreaming/DONet (2004), Chainsaw (2005) Topology creation: gossiping protocol (SCAMP) Peers maintain random partial view of the network Peers select random partners No centralized tracker Swarming: sliding buffer of pieces Reports pieces it has to its partners Partners request for pieces they don t have Design problems Whom to select as partner? When and from whom to request a piece? Overhead vs. latency? Report Request Send 17 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

Main challenges of data-driven approach Open research questions Based on real-life experiences with Coolstreaming and 80k users Affect negatively to end-user experience Dealing with flash crowd How to cope if number of users increases from 1k to 100k in 10 minutes? We don t have infrastructure to support new users Joining takes a long time > 25% of new users must re-try joining Dealing with 50% of users that don t contribute Due to asymmetric connection, firewall, NAT, Where to get the missing output capacity? 18 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen

Hybrid technology The best known technology for commercial large-scale streaming Streaming to 100k 1M users Proposed practical solution to problems of data-driven overlay A combination of P2P and server grid Use P2P distribution in stable conditions Use powerful servers to fill in missing output capacity Servers support newcomers Servers support users behind asymmetric connections 19 V1-Filename.ppt / 2008-11-12 / Jukka K. Nurminen