CS 268: DHTs. Page 1. How Did it Start? Model. Main Challenge. Napster. Other Challenges
|
|
- Emil Shaw
- 5 years ago
- Views:
Transcription
1 How Did it Start? CS : DHTs A killer application: Naptser - Free music over the Internet Key idea: share the content, storage and bandwidth of individual (home) users Scott Shenker and Ion Stoica April, Internet Model Main Challenge Each user stores a subset of files Each user has access (can download) files from all users in the system Find where a particular file is stored F E D E? A B C Other Challenges Napster Scale: up to hundred of thousands or millions of machines Dynamicity: machines can come and go any time Assume a centralized index system that maps files (songs) to machines that are alive How to find a file (song) - Query the index system return a machine that stores the required file Ideally this is the closest/least-loaded machine - ftp the file Advantages: - Simplicity, easy to implement sophisticated search engines on top of the index system Disadvantages: - Robustness, scalability (?) Page
2 Napster: Example Gnutella m F E? E m A E? m m E m A m B m C m D m E m F m B D m C m Distribute file location Idea: flood the request Hot to find a file: - Send request to all neighbors - Neighbors recursively multicast the request - Eventually a machine that has the file receives the request, and it sends back the answer Advantages: - Totally decentralized, highly robust Disadvantages: - Not scalable; the entire network can be swamped with request (to alleviate this problem, each request has a TTL) Gnutella Ad-hoc topology Queries are flooded for bounded number of hops No guarantees on recall Distributed Hash Tables (DHTs) Abstraction: a distributed hash-table data structure - insert(id, item); - item = query(id); (or lookup(id);) - Note: item can be anything: a data object, document, file, pointer to a file Proposals - CAN, Chord, Kademlia, Pastry, Tapestry, etc 9 DHT Design Goals Make sure that an item (file) identified is always found Scales to hundreds of thousands of nodes Handles rapid arrival and failure of nodes Structured Networks Distributed Hash Tables (DHTs) Hash table interface: put(key,item), get(key) O(log n) hops Guarantees on recall (K,I ) put(k,i ) get (K ) Page
3 Content Addressable Network (CAN) CAN Example: Two Dimensional Space Associate to each node and item a unique id in an d- dimensional Cartesian space on a d-torus Properties - Routing table size O(d) - Guarantees that a file is found in at most d*n /d steps, where n is the total number of nodes Space divided between nodes All nodes cover the entire space Each node covers either a square or a rectangular area of ratios : or : Example: - Node n:(, ) first node that joins cover the entire space n CAN Example: Two Dimensional Space CAN Example: Two Dimensional Space Node n:(, ) joins space is divided between n and n Node n:(, ) joins space is divided between n and n n n n n n CAN Example: Two Dimensional Space CAN Example: Two Dimensional Space Nodes n:(, ) and n:(,) join n n n Nodes: n:(, ); n:(,); n:(, ); n:(,);n:(,) Items: f:(,); f:(,); f:(,); f:(,); n n n f n n n f f n f Page
4 Q Q CAN Example: Two Dimensional Space CAN: Query Example Each item is stored by the node who owns its mapping in the space f n f n n n f n f Each node knows its neighbors in the d-space Forward query to the neighbor that is closest to the query id Example: assume n queries f Can route around some failures f n f n n n f n f 9 CAN: Node Joining CAN: Node Joining 9: *!#"%$'& (!) (*+ -,!./ ; < = ; >? <! % (!! -( CAN: Node Joining CAN: Node Joining 9: * ; < = ; < = ; >? A-B > C D< E-D>GF'HJI K A IL?M EN!>O< B EP; >? <RQ S A ET U M DVQ!W EYX>Z;Z<[M ;G\Z]ZU ^%_`; < = ;Z>J? <a> =b;zec> ; <G\] U ^ Page
5 Node departure Chord Node explicitly hands over its zone and the associated (key,value) database to one of its neighbors Incase of network failure this is handled by a take-over algorithm Problem : take over mechanism does not provide regeneration of data Solution: every node has a backup of its neighbours Associate to each node and item a unique id in an unidimensional space.. m - Goals - Scales to hundreds of thousands of nodes - Handles rapid arrival and failure of nodes Key design decision - Decouple correctness from efficiency Properties - Routing table size O(log(N)), where N is the total number of nodes - Guarantees that a file is found in O(log(N)) steps Identifier to Node Mapping Example Lookup Node maps [,] Node maps [9,] Node maps [, ] Node maps [9, ] Each node maintains its successor Route packet (ID, data) to the node responsible for ID using successor pointers lookup() Each node maintains a pointer to its successor node= Joining Operation Joining Operation Each node A periodically sends a stabilize() message to its successor B Upon receiving a stabilize() message node B - returns its predecessor B =pred(b) to A by sending a notify(b ) message Upon receiving notify(b ) from B, - if B is between A and B, A updates its successor to B - A doesn t do anything, otherwise Node with id= joins the ring Node needs to know at least one node already in the system - Assume known node is succ=nil pred=nil succ= pred= succ= pred= 9 Page
6 Joining Operation Joining Operation Node : send join() to node Node : returns node Node updates its successor to succ= succ=nil pred=nil succ= pred= join() Node : send stabilize() to node Node : - update predecessor to - send notify() back succ= pred=nil notify(pred=) succ= pred= pred= stabilize() succ= pred= succ= pred= Joining Operation (cont d) Joining Operation (cont d) Node sends a stabilize message to its successor, node Node reply with a notify message Node updates its successor to succ= stabilize() pred=nil notify(pred=) succ= pred= Node sends a stabilize message to its new successor, node Node sets its predecessor to node succ= pred= pred=nil succ= pred= Stabilize() succ= succ= pred= succ= pred= Joining Operation (cont d) Achieving Efficiency: finger tables This completes the joining operation! succ= pred= succ= pred= Finger Table at i ft[i] ( + ) mod = Say m= i m ith entry at peer with id n is first peer with id >= n + (mod ) Page
7 Achieving Robustness CAN/Chord Optimizations To improve robustness each node maintains the k (> ) immediate successors instead of only one successor In the notify() message, node A can send its k- successors to its predecessor B Upon receiving notify() message, B can update its successor list by concatenating the successor list received from A with A itself Reduce latency - Chose finger that reduces expected time to reach destination - Chose the closest node from range [N+ i-,n+ i ) as successor Accommodate heterogeneous systems - Multiple virtual nodes per physical node Conclusions Two Other Papers Distributed Hash Tables are a key component of scalable and robust overlay networks CAN: O(d) state, O(d*n/d) distance Chord: O(log n) state, O(log n) distance Both can achieve stretch < Simplicity is key Services built on top of distributed hash tables - persistent storage (OpenDHT, Oceanstore) - pp file storage, i (chord) - multicast (CAN, Tapestry) Boon Loo Tau et al, Enhancing PP File-Sharing with an Internet-Scale Query Processor, VLDB Krishna Gummadi et al, The Impact of DHT Routing Geometry on Resilience and Proximity, SIGCOMM 9 Goal Keyword Search using DHTs PP Search - What is the best design? Target Environment - Typical PP file-sharing environments. - Items stored in PP network, queried by keywords - Replicas of items follow a long-tailed distribution Popular items at head of distribution Rare items at tail of distribution Inverted Lists hashed by keyword (term) in the DHT %& %. * +,-. + -* %'& % )( * +,- ( + -/.*! " #$ #$ %& % /* +,-/ + -* Page
8 Solution Space & Proposal Gnutella Measurements ;^_ V \]' %'&('(')+*-,'././ %..9,&:;9< = >?!"$# U V W X ZY$ Q[W; \]' (/.A.'9,&:;9 ` Quality of Searches: - Recall (% of all relevant items retrieved) - Distinct Recall (% of all relevant distinct items retrieved) - Response Time (Latency) to st result ` Computed Query Recall - Each query issued simultaneously from ultrapeers - Union of results from ultrapeers Union-of- is our approximation of perfect answer BC DFE'&.G/H&:;./ I'H:JCK.L:(F.C:M'./G(GEN:C DC O C I'P+,I'9Q:/,'RQ:,/.') (/9Q:J/,'RQ:J,/.)+I'. : (G/ S9 T Result Size CDF ab cdef g hfi j k l cbmcn m o n p qr g hfi j vz ww{- }~ y ~w w{z }w t~ $ Query Latency vg $ yt z ˆy ƒ } u; yt zˆy satufvwxty Š_Œ Ž Œ L _ 'Œ_ Š+ 'ŒF Œ LŠ+š _ LœFŒ _ž+ž Ÿ ŒF _ž _ $Œ Ž; ŒF Hybrid Search Challenges Hybrid = Best of both worlds vz w w{ y { ˆw u ¼½ ujxg ¾w¾Ãt z u ˆ À]y'Á ' ' ± -²'³ µ ³ µ ³ ³'¹ ² º;¹» ÄÅÆ -²'³ µ Ç ª$«¼½ ujxg $ ¾uJ [ ˆ Ày'Á ³A ³¹ ² º;¹ È Identifying Rare Items - Query Results Size (QRS) Publish items from previous queries that return few results - Term Frequency Statistics Single Term (TF) Term Pairs (TPF) - Sample items on neighboring nodes (SAM) È Network Churn (IPTPS ) - Use Ultrapeers as DHT nodes - Avoid publishing items from short-lived nodes Page
9 ; ; Identifying Rare Items ³ ² ' ¹'³ L³_ º ³ º;³ ' ' _ ³' ³! '³º G " ³G ² ' #$ %'&( ) ¹'³ L³ Ä* + ¹ + ³ º² '¹-,¹L ³º;³ _¹-,G ³ ²' ¹'³' /. ¹'³ _³'¹ ³º ³'³ #'$%'&( ), ' Results Improved Response Time: - PIER (DHT) returns first result in seconds seconds in aggregate including seconds timeout - Gnutella (Flood) queries returns first result in seconds - seconds (%) reduction in latency Improved Recall Analysis: - % reduction in #queries with empty results Using the naïve QRS rare-item selection scheme - Opportunity to do far better Recall: % potential reduction based on Union-of- Approaches: larger-scale deployments and using better rare-item schemes. 9 Two Other Papers Motivation 9 Boon Loo Tau et al, Enhancing PP File-Sharing with an Internet-Scale Query Processor, VLDB : Krishna Gummadi et al, The Impact of DHT Routing Geometry on Resilience and Proximity, SIGCOMM New DHTs constantly proposed - CAN, Chord, Pastry, Tapestry, Plaxton, Viceroy, Kademlia, Skipnet, Symphony, Koorde, Apocrypha, Land, ORDI ; Each is extensively analyzed but in isolation Each DHT has many algorithmic details making it difficult to compare Goals: a) Separate fundamental design choices from algorithmic details b) Understand their affect reliability and efficiency Our approach: Component-based analysis Three aspects of a DHT design 9 Break DHT design into independent components 9 Analyze impact of each component choice separately - compare with black-box analysis: benchmark each DHT implementation rankings of existing DHTs vs. hints on better designs 9 Two types of components - Routing-level : neighbor & route selection - System-level : caching, replication, querying policy etc. ) Geometry: a graph structure that inspires a DHT design, with its exciting properties - Tree, Hypercube, Ring, Butterfly, Debruijn ) Distance function: captures a geometric structure - d(id, id) for any two node identifiers ) Algorithm: rules for selecting neighbors and routes using the distance function Page 9
10 Chord DHT has Ring Geometry Chord Distance function captures Ring d(, ) = 9 Nodes are points on a clock-wise Ring 9 d(id, id) = length of clock-wise arc between ids = (id id) mod N Chord Neighbor and Route selection Algorithms One Geometry, Many Algorithms d(, ) = 9 Algorithm : exact rules for selecting neighbors, routes - Chord, CAN, PRR, Tapestry, Pastry etc. d(, ) = - inspired by geometric structures like Ring, Hyper-cube, Tree 9 Geometry : an algorithm s underlying structure d(, ) = 9 Neighbor selection: i th neighbor at i distance - Distance function is the formal representation of Geometry - Chord, Symphony => Ring - many algorithms can have same geometry Why is Geometry important? 9 Route selection: pick neighbor closest to destination Insight: Geometry => Flexibility => Performance Route selection flexibility allowed by Ring Geometry 9 Geometry captures flexibility in selecting algorithms 9 Flexibility is important for routing performance - Flexibility in selecting routes leads to shorter, reliable paths - Flexibility in selecting neighbors leads to shorter paths 9 Chord algorithm picks neighbor closest to destination 9 A different algorithm picks the best of alternate paths 9 Page
11 Neighbor selection flexibility allowed by Ring Geometry Geometries we compare Geometry Algorithm Ring Chord, Symphony Hypercube CAN Tree Plaxton 9 Chord algorithm picks i th neighbor at i distance 9 A different algorithm picks i th neighbor from [ i, i+ ) Hybrid = Tree + Ring XOR d(id, id) = id XOR id Tapestry, Pastry Kademlia Metrics for flexibilities Flexibility in neighbor selection (FNS) for Tree 9 FNS: Flexibility in Neighbor Selection = number of node choices for a neighbor 9 FRS: Flexibility in Route Selection = avg. number of next-hop choices for all destinations h = h = h = 9 Constraints for neighbors and routes - select neighbors to have paths of O(logN) - select routes so that each hop is closer to destination 9 logn neighbors in sub-trees of varying heights 9 FNS = i- for i th neighbor of a node Flexibility in route selection (FRS) for Hypercube Summary of our flexibility analysis d(, ) = Flexibility Ordering of Geometries d(, ) = Neighbors (FNS) Hypercube << Tree, XOR, Ring, Hybrid (logn) ( i- ) 9 Routing to next hop fixes one bit 9 FRS =Avg. (#bits destination differs in)=logn/ d(, ) = d(, ) = Routes Tree << XOR, Hybrid < Hypercube < Ring (FRS) () (logn/) (logn/) (logn) How relevant is flexibility for DHT routing performance? Page
12 Analysis of Static Resilience Two aspects of robust routing Dynamic Recovery : how quickly routing state is recovered after failures Static Resilience : how well the network routes before recovery finishes - captures how quickly recovery algorithms need to work - depends on FRS Evaluation: Fail a fraction of nodes, without recovering any state Metric: % Paths Failed % Failed Paths Does flexibility affect Static Resilience? Tree 9 % Failed Nodes Tree << XOR XOR Hybrid Ring Hypercube Hybrid < Hypercube < Ring Flexibility in Route Selection matters for Static Resilience Analysis of Overlay Path Latency Which is more effective, FNS or FRS? Goal: Minimize end-to-end overlay path latency FNS + FRS Ring - not just the number of hops Both FNS and FRS can reduce latency - Tree has FNS, Hypercube has FRS, Ring & XOR have both CDF FNS Ring FRS Ring Plain Ring Evaluation: Using Internet latency distributions (see paper) Latency (msec) Plain << FRS << FNS FNS+FRS Neighbor Selection is much better than Route Selection 9 Does Geometry affect performance of FNS or FRS? Summary of results CDF FNS Ring FNS XOR FRS Ring FRS Hypercube FRS matters for Static Resilience - Ring has the best resilience Both FNS and FRS reduce Overlay Path Latency But, FNS is far more important than FRS - Ring, Hybrid, Tree and XOR have high FNS Latency (msec) No, performance of FNS/FRS is independent of Geometry A Geometry s support for neighbor selection is crucial Page
13 Conclusions Routing Geometry is a fundamental design choice - Geometry determines flexibility - Flexibility improves resilience and proximity Ring has the greatest flexibility Page
CIS 700/005 Networking Meets Databases
Announcements CIS / Networking Meets Databases Boon Thau Loo Spring Lecture Paper summaries due at noon today. Office hours: Wed - pm ( Levine) Project proposal: due Feb. Student presenter: rd Jan: A Scalable
More informationGoals. EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Solution. Overlay Networks: Motivations.
Goals CS : Introduction to Computer Networks Overlay Networks and PP Networks Ion Stoica Computer Science Division Department of lectrical ngineering and Computer Sciences University of California, Berkeley
More informationThe Impact of DHT Routing Geometry on Resilience and Proximity. Acknowledgement. Motivation
The Impact of DHT Routing Geometry on Resilience and Proximity Presented by Noorullah Moghul Krishna Gummadi, Ramakrishna Gummadi, Sylvia Ratnasamy, Steve Gribble, Scott Shenker, Ion Stoica Acknowledgement
More informationMotivation. The Impact of DHT Routing Geometry on Resilience and Proximity. Different components of analysis. Approach:Component-based analysis
The Impact of DHT Routing Geometry on Resilience and Proximity Presented by Karthik Lakshminarayanan at P2P Systems class (Slides liberally borrowed from Krishna s SIGCOMM talk) Krishna Gummadi, Ramakrishna
More informationEE 122: Peer-to-Peer Networks
EE 122: Peer-to-Peer Networks Ion Stoica (and Brighten Godfrey) TAs: Lucian Popa, David Zats and Ganesh Ananthanarayanan http://inst.eecs.berkeley.edu/~ee122/ (Materials with thanks to Vern Paxson, Jennifer
More informationPage 1. How Did it Start?" Model" Main Challenge" CS162 Operating Systems and Systems Programming Lecture 24. Peer-to-Peer Networks"
How Did it Start?" CS162 Operating Systems and Systems Programming Lecture 24 Peer-to-Peer Networks" A killer application: Napster (1999) Free music over the Internet Key idea: share the storage and bandwidth
More informationEE 122: Peer-to-Peer (P2P) Networks. Ion Stoica November 27, 2002
EE 122: Peer-to-Peer (P2P) Networks Ion Stoica November 27, 22 How Did it Start? A killer application: Naptser - Free music over the Internet Key idea: share the storage and bandwidth of individual (home)
More informationEECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Overlay Networks: Motivations
EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks Ion Stoica Computer Science Division Department of Electrical Engineering and Computer Sciences University of California, Berkeley
More informationOverlay Networks: Motivations. EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Motivations (cont d) Goals.
Overlay Networks: Motivations CS : Introduction to Computer Networks Overlay Networks and PP Networks Ion Stoica Computer Science Division Department of lectrical ngineering and Computer Sciences University
More informationMain Challenge. Other Challenges. How Did it Start? Napster. Model. EE 122: Peer-to-Peer Networks. Find where a particular file is stored
Main hallenge ind where a particular file is stored : Peer-to-Peer Networks Ion Stoica (and righten Godfrey) Ts: Lucian Popa, avid Zats and Ganesh nanthanarayanan http://inst.eecs.berkeley.edu/~ee/ (Materials
More information15-744: Computer Networking P2P/DHT
15-744: Computer Networking P2P/DHT Overview P2P Lookup Overview Centralized/Flooded Lookups Routed Lookups Chord Comparison of DHTs 2 Peer-to-Peer Networks Typically each member stores/provides access
More informationScalability In Peer-to-Peer Systems. Presented by Stavros Nikolaou
Scalability In Peer-to-Peer Systems Presented by Stavros Nikolaou Background on Peer-to-Peer Systems Definition: Distributed systems/applications featuring: No centralized control, no hierarchical organization
More informationBadri Nath Rutgers University
lookup services Badri Nath Rutgers University badri@cs.rutgers.edu 1. CAN: A scalable content addressable network, Sylvia Ratnasamy et.al. SIGCOMM 2001 2. Chord: A scalable peer-to-peer lookup protocol
More informationCSCI-1680 P2P Rodrigo Fonseca
CSCI-1680 P2P Rodrigo Fonseca Based partly on lecture notes by Ion Stoica, Sco5 Shenker, Joe Hellerstein Today Overlay networks and Peer-to-Peer Motivation Suppose you want to write a routing protocol
More informationLecture 6: Overlay Networks. CS 598: Advanced Internetworking Matthew Caesar February 15, 2011
Lecture 6: Overlay Networks CS 598: Advanced Internetworking Matthew Caesar February 15, 2011 1 Overlay networks: Motivations Protocol changes in the network happen very slowly Why? Internet is shared
More informationP2P: Distributed Hash Tables
P2P: Distributed Hash Tables Chord + Routing Geometries Nirvan Tyagi CS 6410 Fall16 Peer-to-peer (P2P) Peer-to-peer (P2P) Decentralized! Hard to coordinate with peers joining and leaving Peer-to-peer (P2P)
More informationPage 1. Key Value Storage"
Key Value Storage CS162 Operating Systems and Systems Programming Lecture 14 Key Value Storage Systems March 12, 2012 Anthony D. Joseph and Ion Stoica http://inst.eecs.berkeley.edu/~cs162 Handle huge volumes
More informationDistributed lookup services
Distributed lookup services lookup services Badri Nath Rutgers University badri@cs.rutgers.edu A set of nodes cooperating Peers Run special purpose algorithms/software Doesn t have to be deployed at every
More informationP2P Network Structured Networks: Distributed Hash Tables. Pedro García López Universitat Rovira I Virgili
P2P Network Structured Networks: Distributed Hash Tables Pedro García López Universitat Rovira I Virgili Pedro.garcia@urv.net Index Introduction to DHT s Origins of structured overlays Case studies Chord
More informationOverlay and P2P Networks. Structured Networks and DHTs. Prof. Sasu Tarkoma
Overlay and P2P Networks Structured Networks and DHTs Prof. Sasu Tarkoma 6.2.2014 Contents Today Semantic free indexing Consistent Hashing Distributed Hash Tables (DHTs) Thursday (Dr. Samu Varjonen) DHTs
More informationDistributed Hash Table
Distributed Hash Table P2P Routing and Searching Algorithms Ruixuan Li College of Computer Science, HUST rxli@public.wh.hb.cn http://idc.hust.edu.cn/~rxli/ In Courtesy of Xiaodong Zhang, Ohio State Univ
More informationFlooded Queries (Gnutella) Centralized Lookup (Napster) Routed Queries (Freenet, Chord, etc.) Overview N 2 N 1 N 3 N 4 N 8 N 9 N N 7 N 6 N 9
Peer-to-Peer Networks -: Computer Networking L-: PP Typically each member stores/provides access to content Has quickly grown in popularity Bulk of traffic from/to CMU is Kazaa! Basically a replication
More informationMarch 10, Distributed Hash-based Lookup. for Peer-to-Peer Systems. Sandeep Shelke Shrirang Shirodkar MTech I CSE
for for March 10, 2006 Agenda for Peer-to-Peer Sytems Initial approaches to Their Limitations CAN - Applications of CAN Design Details Benefits for Distributed and a decentralized architecture No centralized
More informationCSCI-1680 Web Performance, Content Distribution P2P Rodrigo Fonseca
CSCI-1680 Web Performance, Content Distribution P2P Rodrigo Fonseca Based partly on lecture notes by Scott Shenker and John Jannotti Last time HTTP and the WWW Today: HTTP Performance Persistent Connections,
More informationCSCI-1680 Web Performance, Content Distribution P2P John Jannotti
CSCI-1680 Web Performance, Content Distribution P2P John Jannotti Based partly on lecture notes by Sco2 Shenker and Rodrigo Fonseca Last time HTTP and the WWW Today: HTTP Performance Persistent Connections,
More informationL3S Research Center, University of Hannover
, University of Hannover Structured Peer-to to-peer Networks Wolf-Tilo Balke and Wolf Siberski 3..6 *Original slides provided by K. Wehrle, S. Götz, S. Rieche (University of Tübingen) Peer-to-Peer Systems
More informationOverlay networks. To do. Overlay networks. P2P evolution DHTs in general, Chord and Kademlia. Turtles all the way down. q q q
Overlay networks To do q q q Overlay networks P2P evolution DHTs in general, Chord and Kademlia Turtles all the way down Overlay networks virtual networks Different applications with a wide range of needs
More informationDistributed Hash Tables Chord and Dynamo
Distributed Hash Tables Chord and Dynamo (Lecture 19, cs262a) Ion Stoica, UC Berkeley October 31, 2016 Today s Papers Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications, Ion Stoica,
More informationDistributed Hash Tables: Chord
Distributed Hash Tables: Chord Brad Karp (with many slides contributed by Robert Morris) UCL Computer Science CS M038 / GZ06 12 th February 2016 Today: DHTs, P2P Distributed Hash Tables: a building block
More informationTelematics Chapter 9: Peer-to-Peer Networks
Telematics Chapter 9: Peer-to-Peer Networks Beispielbild User watching video clip Server with video clips Application Layer Presentation Layer Application Layer Presentation Layer Session Layer Session
More informationCompSci 356: Computer Network Architectures Lecture 21: Overlay Networks Chap 9.4. Xiaowei Yang
CompSci 356: Computer Network Architectures Lecture 21: Overlay Networks Chap 9.4 Xiaowei Yang xwy@cs.duke.edu Overview Problem Evolving solutions IP multicast Proxy caching Content distribution networks
More informationP2P Network Structured Networks: Distributed Hash Tables. Pedro García López Universitat Rovira I Virgili
P2P Network Structured Networks: Distributed Hash Tables Pedro García López Universitat Rovira I Virgili Pedro.garcia@urv.net Index Description of CHORD s Location and routing mechanisms Symphony: Distributed
More information*Adapted from slides provided by Stefan Götz and Klaus Wehrle (University of Tübingen)
Distributed Hash Tables (DHT) Jukka K. Nurminen *Adapted from slides provided by Stefan Götz and Klaus Wehrle (University of Tübingen) The Architectures of st and nd Gen. PP Client-Server Peer-to-Peer.
More informationCS 640 Introduction to Computer Networks. Today s lecture. What is P2P? Lecture30. Peer to peer applications
Introduction to Computer Networks Lecture30 Today s lecture Peer to peer applications Napster Gnutella KaZaA Chord What is P2P? Significant autonomy from central servers Exploits resources at the edges
More information08 Distributed Hash Tables
08 Distributed Hash Tables 2/59 Chord Lookup Algorithm Properties Interface: lookup(key) IP address Efficient: O(log N) messages per lookup N is the total number of servers Scalable: O(log N) state per
More information416 Distributed Systems. Mar 3, Peer-to-Peer Part 2
416 Distributed Systems Mar 3, Peer-to-Peer Part 2 Scaling Problem Millions of clients server and network meltdown 2 P2P System Leverage the resources of client machines (peers) Traditional: Computation,
More informationLECT-05, S-1 FP2P, Javed I.
A Course on Foundations of Peer-to-Peer Systems & Applications LECT-, S- FPP, javed@kent.edu Javed I. Khan@8 CS /99 Foundation of Peer-to-Peer Applications & Systems Kent State University Dept. of Computer
More informationCS514: Intermediate Course in Computer Systems
Distributed Hash Tables (DHT) Overview and Issues Paul Francis CS514: Intermediate Course in Computer Systems Lecture 26: Nov 19, 2003 Distributed Hash Tables (DHT): Overview and Issues What is a Distributed
More informationCS 3516: Advanced Computer Networks
Welcome to CS 3516: Advanced Computer Networks Prof. Yanhua Li Time: 9:00am 9:50am M, T, R, and F Location: Fuller 320 Fall 2017 A-term 1 Some slides are originally from the course materials of the textbook
More informationCPSC 426/526. P2P Lookup Service. Ennan Zhai. Computer Science Department Yale University
CPSC 4/5 PP Lookup Service Ennan Zhai Computer Science Department Yale University Recall: Lec- Network basics: - OSI model and how Internet works - Socket APIs red PP network (Gnutella, KaZaA, etc.) UseNet
More informationPeer-to-Peer Systems and Distributed Hash Tables
Peer-to-Peer Systems and Distributed Hash Tables CS 240: Computing Systems and Concurrency Lecture 8 Marco Canini Credits: Michael Freedman and Kyle Jamieson developed much of the original material. Selected
More informationPeer-to-Peer Internet Applications: A Review
Peer-to-Peer Internet Applications: A Review Davide Quaglia 01/14/10 Introduction Key points Lookup task Outline Centralized (Napster) Query flooding (Gnutella) Distributed Hash Table (Chord) Simulation
More informationOverlay and P2P Networks. Structured Networks and DHTs. Prof. Sasu Tarkoma
Overlay and P2P Networks Structured Networks and DHTs Prof. Sasu Tarkoma 2.2.2015 Contents Today Geometries revisited Distributed Hash Tables (DHTs) Thursday Power-law networks Geometries revisited The
More informationIntroduction to P2P Computing
Introduction to P2P Computing Nicola Dragoni Embedded Systems Engineering DTU Compute 1. Introduction A. Peer-to-Peer vs. Client/Server B. Overlay Networks 2. Common Topologies 3. Data Location 4. Gnutella
More informationOverlay networks. Today. l Overlays networks l P2P evolution l Pastry as a routing overlay example
Overlay networks Today l Overlays networks l P2P evolution l Pastry as a routing overlay eample Network virtualization and overlays " Different applications with a range of demands/needs network virtualization
More informationDistributed Hash Tables
Distributed Hash Tables Chord Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Chord 1/29 Outline Overview 1 Overview 2 3 Smruti R. Sarangi
More informationA Scalable Content- Addressable Network
A Scalable Content- Addressable Network In Proceedings of ACM SIGCOMM 2001 S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Presented by L.G. Alex Sung 9th March 2005 for CS856 1 Outline CAN basics
More informationPeer-to-Peer Systems. Chapter General Characteristics
Chapter 2 Peer-to-Peer Systems Abstract In this chapter, a basic overview is given of P2P systems, architectures, and search strategies in P2P systems. More specific concepts that are outlined include
More informationMaking Gnutella-like P2P Systems Scalable
Making Gnutella-like P2P Systems Scalable Y. Chawathe, S. Ratnasamy, L. Breslau, N. Lanham, S. Shenker Presented by: Herman Li Mar 2, 2005 Outline What are peer-to-peer (P2P) systems? Early P2P systems
More informationToday. Why might P2P be a win? What is a Peer-to-Peer (P2P) system? Peer-to-Peer Systems and Distributed Hash Tables
Peer-to-Peer Systems and Distributed Hash Tables COS 418: Distributed Systems Lecture 7 Today 1. Peer-to-Peer Systems Napster, Gnutella, BitTorrent, challenges 2. Distributed Hash Tables 3. The Chord Lookup
More informationArchitectures for Distributed Systems
Distributed Systems and Middleware 2013 2: Architectures Architectures for Distributed Systems Components A distributed system consists of components Each component has well-defined interface, can be replaced
More informationCycloid: A constant-degree and lookup-efficient P2P overlay network
Performance Evaluation xxx (2005) xxx xxx Cycloid: A constant-degree and lookup-efficient P2P overlay network Haiying Shen a, Cheng-Zhong Xu a,, Guihai Chen b a Department of Electrical and Computer Engineering,
More informationPeer-to-Peer (P2P) Systems
Peer-to-Peer (P2P) Systems What Does Peer-to-Peer Mean? A generic name for systems in which peers communicate directly and not through a server Characteristics: decentralized self-organizing distributed
More informationChord : A Scalable Peer-to-Peer Lookup Protocol for Internet Applications
: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications Ion Stoica, Robert Morris, David Liben-Nowell, David R. Karger, M. Frans Kaashock, Frank Dabek, Hari Balakrishnan March 4, 2013 One slide
More informationKademlia: A P2P Informa2on System Based on the XOR Metric
Kademlia: A P2P Informa2on System Based on the XOR Metric Today! By Petar Mayamounkov and David Mazières, presented at IPTPS 22 Next! Paper presentation and discussion Image from http://www.vs.inf.ethz.ch/about/zeit.jpg
More information: Scalable Lookup
6.824 2006: Scalable Lookup Prior focus has been on traditional distributed systems e.g. NFS, DSM/Hypervisor, Harp Machine room: well maintained, centrally located. Relatively stable population: can be
More informationDebunking some myths about structured and unstructured overlays
Debunking some myths about structured and unstructured overlays Miguel Castro Manuel Costa Antony Rowstron Microsoft Research, 7 J J Thomson Avenue, Cambridge, UK Abstract We present a comparison of structured
More informationPeer-to-peer computing research a fad?
Peer-to-peer computing research a fad? Frans Kaashoek kaashoek@lcs.mit.edu NSF Project IRIS http://www.project-iris.net Berkeley, ICSI, MIT, NYU, Rice What is a P2P system? Node Node Node Internet Node
More informationCS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University
CS 555: DISTRIBUTED SYSTEMS [P2P SYSTEMS] Shrideep Pallickara Computer Science Colorado State University Frequently asked questions from the previous class survey Byzantine failures vs malicious nodes
More informationStructured Peer-to-Peer Networks
Structured Peer-to-Peer Networks The P2P Scaling Problem Unstructured P2P Revisited Distributed Indexing Fundamentals of Distributed Hash Tables DHT Algorithms Chord Pastry Can Programming a DHT Graphics
More information0!1. Overlaying mechanism is called tunneling. Overlay Network Nodes. ATM links can be the physical layer for IP
epartment of lectrical ngineering and omputer Sciences University of alifornia erkeley '!$$( network defined over another set of networks The overlay addresses its own nodes Links on one layer are network
More informationDistributed Systems. 17. Distributed Lookup. Paul Krzyzanowski. Rutgers University. Fall 2016
Distributed Systems 17. Distributed Lookup Paul Krzyzanowski Rutgers University Fall 2016 1 Distributed Lookup Look up (key, value) Cooperating set of nodes Ideally: No central coordinator Some nodes can
More informationReCord: A Distributed Hash Table with Recursive Structure
ReCord: A Distributed Hash Table with Recursive Structure Jianyang Zeng and Wen-Jing Hsu Abstract We propose a simple distributed hash table called ReCord, which is a generalized version of Randomized-
More informationDistributed Hash Tables
Distributed Hash Tables CS6450: Distributed Systems Lecture 11 Ryan Stutsman Material taken/derived from Princeton COS-418 materials created by Michael Freedman and Kyle Jamieson at Princeton University.
More informationAn Adaptive Stabilization Framework for DHT
An Adaptive Stabilization Framework for DHT Gabriel Ghinita, Yong Meng Teo Department of Computer Science National University of Singapore {ghinitag,teoym}@comp.nus.edu.sg Overview Background Related Work
More informationDecentralized Object Location In Dynamic Peer-to-Peer Distributed Systems
Decentralized Object Location In Dynamic Peer-to-Peer Distributed Systems George Fletcher Project 3, B649, Dr. Plale July 16, 2003 1 Introduction One of the key requirements for global level scalability
More informationA Survey of Peer-to-Peer Content Distribution Technologies
A Survey of Peer-to-Peer Content Distribution Technologies Stephanos Androutsellis-Theotokis and Diomidis Spinellis ACM Computing Surveys, December 2004 Presenter: Seung-hwan Baek Ja-eun Choi Outline Overview
More informationIntroduction to Peer-to-Peer Systems
Introduction Introduction to Peer-to-Peer Systems Peer-to-peer (PP) systems have become extremely popular and contribute to vast amounts of Internet traffic PP basic definition: A PP system is a distributed
More informationPeer to peer systems: An overview
Peer to peer systems: An overview Gaurav Veda (Y1148) gveda@cse.iitk.ac.in Computer Science & Engineering Indian Institute of Technology Kanpur, UP, INDIA - 208016 Abstract Peer-to-peer (p2p) systems is
More informationHandling Churn in a DHT
Handling Churn in a DHT Sean Rhea, Dennis Geels, Timothy Roscoe, and John Kubiatowicz UC Berkeley and Intel Research Berkeley What s a DHT? Distributed Hash Table Peer-to-peer algorithm to offering put/get
More informationSearching for Shared Resources: DHT in General
1 ELT-53207 P2P & IoT Systems Searching for Shared Resources: DHT in General Mathieu Devos Tampere University of Technology Department of Electronics and Communications Engineering Based on the original
More informationP2P Network Structured Networks (IV) Distributed Hash Tables. Pedro García López Universitat Rovira I Virgili
P2P Network Structured Networks (IV) Distributed Hash Tables Pedro García López Universitat Rovira I Virgili Pedro.garcia@urv.net Index Koorde: Degree optimal DHT Constructing overlay applications and
More informationOverlay networks. T o do. Overlay networks. P2P evolution DHTs in general, Chord and Kademlia. q q q. Turtles all the way down
Overlay networks T o do q q q Overlay networks P2P evolution DHTs in general, Chord and Kademlia Turtles all the way down Overlay networks virtual networks Different applications with a wide range of needs
More informationPage 1. P2P Traffic" P2P Traffic" Today, around 18-20% (North America)! Big chunk now is video entertainment (e.g., Netflix, itunes)!
P2P Traffic" CS162 Operating Systems and Systems Programming Lecture 25 Capstone: P2P Systems, Review" 2004: some Internet Service Providers (ISPs) claimed that over 50% of their traffic was peer-to-peer
More informationEffect of Links on DHT Routing Algorithms 1
Effect of Links on DHT Routing Algorithms 1 Futai Zou, Liang Zhang, Yin Li, Fanyuan Ma Department of Computer Science and Engineering Shanghai Jiao Tong University, 200030 Shanghai, China zoufutai@cs.sjtu.edu.cn
More informationDHT Overview. P2P: Advanced Topics Filesystems over DHTs and P2P research. How to build applications over DHTS. What we would like to have..
DHT Overview P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar DHTs provide a simple primitive put (key,value) get (key) Data/Nodes distributed over a key-space High-level idea: Move
More informationPeer to Peer I II 1 CS 138. Copyright 2015 Thomas W. Doeppner, Rodrigo Fonseca. All rights reserved.
Peer to Peer I II 1 Roadmap This course will feature key concepts in Distributed Systems, often illustrated by their use in example systems Start with Peer-to-Peer systems, which will be useful for your
More informationDistributed Systems. 16. Distributed Lookup. Paul Krzyzanowski. Rutgers University. Fall 2017
Distributed Systems 16. Distributed Lookup Paul Krzyzanowski Rutgers University Fall 2017 1 Distributed Lookup Look up (key, value) Cooperating set of nodes Ideally: No central coordinator Some nodes can
More informationSearching for Shared Resources: DHT in General
1 ELT-53206 Peer-to-Peer Networks Searching for Shared Resources: DHT in General Mathieu Devos Tampere University of Technology Department of Electronics and Communications Engineering Based on the original
More informationPeer-to-Peer Systems. Network Science: Introduction. P2P History: P2P History: 1999 today
Network Science: Peer-to-Peer Systems Ozalp Babaoglu Dipartimento di Informatica Scienza e Ingegneria Università di Bologna www.cs.unibo.it/babaoglu/ Introduction Peer-to-peer (PP) systems have become
More informationCRESCENDO GEORGE S. NOMIKOS. Advisor: Dr. George Xylomenos
CRESCENDO Implementation of Hierarchical Chord (Crescendo), according to Canon paradigm and evaluation, via simulation over realistic network topologies, of the Crescendo's advantages in comparison to
More informationPeer-to-peer systems and overlay networks
Complex Adaptive Systems C.d.L. Informatica Università di Bologna Peer-to-peer systems and overlay networks Fabio Picconi Dipartimento di Scienze dell Informazione 1 Outline Introduction to P2P systems
More informationSurvey on Distributed Hash Tables 1
Survey on Distributed Hash Tables 1 Filipe Araújo and Luís Rodrigues Univ. de Coimbra, Univ. de Lisboa filipius@dei.uc.pt, ler@di.fc.ul.pt Abstract In this paper we review distributed hash tables (DHTs)
More informationThe Impact of DHT Routing Geometry on Resilience and Proximity
The Impact of DHT Routing Geometry on Resilience and Proximity K. Gummadi, R. Gummadi, S. Gribble, S. Ratnasamy, S. Shenker, I. Stoica, ABSTRACT The various proposed DHT routing algorithms embody several
More informationContent Overlays. Nick Feamster CS 7260 March 12, 2007
Content Overlays Nick Feamster CS 7260 March 12, 2007 Content Overlays Distributed content storage and retrieval Two primary approaches: Structured overlay Unstructured overlay Today s paper: Chord Not
More informationScalable overlay Networks
overlay Networks Dr. Samu Varjonen 1 Lectures MO 15.01. C122 Introduction. Exercises. Motivation. TH 18.01. DK117 Unstructured networks I MO 22.01. C122 Unstructured networks II TH 25.01. DK117 Bittorrent
More informationAn Expresway over Chord in Peer-to-Peer Systems
An Expresway over Chord in Peer-to-Peer Systems Hathai Tanta-ngai Technical Report CS-2005-19 October 18, 2005 Faculty of Computer Science 6050 University Ave., Halifax, Nova Scotia, B3H 1W5, Canada An
More informationPart 1: Introducing DHTs
Uni Innsbruck Informatik - Peer-to to-peer Systems Structured PP file sharing systems Michael Welzl michael.welzl@uibk.ac.at DPS NSG Team http://dps.uibk.ac.at dps.uibk.ac.at/nsg Institute of Computer
More informationLecture 6: Securing Distributed and Networked Systems. CS 598: Network Security Matthew Caesar March 12, 2013
Lecture 6: Securing Distributed and Networked Systems CS 598: Network Security Matthew Caesar March 12, 2013 1 Today: Distributed Internet Services Previous cycle: how to build Internet services that run
More informationDISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES
DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES Dr. Jack Lange Computer Science Department University of Pittsburgh Fall 2015 Outline System Architectural Design Issues Centralized Architectures Application
More informationScaling Out Systems. COS 518: Advanced Computer Systems Lecture 6. Kyle Jamieson
Scaling Out Systems COS 518: Advanced Computer Systems Lecture 6 Kyle Jamieson [Credits: Selected content adapted from M. Freedman, B. Karp, R. Morris] Today 1. Scaling up: Reliable multicast 2. Scaling
More informationDistriubted Hash Tables and Scalable Content Adressable Network (CAN)
Distriubted Hash Tables and Scalable Content Adressable Network (CAN) Ines Abdelghani 22.09.2008 Contents 1 Introduction 2 2 Distributed Hash Tables: DHT 2 2.1 Generalities about DHTs............................
More informationMinimizing Churn in Distributed Systems
Minimizing Churn in Distributed Systems by P. Brighten Godfrey, Scott Shenker, and Ion Stoica appearing in SIGCOMM 2006 presented by Todd Sproull Introduction Problem: nodes joining or leaving distributed
More informationCS 347 Parallel and Distributed Data Processing
CS 347 Parallel and Distributed Data Processing Spring 2016 Notes 9: Peer-to-Peer Systems Previous Topics Data Database design Queries Query processing Localization Operators Optimization Transactions
More informationWinter CS454/ Assignment 2 Instructor: Bernard Wong Due date: March 12 15, 2012 Group size: 2
Winter CS454/654 2012 Assignment 2 Instructor: Bernard Wong Due date: March 12 15, 2012 Group size: 2 Distributed systems is a rapidly evolving field. Most of the systems we have studied (or will study)
More informationKen Birman. Cornell University. CS5410 Fall 2008.
Ken Birman Cornell University. CS5410 Fall 2008. What is a Distributed Hash Table (DHT)? Exactly that A service, distributed over multiple machines, with hash h table semantics Put(key, value), Value =
More informationA Structured Overlay for Non-uniform Node Identifier Distribution Based on Flexible Routing Tables
A Structured Overlay for Non-uniform Node Identifier Distribution Based on Flexible Routing Tables Takehiro Miyao, Hiroya Nagao, Kazuyuki Shudo Tokyo Institute of Technology 2-12-1 Ookayama, Meguro-ku,
More informationPeer-To-Peer Techniques
PG DynaSearch Markus Benter 31th October, 2013 Introduction Centralized P2P-Networks Unstructured P2P-Networks Structured P2P-Networks Misc 1 What is a Peer-to-Peer System? Definition Peer-to-peer systems
More informationDepartment of Computer Science Institute for System Architecture, Chair for Computer Networks. File Sharing
Department of Computer Science Institute for System Architecture, Chair for Computer Networks File Sharing What is file sharing? File sharing is the practice of making files available for other users to
More informationPeer-to-Peer Protocols and Systems. TA: David Murray Spring /19/2006
Peer-to-Peer Protocols and Systems TA: David Murray 15-441 Spring 2006 4/19/2006 P2P - Outline What is P2P? P2P System Types 1) File-sharing 2) File distribution 3) Streaming Uses & Challenges 2 Problem:
More information