Semantic Multicast for Content-based Stream Dissemination

Similar documents
Architecture and Implementation of a Content-based Data Dissemination System

XFlow: Dynamic Dissemination Trees for Extensible In-Network Stream Processing

Extensible Optimization in Overlay Dissemination Trees

Application-aware Overlay Networks for Data Dissemination

IP Multicast Technology Overview

IP Multicast. Overview. Casts. Tarik Čičić University of Oslo December 2001

Adaptive In-Network Query Deployment for Shared Stream Processing Environments

Multicast Communications. Tarik Čičić, 4. March. 2016

Multicast Communications

XML Filtering Technologies

! Naive n-way unicast does not scale. ! IP multicast to the rescue. ! Extends IP architecture for efficient multi-point delivery. !

Overlay Multicast. Application Layer Multicast. Structured Overlays Unstructured Overlays. CAN Flooding Centralised. Scribe/SplitStream Distributed

What is Multicasting? Multicasting Fundamentals. Unicast Transmission. Agenda. L70 - Multicasting Fundamentals. L70 - Multicasting Fundamentals

Bayeux: An Architecture for Scalable and Fault Tolerant Wide area Data Dissemination

A Routing Scheme for Content-Based Networking

DATA COMMUNICATOIN NETWORKING

SFilter: A Simple and Scalable Filter for XML Streams

Design of Next Generation Internet Based on Application-Oriented Networking

IP Multicast Technology Overview

Subscription Aggregation for Scalability and Efficiency in XPath/XML-Based Publish/Subscribe Systems

Scalable Application Layer Multicast

ASM. Engineering Workshops

Architectural Approaches for Social Networks. Presenter: Qian Li

Staying FIT: Efficient Load Shedding Techniques for Distributed Stream Processing

BGP. Daniel Zappala. CS 460 Computer Networking Brigham Young University

Chapter 11 Chapter 6

References. Forwarding. Introduction...

Delaunay Triangulation Overlays

IPv6 PIM. Based on the forwarding mechanism, IPv6 PIM falls into two modes:

Indirect Communication

Arvind Krishnamurthy Fall 2003

On Minimizing Packet Loss Rate and Delay for Mesh-based P2P Streaming Services

Implementing Multicast Service Reflection

Distributed Core Multicast (DCM): a multicast routing protocol for many groups with few receivers

Distributed Conditional Multicast Access for IP TV in High-Speed Wireless Networks (Destination Specific Multicast)

Distributed Core Multicast (DCM): a multicast routing protocol for many groups with few receivers

Contents. Overview Multicast = Send to a group of hosts. Overview. Overview. Implementation Issues. Motivation: ISPs charge by bandwidth

Table of Contents 1 PIM Configuration 1-1

Overlay Multicast/Broadcast

Enhancement of the CBT Multicast Routing Protocol

Many-to-Many Communications in HyperCast

CCD: Efficient Customized Content Dissemination in Distributed Publish/Subscribe

A Self-Organizing Crash-Resilient Topology Management System for Content-Based Publish/Subscribe

Overlay Networks for Multimedia Contents Distribution

Broadcast Routing. Multicast. Flooding. In-network duplication. deliver packets from source to all other nodes source duplication is inefficient:

Top-Down Network Design, Ch. 7: Selecting Switching and Routing Protocols. Top-Down Network Design. Selecting Switching and Routing Protocols

Data Communication. Guaranteed Delivery Based on Memorization

Adaptive Content-Based Routing in General Overlay Topologies

Configuring IP Multicast Routing

CSCE 463/612 Networks and Distributed Processing Spring 2018

Configuring a Rendezvous Point

IP Multicast. What is multicast?

Configuring IP Multicast Routing

Peer-to-Peer Overlay Broker Networks in an Event-Based Middleware

Content. 1. Introduction. 2. The Ad-hoc On-Demand Distance Vector Algorithm. 3. Simulation and Results. 4. Future Work. 5.

Lab 7-3 Routing IP Multicast with PIM Sparse Mode

Adaptive Content-based Routing In General Overlay Topologies

FSOS Multicast Configuration Guide

Multicast Technology White Paper

Configuring Bidirectional PIM

PUBLISHER Subscriber system is an event notification service

Multicast routing Draft

Configuring multicast VPN

Financial Services Design for High Availability

Rethinking the Design of Distributed Stream Processing Systems

Multicast EECS 122: Lecture 16

Configuring SSM. Finding Feature Information. Prerequisites for Configuring SSM

Abstract In this paper we describe a new network service called Speccast. Speccast offers a generalized addressing and

Tag Switching. Background. Tag-Switching Architecture. Forwarding Component CHAPTER

Multicast VPN C H A P T E R. Introduction to IP Multicast

05 Indirect Communication

Top-Down Network Design

Hierarchical Routing. Our routing study thus far - idealization all routers identical network flat not true in practice

Exercises to Communication Systems

Multicast. Presented by Hakim Weatherspoon. Slides borrowed from Qi Huang 2009 and Tom Ternquist 2010

Network Configuration Example

Application Layer Multicast For Efficient Peer-to-Peer Applications

YFilter: an XML Stream Filtering Engine. Weiwei SUN University of Konstanz

Multicast overview. Introduction to multicast. Information transmission techniques. Unicast

Routing over Low Power and Lossy Networks

List of groups known at each router. Router gets those using IGMP. And where they are in use Where members are located. Enhancement to OSPF

ICS 351: Today's plan. distance-vector routing game link-state routing OSPF

ROUTING ALGORITHMS Part 1: Data centric and hierarchical protocols

Scalable Overlay Multicast Tree Construction for Media Streaming

Network Configuration Example

Recent Advances and Challenges in XML Document Routing

SCAN: Scalable Content routing for

Broadcast and Multicast Routing

Routing protocols in WSN

QoS-Aware Hierarchical Multicast Routing on Next Generation Internetworks

Enhanced Cores Based Tree for Many-to-Many IP Multicasting

ITEC310 Computer Networks II

Configuring IP Multicast Routing

Path Sharing and Predicate Evaluation for High- Performance XML Filtering

Rendezvous Point Engineering

PIM Configuration. Page 1 of 9

Table of Contents 1 Multicast VPN Configuration 1-1

Chapter 7 CONCLUSION

Why multicast? The concept of multicast Multicast groups Multicast addressing Multicast routing protocols MBONE Multicast applications Conclusions

Indirect Communication

Transcription:

Semantic Multicast for Content-based Stream Dissemination Olga Papaemmanouil Brown University Uğur Çetintemel Brown University

Stream Dissemination Applications Clients Data Providers Push-based applications Environmental monitoring Real-time financial services Characteristics High data volume Fast stream rates Highly dispersed sources & destinations 2

Content-based Dissemination Centralized filtering Single node gathers profiles Unicast to destinations e.g., XFilter [Altinel et al., 2000] Distributed content-based routing Interest-based routing Predefined acyclic overlay network e.g. SIENA [Carzaniga et al., 2001] Predicate-based filtering network Upstream profile aggregation Content filtering of each message at each hop x>10 A x<20 B x>10 C 1 S 1 x<5 D x<5 C 2 C 1 x>10 C 2 x<5 3

Content-based routing overhead Processing cost Local filtering of each message Compression/decompression for XML streams Bandwidth consumption Missed tree optimization opportunities Different client sets request different sub-streams One spanning tree of the whole network is not the best solution for low cost 4

Our approach: Semantic Multicast Constructs content-based (a.k.a. semantic) multicast channels Independent overlay dissemination trees Channels characterized by their content Advantages Decreases processing cost Eliminates local filtering at interior brokers Decreases overall bandwidth requirements Allows QoS-aware multicast trees 5

Content-based Channelization SemCast decides: Number of channels Content of channels Clients subscriptions to channels Channel implementation Operational goals No false exclusion Low run-time cost : Overall bandwidth consumption Minimize redundancy among channels content Create efficient multicast trees 6

System Model Publishers Source brokers (S) Receive XML streams from publishers Gateway brokers (GB) Receive XPath profiles from subscribers Rendezvous points (RP) Roots of channels Interior brokers (I) Forward incoming messages Coordinator Identifies content of channels Coordinator S S S RP RP RP I GB GB GB GB I GB Subscribers I 7

SemCast Overview Membership Management Syntactical analysis of profiles Dynamic Channelization Exploit statistical information Reorganize channels 8

Membership Management Adding subscriptions Gateway broker forwards unsatisfied subscriptions to coordinator Coordinator creates a new channel for any subscription not covered by existing channels Removing subscriptions Gateway broker forwards request to upstream brokers Interior brokers remove channel entries from routing tables Coordinator removes channel if there are no more remaining clients 9

Profile Containment Hierarchies Identify channels covering a profile based on profile syntax Containment algorithms for XPath expressions. e.g. [Wood, 2003] Maintain syntax-based containment hierarchies Parent profile covers children Root is the channel s content expression Channel1 Channel 2 a//* d//* a/b a/c d/*/g d/f/g 10

Dynamic Channelization Syntactic analysis might cause high data redundancy Undiscovered full/partial overlapping profiles Assign similar profiles to different channels Forward matching messages to more than one channel Use statistics to re-evaluate channelization Profile overlap Run-time stream rate 11

Profile Overlapping Relations SemCast exploits also partial overlap among profiles Partial overlap P i k-overlaps with P j : Containment match( PP ) k i j Pi! Pj, k = match( P) Special case of partial overlap, k=1 P j covers P i :messages matching P i are subset of those matching P j i 12

Rate-based Hierarchies P j is parent of P i if { k }! P # P and k = max k r P # P 1 i j j i j i P, i" j j r = rate of non-overlapping part between P and P j! i j i P j is more general than P i If multiple candidate parents exist Maximum overlap Low stream rate of redundant messages 13

Hierarchy Merging Highly diverse profiles may increase cost High message replication Large number of channels Large routing tables at interior brokers Merging hierarchies with partial overlap reduces cost Use a cost-based model Merge pairs with maximum benefit in bandwidth High profile overlap Efficient single multicast tree 14

Multicast Tree Construction Base low cost heuristic Request channel s destinations from RP Find min-cost path to all destinations in the channel Connect to closest one C 2 RP S C 1 C 3 15

Simulation Metrics Processing cost Eliminate need for local filtering (not in this paper) Bandwidth efficiency Approaches Unicast approach SPT: Shortest Path Tree approach Distributed pub-sub system [Carzaniga et al., 2001] SemCast: Distributed content-based channelization Optimal: Centralized Steiner tree construction Simulation environment Random graph generated by GT-ITM Up to 700 nodes and 7000 profiles 16

Bandwidth efficiency % cost degradation over Optimal 160 140 120 100 80 60 40 20 0 0 0.025 0.05 0.06 Unicast SPT SemCas t Heuris tic 0.125 0.18 0.25 0.5 0.7 profile selectivity percentage 1 5 10 20 Network size =700 #profiles =7000 17

Scalability % cost degradation over Optimal 90 80 70 60 50 40 30 20 10 0 Unicast SPT SemCast Heuristic 200 400 600 netw ork size (profile selectivity factor = 0.7%) 18

Related Work Publish-Subscribe Systems Centralized approaches: XML Filtering XFilter [Altinel et al., 2000], YFilter [Diao et al., 2002] XTrie [Chan et al., 2002] Distributed approaches: Content- based Routing Gryphon [Opyrchal et al., 2000] SIENA [Carzaniga et al., 2001] Application-Level Multicast SCRIBE [Castro et al., 2002] CAN-based Multicast [Ratnasamy et al., 2001] 19

Conclusions & Ongoing Work SemCast Performs semantic split of incoming streams Eliminates local filtering in interior brokers Improves bandwidth consumption Ongoing work SemCast prototype Demonstrate processing cost benefit Investigate affects of network topology Storage-oriented model: caching XML nodes 20