Mutually Exclusive Data Dissemination in the Mobile Publish/Subscribe System

Similar documents
Center for Networked Computing

Center for Networked Computing

NSFA: Nested Scale-Free Architecture for Scalable Publish/Subscribe over P2P Networks

Heterogeneous Community-based Routing in Opportunistic Mobile Social Networks

InterestSpread: An Efficient Method for Content Transmission in Mobile Social Networks

Homing Spread: Community Home-based Multi-copy Routing in Mobile Social Networks

4.1 Interval Scheduling

Outsourcing Privacy-Preserving Social Networks to a Cloud

A Multi-Copy Delegation Forwarding Based On Short-term and Long-Term Speed in DTNs

Social-Aware Routing in Delay Tolerant Networks

Community-home-based Multi-copy Routing in Mobile Social Networks

Multi-resource Energy-efficient Routing in Cloud Data Centers with Network-as-a-Service

A Joint Replication-Migration-based Routing in Delay Tolerant Networks

GUI: GPS-Less Traffic Congestion Avoidance in Urban Area with Inter-Vehicular Communication

Virtual Network Embedding with Substrate Support for Parallelization

Absorbing Random walks Coverage

Absorbing Random walks Coverage

A O (log n ) Distributed Algorithm to Construct Routing Structures for Pub/Sub Systems. Volker Turau. November 6th, 2018

Topology Control in Wireless Networks 4/24/06

WSN Routing Protocols

IEEE TRANSACTIONS ON BIG DATA 1. Cost-efficient Heterogeneous Worker Recruitment under Coverage Requirement in Spatial Crowdsourcing

Building an Internet-Scale Publish/Subscribe System

Up-and-Down Routing Through Nested Core-Periphery Hierarchy in Mobile Opportunistic Social Networks

Cost-Aware SEcure Routing (CASER) Protocol Design for Wireless Sensor Networks

End-To-End Delay Optimization in Wireless Sensor Network (WSN)

CCD: Efficient Customized Content Dissemination in Distributed Publish/Subscribe

On Efficient Data Collection and Event Detection with Delay Minimization in Deep Sea

Linear Programming Problems: Geometric Solutions

Graph Algorithms. Many problems in networks can be modeled as graph problems.

Routing protocols in WSN

Energy aware geographic routing in wireless sensor networks with anchor nodes. Mircea Cretu Stancu Utrecht University Computing Science May 2013

Cross-Monotonic Multicast

Protocol Design and Optimization for

On Efficient Data Collection Using Autonomous Underwater Vehicles

QoS-Aware Service Selection in Geographically Distributed Clouds

Connected Point Coverage in Wireless Sensor Networks using Robust Spanning Trees

Energy-Aware Scheduling for Acyclic Synchronous Data Flows on Multiprocessors

AM 221: Advanced Optimization Spring 2016

Routing in Large-Scale Buses Ad Hoc Networks

Approximately Uniform Random Sampling in Sensor Networks

APPM 4120/5120 Exam #2 Practice Solutions Spring 2015

CSE 417 Network Flows (pt 4) Min Cost Flows

Channel Dynamics Matter: Forwarding Node Set Selection in Cognitive Radio Networks

Energy-aware Scheduling for Frame-based Tasks on Heterogeneous Multiprocessor Platforms

Information Brokerage

On Performance Evaluation of Reliable Topology Control Algorithms in Mobile Ad Hoc Networks (Invited Paper)

Complex Contagions Models in Opportunistic Mobile Social Networks

References. Introduction. Publish/Subscribe paradigm. In a wireless sensor network, a node is often interested in some information, but

Achieving Fast BGP Reroute with Traffic Engineering Using Multiple Routing Planes

ISSN Vol.04,Issue.12, September-2016, Pages:

Reducing Power Consumption in Data Centers by Jointly Considering VM Placement and Flow Scheduling

Virtual Network Function Deployment in Tree-structured Networks

Basic Switch Organization

Evaluation of Seed Selection Strategies for Vehicle to Vehicle Epidemic Information Dissemination

Efficient Broadcasting in Ad Hoc Wireless Networks Using Directional Antennas

Advanced Topics on Wireless Ad Hoc Networks. Lecture 4: Energy-aware Routing Algorithms

BUBBLE RAP: Social-Based Forwarding in Delay-Tolerant Networks

Outline. (single-source) shortest path. (all-pairs) shortest path. minimum spanning tree. Dijkstra (Section 4.4) Bellman-Ford (Section 4.

Architectures for Distributed Systems

Viral Marketing and Outbreak Detection. Fang Jin Yao Zhang

Algorithmic Problems in Epidemiology

Graph Algorithms. Many problems in networks can be modeled as graph problems.

Broadcasting with Hard Deadlines in Wireless Multi-hop Networks Using Network Coding

CASER Protocol Using DCFN Mechanism in Wireless Sensor Network

Community-Aware Opportunistic Routing in Mobile Social Networks

Erasure Codes for Heterogeneous Networked Storage Systems

Efficient Synthesis of Production Schedules by Optimization of Timed Automata

CS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University

An Approach Using Ant-like Agents. Alice Forehand Robert Pienta

Mathematics for Decision Making: An Introduction. Lecture 18

Artificial Intelligence

Computer Science & Engineering 423/823 Design and Analysis of Algorithms

Chalmers University of Technology. Network Models. April 11, Birgit Grohe

Network Models - Examples. Network Models. Definition and Terminology. The Minimum Spanning Tree (MST) Problem

BIPAR: BImodal Power-Aware Routing Protocol For Wireless Sensor Networks

Backbone Discovery In Thick Wireless Linear Sensor Netorks

SMITE: A Stochastic Compressive Data Collection. Sensor Networks

Adaptive Routing of QoS-Constrained Media Streams over Scalable Overlay Topologies

Community-Based Adaptive Buffer Management Strategy in Opportunistic Network

EBRP: Energy Band based Routing Protocol for Wireless Sensor Networks

Techniques for Content Subscription Anonymity with Distributed Brokers

Analysis of Hop Limit in Opportunistic Networks by

Energy-efficient routing algorithms for Wireless Sensor Networks

Load Balanced Link Reversal Routing in Mobile Wireless Ad Hoc Networks

Scalable Clustering of Signed Networks Using Balance Normalized Cut

CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks

Lecture 6: Vehicular Computing and Networking. Cristian Borcea Department of Computer Science NJIT

Local Search. (Textbook Chpt 4.8) Computer Science cpsc322, Lecture 14. May, 30, CPSC 322, Lecture 14 Slide 1

Web Structure Mining Community Detection and Evaluation

Implementation of Near Optimal Algorithm for Integrated Cellular and Ad-Hoc Multicast (ICAM)

Checkout Success Page v2.x Configuration for Magento 2

Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs)

Why dynamic route? (1)

Load Balanced Short Path Routing in Wireless Networks Jie Gao, Stanford University Li Zhang, Hewlett-Packard Labs

Femto-Matching: Efficient Traffic Offloading in Heterogeneous Cellular Networks

Delay Tolerant Networks

Towards Green Cloud Computing: Demand Allocation and Pricing Policies for Cloud Service Brokerage Chenxi Qiu

Surveying Formal and Practical Approaches for Optimal Placement of Replicas on the Web

Interconnection Networks: Routing. Prof. Natalie Enright Jerger

R/Kademlia: Recursive and Topology-aware Overlay Routing

Transcription:

Mutually Exclusive Data Dissemination in the Mobile Publish/Subscribe System Ning Wang and Jie Wu Dept. of Computer and Info. Sciences Temple University

Road Map Introduction Problem and challenge Centralized solution Distributed solution Experiments Conclusion and future work

Introduction Data dissemination in Mobile Social Networks Distributed system Proximity-based communication, no cellular network Pub/sub paradigm Data are labeled with topics. Mobile user identifies its interest topics (e.g., Music or Computer Science ) to filter data. No specific destination set Application scenarios Service advertising, news spreading, environmental alert

Introduction Mutually exclusive data dissemination A user might be interested in multiple topics, receiving anyone and only one is enough. Potential applications: Digital ads (receiving too much will impact the user's experience.) Electric coupon system (only apply one at checkout) Carpool invitation (only choose one car)

Introduction A budget-constrained data dissemination Data amount under each topic is pre-determined. E.g., Facebook advertisement Real example in facebook advertisement

Problem and challenge Challenge How to assign data to a node with multiple interest topics leads to a unique challenge. Problem Given a pre-defined dissemination budget {n, n,, n k } in k topics, n i = N, and nodes interest information in a mobile social network, How to minimize the maximum data delivery delay with the mutually exclusive delivery requirement?

Problem and challenge Different data assignment leads to different forwarding delay. A motivational example u carries 4 data ( sports, music ) Assign u, sports or music u s buffer 5 min

Problem and challenge Different data assignment of u Assign u sports, Optimal delay: 5 min Assign u music, Optimal delay: 5 min

Centralized solution The network topology and contact frequency between nodes are known. Three sub-problems How many reachable nodes are within time, T? Shortest-path algorithm Is there a feasible assignment within time, T? Assignment problem (solved it this paper) How to find the minimal T? Binary search

Centralized solution How many reachable nodes are within time, T? Build the shortest-path tree from source node algorithm in the contact graph, the number in the edge is the average contact interval. min 9 min 0 min 8 min 4 min 4 min Build a table based on the reachable time Reach time (min) 0 4 4 9 8 Node ID 3 4 6 5

Centralized solution Assignment problem Is there a feasible assignment in T? An assignment with mutually exclusive delivery requirement between different types of user and different topics An example M M M 3 Number of mobile user in each type topic budget

Centralized solution Max-flow formulation Two sets of nodes Mobile user node set and the topic set Each type of user has connection to its interests The capacity from the user node to its interested topic is the amount of each type of mobile user. User node set Topic node set M M M 3

Centralized solution Max-flow formulation Add virtual source and sink The capacity from source to each type of user equals to each type of user to topic. The capacity from each topic node to sink equals to the budget in each topic. Number of user in each type topic Budget Src. M M Sink M 3

Centralized solution Max-flow formulation Correctness Mutually exclusive delivery requirement: each user can be assigned at most once. source link capacity = user node to topic node capacity Feasibility: A max-flow will consume all the capacity from the topic nodes to sink. Finishing all the assignment. Src. M M Sink M 3

Centralized solution Greedy algorithm Further reduce the algorithm complexity A simple idea If the data amount under a topic is close to the number of users which can consume it, this type of data should be assigned to the user first, otherwise, later, it might not be able to find enough user to consume it.

Centralized solution Most-unbalanced-first algorithm Three definition (from the topic side) Supply level of topic i: Remaining amount of data in topic i Consumption level of topic i: Remaining amount of mobile users which subscribe topic i Unbalance level of topic i: Difference between the consumption level and supply level of topic i 3 Topic Supply Consumption Unbalance 3 3 3 3 5 3 3

Centralized solution Node s interest amount matters An example The unbalance level for three topics is,,, respectively. Topic 3 is the most unbalanced. We can assign user in M or M 3 to consume data in topic 3 first. 3 M 3 M M 3 M 3 M 3 3 M 3 3 Assign M 3 first, M cannot be further assigned Assign M first M mobile user has only one choice, should have higher priority!

Centralized solution Greedy_plus algorithm Jointly consider from the user and topic (two sides) While we can do data assignment Find the set of mobile users with the fewest topic. The most unbalanced topic is assigned from the selected set of mobile users. Theorem: if each mobile user subscribes at most two topics, the user assignment amount of greedy_plus algorithm is maximum. Insight: maximize the the assignment feasibility from two sides

Distributed scenario The network topology and contact frequency between nodes are unknown. Two main challenges Forwarding ability estimation How many data copy and which type of data copy should be forwarded to the encountered node? Local data assignment For the encountered node with multiple interests, which type of data should the encountered node consume?

Distributed scenario Forwarding ability estimation Nodes record and exchange their interest information with neighbors to get the network estimation for different topics overtime. An illustration Two topics, M and in the network 4 different type of users { M,, M/S, }. If the encountering probability of a node to these four different type nodes are {0., 0., 0.5, 0.}, its forwarding ability vector is V = {0., 0., 0.5, 0.}. Interested topics Already received data or not? Forwarding ability vector { M,, M/S, } = {0., 0., 0.5, 0.}

Distributed scenario Local data assignment Encountering node with single interest Buffer M Amount 9 encountering Buffer M u Amount 0 0 u 0 V { M,, M/S, } = {0.7, 0., 0., 0.} Average forwarding ability {0.6, 0., 0., 0.} V { M,, M/S, } = {0.5, 0.3, 0., 0.} Direct data consumption Buffer M Amount 8 encountering Buffer M u u Amount 0 0 Data copy splitting (proportional to V/(V + V) and V/(V + V)): The expected delay for M /0.6 = 0 smaller than 8/0. = 40, therefore, we will regard the utility of M/S nodes as utility of nodes Buffer M Amount 3 7 encountering Buffer M u Amount 5 5 u

Distributed scenario Local data assignment Encountering node with multiple interest Buffer M Amount 9 encountering u Amount 0 0 u Direct data consumption (min-max volume algorithm) Consume the data with the largest amount. Buffer M M 0 0 V { M,, M/S, } = {0.7, 0., 0., 0.} V { M,, M/S, } = {0.5, 0.3, 0., 0.} Average forwarding ability {0.6, 0., 0., 0.} Buffer M Amount 9 encountering Buffer M u Amount 0 0 u M 0 Data copy splitting (proportional splitting): Buffer M Amount 3 6 encountering Buffer M u Amount 6 5 u M 0

Distributed scenario Local data assignment Encountering node with multiple interest Min-max speed algorithm Increase the data delivery delay in the slowest topic as much as possible. To minimize the maximum data delivery delay, the bottleneck is the lowest delivery delay of a topic. Walter-filling condition Theorem: if different types of mobile users are uniformly distributed in the network, a schedule which makes max ratio between data budget and the amount of interested users in any topic minimized, it is the optimal assignment.

Distributed scenario Local data assignment An example of the min-max speed algorithm Buffer M Amount 9 Direct data consumption: encountering u Amount 0 0 u The expected delay for M /0.6 = 0 smaller than 9/(0. + 0.) = 30, even we regard all M/S nodes s as nodes, therefore, we will consume data to increase its delivery speed. Buffer M Amount 8 encountering Buffer M Buffer M u Amount 0 0 u M 0 0 V { M,, M/S, } = {0.7, 0., 0., 0.} V { M,, M/S, } = {0.5, 0.3, 0., 0.} Average forwarding ability {0.6, 0., 0., 0.} M 0 Data copy splitting (proportional splitting): Buffer M Amount 3 7 encountering Buffer M u Amount 5 5 u M 0

Distributed scenario Forwarding ability estimation (an extension) Record whether a user is interested in a topic For example, if we have two topics, M and in total, the forwarding ability vector will be like {0.4, 0.6}. Advantage: reduce the forwarding utility vector attributes. Disadvantage: Do not have the estimation for nodes with multiple interests Positive estimation Meeting a user with n interests regards as meeting n users with single interest in utility calculation. Negative estimation Meeting a user with n interests regards as meeting /n users with single interest in utility calculation.

Experiments Trace setting: Synthetic trace 00 nodes, 60000 contacts Randomly contact, average contact frequency s Each node randomly has 6 different topics Average subscription number for a topic 0 to 40 Data budget, 0 to 80 INFOCOM 06 trace 78 nodes, 7657 contacts Questionnaire to get the user s interests with 35 different topics Average subscription number for a topic is Data budget, 0 to 40

Experiments Algorithm comparison: Centralized algorithm Max-flow: proposed max-flow data assignment Greedy_plus: proposed modified greedy algorithm Greedy: most-unbalanced-first algorithm Random: randomly assign data Distributed algorithm Min-max speed: proposed data assignment Min-max volume: same as min-max speed, instead of assigning the node with multiple data with data with the largest amount. Random: random assignment

Experiments Centralized environment Performance of centralized data assignment algorithms in synthetic trace Randomly generate users interests and data budget, compare the average data delivered number of user in time 00 seconds. amount of data (n) 70 60 50 40 30 Maxflow Greedy_plus Greedy Random 0 4 4.5 5 5.5 6 number of the topics (n) processing time (s) 0.8 0.6 0.4 0. Maxflow Greedy_plus Greedy Random 0 4 4.5 5 5.5 6 number of the topics (n) Max-flow algorithm achieves the best performance with max processing time. The Greedy algorithm improves performance with a low processing time.

Experiments Distributed environment Performance of distributed data assignment in synthetic trace The average delay (s) Given the total budget, random generation data budget in different topics 000 800 600 400 Min-max speed Min-max volume Random 00 0 5 30 35 40 The number of the data (n) average delay (s) Given the budget assignment, random generation node s interests 390 380 370 360 Min-max speed Min-max volume Random 350 0 0 0 30 40 mobile users with multiple interests The proposed min-max speed algorithm balances the data delivery speed in different topics, therefore, the maximum delay is minimized.

Experiments Distributed environment Performance of the different utility estimation in INFOCOM trace average delay (hour) 6 4 different topics are randomly selected from 35 topics Single source, random data budget in different topics. Compare the min-max speed algorithm in different utility estimation methods Global_plus Global_positive Golbal_negative Local 0 5 0 5 30 number of the data (n) The total data budget Global_plus: vector of each type of user + vector exchange Global_positive: vector of each type of topic + vector exchange Global_negative: vector of each type of topic + vector exchange Local: vector of each type of user The proposed global estimation is the most effective approach. The positive/negative estimation is better than local estimation.

Conclusions Proposed the Mutually exclusive data dissemination Unique data consumption selection problem for a node with multiple interest Centralized solution Max-flow formulation Distributed solution Local data assignment selection Water-filling condition Distributed utility estimation

Thanks! ning.wang@temple.edu jiewu@temple.edu