SOCIAL networking is among the fastest growing information

Similar documents
On the End-to-end Call Acceptance and the Possibility of Deterministic QoS Guarantees in Ad hoc Wireless Networks

Session 4.2. Switching planning. Switching/Routing planning

Matrix-Matrix Multiplication Using Systolic Array Architecture in Bluespec

Time Synchronization in WSN: A survey Vikram Singh, Satyendra Sharma, Dr. T. P. Sharma NIT Hamirpur, India

Semi-analytic Evaluation of Quality of Service Parameters in Multihop Networks

Research on Neural Network Model Based on Subtraction Clustering and Its Applications

Minimize Congestion for Random-Walks in Networks via Local Adaptive Congestion Control

Connectivity in Fuzzy Soft graph and its Complement

Bit-level Arithmetic Optimization for Carry-Save Additions

Microprocessors and Microsystems

A Novel Dynamic and Scalable Caching Algorithm of Proxy Server for Multimedia Objects

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT

Significance of Eigenvector Centrality for Routing in a Delay Tolerant Network

MobileGrid: Capacity-aware Topology Control in Mobile Ad Hoc Networks

ABHELSINKI UNIVERSITY OF TECHNOLOGY Networking Laboratory

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Efficient Distributed File System (EDFS)

RAP. Speed/RAP/CODA. Real-time Systems. Modeling the sensor networks. Real-time Systems. Modeling the sensor networks. Real-time systems:

Interval uncertain optimization of structures using Chebyshev meta-models

Problem Set 3 Solutions

Network Coding as a Dynamical System

Measurement and Calibration of High Accuracy Spherical Joints

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

Simulation Based Analysis of FAST TCP using OMNET++

Color Texture Classification using Modified Local Binary Patterns based on Intensity and Color Information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Load Balancing for Hex-Cell Interconnection Network

Link Graph Analysis for Adult Images Classification

Performance Evaluation of TreeQ and LVQ Classifiers for Music Information Retrieval

Unsupervised Learning

Cluster ( Vehicle Example. Cluster analysis ( Terminology. Vehicle Clusters. Why cluster?

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

Private Information Retrieval (PIR)

Optimal shape and location of piezoelectric materials for topology optimization of flextensional actuators

An Optimal Algorithm for Prufer Codes *

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

Performance Evaluation of Information Retrieval Systems

A Fast Way to Produce Optimal Fixed-Depth Decision Trees

A Real-Time Detecting Algorithm for Tracking Community Structure of Dynamic Networks

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions

Scalable Parametric Runtime Monitoring

Wishing you all a Total Quality New Year!

Multilabel Classification with Meta-level Features

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Simulation of Electromagnetic Suspension System Based on the Finite Element Analysis

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Module Management Tool in Software Development Organizations

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

The Codesign Challenge

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

Efficient Routing Algorithms Combining History and Social Predictors in Mobile Social Networks

Dynamic Bandwidth Provisioning with Fairness and Revenue Considerations for Broadband Wireless Communication

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Design Level Performance Modeling of Component-based Applications. Yan Liu, Alan Fekete School of Information Technologies University of Sydney

Cognitive Radio Resource Management Using Multi-Agent Systems

ARTICLE IN PRESS. Signal Processing: Image Communication

TCP Performance over Current Cellular Access: A Comprehensive Analysis

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Efficient QoS Provisioning at the MAC Layer in Heterogeneous Wireless Sensor Networks

Analysis of Collaborative Distributed Admission Control in x Networks

A Topology-aware Random Walk

Hermite Splines in Lie Groups as Products of Geodesics

MULTIHOP wireless networks are a paradigm in wireless

A MPAA-Based Iterative Clustering Algorithm Augmented by Nearest Neighbors Search for Time-Series Data Streams

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

LOCAL BINARY PATTERNS AND ITS VARIANTS FOR FACE RECOGNITION

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Video Proxy System for a Large-scale VOD System (DINA)

with `ook-ahead for Broadcast WDM Networks TR May 14, 1996 Abstract

Constructing Minimum Connected Dominating Set: Algorithmic approach

IP mobility support is becoming very important as the

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

A Semi-Distributed Load Balancing Architecture and Algorithm for Heterogeneous Wireless Networks

Efficient Content Distribution in Wireless P2P Networks

On Maximizing the Lifetime of Delay-Sensitive Wireless Sensor Networks with Anycast

CS 534: Computer Vision Model Fitting

Improving Low Density Parity Check Codes Over the Erasure Channel. The Nelder Mead Downhill Simplex Method. Scott Stransky

DUE to the recent popularization of hand-held mobile

Mobility Based Routing Protocol with MAC Collision Improvement in Vehicular Ad Hoc Networks

Pattern Classification: An Improvement Using Combination of VQ and PCA Based Techniques

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

Virtual Machine Migration based on Trust Measurement of Computer Node

TAR based shape features in unconstrained handwritten digit recognition

Routing in Delay Tolerant Networks Comprising Heterogeneous Node Populations

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments

5 The Primal-Dual Method

Concurrent Apriori Data Mining Algorithms

Routing in Degree-constrained FSO Mesh Networks

A Load-balancing and Energy-aware Clustering Algorithm in Wireless Ad-hoc Networks

Using Particle Swarm Optimization for Enhancing the Hierarchical Cell Relay Routing Protocol

Bottom-Up Fuzzy Partitioning in Fuzzy Decision Trees

Progressive scan conversion based on edge-dependent interpolation using fuzzy logic

Fuzzy Modeling for Multi-Label Text Classification Supported by Classification Algorithms

A Clustering Algorithm for Chinese Adjectives and Nouns 1

FUZZY SEGMENTATION IN IMAGE PROCESSING

A Semi-parametric Approach for Analyzing Longitudinal Measurements with Non-ignorable Missingness Using Regression Spline

Steganalysis of DCT-Embedding Based Adaptive Steganography and YASS

Programming in Fortran 90 : 2017/2018

AADL : about scheduling analysis

Transcription:

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 1301 Effent Data Query n Intermttently-Conneted Moble Ad Ho Soal Networks Yang Lu, Student Member, IEEE, Yanyan Han, Zhpeng Yang, Student Member, IEEE, and Hongy Wu, Member, IEEE Abstrat Ths work addresses the problem of how to enable effent data query n a Moble Ad-ho SOal Network (MASON), formed by moble users who share smlar nterests and onnet wth one another by explotng Bluetooth and/or WF onnetons. The data query n MASONs faes several unque hallenges nludng opportunst lnk onnetvty, autonomous omputng and storage, and unknown or naurate data provders. Our goal s to determne an optmal transmsson strategy that supports the desred query rate wthn a delay budget and at the same tme mnmzes the total ommunaton ost. To ths end, we propose a entralzed optmzaton model that offers useful theoret nsghts and develop a dstrbuted data query protool for pratal applatons. To demonstrate the feasblty and effeny of the proposed sheme and to gan useful empral nsghts, we arry out a testbed experment by usng 25 off-the-shelf Dell Streak tablets for a perod of 15 days. Moreover, extensve smulatons are arred out to learn the performane trend under varous network settngs, whh are not pratal to buld and evaluate n laboratores. Index Terms Data query, moble ad ho soal networks, entralzed optmzaton model, dstrbuted protool, testbed experment, smulatons Ç 1 INTRODCTION SOCIAL networkng s among the fastest growng nformaton tehnologes, as evdened by the popularty of suh onlne soal network stes as Faebook, Twtter, LnkedIn and Google+ that ontnue to experene explosve growth. In ontrast to the popular web-based onlne soal networks that rely on the Internet nfrastruture (nludng ellular systems) for ommunaton, ths paper fouses on Moble Ad-ho SOal Network (MASON), an autonomous soal network formed by moble users who share smlar nterests and onnet wth one another by explotng the Bluetooth and/or WF onnetons of ther moble phones or portable tablets. A MASON s often reated for a loal ommunty where the partpants have frequent nteratons, e.g., people lvng n an urban neghborhood, students studyng n a ollege, or toursts vstng an arhaeologal ste. Its sze vares from a large group (for nstane, all the students n a unversty) to a small luster (suh as members of a shool band). It may serve a ommunty over a long span of years, or be temporary to last for as short as a few hours only (e.g., for soal networkng among a group of toursts). 1.1 System Overvew An ndvdual MASON s nomparable wth onlne soal networks n terms of the populaton of partpants, the The authors are wth the Center for Advaned Computer Studes, nversty of Lousana at Lafayette, Lafayette, LA70504. E-mal: {yxl0782, wu}@as.lousana.edu, yanyan.ull@gmal.om, zhpengyang@hotmal.om. Manusrpt reeved 23 Jan. 2014; revsed 21 Apr. 2014; aepted 23 Apr. 2014. Date of publaton 28 Apr. 2014; date of urrent verson 8 Apr. 2015. Reommended for aeptane by J. Lloret. For nformaton on obtanng reprnts of ths artle, please send e-mal to: reprnts@eee.org, and referene the Dgtal Objet Identfer below. Dgtal Objet Identfer no. 10.1109/TPDS.2014.2320922 number of soal onnetons and the amount of soal meda. However MASONs gan sgnfant value by servng as a supplement and augment to onlne soal networks and by effetvely supportng loal ommunty-based ad-ho soal networkng. For example, t helps dsover and update soal lnks that are not aptured by onlne soal networks and allows a user to query loalzed data suh as loal knowledge, ontats and expertse, surroundng news and photos, or other nformaton that people usually annot or do not bother to report to onlne webstes but may temporarly keep on ther portable deves or generate upon a request. Ths work addresses the problem of how to enable effent data query n MASONs. Consder a MASON wth N nodes. Eah node an be a query ssuer or a data provder, or more ommonly at n both roles for dfferent query requests. The queres fall nto C ategores. Eah node has ertan expertse to answer a query. Let E denote the expertse matrx, where E ndates the expertse of Node to answer a query n Category,.e., the probablty that Node an provde a satsfatory answer to a query n Category. A query s reated by a query ssuer. It s delvered by the network toward the nodes that an suessfully provde an answer (.e., data provders). If a data provder reeves the query, t sends the query reply to the query ssuer. Our goal s to determne an optmal transmsson strategy that supports the desred query rate and at the same tme mnmzes the total ommunaton ost. The formal problem formulaton wll be gven n Seton 2.1. 1.2 nque Challenges The use of free, short-range rado s hghly desred for a dversty of MASON applatons. At the same tme, however, t results n a dstntve ommunaton paradgm haraterzed by ntermttent lnk onnetvty and 1045-9219 ß 2014 IEEE. Personal use s permtted, but republaton/redstrbuton requres IEEE permsson. See http://www.eee.org/publatons_standards/publatons/rghts/ndex.html for more nformaton.

1302 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 autonomous omputng and storage. More spefally, the data query n MASONs faes the followng unque hallenges. (1) Opportunst lnk onnetvty. The onnetvty of MASONs s very low and ntermttent, formng a sparse network where a node s onneted to other nodes only oasonally. Ths s n a sharp ontrast to onlne soal networks, where users always have relable Internet onnetons. The data delvery delay n MASONs s potentally long, due to the loose onnetvty among nodes. Fortunately, suh delay, thought not desrable, s usually tolerable by many data query applatons n MASONs that are often delay nsenstve. (2) Autonomous omputng and storage. Central servers are employed to store and proess user data n onlne soal networks. Suh servers are, however, no longer avalable n MASONs, where ndvdual portable deves must perform dstrbuted data storage and omputaton. It s well known that portable deves have lmted omputng, storage and energy apaty. Nevertheless, suh onstrants are partularly dsadvantageous to MASONs, beause a node must proess data n a dstrbuted manner and store them loally for a muh longer tme before sendng them to another node, due to ntermttent onnetvty. (3) nknown or naurate expertse. When a node ssues a query, t s often unaware of the nodes that have suffent expertse to answer the query, sne the ost s prohbtvely hgh to onstrut a struture to ndex data and data provders lke P2P networks. It s obvously neffent ether to frequently flood queres, whh are expensve and often onsdered spams. Worse yet, n prate, a moble node hardly knows ts probablty to answer queres n eah ategory presely. It may ntally lam ts expertse based on the moble user s soal roles and avalable resoures. But suh ntally lamed expertse s often naurate. The above haratersts make the data query n MASONs a very unque, nterestng, and hallengng problem, renderng not only onventonal data query shemes for well-onneted omputer systems but also dstrbuted solutons for moble ad ho networks [1], [2], [3], [4], [5], [6] and moble (onlne) soal networks [7] napplable here. Only a handful of works have onsdered data query n opportunst network settngs. For example, Osmoss [8] employs an epdem approah to perform fle lookup n poket swthed networks. Whle t s smple and relable, the ommunaton overhead s very hgh due to the floodng-lke epdem routng. DelQue [9] ams to query geoloaton-based nformaton. It assumes eah node moves aordng to a gven shedule and adopts a sem-markov model to predt nodal meetng events, n order to dentfy a proper relay to arry the query to the target loaton and brng the nterested nformaton bak to the soure. Yang et al. [10] proposes a dstrbuted database query framework based on several ommunaton and omputng tehnques spefally talored for RFID networks. Nether of them effently support data queres n MASONs. On the other hand, although several routng algorthms have been proposed for opportunst networks by explotng soal relatons among moble users to aheve effent routng [11], [12], [13], [14], [15], [16], they are developed for gener ommunatons, wthout onsderaton of the unque needs and onstrants n data query. Among them, Zhu et al. [16] s the most reent one, where explots a dstrbuted ommunty parttonng algorthm to dvde a DTN nto smaller ommuntes. For ntra-ommunty ommunaton, a utlty funton onvolutng soal smlarty and soal entralty wth a deay fator s used to hoose relay nodes. For nterommunty ommunaton, the nodes movng frequently aross ommuntes are hosen as relays to arry data to destnaton effently. Although Zhu et al. [16] ntrodues a soluton for DTNs whh leverages soal propertes and moblty haratersts of users, t s not truly applable for the data query n MASONs, beause t does not apture the nherent features for the query delvery n MASONs, hene the nodes are not helpful for eah other by makng the orret desons to arry queres to satsfatory nodes. 1.3 Our Contrbuton In ths work, we frst formulate the optmzaton problem for the data query n MASONs. Then, we develop a statedagram-based analyt model to derve the ommunaton overhead and query rate as the funtons of transmsson strategy. A branh and bound algorthm s adopted to dsover the optmal transmsson strategy that mnmzes the total ommunaton ost whle ahevng a target query rate. The optmzaton model s entralzed, thus unpratal for real world mplementaton. However, t offers useful nsghts for the development of a dstrbuted data query protool. The proposed protool s based on two key tehnques. Frst, as motvated by the analyss, t employs reahable expertse as the routng metr to gude the transmsson of query requests. Seond, t explots the redundany n query transmsson, whh an effetvely mprove the query delvery rate n prate f t s properly ontrolled. To demonstrate the feasblty and effeny of the proposed data query protool and to gan useful empral nsghts, we have arred out a testbed experment usng off-the-shelf Dell Streak tablets. Our experment nvolves 25 volunteers and lasts for 15 days. Moreover, extensve smulatons (based on odes extrated from our prototype mplementaton) are arred out to learn the performane trend under varous network settngs, whh are not pratal to buld and evaluate n laboratores. The remander of ths paper s organzed as follows. Seton 2 ntrodues the proposed data query sheme, nludng a theoret optmzaton model and a pratal protool. Seton 3 presents a testbed experment and results. Seton 4 dsusses large-sale smulatons under real-world moblty traes and power-law moblty model. Fnally, Seton 5 onludes the paper. 2 PROPOSED DATA QERY SCHEME Whle MASONs offer nterestng opportuntes to support ad ho data query, ts apaty s unsurprsngly low ompared to many other data networks due to ts extremely lmted and nondetermnst ommunaton opportuntes. To learn the essene of optmal query delvery and to understand the performane upper bound, we frst arry out a prelmnary analyt study of the data query n MASONs before movng nto the detaled protool desgn and

LI ET AL.: EFFICIENT DATA QERY IN INTERMITTENTLY-CONNECTED MOBILE AD HOC SOCIAL NETWORKS 1303 evaluaton. Based on the nsghts ganed from our analyss, a dstrbuted data query protool s proposed, amng to enable hghly effent ad ho query under pratal MASON settngs. 2.1 Prelmnary Analyss In ths seton, we frst formulate the problem, and then ntrodue an analyt model to obtan an optmal soluton, followed by dsussons on numer results and nsghts for developng a pratal dstrbuted data query protool. 2.1.1 Assumptons and Problem Formulaton The ommunaton n MASONs depends on nodal meetng events. The tme nterval between two onseutve meetng events between two nodes, e.g., Nodes and j, s denoted by a random varable T j. We onsder gener nodal moblty n ths work wthout assumng any spef dstrbuton of T j. 1 For analyt tratablty, we make the followng two assumptons n data transmsson. Assumpton 1. Gven the ntermttent network settng, we assume that the ommunaton apaty s ruled by nodal meetng opportuntes. In other words, we assume the ommunaton delay s domnated by nodal meetng ntervals. When two nodes meet, the hannel bandwdth s suffent for them to exhange data pakets wth neglgble delay. Assumpton 2. Multple opes of a query request may exst n the network, but we assume a node reeves and forwards the same request only one. For a query request, we defne a bnary transmsson matrx, X, where X j ¼ 1 ndates that Node sends the request to Node j, f the former holds the request when t meets the latter. If X j ¼ 0, then Node does not send the request to Node j even f t arres the request upon meetng Node j. Note that X j and X j an be dfferent. The transmsson of query reply follows standard oneto-one routng wth known soure and destnaton. Its expeted transmsson delay has been well studed [11], [12], [13], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37]. Therefore we fous on the delvery of query requests only n ths analyss (but a omplete protool s desgned and mplemented as to be dsussed n later setons). Our goal s to determne the optmal X suh that the total ommunaton ost, defned as CðXÞ, s mnmzed, whle at the same tme the probablty to delver the query to a data provder wthn a gven delay budget d (denoted as P ðxþ) reahes the desred threshold, b. More spefally, the problem s formulated as follows: Objetve : S:t: : mnðcðxþþ; P ðxþ b: nder the assumpton of suffent hannel bandwdth, dfferent query requests do not ontend for ommunaton 1. It s out the sope of ths researh to model user mobltes. The readers are referred to the lteratures [17], [18], [19] that have reported extensve studes on moblty modelng. (1) Fg. 1. The state dagram shows all possble transmssons of a query n a network wth four nodes. Wthout loss of generalty, we let Node 1 be the query ssuer. Sne we fous on a sngle query, the ategory of the query (.e., ) s not shown n the state dagram. resoure and thus an be treated ndependently. To ths end, we next smply analyze the delvery of a sngle query request to derve CðXÞ and PðXÞ, and to optmze X. 2.1.2 Analyt and Optmzaton Model Our analyss s based on a state dagram. Eah state s a vetor wth N elements,.e., S ¼½s 1 ;s 2 ;...;s N Š, where s ¼ 1 sgnfes Node has not reeved the query request, s ¼ 0 ndates Node s arryng the query request but has never transmtted t to another node, and s ¼ 1 means Node has reeved the query request and forwarded t one. Fg. 1 llustrates an example of the state dagram for a network wth four nodes. Sne we fous on a sngle query n ths analyss, the ategory of the query (.e., ) s not shown n the dagram. Wthout loss of generalty, we let Node 1 be the query ssuer. Thus the ntal state s S ¼½0; 1; 1;...; 1Š (see Fg. 1). The state transts, as depted by an arrow n the dagram, when the query request s transmtted from one node to another, e.g., from Node to Node j. Suh a state transton s denoted by L j. Sne we have assumed that a node transmts and reeves the same request only one, L j s possble only f s ¼ 0 and s j ¼ 1,.e., Node s arryng the query request but has never transmtted t whle Node j has not reeved the query request. Note that ths s the neessary ondton to enable a transmsson, but Node does not have to transmt the query to Node j f suh a transmsson s deemed neffent. As a matter of fat, t s the goal of our analyss and optmzaton to determne the most effent transmssons. The state dagram (nludng the states and transtons) forms a tree struture. The ntal state s the root of the tree. As shown n Fg. 1, a state may appear multple tmes due to dfferent transton paths to reah t. We treat suh dental states separately (as f they are dfferent states). A state an be n three status,.e., atve, negatvely termnated,orpostvely termnated, as defned below. Defnton 1. An atve state s a state that allows further transtons. The query request wll be further transmtted under an atve state. An atve state must have a 0 element,.e., s ¼ 0, whh s also alled an atve element. Defnton 2. Node s an atve element of an atve state f and only f s ¼ 0. Fg. 1 depts atve states only. Eah atve state may reman atve or be termnated n two possble ways, resultng n a negatvely termnated state or a postvely termnated state.

1304 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 Defnton 3. An atve state beomes a postvely termnated state f the query request s answered by the atve element. pon reevng a query request n Category, Node has a probablty of E to answer the query. If the query s answered wthn ts delay budget, t wll no longer be forwarded t to another node. Ths essentally termnates the query proess gven the assumpton that a node forwards the same request only one. Otherwse, wth a probablty of 1 E, Node may reman atve and arry the request that s ready to be transmtted to another node. Defnton 4. An atve state beomes a negatvely termnated state f the query request s dropped by the atve element. A query request s dropped f ts delay budget expres. As a result, the state nludes no atve element and the query proess s termnated wthout a suessful reply. Fg. 1 depts all possble transmssons of a query n a network. Our goal s not to exeute all suh transmssons whh lead to hgh ommunaton overhead, but nstead to perform seletve transmssons to mnmze the ommunaton ost. To ths end, we have ntrodued the transmsson matrx, X, where X j ¼ 1 sgnfes that Node wll send the request to Node j, when the former meets the latter wth s ¼ 0 and s j ¼ 1; or the transmsson wll not be performed otherwse. Wth X as the varable to be optmzed, we now analyze the probablty to reah eah state. Sne the state dagram forms a tree struture, there s a unque path from the root (.e., the ntal state) to a gven state S ¼½s 1 ;s 2 ;...;s N Š. One agan, dental states may appear n the tree and they are treated as separate states. Let L S denote the path from the root to S, whh onssts of a sequene of transmssons fl S 1 2 S ;L S 2 3 S ;...;L S K 1 K S g, where f1 S, S 2 ;..., S K g are the set of nodes nvolved n the transmssons n sequene. For example, the path from the root to State S 14 ½1; 0; 1; 1Š nludes suh lnks as L 13, L 34, and L 42. Eah lnk ntrodues a transmsson delay. The total delay to reah the state s P K 1 ¼1 T S þ1. S We are nterested n postvely termnated states. A state S an be postvely termnated f and only f the followng three ondtons are satsfed. Frst, the transmsson matrx s onfgured suh that X S þ1 S ¼ 1, 81 K 1, formng a vald path from the root to State S. Seond, the total delay to reah the state s no greater than the delay budget d,.e., P K 1 ¼1 T S d. Note that d an be very large under S þ1 pratal settngs. As a matter of fat, MASON users are often not anxous to reeve prompt query reples and thus may tolerate long delay. In ths ase, the delay dstrbutons n the above formula an be redued to smple probabltes as to be dsussed n the next seton. Thrd, the last node on the path (.e., Node K S ) an answer the query whle others along the path (.e., Nodes S, 81 K 1) annot. Therefore the probablty that State S s postvely termnated s P S ðxþ ¼ YK 1 X S þ1 S ¼1 E S K YK 1 ¼1 ( K 1 X Pr T S þ1 S d ¼1 1 E S ) : (2) Sne the states are unorrelated, the total probablty to reah a postvely termnated state,.e., the probablty to delver the query request to a node that an answer the query s PðXÞ ¼ P S P SðXÞ. Wth proper manpulaton, we arrve at the followng formula: P ðxþ ¼ XN 1 X N X N X N Y 1 X kj k jþ1 k 6¼k 0 ;k 2 ;...;k 1 j¼0 ¼1 k 1 6¼k 0 k 2 6¼k 0 ;k 1 ( ) Y 1 X 1 Ek 1 E kj k jþ1 Pr T kj k jþ1 d : j¼0 Note that, the total number of add operatons n the above equaton equals the number of states, whh we have found to be N S ¼ P N 1 ¼1 A N 1, where A N 1 s the permutaton of out of N 1 elements,.e., A N 1 ¼!=ðN 1 Þ! The ommunaton ost n a wreless network s often proportonal to the number of transmssons. The more the transmssons, the hgher the onsumpton of energy and storage spae. It s out the sope of ths work to defne the best ost funton. We smply let CðXÞ be the total number of transmssons nvolved n the delvery of a query request. Let DðSÞ denote the depth of State S n the tree. Obvously, DðSÞ represents the number of transmssons (.e., the ost) needed to transt from the ntal state to S. Note that, the ost of DðSÞ s nurred as long as State S s reahed, no matter whether the query an be answered or not. Thus CðXÞ s gven below: CðXÞ ¼ X S j¼0 (3) DðSÞ P SðXÞ E : (4) K S P ðxþ and CðXÞ are obtaned va Eqs. (3) and (4), and then plugged nto Eq. (1). A branh and bound algorthm [38] s adopted here to dsover the optmal transmsson matrx X, n order to mnmze CðXÞ whle ensurng PðXÞ b. 2.1.3 Numer Results and Insghts Fg. 2 shows the numer results of the optmzaton model based on Haggle trae [39] (to be detaled n Seton 1). b s set to be 0:5. nderverysmalld, many queres annot be delvered wthn the delay budget, resultng n low delvery rate. When d < 30, the delvery rate s n fat lower than the expeted threshold (.e., b). By followng the optmzaton model, the nodes do not make unneessary attempts to transmt them, and aordngly the overhead s low. Wth a longer delay budget, more queres an reah the orrespondng data provders and thus the delvery rate naturally nreases. At the same tme, delay and overhead nrease too beause more transmssons wth longer delay are aggressvely attempted. However, when d s suffently large, many optons of routng paths beome avalable (that all satsfy the delay budget), allowng the optmzaton model to hoose the one wth the lowest overhead (.e., the one that nvolves the least transmssons). Ths explans why overhead dereases under large d. Suh a hoe often sarfes delay, as long as t does not exeed the allowed budget. Therefore, the average delay monotonally nreases wth d.

LI ET AL.: EFFICIENT DATA QERY IN INTERMITTENTLY-CONNECTED MOBILE AD HOC SOCIAL NETWORKS 1305 Fg. 2. Numer results of optmzaton. Whle the above optmzaton model an be mplemented by eah ndvdual node, t s ntrnsally entralzed (requrng global nformaton), and thus unpratal for real world mplementaton. However, t offers useful nsghts for the development of a dstrbuted data query protool. In partular, the essene of the optmzaton s to frst learn the probabltes to delver the query along dfferent paths to dfferent nodes, and then dede the optmal paths by strkng the balane between ost and delvery probablty. Ths nsght stmulates us to develop a dstrbuted sheme to learn reahable expertse, whh an effetvely gude the transmsson of a query to the node(s) wth suffent expertse to answer t. 2.2 Protool Desgn In ths seton, we ntrodue a dstrbuted protool for the data query n MASONs. It s based on two key tehnques. Frst, as motvated by our analyt and optmzaton model dsussed above, t employs reahable expertse as the routng metr to gude the transmsson of query requests. Seond, t explots the redundany n query transmsson. Redundany s not onsdered n the analyss due to ts ntratablty, but an effetvely mprove the query delvery rate n prate f t s properly ontrolled. 2.2.1 Routng Metr The delvery of query depends on a routng metr, whh s updated routnely and mantaned separately from the routng algorthm tself. We frst ntrodue suh a metr,.e., reahable expertse, that gudes query transmsson. As ntrodued n Seton 2.1.2, eah node has ertan expertse to answer a query. We let E denote the expertse of Node to answer a query n Category. We have assumed E s known n our analyss. In prate, however, t s nontrval to properly defne the expertse, beause a moble node hardly knows presely ts probablty to answer queres n eah ategory. It may ntally lam ts expertse based on the moble user s soal roles (e.g., professons), nterests, and avalable resoures. But suh ntally lamed expertse s often naurate. Therefore, after ntalzaton, the expertse should be updated aordng to the feedbaks from other nodes, espeally the query ssuers. In ths researh, we adopt the exponentally weghted movng average (EWMA) to mantan and update expertse. More spefally, we have E ð1 mþ E þ mf ; (5) where 0 m 1 s a onstant weght to keep partal memory of hstor status, ½E Š s the expertse before t s updated, and F s the feedbak sore for queres that Node has answered n Category. Varous feedbak ratng shemes an be adopted to determne F. In ths researh, we employ a smple sheme, whh supports quk onvergene as to be dsussed n Seton 3 and shown n Fg. 3h. The expertse ndates the apablty of a node to answer queres, but tself s nsuffent to gude query transmsson. For example, a node may have hgh expertse, but s not reahable by the query ssuer and thus beomes less helpful to answer the query. To ths end, we defne k-hop reahable expertse. As dsussed earler, the MASON users an often tolerate long delay. Thus, the delay random varables n Eq. (3),.e., T S þ1, S may be redued to smple nodal ontat probabltes. Let p j denote the probablty that Nodes and j meet. The mantenane and update of p j have been dsussed extensvely n DTN networks [20], [21], [24], [25]. The k-hop reahable expertse s alulated as follows: E ðkþ ¼1 Y j2f 1 pj E j ðk 1Þ ; (6) where F s the set of nodes that Node meets frequently. E ðkþ ndates the probablty that Node an delver the query wthn k hops to a node that an answer the query. Clearly, E ð0þ ¼E and E ð1þ ¼1 Q j2f ð1 p j Ej Þ. Node ollets fej ðk 1Þj8 j 2 F and 0 <kng whenever t meets other nodes, and perodally makes an update on E ðkþ aordng to Eq. (6). Based on the k-hop reahable expertse, we defne the aggregated reahable expertse, AE ¼ 1 YN k¼1 1 E ðkþ ; (7) whh ndates the overall ablty of Node to help a query n Category to be answered. It serves as the routng metr to gude query delvery as to be dsussed next. 2.2.2 Routng wth Dynam Redundany Control Based on the routng metr,.e., reahable expertse, we now ntrodue the routng algorthm. The delvery of a query s guded by the aggregated reahable expertse, where the query s generally forwarded from the node wth a lower aggregated reahable expertse to the node wth a hgher

1306 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 Fg. 3. Expermental results.

LI ET AL.: EFFICIENT DATA QERY IN INTERMITTENTLY-CONNECTED MOBILE AD HOC SOCIAL NETWORKS 1307 one. In ontrast to the onventonal store-and-forward data transmsson where a sngle opy of data s transmtted aross the network, redundany s often employed n opportunst networks. Whle redundany s not addressed n the analyss due to ts ntratablty, t s mportant n prate to aheve the desred query delvery rate. Generally speakng, the hgher the redundany, the hgher probablty the query s answered suessfully. However, redundany must be properly ontrolled as exessve redundany may exhaust network apaty and thus degrade the performane. A nave approah s to reate a fxed amount of redundany for eah query. For example, a predetermned number of opes of the query an be reated and dstrbuted to other nodes n the network. Ths approah, however, s often neffent, beause the effetveness of redundany depends on the nodes that reeve, arry and forward the query. In an extreme ase, all redundant opes of the query may be transmtted and arred by the nodes that have lttle hane to meet the node(s) that an answer the query and thus beome neffetve. As a matter of fat, the effetveness of redundany hghly depends on the reahable expertse of the nodes that arry the redundant opes. To ths end, we ntrodue a parameter to dynamally reflet the effetve redundany. More spefally, the proposed routng algorthm wth dynam redundany ontrol s outlned below. Let R q denote the redundany of Query q as observed by Node. The parameter s the estmated probablty that at least one opy of the query s answered by any other nodes n the network. It s mantaned and updated n a dstrbuted way. Assume Query q s n Category. R q s ntalzed to zero when the query s reated and subsequently updated durng ts transmsson. Sne ommunaton opportunty s low, transmsson s often between two nodes only. If more than two nodes are wthn ommunaton range, we assume an underlyng medum aess ontrol protool that randomly selets one node as the sender. Therefore we onsder a general senaro where Node meets Node j n the followng dsussons. Frst, Nodes and j exhange ther k-hop reahable expertse and update ther aggregated reahable expertse aordng to Eqs. (6) and (7). Then, Node fethes the query wth the lowest redundany n ts queue. The queue holds the queres that Node reates or reeves from other nodes. It s sorted aordng to the redundany level suh that the query wth the lowest redundany (denoted as Query q n Category ) s at the head of the queue. If Node j has a hgh expertse for queres n Category (.e., Ej a where a s a predefned onstant), t dretly answers the query by reatng and sendng a query reply to the query ssuer. Sne the destnaton of the query reply (.e., the query ssuer) s known, t an be delvered va any exstng routng protool for opportunst networks [11], [12], [13], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [40], [41], [42], [43]. We adopt the sheme proposed n [24] n our mplementaton. Note that the answer of Node j s not always satsfatory. It has a probablty of Ej to be satsfed by the query ssuer. Therefore, Node removes the query from ts queue wth a probablty of Ej. Otherwse, f Node j annot answer the query dretly (.e., Ej < a), Node heks the redundany of Query q. If R q b where b s the desred query delvery probablty, t mples that hgh enough redundany has been reated for thequery.thusnoatonwllbetaken.nodesmply holds the query untl t meets a node that an dretly answer the query or the query must be dropped due to queue overflow. An overflow happens when a new query s added nto the queue whh s already full. In ths ase, thequerywththehghestredundany(.e.,attheendof the queue) s dropped. If R q < b, the query should be further propagated. But t s transmtted to Node j only f AE <AEj (.e., the latter has a better hane to delver the query). Ths transmsson reates two opes of the query, eah sharng partal responsblty to get the answer. The redundanes for the two opes are assgned as follows: R q j 1 1 R q 1 E ð0þ ; (8) R q 1 1 R q 1 E j ð0þ : (9) In both formulas, ð1 R q Þ denotes the estmated probablty that none of other nodes (exept Nodes and j) an get the answer for Query q, and ð1 E ð0þþ and ð1 E jð0þþ gve the probablty that Node and Node j annot dretly answer the query. Therefore the updated R q (or R q j ) ndates the probablty that at least one opy of the query an be answered by other nodes exept Node (or Node j). The transmsson of Query q ontnues upon future ommunaton opportuntes untl, as dsussed earler, the query s answered by a node or dropped due to queue overflow. pon reevng the query reply, the query ssuer evaluates t and onstruts a feedbak paket, whh s delvered to the node that answers the query, agan, va an exstng routng protool for opportunst networks. The latter then updates ts expertse aordng to Eq. (5). 3 PROTOTYPE AND EXPERIMENT To demonstrate the feasblty and effeny of the proposed data query protool and to gan useful empral nsghts, we have arred out a testbed experment usng off-the-shelf Dell Streak tablets. In ths seton, we frst ntrodue our testbed setup and then present expermental results. 3.1 Prototype and Testbed Setup We have developed a prototype system by usng Dell Streak 5 and 7 tablets that are of the smartphone/tablet PC hybrd operatng on Androd 2:2. The ommunaton between the tablets s enabled va Bluetooth. A Streak tablet has 16 GB nternal storage adequate to keep large amounts of expermental data. We have mplemented our proposed data query protool by usng standard Androd APIs, losely followng the desrpton n the prevous seton. In order to save power, eah node ntates neghbor dsovery one every a random nterval (between 5 to 10 mnutes). Our experment nvolves twenty fve volunteers nludng faulty members and students. They are marked as Node 0 to 24. In the experment, we defne three ategores

1308 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 TABLE 1 Results under Experment of queres,.e., hstory, sene and arts (whh are named Category 0, 1 and 2, respetvely). Eah partpant has a lamed ntal expertse for answerng queres n eah ategory and generates twelve queres per day n randomly hosen ategores. Note that the ntal expertse s not aurate. The true expertse s arbtrarly set by lettng a small set of nodes to have an expertse of 1 to answer queres n eah ategory. More spefally, Nodes 3, 13, and 19 an answer queres n Category 0; Nodes 4, 14, and 20 an answer queres n Category 2; and Node 1 an answer queres n any ategory. Other nodes ntalze expertse to be ð0 1Þ randomly and learn and update ther aggregated expertse durng the experment. The experment had run for ffteen days, startng from Monday 16:00 p.m. n the frst week to Monday 17:00 p.m n the thrd week. We ompare several varatons of the proposed sheme and related shemes. In the followng dsussons, zerohop, one-hop, two-hop, and three-hop stand for the proposed sheme that allows up to zero hop, one hop, two hops, and three hops relayng, respetvely; No Feedbak means the proposed sheme (wth two-hop relay) but wthout the feedbak mehansm to retfy expertse; Floodng s the smple floodng sheme for query delvery; Gossp [44] onsders multple ategores and assgns the queres n eah ategory a transmsson probablty for data transmsson; Wllngness [45] s a sheme that a query s delvered based on wllngness, whh s the degree to whh a node atvely engages n tryng to re-transmt a query; Spray and Wat [33] s onsdered as a baselne opportunst delvery protool; Soal-based [16] s a soal-based routng sheme. We are prmarly nterested n two parameters: (1) the suess query rate,.e., the rato of suessfully answered queres to the total generated queres, and (2) the query delay whh s the perod from the tme when a node generates the query to the tme when t reeves the answer. 3.2 Expermental Results Table 1 shows the overall performane of dfferent shemes. The two-hop sheme aheves the hghest query rate. It s not surprsng to fnd the one-hop sheme wth a lower query rate sne a node merely tres to answer the queres va up to one hop relay. The zero-hop sheme has the lowest query rate as a query an be answered only when the query ssuer meets the data provder dretly. On the other hand, t seems ant-ntutve that allowng a longer relay path (e.g., the three-hop sheme) leads to a negatve gan. But ths s reasonable beause exessve redundany s reated when too many nodes are nvolved n relayng queres, subsequently overloadng the network and resultng n degraded performane. For a smlar reason, the Floodng sheme has a even lower query rate gven ts extremely hgh redundany. nder the No Feedbak sheme, the naurate expertse s not retfed, resultng n msleadng reahable expertse and thus lower query rate. Gossp [44] onsders multple ategores and assgns the queres n eah ategory a transmsson probablty for data transmsson. However, asagosspngapproah,ts data transmsson s randomzed. Therefore a query s often answered and arred by nodes wth nsuffent expertse, thus ndung many non-satsfatory reples. Wllngness [45] s a sheme that a query s delvered based on wllngness, whh s the degree to whh a node atvely engages n tryng to re-transmt a query. The wllngness does not reflet the expertse based on whh a node reples queres, therefore the nodes are not helpful for eah other to arry queres to nodes wth suffent expertse. We also ompare wth Spray and Wat [33] whh s onsdered as a baselne opportunst delvery protool. [33] fxes the number of opes for eah query whh lmts the queres to go through orret paths to be repled by nodes wth suffent expertse, makng query rate even lower. Soal-based [16] explots a dstrbuted ommunty parttonng algorthm to dvde a DTN nto smaller ommuntes. For ntra-ommunty ommunaton, a utlty funton onvolutng soal smlarty and soal entralty wth a deay fator s used to hoose relay nodes. For nter-ommunty ommunaton, the nodes movng frequently aross ommuntes are hosen as relays to arry data to destnaton effently. Although Zhu et al. [16] ntrodues a soluton for DTNs whh leverages soal propertes and moblty haratersts of users, t s not truly applable for the data query n MASONs, beause when a node ssues a query, t s often unaware of the nodes that have suffent expertse to answer the query. The ost s prohbtvely hgh to onstrut a struture to ndex data and data provders lke P2P networks. It s obvously neffent ether to frequently flood queres, whh are expensve and often onsdered spams. It s not surprsng that our proposed sheme has better performane than Soal-based, sne Soal-based does not apture the nherent features for the query delvery n MASONs, hene the nodes are not helpful for eah

LI ET AL.: EFFICIENT DATA QERY IN INTERMITTENTLY-CONNECTED MOBILE AD HOC SOCIAL NETWORKS 1309 other by makng the orret desons to arry queres to satsfatory nodes. In general, when more hops are allowed n relayng queres, the overhead nreases, beause a query s more aggressvely propagated. As a result, more opes of the query are transmtted n the network and the query ssuer often reeves more reples. At the same tme, sne a query may potentally go through a longer path to reah the data provder, the average delay also nreases. Compared wth the two-hop sheme, No Feedbak has longer delay and more number of reples beause norret expertse often leads the queres to wrong routes. More than 96 perent queres are answered suessfully. The unanswered queres are all generated durng the fnal hours of the experment. Fg. 3a llustrates the number of queres answered on eah day of the experment. As an be seen, the results vary among days, refletng the movng patterns of the partpants. More queres are answered durng weekdays than weekends due to the lower nteratve atvtes of students and faulty on Saturday and Sunday. In fat, many queres annot be answered durng weekends and have to wat untl Monday of the next week. Ths explans the peak on the seond Monday. It s worth mentonng that the frst and the thrd Monday are not the whole days, hene the number of answered queres s less than the seond Monday. The atvty pattern s also evdened by the delay varaton shown n Fg. 3b. Queres generated n weekends have longer delay ompared wth those n weekdays. The delay of queres generated on Frday s also hgh beause no lasses are sheduled on Frday afternoon and many offes are losed after 1:00 p.m.. Fg. 3b also shows the total traff n the network, whh follows a smlar pattern of nodal atvtes. Fgs. 3 and 3d further zoom n to show the results n eah hour of a day. The data are averaged over 15 days. Both the network traff and the number of answered queres are hgh durng daytme and low at nght, whh agan shows the query heavly depends on the atvty of students and faulty who arry the tablets. Lkewse, we expet lower delay durng daytme. However, the results are just the opposte (as depted n Fg. 3d). Suh ant-ntutve observaton s due to the queres from a few nodes, whh experened extremely long delay that domnates the overall performane. In fat, most queres generated durng daytme ndeed have short delay. But a set of nodes (nludng Nodes 3-12) rely on asnglenode(node2) to arry ther queres n Category 2 to orrespondng data provders. Suh delvery happens around 9:00 a.m. daly. The queres generated after 9:00 a.m. must wat untl the next day, thus ndung unusually long lateny that sgnfantly elevates the overall average. If we exlude suh queres (see the lower purple bars n Fg. 3d), the average delay beomes muh lower, and the daytme delay s generally shorter than that durng nght. The average delay and traff of dfferent nodes are llustrated n Fg. 3e. In general, delay and traff vary among dfferent nodes due to the randomness n nodal moblty and query generaton and transmsson. Node 0 has extremely poor onnetvty (ether dretly or ndretly) to the nodes wth hgh expertse, resultng n very long delay ompared wth other nodes. Contrarly, sne Node 1 s able to answer all the queres of three ategores, t has the mnmum delay. In addton, Node 2 has the heavest traff load beause t frequently meets other nodes, whle Node 0 arres the least traff due to few nteratons between t and other nodes. The delay dstrbuton s shown n Fg. 3f. More than 65 perent queres are answered wthn two hours. The queres wth longer delays are ether generated by Nodes 3-12 as dsussed above, or generated durng weekends and thus annot be repled untl the next Monday. Fg. 3g llustrates the dstrbuton of path length. All queres are answered wthn three hops. Fg. 3h shows the onvergene of the lamed expertse to the ground truth. A node s hosen as an example, whle smlar results are observed n other nodes as well. As we an see, the feedbak mehansm effetvely adjusts the node s expertse, gradually approahng to the true value wthn a few hours. 4 SIMLATION RESLTS Besdes the experment dsussed above, extensve smulatons are arred out to learn the performane trend of the proposed data query algorthm under varous network settngs, whh are not pratal to evaluate by usng lab equpments. The smulaton odes are extrated from our prototype mplementaton, and the smulaton results are obtaned under real-world traes and powerlaw moblty model. Eah node mantans a maxmum queueszeof1;000. 4.1 Smulaton under Haggle Trae We have evaluated our proposed sheme under several real-world traes avalable at CRAWDAD [46]. Table 2 shows the results based on Haggle trae [39], whh nludes 98 partpants arryng small deves (Motes) durng Infoom 2006. We run the smulaton for a perod of 342; 916 seonds (or about 4 days). Eah node generates 1:08 queres per hour. The queres fall nto fve ategores, and eah ategory s assoated wth three expert nodes that an provde satsfatory reples. Smlar to the results n Table 1, Table 2 shows the results under Haggle trae. The dstrbutons of query rate and delay are llustrated n Fg. 4. About 90 perent of nodes an aheve a query rate of 80 perent or hgher under the proposed sheme. At the same tme, more than half of the queres are answered wthn an hour. 4.2 Smulaton under Power-Law Moblty Model Besdes the above results based on Haggle trae, we have arred out smulatons under power-law moblty model, whh enables onvenent study of performane trend wth the varaton of several network parameters. More spefally, we smulate an area that s parttoned nto a grd of 20 20 ells. Eah node s assoated wth a randomly-hosen home ell, n whh t ntally resdes. In a tme slot, t may move n one of the four dretons,.e., up, down, left and rght, or stay n ts home ell. Let P ðxþ denote the probablty for Node to be at Cell x. P ðxþ ¼k ð 1 d ðxþ Þs where k s

1310 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 TABLE 2 Results under Haggle Trae a onstant, s s the exponent of the power-law dstrbuton and d ðxþ denotes the dstane between Cell x and Node s home ell. The default network parameters nlude a network of 100 nodes, a s of 2, 10 ategores of queres, fve experts per ategory, and a generaton rate of 0:02 queres per tme unt per node. In an opportunst network, the ommunaton apaty hghly depends on the meetng opportuntes among moble nodes. As shown n Fg. 5a, query reply rate grows wth the nrease of the network densty, beause the nodes have more opportuntes to meet eah other and exhange ther queres. Fg. 5b depts the mpat of the power-law fator s. When s s large, the nodes tend to stay n ther home ells,.e., have low moblty, resultng n small probabltes to meet eah other and onsequently small network apaty. Therefore the query reply rate s low. When s s extremely large, the query reply rate may approah as low as zero. On the other hand, when s s small, the nodes have unform moblty,.e., smlar probabltes to vst all ells and aordngly smlar routng metr (.e., k-hop reahable expertse), renderng routng neffetve. nder the smulated network settng, s ¼ 2 results n the best performane. The mpat of traff load s llustrated n Fg. 5. Whle the query reply rate keeps stable at the begnnng under all shemes, t starts to drop when the generaton rate exeeds 0:03. In general, wth a hgher query generaton rate, the overall traff load nreases, resultng n more frequent queue overflow and onsequently lower query reply rate. As dsussed n Seton 2, a threshold b s employed for dynam redundany ontrol. A larger b allows more redundany to be reated, amng to aheve a hgher query reply rate. However, f b s too large, the exessve redundany degrades the utlzaton of ommunaton and storage resoures and lowers the overall performane aordngly (see the Fg. 5d). Fg. 5e shows that a hgher query reply rate s aheved wth the nrease of queue sze, beause more queres and reples an be kept n the queue untl they are delvered. The number of experts for eah ategory s also studed n ths work. As shown n Fg. 5f, more experts for a ategory result n hgher query reply rate beause more nodes an answer the queres n ths ategory. 5 CONCLSION We have studed the problem of data query n a Moble Adho SOal Network, amng to determne an optmal transmsson strategy that supports the desred query rate wthn Fg. 4. Dstrbuton under Haggle trae. Fg. 5. Performane trend under power-law moblty model.

LI ET AL.: EFFICIENT DATA QERY IN INTERMITTENTLY-CONNECTED MOBILE AD HOC SOCIAL NETWORKS 1311 a delay budget and at the same tme mnmzes the total ommunaton ost. We have proposed a entralzed optmzaton model that offers useful theoret nsghts and developed a dstrbuted data query protool for pratal applatons. To demonstrate the feasblty and effeny of the proposed sheme and to gan useful empral nsghts, we have arred out a testbed experment by usng 25 off-the-shelf Dell Streak tablets for a perod of 15 days. Moreover, we have run extensve smulatons to learn the performane trend under varous network settngs, whh are not pratal to buld and evaluate n laboratores. REFERENCES [1] B. Yang and A. Hurson, A ontent-aware multmeda aessng model n ad ho networks, n Pro. 11th Int. Conf. Parallel Dstrb. Syst., 2005, pp. 613 619. [2] C. Avn and C. Brto, Effent and robust query proessng n dynam envronments usng random walk tehnques, n Pro. 3rd Int. Symp. Inf. Proess. Sens. Netw., 2004, pp. 277 286. [3] N. Chang and M. Lu, Controlled floodng searh n a large network, IEEE/ACM Trans. Netw., vol. 15, no. 2, pp. 436 449, Apr. 2007. [4] T. Hara, Effetve repla alloaton n ad ho networks for mprovng data aessblty, n Pro. IEEE 20th Annu. Jont Conf. Comput. Commun., 2001, pp. 1568 1576. [5] W. W. Terpstra, J. Kangasharju, C. Leng, and A. P. Buhmann, BubbleStorm: Reslent, probablst, and exhaustve peer-topeer searh, n Pro. ACM Conf. Appl., Tehnol., Arht., Protools Comput. Commun., 2007, pp. 49 60. [6] A. N. Shferaw, V.-M. Sutur, and L. Brune, Interest-awareness for nformaton sharng n MANETs, n Pro. 11th Int. Conf. Moble Data Manage., 2010, pp. 342 347. [7] Z. L, H. Shen, G. Lu, and J. L, SOS: A dstrbuted moble QA system based on soal networks, n Pro. IEEE 32nd Int. Conf. Dstrb. Comput. Syst., 2012, pp. 627 636. [8] P. Hu, J. Leguay, J. Crowroft, J. Sott, T. Fredman, and V. Conan, Osmoss n poket swthed networks, n Pro. 1st Int. Conf. Commun. Netw. Chna, 2006, pp. 1 6. [9] J. Fan, J. Chen, Y. Du, P. Wang, and Y. Sun, DelQue: A soally aware delegaton query sheme n delay-tolerant networks, IEEE Trans. Vehular Tehnol., vol. 60, no. 5, pp. 2181 2193, Jun. 2011. [10] Z. Yang, T. Nng, and H. Wu, Dstrbuted data query n ntermttently onneted passve RFID networks, IEEE Trans. Parallel Dstrb. Syst., vol. 24, no. 10, pp. 1972 1982, Ot. 2013. [11] A. Me, G. Morabto, P. Sant, and J. Stefa, Soal-aware stateless forwardng n poket swthed networks, n Pro. IEEE Conf. Comput. Commun., 2011, pp. 251 255. [12] W. Gao and G. Cao, ser-entr data dssemnaton n dsrupton tolerant networks, n Pro. IEEE Conf. Comput. Commun., 2011, pp. 3119 3127. [13] Z. L and H. Shen, SEDM: Explotng soal networks n utltybased dstrbuted routng for DTNs, IEEE Trans. Comput., vol. 62, no. 1, pp. 83 97, Jan. 2011. [14] E. M. Daly and M. Haahr. (2007). Soal network analyss for routng n dsonneted delay-tolerant MANETs Pro. 8th ACM Int. Symp. Moble Ad Ho Netw. Comput., pp. 32 40. [Onlne]. Avalable: http://do.am.org/10.1145/1288107.1288113 [15] P. Hu, J. Crowroft, and E. Yonek. (2009)., Bubble Rap: Soalbased forwardng n delay tolerant networks n Pro. 9th ACM Int. Symp. Moble Ad Ho Netw. Comput., pp. 241 250 [Onlne]. Avalable: http://do.am.org/10.1145/1374618.1374652 [16] K. Zhu, W. L, and X. Fu, SMART: A soal and moble aware routng strategy for dsrupton tolerant networks, IEEE Trans. Vehular Teh., vol. PP, no. 99, pp. 1 1, 2014. [17] A. Chantreau, P. Hu, J. Crowroft, C. Dot, R. Gass, and J. Sott, Impat of human moblty on the desgn of opportunst forwardngalgorthms, npro.ieeeconf.comput.commun.,2006,pp.1 13. [18] M. Km, D. Kotz, and S. Km, Extratng a moblty model from real user traes, n Pro. IEEE Conf. Comput. Commun., 2006, pp. 1 13. [19] T. Spyropoulos, K. Psouns, and C. Raghavendra, Performane analyss of moblty-asssted routng, n Pro. 7th ACM Int. Symp. Moble Ad Ho Netw. Comput., 2006, pp. 49 60. [20] K. Fall, A delay-tolerant network arhteture for hallenged nternets, n Pro. ACM Conf. Appl., Tehnol., Arht., Protools Comput. Commun., 2003, pp. 27 34. [21] A. Balasubramanan, B. N. Levne, and A. Venkataraman, DTN routng as a resoure alloaton problem, n Pro. ACM Conf. Appl., Tehnol., Arht., Protools Comput. Commun., 2007, pp. 373 384. [22] D. Gunawardena, T. Karaganns, A. Proutere, E. Santos-Neto, and M. Vojnov, Soop: Deentralzed and opportunst multastng of nformaton streams, n Pro. 11th ACM Int. Conf. Moble Comput. Netw., 2011, pp. 169 180. [23] T. Spyropoulos, K. Psouns, and C. S. Raghavendra, Effent routng n ntermttently onneted moble networks: The sngleopy ase, IEEE/ACM Trans. Netw., vol. 16, no. 1, pp. 77 90, Feb. 2008. [24] P. Juang, H. Ok, Y. Wang, M. Martonos, L.-S. Peh, and D. Rubensten, Energy-effent omputng for wldlfe trakng: Desgn tradeoffs and early experenes wth Zebranet, n Pro. Annu. Conf. Arht. Support Program Languages Operatng Syst., 2002, pp. 96 107. [25] T. Small and Z. J. Haas, The shared wreless nfostaton model A new ad ho networkng paradgm, n Pro. 4th ACM Int. Symp. Moble Ad Ho Netw. Comput., 2003, pp. 233 244. [26] B. Burns, O. B. Bran, and N. Levne, MV routng and apaty buldng n dsrupton tolerant networks, n Pro. IEEE Conf. Comput. Commun., 2005, pp. 398 408. [27] N. Banerjee, M. D. Corner, D. Towsley, and B. N. Levne, Relays, base statons, and meshes: Enhanng moble networks wth nfrastruture, n Pro. 14th ACM Int. Conf. Moble Comput. Netw, 2008, pp. 81 91. [28] P. Hu, A. Chantreau, J. Sott, R. Gass, J. Crowroft, and C. Dot, Poket swthed networks and human moblty n onferene envronments, n Pro. ACM SIGCOMM Workshop Delay-Tolerant Netw., 2005, pp. 244 251. [29] S. B. Esenman, E. Mluzzo, N. D. Lane, R. A. Peterson, G.-S. Ahn, and A. T. Campbell, The BkeNet moble sensng system for ylst experene mappng, n Pro. 5th Int. Conf. Embedded Netw. Sens. Syst., 2007, pp. 87 101. [30] P. Klasnja, B. L. Harrson, L. LeGrand, A. LaMara, J. Froehlh, and S. E. Hudson, sng wearable sensors and real tme nferene to understand human reall of routne atvtes, n Pro. 10th Int. Conf. bqutous Comput., 2008, pp. 154 163. [31] E. Mluzzo, N. D. Lane, K. Fodor, R. A. Peterson, H. Lu, M. Musoles, S. B. Esenman, X. Zheng, and A. T. Campbell, Sensng meets moble soal networks: The desgn, mplementaton and evaluaton of the CeneMe applaton, n Pro. 6th ACM Conf. Embedded Netw. Sens. Syst., 2008, pp. 337 350. [32] P. Mohan, V. Padmanabhan, and R. Ramjee, Nerell: Rh montorng of road and traff ondtons usng moble smartphones, n Pro. 6th ACM Conf. Embedded Netw. Sens. Syst., 2008, pp. 323 336. [33] T. Spyropoulos, K. Psouns, and C. S. Raghavendra, Spray and wat: An effent routng sheme for ntermttently onneted moble networks, n Pro. ACM SIGCOMM Workshop Delay-Tolerant Netw., 2005, pp. 252 259. [34] A. Lndgren, A. Dora, and O. Shelen, Probablst routng n ntermttently onneted networks, ACM SIGMOBILE Moble Comput. Commun. Rev., vol. 7, no. 3, pp. 19 20, 2003. [35] S. Jan, M. Demmer, R. Patra, and K. Fall, sng redundany to ope wth falures n a delay tolerant network, n Pro. ACM Conf. Appl., Tehnol., Arht., Protools Comput. Commun., 2005, pp. 109 120. [36] A. A. Hanbal, P. Nan, and E. Altman, Performane of ad ho networks wth two hop relay routng and lmted paket lfetme, Performane Eval., vol. 65, no. 6, pp. 463 483, 2006. [37] X. Te, A. Venkataraman, and A. Balasubramanan, R3: Robust replaton routng n wreless networks wth dverse onnetvty haratersts, n Pro. 17th Annu. Int. Conf. Moble Comput. Netw., 2011, pp. 181 192. [38] A. H. Land and A. G. Dog, An automat method of solvng dsrete programmng problems, Eonometra, vol. 28, no. 3, pp. 497 520, 1960. [39] J. Sott, R. Gass, J. Crowroft, P. Hu, C. Dot, and A. Chantreau. (2009). CRAWDAD trae ambrdge/haggle/mote/nfoom2006 (v. 2009-05-29), [Onlne]. Avalable: http://rawdad.s. dartmouth.edu/

1312 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBTED SYSTEMS, VOL. 26, NO. 5, MAY 2015 [40] Y. Wang and H. Wu, DFT-MSN: The delay fault tolerant moble sensor network for pervasve nformaton gatherng, n Pro. IEEE Conf. Comput. Commun., 2006, pp. 1 12. [41] Y. Wang and H. Wu, Delay/fault-tolerant moble sensor network (DFT-MSN): A new paradgm for pervasve nformaton gatherng, IEEE Trans. Moble Comput., vol. 6, no. 9, pp. 1021 1034, Sep. 2007. [42] Y. Wang, H. Wu, F. Ln, and N.-F. Tzeng, Cross-layer protool desgn and optmzaton for delay/fault-tolerant moble sensor networks, IEEE J. Sel. Areas Commun., vol. 26, no. 5, pp. 809 819, Jun. 2008. [43] H. Dang and H. Wu, Clusterng and luster-based routng protool for delay-tolerant moble networks, IEEE Trans. Wreless Commun., vol. 9, no. 6, pp. 1874 1881, Jun. 2010. [44] Z. J. Haas, J. Y. Halpern, and L. L, Gossp-based ad ho routng, IEEE/ACM Trans. Netw., vol. 14, no. 3, pp. 479 491, Jun. 2006. [45] K. A. Harras and K. C. Almeroth, Controlled floodng n dsonneted sparse moble networks, Wreless Commun. Moble Comput., vol. 9, no. 1, pp. 21 33, 2009. [46] [Onlne]. Avalable: http://www.rawdad.org/, 2014. Yang Lu (S 11) reeved the BE degree n eletral engneerng and ts automaton, and the ME degree n ontrol theory and ontrol engneerng from Harbn Engneerng nversty, Chna, n 2008 and 2010, respetvely. He has been workng toward the PhD degree n omputer engneerng at the Center for Advaned Computer Studes, nversty of Lousana at Lafayette, sne 2011. Hs urrent researh nterests nlude delay-tolerant networks, rado frequeny dentfaton (RFID) systems, and wreless sensor networks. He s a student member of the IEEE. Yanyan Han reeved the BS degree n eletron nformaton engneerng from Shandong nversty, Jnan, Chna, n 2008, and has been workng toward the PhD degree n omputer sene at the Center for Advaned Computer Studes (CACS), nversty of Lousana at Lafayette (L Lafayette), sne 2011. Her urrent researh nterests nlude moble ad ho networks and delay-tolerant networks. Zhpeng Yang (S 07) reeved the BS and MS degrees n omputer sene from Tanjn nversty, Chna, n 2001 and 2004, respetvely. He has been workng toward the PhD degree n omputer sene at the Center for Advaned Computer Studes (CACS), nversty of Lousana at Lafayette (L Lafayette), sne 2007. From 2004 to 2006, he was a software engneer n Nortel and Luent Chna. Hs urrent researh nterests nlude delay-tolerant networks, rado frequeny dentfaton (RFID) systems, and wreless sensor networks. He s a student member of the IEEE. Hongy Wu (M 02) reeved the BS degree n sentf nstruments from Zhejang nversty, Hangzhou, Chna, n 1996, and the MS degree n eletral engneerng and the PhD degree n omputer sene from the State nversty of New York (SNY) at Buffalo n 2000 and 2002, respetvely. Sne then, he has been wth the Center for Advaned Computer Studes (CACS), nversty of Lousana at Lafayette (L Lafayette), where he s urrently a professor and hold the Alfred and Helen Lamson Endowed professorshp n omputer sene. Hs researh nterests nlude delay-tolerant networks, rado frequeny dentfaton (RFID) systems, wreless sensor networks, and ntegrated heterogeneous wreless systems. He reeved the S Natonal Sene Foundaton (NSF) CAREER Award n 2004 and the L Lafayette Dstngushed Professor Award n 2011. He s a member of the IEEE. " For more nformaton on ths or any other omputng top, please vst our Dgtal Lbrary at www.omputer.org/publatons/dlb.