Needed Information to do Allocation

Similar documents
Estimating Costs of Path Expression Evaluation in Distributed Object Databases

Efficient Distributed File System (EDFS)

Cost-efficient deployment of distributed software services

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

A Proactive Non-Cooperative Game-theoretic Framework for Data Replication in Data Grids

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

Efficient Content Distribution in Wireless P2P Networks

Annales UMCS Informatica AI 1 (2003) UMCS. Designing of multichannel optical communication systems topologies criteria optimization

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Concurrent Apriori Data Mining Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Private Information Retrieval (PIR)

DUE to the recent popularization of hand-held mobile

Contact Duration Aware Data Replication in Delay Tolerant Networks

Routing in Degree-constrained FSO Mesh Networks

A HEURISTIC METHOD FOR RELIABILITY REDUNDANCY OPTIMIZATION OF FLOW NETWORKS

Algorithms for data warehouse design to enhance decision-making

MOBILE Cloud Computing (MCC) extends the capabilities

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Oracle Database: SQL and PL/SQL Fundamentals Certification Course

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits

Parallel matrix-vector multiplication

Parallelism for Nested Loops with Non-uniform and Flow Dependences

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

Attribute Allocation in Large Scale Sensor Networks

Virtual Machine Migration based on Trust Measurement of Computer Node

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems

3. CR parameters and Multi-Objective Fitness Function

Chapter 1. Introduction

ARTICLE IN PRESS. Signal Processing: Image Communication

A Decentralized Lifetime Maximization Algorithm for Distributed Applications in Wireless Sensor Networks

An Optimal Algorithm for Prufer Codes *

4/11/17. Agenda. Princeton University Computer Science 217: Introduction to Programming Systems. Goals of this Lecture. Storage Management.

On the Security of a Digital Signature with Message Recovery Using Self-certified Public Key

Data-Aware Scheduling Strategy for Scientific Workflow Applications in IaaS Cloud Computing

MobileGrid: Capacity-aware Topology Control in Mobile Ad Hoc Networks

A Competitive Replica Placement Methodology for Ad Hoc Networks

MILP. LP: max cx ' MILP: some integer. ILP: x integer BLP: x 0,1. x 1. x 2 2 2, c ,

Minimum Cost Optimization of Multicast Wireless Networks with Network Coding

Delay Variation Optimized Traffic Allocation Based on Network Calculus for Multi-path Routing in Wireless Mesh Networks

Distributed Middlebox Placement Based on Potential Game

Copyright 1978, by the author(s). All rights reserved.

Support Vector Machines

Harmonic Placement: File System Support for Scalable Streaming of Layer Encoded Object

AGGREGATED MODELS TECHNIQUE FOR INTEGRATING PLANNING AND SCHEDULING OF PRODUCTION TASKS

MATHEMATICS FORM ONE SCHEME OF WORK 2004

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING

An efficient iterative source routing algorithm

Load-Balanced Anycast Routing

Efficient Video Coding with R-D Constrained Quadtree Segmentation

Classification Method in Integrated Information Network Using Vector Image Comparison

On the Network Partitioning of Large Urban Transportation Networks

A Semi-Distributed Axiomatic Game Theoretical Mechanism for Replicating Data Objects in Large Distributed Computing Systems

CLOUD computing has evolved as an important and

ATYPICAL SDN consists of a logical controller in the

Distributed Resource Scheduling in Grid Computing Using Fuzzy Approach

Pose, Posture, Formation and Contortion in Kinematic Systems

A Pure Nash Equilibrium Guaranteeing Game Theoretical Replica Allocation Method for Reducing Web Access Time

QOS AWARE HW/SW PARTITIONING ON RUN-TIME RECONFIGURABLE MULTIMEDIA PLATFORMS

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

TripS: Automated Multi-tiered Data Placement in a Geo-distributed Cloud Environment

Lecture 7 Real Time Task Scheduling. Forrest Brewer

Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybrid Clouds

Control strategies for network efficiency and resilience with route choice

Greedy Technique - Definition

Optimization Methods: Integer Programming Integer Linear Programming 1. Module 7 Lecture Notes 1. Integer Linear Programming

Closed form Solution for Scheduling Arbitrarily Divisible Load Model in Data Grid Applications: Multiple Sources

QoE-Driven Mobile Edge Caching Placement for Adaptive Video Streaming

Research of Dynamic Access to Cloud Database Based on Improved Pheromone Algorithm

LARRY SNYDER DEPT. OF INDUSTRIAL AND SYSTEMS ENGINEERING CENTER FOR VALUE CHAIN RESEARCH LEHIGH UNIVERSITY

Real-Time Guarantees. Traffic Characteristics. Flow Control

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

Hengming Zou and Farnam Jahanian. The University of Michigan. Ann Arbor, Michigan fzou,

LECTURE NOTES Duality Theory, Sensitivity Analysis, and Parametric Programming

Maintaining temporal validity of real-time data on non-continuously executing resources

(1) The control processes are too complex to analyze by conventional quantitative techniques.

Network Coding as a Dynamical System

Remote display of large raster images using JPEG2000 and the rectangular FishEye-View

A Time-driven Data Placement Strategy for a Scientific Workflow Combining Edge Computing and Cloud Computing

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Design and Evaluation of a Window-Consistent Replication Service

Support Strong Consistency for Mobile Dynamic Contents Delivery Network

A Hybrid Genetic Algorithm for Routing Optimization in IP Networks Utilizing Bandwidth and Delay Metrics

CHAPTER 4 PARALLEL PREFIX ADDER

A Binarization Algorithm specialized on Document Images and Photos

Concurrent models of computation for embedded software

Local Quaternary Patterns and Feature Local Quaternary Patterns

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

A Saturation Binary Neural Network for Crossbar Switching Problem

Decision Support for the Dynamic Reconfiguration of Machine Layout and Part Routing in Cellular Manufacturing

Solving two-person zero-sum game by Matlab

Design and Experimental Evaluation of a Cross-Layer Deadline-Based Joint Routing and Spectrum Allocation Algorithm

Consistency and Update in Mobile Overlay Networks. Zhou SU, Jiro KATTO, Yasuhiko YASUDA, and Yu CHEN, Members, IEEE

QoS-aware routing for heterogeneous layered unicast transmissions in wireless mesh networks with cooperative network coding

AN INDEXING METHOD FOR SUPPORTING SPATIAL QUERIES IN STRUCTURED PEER-TO-PEER SYSTEMS

Priority-Based Scheduling Algorithm for Downlink Traffics in IEEE Networks

Mathematical Modeling of Earthwork Optimization Problems

Music Information Retrieval Schemes in Peer-to-Peer Environments

Transcription:

Complexty n the Database Allocaton Desgn Must tae relatonshp between fragments nto account Cost of ntegrty enforcements Constrants on response-tme, storage, and processng capablty Needed Informaton to do Allocaton Database nformaton: tuple sze, cardnalty of fragment Applcaton nformaton: #updates/#retreval a query performs on a fragment Ste Informaton: storage and processng capabltes and cost of processng a unt of wor Networ Informaton: communcaton cost to transfer a bloc of data between stes and 1

No models developed to date can handle all the constrants Current models smplfy assumptons and wor wth some specfc stuatons ormulaton of DAP DAP can be formulated as an optmzaton problem Mn(Total Cost) Subect to response tme constrant Storage constrant Processng load constrant DAP s NP-complete and several heurstcs have been proposed 2

Q q Constrants Q: Set of all queres S: Set of all stes STC : Storage cost of fragment at S Executon tme constrant Executon tme of q <= maxmum response tme of q for all q n Q Storage constrant STC storage capacty at ste S, S Processng constrant processng load of q at ste S S processng capacty of S, S S Cost Computaton Decson varable x defned as x 1f fragment s stored at ste S = 0 otherwse Total Cost = query processng cost + storage cost Solve the optmzaton constrant for x 3

Cost Model Total cost=query processng cost + storage cost TOC = q Q QPC + S S STC STC sze( = USC sze( ) = card( Unt cost of storng data at S ) x )* length( ) Query Processng Cost QPC = PC + TC Computaton cost of q Transfer cost of q PC = AC + IE + CC access cost of q + ntegrty enforcement cost of q + concurrency control cost of q 4

LPC : Cost of processng one unt of wor at ste S RR : Number of read accesses a query q maes to a fragment UR : #of update accesses a query q maes to a fragment Access cost of query q AC = ( u UR + r RR ) x S S Assume cost of an update same as cost of retreval 1 u = 0 1 r = 0 f query q f query q LPC 0 otherwse retreves from 0 otherwse updates Transmsson Cost Model Update cost: Need to perform updates to all replcas; no large results sent bac S S TCU = u * x * g g, o( ) TC = TCU + TCR o( ), + u * x * S S Cost of update message to all replcas that are nvolved n q Cost of confrmaton message bac to g : communcaton cost per message between S and S 5

Retreval Cost Model TCR = S S sel ( )* length( ) mn ( r * x * go( ), + r * x * * g, o( ) ) fsze Cost of sendng a query Cost of sendng the results bac Pc the least cost ste among all stes wth the replcas g : communcaton cost per message between S and S f sze : #Bytes n a message length( ): #bytes n fragment Sel ( ): Selectvty actor of q on Heurstc Approaches Allocaton of Horzontal ragments Allocaton of Vertcal ragments (Materal not n the textboo) 6

: : : fragment ndex ste ndex applcaton ndex ALLOCATION Notatons f : r : u : n = requency of applcaton at ste Number of retreval references of applcaton to fragment. Number of update references of applcaton to fragment. r + u (Number of accesses of applcaton to fragment ragment r Ste u Applcaton /w freq. f Allocaton of Horzontal ragments (1) No replcaton: Best t Strategy The number of local references of R at ste s B = R s allocated at ste * such that B * s maxmum. f n Advantage: A fragment s allocated to a ste that needs t most. Dsadvantage: It dsregards the mutual effect of placng a fragment at a gven ste f a related fragment s also at that ste. 7

Allocaton of Horzontal ragments (2) All benefcal stes approach (replcaton) B = f r c ' f ' u Cost of retreval references Cost of update references from other stes R s allocated at all stes * such that B * > 0. When all B s are negatve, a sngle copy of R s placed at the ste such that B * s maxmum. Allocaton of Horzontal ragments (3) Another Replcaton Approach: d β(d ) The degree of redundancy of R The relablty and avalablty beneft of havng R fully replcated. The relablty and avalablty beneft when the fragment has d copes. 1 d 3 ( ) = (1 ) (1) = 0, (2) = β d 2 β β, β (3) =, 2 4 The beneft of ntroducng a new copy of R at ste : B = f r c f ' Same as All Benefcal Stes approach ' u + β ( d ) Also taes nto account the beneft of replcaton 8

Allocaton of Horzontal ragments (4) All Benefcal Stes Approach: 1. Determne the set of all stes where the beneft of allocatng one copy of the fragment s hgher than the cost. 2. Allocate a copy of the fragment to each ste n the set. Alternatvely: 1. Determne the soluton of the non-replcated problem. 2. Progressvely ntroduce replcated copes startng from the most benefcal; the process s termnated when no addtonal replcaton s benefcal. How about Heurstcs for Vertcal Allocaton? 9

SUMMARY Desgn of a dstrbuted DB conssts of four phases: Phase 1: Global schema desgn (same as n centralzed DB desgn) Phase 2: ragmentaton Horzontal ragmentaton Prmary: Determne a complete and mnmal set of predcates Derved: Use semon Vertcal ragmentaton Identfy fragments such that many applcatons can be executed usng ust one fragment. Phase 3: Allocaton The prmary goal s to mnmze the number of remote accesses. Phase 4: Physcal schema desgn (same as n centralzed DB desgn). 10