Efficient Distributed File System (EDFS)

Similar documents
Simulation Based Analysis of FAST TCP using OMNET++

Quantifying Performance Models

Goals and Approach Type of Resources Allocation Models Shared Non-shared Not in this Lecture In this Lecture

Real-Time Guarantees. Traffic Characteristics. Flow Control

Load-Balanced Anycast Routing

Efficient Load-Balanced IP Routing Scheme Based on Shortest Paths in Hose Model. Eiji Oki May 28, 2009 The University of Electro-Communications

ELEC 377 Operating Systems. Week 6 Class 3

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Load Balancing for Hex-Cell Interconnection Network

Scheduling and queue management. DigiComm II

Application of VCG in Replica Placement Strategy of Cloud Storage

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

CS 268: Lecture 8 Router Support for Congestion Control

Internet Traffic Managers

Needed Information to do Allocation

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Cost-efficient deployment of distributed software services

A New Transaction Processing Model Based on Optimistic Concurrency Control

The Impact of Delayed Acknowledgement on E-TCP Performance In Wireless networks

Cache Performance 3/28/17. Agenda. Cache Abstraction and Metrics. Direct-Mapped Cache: Placement and Access

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Gateway Algorithm for Fair Bandwidth Sharing

Why Congestion Control. Congestion Control and Active Queue Management. TCP Congestion Control Behavior. Generic TCP CC Behavior: Additive Increase

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

Virtual Machine Migration based on Trust Measurement of Computer Node

TECHNICAL REPORT AN OPTIMAL DISTRIBUTED PROTOCOL FOR FAST CONVERGENCE TO MAXMIN RATE ALLOCATION. Jordi Ros and Wei K Tsai

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Support Strong Consistency for Mobile Dynamic Contents Delivery Network

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

High-Boost Mesh Filtering for 3-D Shape Enhancement

Constructing Minimum Connected Dominating Set: Algorithmic approach

Self-Tuning, Bandwidth-Aware Monitoring for Dynamic Data Streams

A Distributed Three-hop Routing Protocol to Increase the Capacity of Hybrid Wireless Networks

Dynamic Bandwidth Provisioning with Fairness and Revenue Considerations for Broadband Wireless Communication

Parallelism for Nested Loops with Non-uniform and Flow Dependences

CS 534: Computer Vision Model Fitting

Efficient Content Distribution in Wireless P2P Networks

MobileGrid: Capacity-aware Topology Control in Mobile Ad Hoc Networks

Pricing Network Resources for Adaptive Applications in a Differentiated Services Network

Research of Dynamic Access to Cloud Database Based on Improved Pheromone Algorithm

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

Adaptive Power-Aware Prefetch in Wireless Networks

RAP. Speed/RAP/CODA. Real-time Systems. Modeling the sensor networks. Real-time Systems. Modeling the sensor networks. Real-time systems:

Design and Analysis of Algorithms

Evaluation of TCP Variants and Bandwidth on Demand over Next Generation Satellite Network

A Model Based on Multi-agent for Dynamic Bandwidth Allocation in Networks Guang LU, Jian-Wen QI

Neural Network Control for TCP Network Congestion

Algorithms for Routing and Centralized Scheduling to Provide QoS in IEEE Mesh Networks

A KIND OF ROUTING MODEL IN PEER-TO-PEER NETWORK BASED ON SUCCESSFUL ACCESSING RATE

Routing in Degree-constrained FSO Mesh Networks

ATYPICAL SDN consists of a logical controller in the

A Distributed Dynamic Bandwidth Allocation Algorithm in EPON

ARTICLE IN PRESS. Signal Processing: Image Communication

Comparisons of Packet Scheduling Algorithms for Fair Service among Connections on the Internet

Self-Tuning, Bandwidth-Aware Monitoring for Dynamic Data Streams

Performance Evaluation of Information Retrieval Systems

A Semi-Distributed Load Balancing Architecture and Algorithm for Heterogeneous Wireless Networks

CE 221 Data Structures and Algorithms

IJCTA Nov-Dec 2016 Available

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) , Fax: (370-5) ,

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments

Installation and User Guide. Digidim Remote Control (303) Product description. Switching Lights On/Off using Digidim 303 Remote

Scalable Download Protocols

Smoothing Spline ANOVA for variable screening

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Analysis of Collaborative Distributed Admission Control in x Networks

ENERGY EFFICIENT ROUTING PROTOCOLS FOR WIRELESS AD HOC NETWORKS A SURVEY

Video Proxy System for a Large-scale VOD System (DINA)

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Programming in Fortran 90 : 2017/2018

Module Management Tool in Software Development Organizations

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

FAST TCP: Motivation, Architecture, Algorithms, Performance

Distributed Resource Scheduling in Grid Computing Using Fuzzy Approach

A Fair MAC Algorithm with Dynamic Priority for e WLANs

OPTIMAL CONFIGURATION FOR NODES IN MIXED CELLULAR AND MOBILE AD HOC NETWORK FOR INET

K-means and Hierarchical Clustering

IP mobility support is becoming very important as the

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

State of the Art in Differentiated

Modeling TCP Throughput: A Simple Model and its Empirical Validation

Cluster Analysis of Electrical Behavior

A Genetic Algorithm Based Dynamic Load Balancing Scheme for Heterogeneous Distributed Systems

IP Camera Configuration Software Instruction Manual

On the Exact Analysis of Bluetooth Scheduling Algorithms

Shared Running Buffer Based Proxy Caching of Streaming Sessions

High level vs Low Level. What is a Computer Program? What does gcc do for you? Program = Instructions + Data. Basic Computer Organization

A Decentralized Lifetime Maximization Algorithm for Distributed Applications in Wireless Sensor Networks

Energy-Efficient Workload Placement in Enterprise Datacenters

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 4, Issue 6, December 2014

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

Wishing you all a Total Quality New Year!

APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET

AADL : about scheduling analysis

Real-time interactive applications

DEAR: A DEVICE AND ENERGY AWARE ROUTING PROTOCOL FOR MOBILE AD HOC NETWORKS

Delay Variation Optimized Traffic Allocation Based on Network Calculus for Multi-path Routing in Wireless Mesh Networks

mquest Quickstart Version 11.0

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

Transcription:

Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn

Contents Problem Statement, Related Work, EDFS Desgn Rate Montor (RM), Resource Allocator (RA) Servng Requests Implementaton approach Some Smulaton Results Concluson and Future Work Extra Sldes

Problem Statement To deal wth growng data storage and processng needs Mult-GB fles are common Increasng number of concurrent requests (clents) s more common To optmally utlze network, storage and processng resources Some nodes can be overloaded whle others are less loaded Resultng n hgher block transfer delay

Related Work Exstng Dstrbuted Fle Systems (GFS, HDFS) NameNode (Master) Keeps track of every block and replca operatons a bottleneck sngle pont of falure Lack effcent block allocaton and load balancng scheme Consder ust physcal dstance than rate to allocate Use TCP for data transfer Don t smultaneously look at all bottlenecks (dsk, lnk, CPU)

Our Aproach (EDFS Desgn) Keep an EDFS frontend To forward requests to correspondng NameNode hashng (lght weght) can handle more requests smultaneously New cross-layer (network & transport) protocol Adaptve, effcent block allocaton, load balancng and routng Consder all bottlenecks smultaneously

EDFS Desgn... Cont d A clent applcaton frst connects to the Front-End Server (FES). The FES chooses the correspondng Name Node Server (NNS) for the clent. The NNS asks the Resource Allocator (RA) for the best BS. The RA fnds the best BS based on the metrc montor t gets from the Resource Montor (RM) server n the BS and other RAs.

EDFS... Cont d

EDFS... Cont d

The Resource Montor (RM) The RM computes R d,u (the rates R d and R u )at whch R ts flow can send data to (uplnk) or receve data from (downlnk) other BS or clent as: d, u N T Q RTT L Q R d N N Q w Q And sends R d,u and S to ts resource allocator (RA) w RTT Where R = rate at whch a flow should send data from node or to node K = resource capacty at node Q = backlog (queue sze) ψ = packet (nstructon) sze R = sendng rate of node L = total number of packets d = control nterval S d N

The Resource Allocator(RA) Each RA gathers S from all ts RMs and computes the rates assocated wth ts swtch and lnks as : R d, u N Q S Where N = the number of RMs attached to the RA Keeps the maxmum R d,u max of the rates of ts chldren (for k-allocaton) and the IP addresses of the chldren wth these max values. And sends ts mn(r d,u, R u,d max ) and ts S to ts parent RA Ths contnues untl the hghest level (n-hghest) RA n the tree every control nterval d or at every trgger event.

The RA... Cont d The n-hghest level RA now sends (broadcasts) to ts chldren Its R d,u (at what maxmum rate a node can send to and receve from the outsde clent) Its mn(r d,u, R u,d max ) (at what maxmum rate a BS n the cloud can send to and receve from another BS) Each RA passes these values to ts chldren RAs untl t fnally reaches the parent RA of the RMs whch keeps ths nformaton. The above values can also be requested drectly from the RA-n. Decreased overhead of broadcastng the values to ts chldren (pro) The hghest level RA (RA-n) can be overloaded wth too many requests (con) Each RA also acts as k-hghest level RA and repeat the above steps for k-level allocaton purpose.

Servng Requests Two types of requests to wrte, read and nteract data An external user request to use the cloud Internal BS (RA) request to replcate data, for mantenance and load balancng An external user wth a read weght 0 w r 1 conects to the FES whch forwards ts request to the selected NNS to wrte some data to the cloud. The NNS asks the RA (n ts rack) for the best BS The RA chooses the best level n (hghest ) value V r = w r R u +(1-w r )R d BS and tells the user to use R d,u Here each RA mantans V r for some common w r values

The Interactve (read) weght w r If the RA algorthm runs at every ntal wrte request wth w r gven by every applcaton It can be expensve to run Optmzaton each RA mantans V r for w r = 0 (nfrequent read), 0.5 (nteractve), 0.25 (less read), 0.75 (more read for data replcated n many BS) The RA whch has the n-level R d and R u can also obtan V r = w r R u +(1-w r )R d after an applcaton provdes t The cached frequency of user read/wrtes can also be used to obtan the w r

Servng Requests (Internal) When the BS has a request to another BS (to read/wrte data), ts RA chooses the BS wth the hghest V r (For better update) n ts rack (local swtch) (1-level allocaton) or n a branch of the tree k levels above (k-level allocaton). When the RA wants to upload (replcate) data from ts BS to another BS the value of w r can be 0 or a weghted average of the R d and R u values. When each BS nteract wth almost smlar read and wrte rates, w r = 0.5

Servng Requests: Smple examples External request: If user wants to push data to or pull data from the cloud The RA chooses the BS wth hghest R d or R u resp. If the user wants to do push+pull, The RA chooses the the BS wth the hghest V r value Internal request: If a BS wants to push data to or pull data from a BS, The RA chooses the BS wth hghest mn(r u, R d max ) or hghest mn(r d, R u max ) value resp. If the user wants to do push+pull It chooses the BS wth hghest V r value

Implementng EDFS Among other thngs EDFS answers two man questons Where n the cloud to store data and In the BS whch offers the best V r value. At what rate to transfer data Internal or external transfer? Internal transfer (wthn the cloud) TCP can be modfed to send at the rate obtaned from the RA Settng TCP beta and alpha or the cwnd parameters. For external transfer (no knowledge of the clent path capacty) If a clent uploads data to the cloud (cloud download) Set the clent cloud TCP receve wndow to R d max x RTT If a clent downloads data from the cloud (cloud uplnk) Set the cloud TCP maxmum cwnd to R u max x RTT

Some Smulaton Results Evaluatng the Front End Server (FES ) External requests Number of clents Posson dstrbuted Sze of requests Pareto dstrbuted Update rate Read + wrte Mem access tme = 10ns log 2GB 64 B

Some Smulaton Results... cont d Evaluatg the Resource Allocator (RA) Internal transfer (At what rate?)

Concluson and Future Work Presented an effcent dstrbuted fle system To overcome the sngle NameNode bottleneck ssue of exstng dstrbuted fle systems An effcent block allocaton and load balancng Ongong work System mplementaton of EDFS Comparson wth GFS and HDFS

Extra Sldes Jont Congeston Control and Routng for Dstrbuted Systems Debessay(Debsh) Fesehaye & Klara Naherstedt Unversty of Illnos-Urbana Champagn

Contents Motvaton Dervaton of the rate metrc Applcatons of such scheme

Motvaton TCP takes many rounds to fully utlze the lnk capacty Very hgh download tme TCP uses packet loss and delay as a congeston sgnal A temporary loss or delay due to channel error means congeston and reducng the congeston wndow No good routng metrc Do not effectvely take nto account dynamc delay and congeston on lnks

Exact dervaton of a rate metrc Notatons C = Capacty of Lnk Q = Queue sze at lnk RTT = RTT of flow durng control nterval d d C Q = total load whch has to be shared among all connectons crossng lnk L = total number of packets that arrve at router durng control nterval d w = current cwnd of correspondng to packet w = next cwnd of flow a = cwnd per packet = number of packets the source sends at an arrval of ACKs of each of the w packets

Exact dervaton of a rate metrc Then we have Hence for some constant k. But whch mples that Hence the throughput share assocated wth packet s

Applcatons Applcaton to the Internet Routng Metrc Mtgatng DDoS & DoS Overlay Networks TEEVE: Tele-mmersve Envronment for EVErybody Other Applcatons TCIPG (Power Grd), etc Large Scale Dstrbuted Fle System Desgn (Today s topc) The End!