A two-level distributed architecture for Web content adaptation and delivery

Size: px

Start display at page:

Download "A two-level distributed architecture for Web content adaptation and delivery"

Mabel Gallagher
6 years ago
Views:

1 A two-level distributed architecture for Web content adaptation and delivery Claudia Canali University of Parma Valeria Cardellini University of Rome Tor vergata Michele Colajanni University of Modena Philip S. Yu IBM T.J. Watson research center Riccardo Lancellotti University of Modena

2 The evolving Web scenario Heterogeneous clients Different display, CPU, network Heterogeneous user behavior Request for sophisticated,personalized services Content adaptation (transcoding, personalization)

adaptation is computationally expensive Typical

3 Content adaptation Possible approaches: Server-based Client-based Intermediary-based Issue: Content adaptation is computationally expensive Typical solutions Caching reduce adaptation operations Replication load sharing

4 Issues for efficient content adaptation Caching issue multiple version of every resource working set size grows locality improves caching effectiveness Replication issue provide adequate load sharing Contribution: We propose a novel architecture for distributed content adaptation that preserves locality and provides load sharing

5 Functions in content adaptation Main functions: Gateway Location Adaptation Fetch Cache F G Architectures differ in the mapping of these functions

6 Two-level content adaptation architecture Origin server Client G F Edge node G F F Internal nodes G Client Hash(URL)

7 Benefit of hashing Hash computed only on URL (no version) every version of the same resource is on the same node (simplify lookup) improve locality URL-space partitioned no cache duplicates (efficient use of cache) Hash-based request distribution evenly distributed requests (load sharing)

Two-level content adaptation architecture Few powerful internal nodes: From edge nodes Improves security (easy to control) Improves privacy (sensitive information) Solves management

8 Two-level content adaptation architecture Few powerful internal nodes: From edge nodes Improves security (easy to control) Improves privacy (sensitive information) Solves management issues (few nodes) Solves data consistency issues (hashing) Internal nodes can be locally replicated to further increase computational power (cluster) Web switch To origin server Internal node

9 Two-level content adaptation architecture Many simple edge nodes no management required no computational power required can be highly distributed Drawback of having two levels Two steps for every request We compare the two-level architecture with a flat architecture

10 Flat content adaptation architecture G,F Origin server Client G,F G,F Edge node Client Query-based lookup

11 Flat content adaptation architecture Pros: Highly distributed No need for two steps Cons: Privacy issues Data consistency issues Does not guarantee load sharing

12 Performance evaluation Workload models: Working set with heavy impact on adaptation Synthetically generated traces We compare: Two-level architecture Flat architecture No cooperation architecture Two network scenario: Real network scenario WAN-emulated scenario

13 Real network scenario

14 Hit rate local remote global exact useful exact useful No_Coop 8.3% 6.4% n/a n/a 14.7% Flat 8.0% 7.0% 25.1% 26.8% 66.9% Two-level n/a n/a 60.2% 21.0% 81.2% Two-level provides the highest hit rate Hash-based partition is effective in optimizing cache usage

15 Summary of findings Two-level architecture: Hashing avoids duplicates efficient cache space usage, high hit rate High cache hit rate load is reduced Focus on two-step penalty: sensitivity to network

16 Two-step penalty

17 Sensitivity to network parameters Resp. time [ms] Sensitivity to delay to origin server Flat Two-level Sensitivity to delay between intermediate nodes 0 Two-level: Delay to origin server[ms] High hit rate less sensitive to delay to origin server Resp. time [ms] Delay between intermediate nodes [ms] Two steps sensitivity to delay between edge and internal nodes

18 Conclusion Two-level architecture constantly outperforms other architecture in our experiments Two-level architecture is sensitive to network delays between the nodes of the intermediate infrastructure Two level architecture is less sensitive than flat architecture to delay to origin server

19 Future work Flat architecture and two-level are two extreme cases Flat: every node provides every function Two-level: node functions partitioned In medio stat virtus Intermediate hybrid architectures are a whole new space of investigation

20 A two-level distributed architecture for Web content adaptation and delivery Claudia Canali University of Parma Valeria Cardellini University of Rome Tor vergata Michele Colajanni University of Modena Philip S. Yu IBM T.J. Watson research center Riccardo Lancellotti University of Modena

A Distributed Architecture of Edge Proxy Servers for Cooperative Transcoding

A Distributed Architecture of Edge Proxy Servers for Cooperative Transcoding Valeria Cardellini University of Roma Tor Vergata cardellini@ing.uniroma2.it Michele Colajanni University of Modena colajanni@unimo.it