: Fred Douglis, Brian D. Davison
: Web Content Caching and Distribution
: Kluwer Academic Publishers
: 9781402022586
: 1
: CHF 91.40
:
: Sonstiges
: English
: 363
: DRM
: PC/MAC/eReader/Tablet
: PDF
Web caching and content delivery technologies provide the infrastructure on which systems are built for the scalable distribution of information. This proceedings of the eighth annual workshop, captures a cross-section of the latest issues and techniques of interest to network architects and researchers in large-scale content delivery. Topics covered include the distribution of streaming multimedia, edge caching and computation, multicast, delivery of dynamic content, enterprise content delivery, streaming proxies and servers, content transcoding, replication and caching strategies, peer-to-peer content delivery, and Web prefetching.

Web Content Caching and Distribution encompasses all areas relating to the intersection of storage and networking for Internet content services. The book is divided into eight parts: mobility, applications, architectures, multimedia, customization, peer-to-peer, performance and measurement, and delta encoding.  
SERVER-FRIENDLY DELTA COMPRESSION FOR EFFICIENT WEB ACCESS (p. 303-304)

Anubhav Savant and Torsten Suel
CIS Department, Polytechnic University


Abstract

A number of researchers have studied delta compression techniques for improving the efficiency of web page accesses over slow communication links. Most of these schemes exploit the fact that updated web pages often change only very slightly, thus resulting in very small sizes for the transmitted deltas. However, these schemes are only applicable to a minority of page accesses, and require web or proxy servers to retain potentially many different outdated versions of pages for use as reference files in the encoding. Another approach, studied by Chan and Woo [4], encodes a page with respect to similar files located on the same web server that are already in the client’s browser cache. Based on the latter approach, we study different delta compression policies for web access. Our emphasis is on web and proxy server-friendly policies that do not require the maintenance of multiple older versions of a page, but only use reference files accessed by the client within the last few minutes. We compare several policies for identifying appropriate reference files and evaluate their performance on a set of traces. We show that there are very simple policies that achieve significant benefits over gzip compression on most web accesses, and that can be efficiently implemented at web or proxy servers. We also study the potential of file synchronization techniques such as rsync [28] for web access.


1. Introduction

Delta compression (delta encoding) is the process of encoding a target file with respect to one or several, usually similar, referenceles. This encoding, called a delta, describes the target file in terms of the reference files, and a recipient that receives the encoding and already knows the reference files can thus efficiently reconstruct the target. Delta compression has numerous applications in scenarios where there are several versions of a file or many similar files, such as software revision control systems, distribution of software updates, content distribution networks, or efficient storage of related files. Several tools for delta compression, such as bdiff, vcdiff [10, 13], Xdelta [14], and zdelta [25], are freely available. We refer to [23] for an overview of delta compression techniques and applications.


1.1 Delta compression for Web access

A number of authors have proposed the use of delta compression techniques to improve the ef.ciency of web access [1, 4, 7, 9, 16, 17, 21, 27, 29]. In particular, when web pages are updated, they typically do not change by much, and thus delta compression can be used to very succinctly encode the difference between a new version of a web page and an outdated version already in the client’s browser cache. Most proposals focus on encodings between different versions located at the same URL, which results in small sizes for the deltas but is restricted to pages that have been previously visited by the client. One exception is the work by Chan andWoo [4], which proposes to use as reference files other pages on the same site recently visited by the client, which tend to have a significant degree of similarity due to common layout features and HTML structure. In general, delta compression schemes for web access can be distinguished along the following axes.
Contents6
A Message from the Workshop Chairs10
Credits12
Contributing Authors14
Mobility-aware server selection for mobile streaming multimedia content distribution networks18
1. Introduction and Overview18
2. Mobility Based Server Selection19
2.1 Layout of servers in content distribution network20
2.2 Considerations in mobility based server selection21
2.3 Measurement of mobility rate and server residence time estimation21
2.4 Server load and QoS information collection22
2.5 Server selection algorithm23
3. Simulation Setup25
3.1 Mobility simulation26
3.2 CDN layout27
3.3 Simulation scenarios28
4. Results30
5. Related Work33
6. Conclusions33
References34
Performance of PEPs in cellular wireless networks36
1. Introduction36
2. RelatedWork37
3. Overview of Cellular Networks38
4. Wireless PEP39
5. Latency Components40
6. Transport/Session Optimizations41
6.1 TCP Tuning42
6.2 TCP Connection Sharing45
6.3 Number of TCP connections46
6.4 Temporal Block Flow Release48
6.5 Session-level overheads: DNS49
7. Application Level Optimizations50
7.1 Compression Results50
7.2 Acceleration Results51
7.3 Impact of Pipelining52
8. Comparison53
9. Conclusions54
References55
Edge caching for directory based Web applications: Algorithms and performance56
1. Introduction56
2. Notations58
3. LDAP Caching Framework59
4. LDAP Query Containment60
4.1 Query containment problem60
4.2 General filter containment60
4.3 Template based filter containment61
4.4 Query containment algorithm62
5. LDAP Caching Algorithms63
6. Directory Server Extensions65
7. Application Offload and Prefetching66
7.1 Directory application modeling: Example67
7.2 Prefetching68
8. Performance of Caching Algorithms68
9. Conclusions71
References72
Computing on the edge: A platform for replicating Internet applications74
1. Introduction74
2. Issues75
3. Architecture Overview76
4. Application Distribution Framework77
4.1 The metafile77
4.2 Replica creation78
4.3 Replica deletion79
4.4 Consistency maintenance79
5. Algorithms80
5.1 Content placement algorithm81
5.2 Request distribution algorithm83
6. Performance85
6.1 Request distribution85
6.2 Content placement86
6.3 Redeployment threshold89
7. RelatedWork91
8. Conclusions93
Acknowledgments93
References93
Scalable consistency maintenance for edge query caches: Exploiting templates in Web applications96
1. Introduction96
2. Semantic caching over theWeb97
2.1 DBProxy overview97
2.2 Common Local Store98
3. Consistency Management99
3.1 Update propagation approaches100
4. Basic filtering100
5. Template-based filtering102
5.1 Template-based filtering: Single cache case102
5.2 Template-based filtering: Multiple caches105
6. Related work106
7. Conclusions106
References107
Proxy+: Simple proxy augmentation for dynamic content processing108
1. Introduction108
2. Related Work109
3. Summary of Previous Result110
4. Proxy+ Architecture111
4.1 ASP.NET output caching112
4.2 Cache key generation113
4.3 Tag generation and fragment caching114
4.4 Cache keys notification and page composition117
4.5 Summary of the protocol119
5. Application Modifications120
6. Experimental Results122
7. Security aspect124
8. Conclusions124
References125
Multicast cloud with integrated multicast and unicast content distribution routing126
1. Introduction126
3. Multicast Cloud129
4. Integrated Channel Routing130
5. Application-layer Traffic Control132
6. Related Work133
7. Conclusion134
8. References135
A large enterprise content distribution network: Design, implementation and operation136
1. Introduction136
2. Service Perspective and Network Infrastructure137
3. CDN Architecture138
3.1 Redirection System140