A Fast File System for Caching Web Objects


by Jonathan Ledlie and Matt McCormick

Given the increasing performance of network connections and the slow nature of disk drives, the authors propose a new file system for web proxy servers that cache files on a local disk. This file system, referred to as the cache file system (CFS), utilizes the facts that cached web pages do not change in size once they are cached, do not have changing permissions, and do have a back-up version at the original server. Immutable files allow all data in a file to be written in contiguous blocks on disk. Reading and writing to contiguous blocks allows the best utilization of a disk's bandwidth and results in the lowest possible seek times. Because all data is backed up on remote disks, CFS can store all meta information in memory and not perform any checkpointing. Should the system crash and this data be lost, users will still get the correct data - it will just be coming from the remote server until the cache is reloaded. By taking advantage of these invariants, CFS is able to out perform a Unix file system by at least 50% on several benchmarks.
View paper in pdf format or in ps format.