Let us consider an infinite arrival stream and consider a request for a given page i. The quantity of interest is the probability distribution d(k) that the next request for that page is k requests later (i.e. , that the request for page i is followed by k-1 requests for pages other than page i, followed by another request for page i). Assuming that page requests are independent, we find that
Figure 3 shows a plot of the probability distribution for page request interarrival times d(k) produced by our model and the distribution for a cache trace of actual requests for web pages. The cache trace was extracted from the trace file for a DEC Web server that serviced roughly 1700 workstation over a period of 25 days [DEC96]. Once again the model predicted by Zipf's law is consistent with data observed at operational web servers [RV98,CI97], that is, the probability that a document will be referenced k requests after it was last referenced is proportional to 1/k. Note that our model predicts higher values for d(k) when k < 102 but is quite accurate when k > 102.