We consider the case where the cache has unlimited storage, so all previously requested pages remain in the cache. In this case, we consider a finite request stream of R requests, and wish to determine the probability that the next request, the R+1'th request, is a request for a page that already resides in the cache. The hit-ratio H(R) can be calculated as follows. If the R+1'th request is for page i then the probability that this page is in the cache is given by (1 - ( 1 - PN(i))R). Thus, we have:
The asymptotic behavior of the hit-ratio is
. This behavior can be seen more directly
by approximating the sum for H(R) by assuming the function
is given,
for
, by:
Figure 1 shows the hit-ratio H(R), the
approximate hit-ratio and the hit-ratio for a
cache trace as a function of
.The cache trace data was collected by Pei et al. using a
cache simulator and a Web cache trace file [CI97].
Figure 4 of [CI97] actually plots the hit-ratio as
a function of group size, but in the simulation the number
of page requests is proportional to the group size.