He expands at some length on the stochastic properties of this function: unsurprisingly, the arithmetic mean of samples of this function doesn't converge with more samples, but the geometric mean does. What's a little more surprising is that the distribution isn't log-normal. One might expect that since upon taking logs, we're averaging a collection of random numbers between 1 and log n, we might expect the average of the logs to display normality, but that doesn't happen.
The explanation is a nice take-way nugget from this article. In short, work by William Reed and Barry Hughes from 2002 shows (this example taken from their abstract) that if you take an exponentially growing process (for example ), and truncate it at a random time given by an exponential process (for example, with parameter ), then the resulting random variable exhibits the power law distribution
I'd just like to point out that this is also related to my work on "Double Pareto Distributions" (which made use of the work of Reed and Reed/Hughes),
ReplyDeleteDynamic Models for File Sizes and Double Pareto Distributions
www.internetmathematics.org/volumes/1/3/Mitzenmacher.pdf
Michael