Processing math: 100%

August 28, 2021

The Pareto distribution and the 20/80 rule

I mentioned in the previous post Pareto's 20/80 rule. Here, I will discuss Pareto's distribution, insisting on how (and in what conditions) it gives rise to this result. I had some trouble understanding the derivation as presented in various sources, so I will go through it in detail.

The functional form of the Pareto distribution is a power law, over an interval (L,H) such that 0<L<H. I will use the notations of the Wikipedia page unless stated otherwise. Its probability density function (PDF) p(x) and cumulative distribution function (CDF) F(x) are (α is real and strictly positive):

p(x)=α1(L/H)α1x(Lx)α;F(x)=1(L/x)α1(L/H)α

One often uses the complementary CDF (or survival function) defined as:

S(x)=1F(x)=11(L/H)α[(Lx)α(LH)α]

Note that the survival function is very similar to the PDF multiplied by x: S(x)xαp(x), the difference being due only to the final truncation term. However, this is only true for power laws, as one can easily check by writing p(x)=F(x) and solving the resulting ODE. We should therefore carefully distinguish xp(x) (which is, for instance, the integrand to use for computing the mean of the distribution) and S(x) which "has already been integrated", so to speak.

Let us use this continuous model to describe the distribution of publications (neglecting for now its intrinsically discrete character). x stands for the number of publications by one author, bounded by L and H. The number of authors that published x books is given by N0p(x). N0 is the total number of authors. 

  • The first question is: who are the first f more prolific authors (in Pareto's case, f=0.2=20%)? More precisely, what is the threshold number of publications xf separating them from the less prolific ones?
This is quite easy: if we go through the list of authors (ordered by increasing x) when we reach xf we will have counted the lower fraction, so xfLp(x)dx=F(xf)=1f. Thus, the survival function is S(xf)=Hxfp(x)dx=f and we can simply invert this dependency to get xf=S1(f).
  •  The second question is: how many publications did these top f authors contribute?
We need to count the authors again, but with an additional factor of x, since there are N0p(x) authors with exactly x publications, for a total contribution of xN0p(x). The fraction of publications contributed by the top f authors v is then:
v=HxfxN0p(x)dxHLxN0p(x)dx=Hxfxp(x)dxμ
where μ is the mean of the distribution and N0μ is the total number of publications.

In the simple case H= (which requires α>1), one has:
p(x)=αx(Lx)α,withμ=αα1L
f=S(xf)=(Lxf)αxf=Lf1/α
 
Plugging the above into the equation for v yields:
v=1μxfxp(x)dx=αμxf(Lx)αdx=(Lxf)α1=fα1αf=vαα1
 
Pareto's rule f=0.2 and v=0.8 requires α1.161: a power law with this exponent will obey the rule, irrespective of the values of L and N0. Despite the neat coincidence in the established statement of the principle, there is absolutely no need that f+v=1! For instance, the same α implies that, for v=0.5, f0.065, a result I have already used in the previous post.

No comments:

Post a Comment