As detailed in the previous post, the ratio f of the top authors that publish a fraction v of all publications is independent from the total number of authors N0. Of course, this result is incompatible with Price's law (that for v=0.5, f=1/√N0). This issue has been discussed by Price and co-workers [1], but I will take here a slightly different approach.
I had assumed in my derivation that he domain of the distribution was unbound above (H=∞), and that the exponent α was higher than 1. One can relax these assumptions and check their effect on f by:
- imposing a finite upper bound H and
- by setting α=1. Note that 2. also requires 1.
Role of the upper bound
In the finite H case one must use the full expressions (containing H and L) for the various quantities. In this section, we will continue to assume that α>1. Since L acts everywhere as a scale factor for x (and H) I will set it to 1 in the following. It is also reasonable to assume that the least productive authors have one publication (why truncate at a higher value?!) Consequently, all results will also depend on H, but presumably not explicitly on N0, which is a prefactor for the PDF and should cancel out of all expectation calculations. It is, however, quite likely that H itself will depend on N0, since more authors will lead to a higher maximum publication number!
In my opinion, the most reasonable assumption is that there is only one author with H publications, so that N0p(H)=1⇒H≃(N0α)1α+1, neglecting the normalization prefactor of p(x).
The threshold number xf is easy to obtain directly from S(x):
xf=[f+(1−f)H−α]−1/α
From its definition, the fraction v is given by: v=αμ11−H−α1α−1(x1−αf−H1−α). Note that we need here the complete expression for the mean [2]:
μ=αα−1L1−H1−α1−H−α
Plugging xf and μ in the definition of v and setting v=1/2 yields:
f=f∞(1+H1−α)αα−1−2αα−1H−α1−H−α,with f∞=(12)αα−1,and we assume that the upper bound is given by:
H=(N0α)1α+1.Exponent α=1
Let us rewrite the PDF, CDF and survival function in this particular case:
p(x)=11−H−11x2;F(x)=1−x−11−H−1;S(x)=1−F(x)=x−1−H−11−H−1
xf=S−1(f)=1f+(1−f)H−1
v=12=1−ln(xf)ln(H)⇒xf=√Hand, since H=√N0,xf=N1/40
Putting it all together yields f=N1/40−1N1/20−1 and, in the high N0 limit, f∼N−1/40, so the number of "prolific" authors Np=fN0=N3/40, a result also obtained by Price et al. [1] using the discrete distribution. They also showed that other power laws (from N1/20 to N10) can be obtained, depending on the exact dependence of H on N0.
1 Allison, P. D. et al., Lotka's Law: A Problem in Its Interpretation and Application Social Studies of Science 6, 269-276, (1976).↩