RESEARCH ARTICLE
Consistency pays off in science
Şirag Erkol1
, Satyaki Sikdar1
, Filippo Radicchi1
, and Santo Fortunato1,2
1Center for Complex Networks and Systems Research, Luddy School of Informatics, 计算, and Engineering,
Indiana University Bloomington, 布卢明顿, 在, 美国
2Indiana University Network Science Institute (IUNI), Indiana University Bloomington, 布卢明顿, 在, 美国
开放访问
杂志
关键词: citation, Nobel Prize, science of science, 成功
抽象的
The exponentially growing number of scientific papers stimulates a discussion on the interplay
between quantity and quality in science. 尤其, one may wonder which publication
strategy may offer more chances of success: publishing lots of papers, producing a few hit
文件, or something in between. Here we tackle this question by studying the scientific
portfolios of Nobel Prize laureates. A comparative analysis of different citation-based
indicators of individual impact suggests that the best path to success may rely on consistently
producing high-quality work. Such a pattern is especially rewarded by a new metric, 这
E-index, which identifies excellence better than state-of-the-art measures.
引文: Erkol, Ş., Sikdar, S。, Radicchi,
F。, & Fortunato, S. (2023). Consistency
pays off in science. Quantitative
Science Studies, 4(2), 491–500. https://
doi.org/10.1162/qss_a_00252
DOI:
https://doi.org/10.1162/qss_a_00252
Peer Review:
https://www.webofscience.com/api
/gateway/wos/peer-review/10.1162
/qss_a_00252
支持信息:
https://doi.org/10.1162/qss_a_00252
1.
介绍
已收到: 9 十一月 2022
公认: 12 二月 2023
通讯作者:
Santo Fortunato
santo@indiana.edu
处理编辑器:
Ludo Waltman
版权: © 2023 Şirag Erkol, Satyaki
Sikdar, Filippo Radicchi, and Santo
Fortunato. Published under a Creative
Commons Attribution 4.0 国际的
(抄送 4.0) 执照.
麻省理工学院出版社
The number of scientific papers has been growing exponentially for over a century (Dong, Ma
等人。, 2017; Fortunato, Bergstrom et al., 2018). The number of papers per author has been
relatively stable for a long time, but it has been increasing over the past decades (Dong
等人。, 2017), favored by the growing tendency of scientists to work in teams (Wuchty, 琼斯,
& Uzzi, 2007).
Such increased productivity is incentivized by career evaluation criteria that typically reward
large outputs, making scientists less risk averse when choosing research directions (Franzoni &
Rossi-Lamastra, 2017). 这, 然而, may come at the expense of the quality of research out-
来了 (Bornmann & Tekles, 2019; Sunahara, Perc, & Ribeiro, 2021). 的确, it has been shown
that the exponential growth of the number of publications corresponds to a much slower
increase in the number of new or disruptive ideas (Chu & 埃文斯, 2021; Milojević, 2015).
然而, although scholars should focus on quality, it is unclear whether it is more reward-
ing to pursue rare hit papers, have a consistent track record of valuable outputs, or be in
between these scenarios. Analyzing the careers of arguably the most successful class of scien-
奶嘴, Nobel Prize laureates, may help address this issue. 尤其, we would like to check if
there is a dominant path to success in the careers of such illustrious scholars.
To that effect, we consider a broad range of evaluation metrics that reward one-hit wonders
alongside those that favor a consistent production of high-quality research and investigate their
effectiveness in identifying Nobelists from within a more extensive set of similarly productive
科学家. We find that the best-performing metrics are indeed the ones that prioritize a con-
sistent stream of high-quality research.
The rest of this article is organized as follows. We first describe the data collection and
curation in Section 2. 然后, we briefly review some popularly adopted impact metrics and
我
D
哦
w
n
哦
A
d
e
d
F
r
哦
米
H
t
t
p
:
/
/
d
我
r
e
C
t
.
米
我
t
.
/
e
d
你
q
s
s
/
A
r
t
我
C
e
–
p
d
我
F
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
A
_
0
0
2
5
2
p
d
/
.
F
乙
y
G
你
e
s
t
t
哦
n
0
7
S
e
p
e
米
乙
e
r
2
0
2
3
Consistency pays off in science
introduce two new ones. 在部分 3, we describe and discuss the two sets of experiments we
used to check which of the two competing scenarios is more common. 最后, we give our
conclusions in Section 4.
2. 方法
2.1. 数据
We consider three fields in which the Nobel Prize is awarded: Physics, Chemistry, and Phys-
iology or Medicine (abbreviated henceforth as Medicine).
The publication records of scientists are obtained from two sources. For Nobelists, 我们用
the hand-curated data set with explicit annotations for prize-winning papers (李, Yin et al.,
2019). As a baseline, we consider scientists with verified Google Scholar (GS) profiles tagged
with Physics, Chemistry, Physiology, or Medicine as of May 2021.
We use the 2017 version of the Web of Science ( WoS) database to compile the citation
statistics of the articles. We rely on gathering data from different sources on purpose, 作为
WoS and GS complement each other well. GS offers the possibility of obtaining accurate pub-
lication records of individual scientists without the need to perform name disambiguation
(Radicchi & Castellano, 2013). WoS lets us reconstruct the citation history of individual papers.
Both ingredients are necessary for the type of analysis that we perform in this paper.
We adopt a similar methodology to that of Sinatra, Wang et al. (2016) to match papers
across databases. Given a paper ^p written by author a in GS, we list the papers Pa in WoS
authored by people with the same last name as a. From Pa, we select the paper p with the
highest normalized Levenshtein similarity between the corresponding paper titles (Levenshtein,
1966). We consider it a successful match only if the similarity exceeds 90%. 否则, 我们
discard ^p from further analysis. Following this procedure, we could match 78.1% of papers
by Nobelists and 49.6% of papers by baseline scientists, 分别. For our analysis, 我们只
consider scientists who published their first paper after 1960 and have a portfolio with at least
10 文件. Detailed statistics are provided in Table 1.
2.2. 指标
Let us consider a portfolio P = {c1, ……, cN} of N = |磷| papers that collectively receive Ctot
citations (IE。, Ctot =
氮
i ci). We consider the following metrics:
磷
氮: total number of papers.
Ctot: total number of citations.
Cavg: average number of citations (IE。, Cavg(磷) = Ctot
氮 ).
Cmax: citations received by the most cited paper (IE。, Cmax(磷) = max{c1, ⋯, cN}).
H: H-index (IE。, the largest number H of the top-cited papers with at least H citations;
Hirsch, 2005).
G: G-index (IE。, the largest number G of the top-cited papers with at least G2 combined
citations; Egghe, 2006).
桌子 1. Number of scientists in each category and field
类别
Nobelists
Baseline scientists
Physics
55
4,081
Chemistry
51
3,330
药品
56
2,715
492
Quantitative Science Studies
我
D
哦
w
n
哦
A
d
e
d
F
r
哦
米
H
t
t
p
:
/
/
d
我
r
e
C
t
.
米
我
t
.
/
e
d
你
q
s
s
/
A
r
t
我
C
e
–
p
d
我
F
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
A
_
0
0
2
5
2
p
d
/
.
F
乙
y
G
你
e
s
t
t
哦
n
0
7
S
e
p
e
米
乙
e
r
2
0
2
3
Consistency pays off in science
(西德:4)
问: Q-index, proposed by Sinatra et al. (2016), 问(磷) = exp
,
up to a constant factor, where Θ is the Heaviside function (IE。, Θ(X) = 1 if x > 0 和 0
否则), and c10,i is the citations gained by paper i within 10 years of publication. 我们
normalize c10,i by dividing it with the average c10 of all papers published in the same
discipline and year as paper i (Sinatra et al., 2016).
log c10;我
Q~: a variant of the unnormalized Q-index, where we use the total number of citations ci
ð
Θ c10;我
氮
i=1
1磷
氮
i=1
Þ
磷
(西德:2)
Θ c10;我
(西德:3)
(西德:5)
instead of c10,i.
We observe that these measures have their unique preferences for ranking portfolios.
一些, like Cmax, appear to reward one-hit wonders, 和别的, like H, reward consistency.
One of the goals of this work is to identify and differentiate Nobelists from baseline scientists.
所以, we argue that we need a new, 简单的, yet interpretable metric covering the whole
portfolio spectrum.
2.3. Citation Moment and E-Index
Given a publication portfolio P, one may consider the following extreme scenarios:
(西德:129) Citations are equally distributed among the papers, with each paper having Ctot/N
citations.
(西德:129) A single paper accounts for all citations.
In the first case, there is a sustained production of work of similar quality, while the second
represents a one-hit-wonder situation.
2.3.1. Citation moment
We propose the citation moment Mα, a new parametric measure that can reward both scenar-
ios, as well as the ones in between, depending on the value of the parameter α. It is defined as
Mα Pð Þ ¼ 1
氮
XN
i=1
cα
我
;
(1)
where α is a real positive number. We remark that Mα is essentially an average of the citation
scores of the papers, where the weight of each score is modulated by the exponent α. 我们可以
make the following observations of the behavior of our metric for different values of α.
α → 0: Mα behaves like Q~ as c α ≈ log c, but unlike Q~, it accounts for uncited papers.
0 < α < 1: Mα is higher for balanced portfolios (i.e., ones with a more uniform distribution of
citations).
α = 1: Mα becomes identical to Cavg.
α > 1: Mα is higher for unbalanced portfolios.
α → ∞: Mα closely imitates Cmax.
2.3.2.
E-index
We also propose an additional parameter-free measure that, like Mα, is sensitive to the distri-
bution of citations. We call this metric E-index, 定义为
E Pð Þ ¼ − 1
氮
XN
ci log
i=1
词
Ctot
;
(2)
493
Quantitative Science Studies
我
D
哦
w
n
哦
A
d
e
d
F
r
哦
米
H
t
t
p
:
/
/
d
我
r
e
C
t
.
米
我
t
.
/
e
d
你
q
s
s
/
A
r
t
我
C
e
–
p
d
我
F
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
A
_
0
0
2
5
2
p
d
.
/
F
乙
y
G
你
e
s
t
t
哦
n
0
7
S
e
p
e
米
乙
e
r
2
0
2
3
Consistency pays off in science
桌子 2. Values of metrics for portfolios with N papers with Ctot citations, of which n are equally
cited and N − n are uncited
Metric
H
G
Q~
Mα
乙
Value
min{⌊Ctot/n⌋, n}
p
ffiffiffiffiffiffiffiffi
Ctot
min{⌊
⌋, ⌊Ctot/n⌋, 氮}
C tot/n
C α
tot
N nα−1
Ctot
N log n
which reaches its maximum Cavg log N when citations are distributed equally among papers,
favoring authors with large average numbers of citations. 实际上, 乙(磷) is just the product of the
average number of citations Cavg and of the Shannon entropy of the citation distribution.
2.4. Behavior of Metrics on Stylized Portfolios
To better understand the behavior of the different metrics in our analysis, we consider a port-
folio with n cited papers with Ctot /n citations each and N − n uncited papers. 表中 2, 我们
show the values that several key metrics take in this case.
We see that the citation moment Mα (for α ≠ 0, 1), E-index, and G-index depend on n, 氮,
and Ctot. The H-index and the Q~ depend only on the cited papers. 所以, 例如, two port-
folios with identical values of Ctot and n would have the same H-index, regardless of the num-
ber of uncited papers. 此外, even though the G-index depends on all three parameters,
it depends on them in a somehow undesirable way. 例如, a portfolio with more
uncited papers may have a G-index value greater than or equal to the G-index of another
portfolio with identical Ctot and n values. 反而, ranking the portfolio with fewer uncited
works higher (lower N − n), as Mα and E would, seems more intuitive.
我
D
哦
w
n
哦
A
d
e
d
F
r
哦
米
H
t
t
p
:
/
/
d
我
r
e
C
t
.
米
我
t
.
/
e
d
你
q
s
s
/
A
r
t
我
C
e
–
p
d
我
F
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
A
_
0
0
2
5
2
p
d
.
/
3. 结果
图中 1, we plot Nobelists and baseline scientists according to their number of papers and
the total number of citations. 正如预期的那样, most Nobelists lie in the top right region, indicating
high levels of both productivity and impact. 然而, there appear to be a few Nobelists in
the top left, indicating that they only produced a handful of high-impact papers. To further
illustrate this difference, we consider two Nobelists in Physics, David J. 总的 (2004) and John
中号. Kosterlitz (2016), and plot their publication timelines in Figure 2. Gross has a consistent
production of high-impact works, but Kosterlitz stands out for having a single big paper.
F
乙
y
G
你
e
s
t
t
哦
n
0
7
S
e
p
e
米
乙
e
r
2
0
2
3
数字 1. Total number of citations vs. total number of papers for Nobelists (purple dots) and baseline scientists (gray dots).
Quantitative Science Studies
494
Consistency pays off in science
我
D
哦
w
n
哦
A
d
e
d
F
r
哦
米
H
t
t
p
:
/
/
d
我
r
e
C
t
.
米
我
t
.
/
e
d
你
q
s
s
/
A
r
t
我
C
e
–
p
d
我
F
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
A
_
0
0
2
5
2
p
d
.
/
F
乙
y
G
你
e
s
t
t
哦
n
0
7
S
e
p
e
米
乙
e
r
2
0
2
3
数字 2. Consistency versus single-hit scenario. On the x-axis, we indicate the temporal sequence
of papers, and on the y-axis the citations accrued by each paper. The two panels show the profiles
of D. J. 总的 (顶部) 和 J. 中号. Kosterlitz (底部). The former has a portfolio with multiple highly cited
文件, and the latter has one highly cited paper. D. J. 总的: N= 122, Ctot = 24,144, Cavg = 197.9,
E = 768.6. J. 中号. Kosterlitz: N= 63, Ctot = 11,688, Cavg = 185.5, E = 348.8.
We now focus on two tasks: portfolio classification and future Nobelist identification.
3.1. Portfolio Classification
We test the performance of the metrics in distinguishing the portfolios of Nobelists from those
of the baseline scientists. We consider two subtasks which we describe below. We use the area
under the precision-recall curve (AUC-PR) in each task as the performance metric. This curve
shows the trade-off between precision and recall at different thresholds. Bounded between 0
和 1, higher AUC-PR values indicate better classification performance. For random predic-
系统蒸发散, AUC-PR is the fraction of positive samples. AUC-PR is better suited for imbalanced data
sets than the area under the receiver operating characteristic curve (ROC-AUC) (Saito &
Rehmsmeier, 2015). Results for the ROC-AUC are reported in the Supplementary material
and are consistent with the analysis done using AUC-PR.
(西德:129) Full. We use the entire portfolio of the scientists described in Section 2.1.
(西德:129) Preaward. We construct the preaward portfolio of Nobelists (IE。, the set of papers
published until the year of the prize-winning paper), discarding those with fewer than
10 文件. We find that 15 (27%), 28 (55%), 和 22 (39%) of Nobelists in Physics,
Chemistry, and Medicine, 分别, satisfy the above criteria.
具体来说, for a Nobelist who published their first paper in year y0 and wrote their prize-
winning article in year yp, we consider the papers published and citations accrued between
years y0 and yp − 1. We then pair the Nobelist with 20 baseline scientists who published their
first papers around the year y0 and wrote at least 10 papers in their careers’ first yp − y0 years.
(西德:129) Optimal α selection. Recall that, unlike other measures, Mα has a tunable parameter α.
所以, for each task, we record the performance of Mα across a range of α values
Quantitative Science Studies
495
Consistency pays off in science
我
D
哦
w
n
哦
A
d
e
d
F
r
哦
米
H
t
t
p
:
/
/
d
我
r
e
C
t
.
米
我
t
.
/
e
d
你
q
s
s
/
A
r
t
我
C
e
–
p
d
我
F
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
A
_
0
0
2
5
2
p
d
.
/
F
乙
y
G
你
e
s
t
t
哦
n
0
7
S
e
p
e
米
乙
e
r
2
0
2
3
数字 3. Classification performance of Mα for varying α. Different symbols denote different fields.
The dashed line α = 1 separates the two regimes. We use the optimal values of α (α*) in our analyses.
AUC-PR values for the Full and Preaward (PA) portfolio classification tasks. The best-
桌子 3.
performing metrics for each field are marked in bold type. Mα and E are the standout performers.
Note that values across columns are not comparable as the baseline values are determined by the
respective class imbalance ratios
Physics
Chemistry
药品
Metric
氮
Ctot
Cavg
Cmax
H
G
Q~
问
Mα
乙
Full
0.03
0.21
0.42
0.24
0.12
0.15
0.30
0.08
0.43
0.44
PA
0.07
0.15
0.19
0.12
0.16
0.15
0.19
0.15
0.34
0.23
Full
0.13
0.43
0.32
0.25
0.44
0.41
0.32
0.13
0.49
0.53
PA
0.12
0.34
0.39
0.21
0.36
0.33
0.41
0.20
0.53
0.45
Full
0.06
0.52
0.68
0.49
0.50
0.48
0.67
0.26
0.78
0.75
PA
0.06
0.24
0.46
0.18
0.24
0.17
0.48
0.45
0.68
0.44
496
Quantitative Science Studies
Consistency pays off in science
and plot the results in Figure 3. We observe a slight dependence of the optimal α-value (α*)
on the task and the field. We use the corresponding α* values while comparing the per-
formance of Mα with other metrics. 在每种情况下, 然而, we find α* < 1, which indi-
cates that portfolios are most separable when the metric prioritizes consistent impact.
We record the metrics’ performance in Table 3. In the Supplementary material, we report
the classification results on the American Physical Society (APS) bibliographic data set.
Metrics agnostic to the distribution of citations appear to perform worse than their coun-
terparts across either task. This includes the total number of papers N, as well as total cita-
tions Ctot, and maximum citations Cmax. We highlight the performance of three metrics: N,
Cavg, and Cmax. N is consistently the worst performer because it does not account for the
impact, only volume. Cavg is among the top performers considering the whole portfolio. We
believe that is partly due to the nature of the distributions observed in Figure 1, where the
Top 20 baseline scholars with the largest E-index in each discipline. The ones marked in
Table 4.
bold type received the Nobel Prize between 2018 and 2022. Some authors are assigned multiple
labels, so they may appear in multiple lists
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
q
s
s
/
a
r
t
i
c
e
-
p
d
l
f
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
a
_
0
0
2
5
2
p
d
/
.
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Rank
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Physics
H. Dai
A. L. Barabási
D. Finkbeiner
P. McEuen
I. Bloch
A. Ashkin
U. Seljak
S. Inouye
S. Manabe
M. Tegmark
J. R. Heath
L. Verde
S. G. Louie
D. I. Schuster
N. D. Lang
B. Hammer
D. Holmgren
M. Lazzeri
Chemistry
H. Dai
J. Godwin
R. Ruoff
K. L. Kelly
H. Wang
M. Egholm
L. Umayam
L. Zhang
R. Freeman
P. Cieplak
G. Church
Medicine
S. Kumar
R. A. Larson
A. L. Barabási
G. L. Semenza
A. S. Levey
S. Paabo
R. A. North
A. Patapoutian
J. Goldberger
M. Snyder
J. Magee
D. Macmillan
M. Houghton
G. Winter
J. Kuriyan
J. R. Heath
G. Loewenstein
S. Via
R. Jaeschke
E. H. Schroeter
G. Hollopeter
W. Lin
S. J. Wagner
W. L. Jorgensen
V. V. Fokin
L. P. Kouwenhoven
M. Buttiker
J. Clardy
D. Zhao
J. Allison
B. Moss
Quantitative Science Studies
497
Consistency pays off in science
Nobelists are likely to accumulate higher than average citations over their careers. However,
performance for the preaward portfolios is a bit worse, probably because we only consider
the preaward period of their careers. Winning the prize has been shown to provide a
tangible boost to the overall visibility of a scientist, resulting in more citations (Inhaber &
Przednowek, 1976). The number of citations of the most cited paper Cmax is among the
worst performers, which suggests that the one big-hit portfolio is not typical among Nobelists.
This finding supports the idea that scientists win the Nobel Prize after years of consistent,
high-quality work.
We now shift our focus to the other category of indicators (i.e., ones sensitive to the citation
distributions). We find that H records mediocre performance despite rewarding consistency.
Its dependence on productivity likely fails to account for the Nobelists with a few highly cited
papers. The Q-index performs poorly. However, its variant, Q~, fares considerably better, which
is consistent with the fact that it is similar to Mα for small α.
Mα and E consistently rank in the top two positions. This further supports the hypothesis that
Nobelists set themselves apart by producing a steady stream of high-impact work.
3.2.
Identifying Future Nobelists
As a test of the predictive power of the metrics, we check whether we can identify scholars
who received the Nobel Prize from 2018 to 2022 (i.e., the period not covered by our WoS
data set). First, we note that our set of baseline scientists may be missing some of these new
Nobelists, in which case we add them manually, provided they have a GS profile.
Then, for each metric, we construct a top 20 list of baseline scientists by ranking them in
descending order and highlighting the Nobelists. We report the table for the E -index in the
main text (Table 4), while the remaining lists can be found in the Supplementary material.
In Table 5, we show how many Nobelists appeared in the top 20 lists for each metric.
E -index outperforms all other indicators, proving particularly effective for Medicine.
Count of Nobelists awarded in the period [2018, 2022] identified in the top 20 lists of
Table 5.
various metrics. The numbers in parentheses indicate how many such Nobelists have a GS profile
Metric
N
Ctot
Cavg
Cmax
H
G
Q~
Q
Mα
E
Physics (9)
1
Chemistry (8)
0
Medicine (5)
0
2
1
0
2
2
1
0
1
2
0
0
0
1
0
1
1
0
2
3
3
2
2
3
3
1
2
5
Quantitative Science Studies
498
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
q
s
s
/
a
r
t
i
c
e
-
p
d
l
f
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
a
_
0
0
2
5
2
p
d
.
/
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Consistency pays off in science
E-index of Nobelists versus baseline scientists with comparable numbers of papers and citations. We see a prevalent trend towards
Figure 4.
larger E values for Nobelists. Some 58.2% (Physics), 86.3% (Chemistry), and 87.5% (Medicine) of Nobelists have larger E values than their
counterparts.
To further corroborate this conclusion, we matched each Nobelist with a baseline scientist
with (nearly) identical N and Ctot values. In Figure 4, we plot the E-index of each Nobelist and
matched baseline pair. We find that the E-index of Nobelists usually exceeds that of their
matches. Some exceptions correspond to Nobelists with a low number of highly cited papers.
Other outliers might be prominent scholars who have not yet received the award but might
receive it in the future.
4. CONCLUSION
In this work, we searched for productivity patterns in excellent scientific careers. Specifically,
we aimed to assess whether the output of high-profile scientists is more likely to be character-
ized by a low number of hit papers or by a consistent production of high-quality work. To
address this question, we have examined the scientific portfolios of Nobel Prize winners in
Physics, Chemistry, and Medicine and checked which citation-based metrics are most suitable
to recognize them among a much larger number of baseline scholars. In addition, we intro-
duced two new metrics, the E-index and Mα, that reward both consistency and high average
impact (when α < 1).
We found that the best-performing metrics are the ones that peak when citations are dis-
tributed among a considerable number of works rather than being concentrated on a few hit
papers. The E-index, in particular, proves especially effective in identifying future Nobelists. A
portal for the calculation of E-index and other scores of individual performance can be found
at e-index.net.
While there are Nobelists whose success relied on isolated hit papers, the most successful
scientists usually stayed on top of their game for most of their careers.
ACKNOWLEDGMENTS
We acknowledge Aditya Tandon’s help in this study’s initial phase. This work uses WoS data
by Clarivate Analytics provided by the Indiana University Network Science Institute and the
Cyberinfrastructure for Network Science Center at Indiana University.
COMPETING INTERESTS
The authors have no competing interests.
Quantitative Science Studies
499
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
q
s
s
/
a
r
t
i
c
e
-
p
d
l
f
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
a
_
0
0
2
5
2
p
d
.
/
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Consistency pays off in science
FUNDING INFORMATION
This project was partially supported by grants from the Army Research Office (#W911NF-21-1-
0194) and the Air Force Office of Scientific Research (#FA9550-19-1-0391, #FA9550-19-1-0354).
DATA AVAILABILITY
The data for Nobel laureates is available at Li et al. (2019). The disambiguated APS data set is
available at Sinatra et al. (2016). The raw data set for the APS can be requested at https://
journals.aps.org/datasets. The code is available at https://github.com/siragerkol/Consistency
-pays-off-in-science. WoS data are not publicly available.
REFERENCES
Bornmann, L., & Tekles, A. (2019). Productivity does not equal use-
fulness. Scientometrics, 118(2), 705–707. https://doi.org/10.1007
/s11192-018-2982-5
Chu, J. S., & Evans, J. A. (2021). Slowed canonical progress in large
fields of science. Proceedings of the National Academy of
Sciences, 118(41), e2021636118. https://doi.org/10.1073/pnas
.2021636118, PubMed: 34607941
Dong, Y., Ma, H., Shen, Z., & Wang, K. (2017). A century of sci-
ence: Globalization of scientific collaborations, citations, and
innovations. In Proceedings of the 23rd ACM SIGKDD Interna-
tional Conference on Knowledge Discovery and Data Mining
(pp. 1437–1446). https://doi.org/10.1145/3097983.3098016
Egghe, L. (2006). Theory and practise of the g-index. Scientomet-
rics, 69(1), 131–152. https://doi.org/10.1007/s11192-006-0144-7
Fortunato, S., Bergstrom, C. T., Börner, K., Evans, J. A., Helbing, D.,
… Barabási, A.-L. (2018). Science of science. Science, 359(6379),
eaao0185. https://doi.org/10.1126/science.aao0185, PubMed:
29496846
Franzoni, C., & Rossi-Lamastra, C. (2017). Academic tenure,
risk-taking and the diversification of scientific research. Industry
and Innovation, 24(7), 691–712. https://doi.org/10.1080
/13662716.2016.1264067
Hirsch, J. E. (2005). An index to quantify an individual’s scientific
research output. Proceedings of the National Academy of
Sciences, 102(46), 16569–16572. https://doi.org/10.1073/pnas
.0507655102, PubMed: 16275915
Inhaber, H., & Przednowek, K. (1976). Quality of research and the
Nobel Prizes. Social Studies of Science, 6(1), 33–50. https://doi
.org/10.1177/030631277600600102
Levenshtein, V. I. (1966). Binary codes capable of correcting dele-
tions, insertions and reversals. Soviet Physics Doklady, 10(8),
707–710.
Li, J., Yin, Y., Fortunato, S., & Wang, D. (2019). A dataset of publica-
tion records for Nobel laureates. Scientific Data, 6(1), 33. https://
doi.org/10.1038/s41597-019-0033-6, PubMed: 31000709
Milojević, S. (2015). Quantifying the cognitive extent of science.
Journal of Informetrics, 9(4), 962–973. https://doi.org/10.1016/j
.joi.2015.10.005
Radicchi, F., & Castellano, C. (2013). Analysis of bibliometric indi-
cators for individual scholars in a large data set. Scientometrics,
97(3), 627–637. https://doi.org/10.1007/s11192-013-1027-3
Saito, T., & Rehmsmeier, M. (2015). The precision-recall plot is
more informative than the ROC plot when evaluating binary
classifiers on imbalanced datasets. PLOS ONE, 10(3), e0118432.
https://doi.org/10.1371/journal.pone.0118432, PubMed:
25738806
Sinatra, R., Wang, D., Deville, P., Song, C., & Barabási, A.-L. (2016).
Quantifying the evolution of individual scientific impact.
Science, 354(6312), aaf5239. https://doi.org/10.1126/science
.aaf5239, PubMed: 27811240
Sunahara, A. S., Perc, M., & Ribeiro, H. V. (2021). Association
between productivity and journal impact across disciplines and
career age. Physical Review Research, 3(3), 033158. https://doi
.org/10.1103/PhysRevResearch.3.033158
Wuchty, S., Jones, B. F., & Uzzi, B. (2007). The increasing dominance
of teams in production of knowledge. Science, 316(5827),
1036–1039. https://doi.org/10.1126/science.1136099, PubMed:
17431139
Quantitative Science Studies
500
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
q
s
s
/
a
r
t
i
c
e
-
p
d
l
f
/
/
/
/
4
2
4
9
1
2
1
3
6
3
8
0
q
s
s
_
a
_
0
0
2
5
2
p
d
.
/
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3