a Research Center for Science Systems, Japan Society for the Promotion of Science (JSPS) , Kojimachi , Chiyoda-ku, Tokyo , Japan.
b RIKEN Advanced Institute for Computational Science , Kobe , Japan.
Account Res. 2018;25(4):212-219. doi: 10.1080/08989621.2018.1449651. Epub 2018 Mar 21.
Both the scientific community and the general public have expressed concern over scientific misconduct. The number of retracted articles has increased dramatically over the past 20 years and now comprises about .02% of the 2 million articles published each year. Retraction of publications available in large public databases can be analyzed as an objective measure for scientific misconduct and errors. In this project, we analyzed retractions of scientific publications using the Web of Science (WoS) and PubMed databases. We found that a power law is applicable to distributions of retracting authors and retracted publications with exponents of about -.6 and -3.0, respectively. Application of a power-law model for retracted publications implies that retraction is not a random event. Analysis of the retraction distributions suggests that a small fraction (1-2%) of retracting authors with ≧5 retractions are responsible for around 10% of retraction. The probabilities for their repeating retraction are calculated using a statistical model: 3-5% likelihood of repeat retraction for authors with a single retraction at five years after the latest retraction and 26-37% for authors with five retractions at five years after the latest retraction. By focusing on those with repeated retractions, this analysis could contribute to identification of measures to reduce such repetition of retractions.
科学界和公众都对科研不端行为表示关注。在过去的 20 年中,被撤回的文章数量急剧增加,现在每年发表的 200 万篇文章中约有.02%被撤回。从大型公共数据库中撤回的出版物可以被分析为科研不端行为和错误的客观衡量标准。在这个项目中,我们使用 Web of Science(WoS)和 PubMed 数据库分析了科学出版物的撤回情况。我们发现,幂律适用于撤回作者和撤回出版物的分布,其指数分别约为 -.6 和 -3.0。幂律模型在撤回出版物中的应用意味着撤回不是随机事件。对撤回分布的分析表明,一小部分(1-2%)有 ≧5 次撤回的撤回作者负责约 10%的撤回。使用统计模型计算了他们重复撤回的概率:对于在最新撤回后五年内有一次撤回的作者,重复撤回的可能性为 3-5%;对于在最新撤回后五年内有五次撤回的作者,重复撤回的可能性为 26-37%。通过关注那些有重复撤回的作者,这种分析有助于确定减少此类重复撤回的措施。