Science-Metrix Inc., Montréal, Québec, Canada.
Elsevier B.V., Amsterdam, Netherlands.
PLoS One. 2021 May 11;16(5):e0251493. doi: 10.1371/journal.pone.0251493. eCollection 2021.
Classification schemes for scientific activity and publications underpin a large swath of research evaluation practices at the organizational, governmental, and national levels. Several research classifications are currently in use, and they require continuous work as new classification techniques becomes available and as new research topics emerge. Convolutional neural networks, a subset of "deep learning" approaches, have recently offered novel and highly performant methods for classifying voluminous corpora of text. This article benchmarks a deep learning classification technique on more than 40 million scientific articles and on tens of thousands of scholarly journals. The comparison is performed against bibliographic coupling-, direct citation-, and manual-based classifications-the established and most widely used approaches in the field of bibliometrics, and by extension, in many science and innovation policy activities such as grant competition management. The results reveal that the performance of this first iteration of a deep learning approach is equivalent to the graph-based bibliometric approaches. All methods presented are also on par with manual classification. Somewhat surprisingly, no machine learning approaches were found to clearly outperform the simple label propagation approach that is direct citation. In conclusion, deep learning is promising because it performed just as well as the other approaches but has more flexibility to be further improved. For example, a deep neural network incorporating information from the citation network is likely to hold the key to an even better classification algorithm.
科学活动和出版物的分类方案是组织、政府和国家各级研究评估实践的基础。目前有几种研究分类方法正在使用,随着新的分类技术的出现和新的研究课题的出现,它们需要不断的工作。卷积神经网络是“深度学习”方法的一个子集,最近为分类大量文本语料库提供了新颖且高性能的方法。本文对超过 4000 万篇科学文章和数万个学术期刊进行了深度学习分类技术的基准测试。与文献耦合、直接引文和基于手动的分类进行了比较-这是文献计量学领域以及许多科学和创新政策活动(如资助竞争管理)中最广泛使用的方法。结果表明,这种深度学习方法的第一次迭代的性能与基于图的文献计量方法相当。所呈现的所有方法也与手动分类相当。有些令人惊讶的是,没有发现任何机器学习方法明显优于直接引文的简单标签传播方法。总之,深度学习很有前途,因为它的表现与其他方法一样好,但具有更大的灵活性,可以进一步改进。例如,包含引文网络信息的深度神经网络可能是实现更好分类算法的关键。