Wei Wenjie, Liu Hongxu, Sun Zhuanlan
Tongji University Library, Tongji University, Shanghai, 200092 China.
College of Electronics and Information Engineering, Tongji University, Shanghai, 200092 China.
Scientometrics. 2022;127(8):4315-4333. doi: 10.1007/s11192-022-04462-y. Epub 2022 Jul 18.
The detection of emerging trends is of great interest to many stakeholders such as government and industry. Previous research focused on the machine learning, network analysis and time series analysis based on the bibliometrics data and made a promising progress. However, these approaches inevitably have time delay problems. For the reason that leader papers of "emerging topics" share the similar characters with the "cover papers", this study present a novel approach to translate the "emerging topics" detection to "cover paper" prediction. By using "AdaBoost model" and topic model, we construct a machine learning framework to imitate the top journal (chief) editor's judgement to select cover paper from material science. The results of our prediction were validated by consulting with field experts. This approach was also suitable for the Nature, Science, and Cell journals.
新兴趋势的检测受到政府和行业等众多利益相关者的极大关注。以往的研究基于文献计量数据,聚焦于机器学习、网络分析和时间序列分析,并取得了有前景的进展。然而,这些方法不可避免地存在时间延迟问题。由于“新兴主题”的领先论文与“封面论文”具有相似特征,本研究提出了一种将“新兴主题”检测转化为“封面论文”预测的新颖方法。通过使用“AdaBoost模型”和主题模型,我们构建了一个机器学习框架,以模仿顶级期刊(主编)从材料科学中挑选封面论文的判断。我们的预测结果通过咨询领域专家得到了验证。这种方法也适用于《自然》《科学》和《细胞》等期刊。