• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语义和元数据特征的科学论文被引潜力即时预测:以人工智能领域为例。

Instant prediction of scientific paper cited potential based on semantic and metadata features: Taking artificial intelligence field as an example.

作者信息

Zhu Hou, Shuhuai Li

机构信息

School of Information Management, Sun Yat-sen University, Guangzhou, China.

Australian Artificial Intelligence Institute, University of Technology Sydney, Sydney, NSW, Australia.

出版信息

PLoS One. 2024 Dec 2;19(12):e0312945. doi: 10.1371/journal.pone.0312945. eCollection 2024.

DOI:10.1371/journal.pone.0312945
PMID:39621723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11611117/
Abstract

With the continuous increase in the number of academic researchers, the volume of scientific papers is also increasing rapidly. The challenge of identifying papers with greater potential academic impact from this large pool has received increasing attention. The citation frequency of a paper is often used as an objective indicator to gauge the academic influence of the paper. The task of citation frequency prediction based on historical citation data in previous studies can achieve high accuracy. However, it can only be executed after the paper has been published for a period. The delay is not conducive to timely discovery of papers with high citation frequency. In this paper, we propose a novel method for predicting cited potential of a paper based on the metadata and semantic information, which can predict the cited potential of academic paper instantly once it has been published. Specifically, the semantic information, such as abstract, semantic span and semantic inflection, is extracted to enhance the ability of the prediction model based on machine learning. To prove the effectiveness and rationality of cited potential prediction model, we conduct two experiments to validate the model and find the most effective combination of input information. The empirical experiments show that the prediction accuracy of our proposed model can reach 88% for the instant prediction of citation.

摘要

随着学术研究人员数量的不断增加,科学论文的数量也在迅速增长。从这大量的论文中识别出具有更大潜在学术影响力的论文所面临的挑战受到了越来越多的关注。论文的被引频次常被用作衡量论文学术影响力的客观指标。基于以往研究中的历史被引数据进行被引频次预测的任务能够实现较高的准确率。然而,这只能在论文发表一段时间后才能进行。这种延迟不利于及时发现高被引频次的论文。在本文中,我们提出了一种基于元数据和语义信息预测论文被引潜力的新方法,该方法能够在学术论文发表后立即预测其被引潜力。具体而言,提取诸如摘要、语义跨度和语义拐点等语义信息,以增强基于机器学习的预测模型的能力。为了证明被引潜力预测模型的有效性和合理性,我们进行了两项实验来验证该模型并找出最有效的输入信息组合。实证实验表明,我们提出的模型对于即时被引预测的准确率可达88%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/d54e36e6633c/pone.0312945.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/cd451b990cde/pone.0312945.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/4ec62b240ec5/pone.0312945.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/ff1703668fda/pone.0312945.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/86b076237b83/pone.0312945.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/a3c615d3e9db/pone.0312945.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/d54e36e6633c/pone.0312945.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/cd451b990cde/pone.0312945.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/4ec62b240ec5/pone.0312945.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/ff1703668fda/pone.0312945.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/86b076237b83/pone.0312945.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/a3c615d3e9db/pone.0312945.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f304/11611117/d54e36e6633c/pone.0312945.g006.jpg

相似文献

1
Instant prediction of scientific paper cited potential based on semantic and metadata features: Taking artificial intelligence field as an example.基于语义和元数据特征的科学论文被引潜力即时预测:以人工智能领域为例。
PLoS One. 2024 Dec 2;19(12):e0312945. doi: 10.1371/journal.pone.0312945. eCollection 2024.
2
Research hotspots and frontiers of machine learning in renal medicine: a bibliometric and visual analysis from 2013 to 2024.肾脏医学中机器学习的研究热点与前沿:2013年至2024年的文献计量学与可视化分析
Int Urol Nephrol. 2025 Mar;57(3):907-928. doi: 10.1007/s11255-024-04259-3. Epub 2024 Oct 30.
3
[The citation analysis of the publications in Chinese Journal of Preventive Medicine from 2014 to 2017].《2014年至2017年《中华预防医学杂志》发表论文的引文分析》
Zhonghua Yu Fang Yi Xue Za Zhi. 2020 Aug 6;54(8):867-874. doi: 10.3760/cma.j.cn112150-20200614-00876.
4
[The characteristics and citation analysis of the publications in during 2016].[2016年期间某刊物出版物的特征及引文分析] (这里原文中“during 2016”前面应该有具体刊物名称,翻译时按实际情况补充完整)
Zhonghua Yi Xue Za Zhi. 2020 Dec 29;100(48):3903-3910. doi: 10.3760/cma.j.cn112137-20201117-03122.
5
[The characteristics and citation analysis of the publications in the from 2017 to 2019].[2017年至2019年期间[具体来源]出版物的特征与引文分析] (注:原文中“the from”表述不完整,推测是有遗漏信息)
Zhonghua Yi Xue Za Zhi. 2023 Dec 26;103(48):3959-3966. doi: 10.3760/cma.j.cn112137-20231106-01027.
6
Application of artificial intelligence in rheumatic disease: a bibliometric analysis.人工智能在风湿性疾病中的应用:文献计量分析。
Clin Exp Med. 2024 Aug 23;24(1):196. doi: 10.1007/s10238-024-01453-6.
7
Artificial Intelligence and Machine Learning in Cancer Research: A Systematic and Thematic Analysis of the Top 100 Cited Articles Indexed in Scopus Database.人工智能和机器学习在癌症研究中的应用:Scopus 数据库中被引前 100 篇文章的系统和主题分析。
Cancer Control. 2022 Jan-Dec;29:10732748221095946. doi: 10.1177/10732748221095946.
8
Artificial intelligence applied to ophthalmology and optometry: A citation network analysis.人工智能在眼科学和视光学中的应用:引文网络分析。
J Optom. 2022;15 Suppl 1(Suppl 1):S82-S90. doi: 10.1016/j.optom.2022.06.005. Epub 2022 Sep 21.
9
A comparative bibliometric analysis of the top 150 cited papers in hypospadiology (1945-2013).尿道下裂(1945 - 2013年)领域被引用次数排名前150的论文的比较文献计量分析
J Pediatr Urol. 2015 Apr;11(2):85.e1-85.e11. doi: 10.1016/j.jpurol.2014.11.022. Epub 2015 Mar 4.
10
Explaining protein-protein interactions with knowledge graph-based semantic similarity.用基于知识图的语义相似度解释蛋白质-蛋白质相互作用。
Comput Biol Med. 2024 Mar;170:108076. doi: 10.1016/j.compbiomed.2024.108076. Epub 2024 Feb 1.

本文引用的文献

1
How quantifying the shape of stories predicts their success.如何通过量化故事的形状来预测其成功。
Proc Natl Acad Sci U S A. 2021 Jun 29;118(26). doi: 10.1073/pnas.2011695118.
2
The preeminence of ethnic diversity in scientific collaboration.科学合作中种族多样性的卓越性。
Nat Commun. 2018 Dec 4;9(1):5163. doi: 10.1038/s41467-018-07634-8.
3
Peering Into the Black Box of Artificial Intelligence: Evaluation Metrics of Machine Learning Methods.窥视人工智能的黑箱:机器学习方法的评估指标。
AJR Am J Roentgenol. 2019 Jan;212(1):38-43. doi: 10.2214/AJR.18.20224. Epub 2018 Oct 17.
4
Atypical combinations and scientific impact.非典型组合和科学影响。
Science. 2013 Oct 25;342(6157):468-72. doi: 10.1126/science.1240474.
5
Are shorter article titles more attractive for citations? Cross-sectional study of 22 scientific journals.较短的文章标题对引用更具吸引力吗?对22种科学期刊的横断面研究。
Croat Med J. 2010 Apr;51(2):165-70. doi: 10.3325/cmj.2010.51.165.
6
The history and meaning of the journal impact factor.期刊影响因子的历史与意义
JAMA. 2006 Jan 4;295(1):90-3. doi: 10.1001/jama.295.1.90.
7
An index to quantify an individual's scientific research output.一个用于量化个人科研产出的指标。
Proc Natl Acad Sci U S A. 2005 Nov 15;102(46):16569-72. doi: 10.1073/pnas.0507655102. Epub 2005 Nov 7.