• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于文中引用频率的相关研究论文推荐。

In-text citation's frequencies-based recommendations of relevant research papers.

作者信息

Shahid Abdul, Afzal Muhammad Tanvir, Alharbi Abdullah, Aljuaid Hanan, Al-Otaibi Shaha

机构信息

Institute of Computing, Kohat University of Science & Technology, Kohat, Pakistan.

Department of Computer Science, NAMAL Institute, Mianwali, Pakistan.

出版信息

PeerJ Comput Sci. 2021 Jun 4;7:e524. doi: 10.7717/peerj-cs.524. eCollection 2021.

DOI:10.7717/peerj-cs.524
PMID:34150995
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8189020/
Abstract

From the past half of a century, identification of the relevant documents is deemed an active area of research due to the rapid increase of data on the web. The traditional models to retrieve relevant documents are based on bibliographic information such as Bibliographic coupling, Co-citations, and Direct citations. However, in the recent past, the scientific community has started to employ textual features to improve existing models' accuracy. In our previous study, we found that analysis of citations at a deep level (i.e., content level) can play a paramount role in finding more relevant documents than surface level (i.e., just bibliography details). We found that cited and citing papers have a high degree of relevancy when in-text citations frequency of the cited paper is more than five times in the citing paper's text. This paper is an extension of our previous study in terms of its evaluation of a comprehensive dataset. Moreover, the study results are also compared with other state-of-the-art approaches i.e., content, metadata, and bibliography. For evaluation, a user study is conducted on selected papers from 1,200 documents (comprise about 16,000 references) of an online journal, Journal of Computer Science (J.UCS). The evaluation results indicate that in-text citation frequency has attained higher precision in finding relevant papers than other state-of-the-art techniques such as content, bibliographic coupling, and metadata-based techniques. The use of in-text citation may help in enhancing the quality of existing information systems and digital libraries. Further, more sophisticated measure may be redefined be considering the use of in-text citations.

摘要

在过去的半个世纪里,由于网络数据的迅速增长,相关文献的识别被视为一个活跃的研究领域。传统的检索相关文献的模型是基于诸如文献耦合、共被引和直接引用等书目信息。然而,最近科学界开始采用文本特征来提高现有模型的准确性。在我们之前的研究中,我们发现深入分析引用(即内容层面)在寻找比表面层面(即仅仅是书目细节)更相关的文献方面可以发挥至关重要的作用。我们发现,当被引论文在引用论文文本中的文内引用频率超过五次时,被引论文和引用论文具有高度相关性。本文是我们之前研究的扩展,对一个综合数据集进行了评估。此外,研究结果还与其他最先进的方法进行了比较,即内容、元数据和书目。为了进行评估,我们对在线期刊《计算机科学杂志》(J.UCS)的1200篇文档(包含约16000条参考文献)中挑选的论文进行了用户研究。评估结果表明,与其他最先进的技术(如内容、文献耦合和基于元数据的技术)相比,文内引用频率在查找相关论文方面具有更高的精度。文内引用的使用可能有助于提高现有信息系统和数字图书馆的质量。此外,考虑到文内引用的使用,可能会重新定义更复杂的度量标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/0bbf19bc0622/peerj-cs-07-524-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/1c3690d991db/peerj-cs-07-524-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/00f47a41e13f/peerj-cs-07-524-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/dd601ae4cd50/peerj-cs-07-524-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/0bbf19bc0622/peerj-cs-07-524-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/1c3690d991db/peerj-cs-07-524-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/00f47a41e13f/peerj-cs-07-524-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/dd601ae4cd50/peerj-cs-07-524-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a01a/8189020/0bbf19bc0622/peerj-cs-07-524-g004.jpg

相似文献

1
In-text citation's frequencies-based recommendations of relevant research papers.基于文中引用频率的相关研究论文推荐。
PeerJ Comput Sci. 2021 Jun 4;7:e524. doi: 10.7717/peerj-cs.524. eCollection 2021.
2
Important citation identification by exploiting content and section-wise in-text citation count.利用内容和按节计算的内文引文计数来进行重要引文识别。
PLoS One. 2020 Mar 5;15(3):e0228885. doi: 10.1371/journal.pone.0228885. eCollection 2020.
3
Citation analysis of computer systems papers.计算机系统论文的引文分析。
PeerJ Comput Sci. 2023 May 16;9:e1389. doi: 10.7717/peerj-cs.1389. eCollection 2023.
4
Impact Factors and Prediction of Popular Topics in a Journal.期刊中热门话题的影响因素及预测
Ultraschall Med. 2016 Aug;37(4):343-5. doi: 10.1055/s-0042-111209. Epub 2016 Aug 4.
5
National bias in citations in urology journals: parochialism or availability?泌尿学杂志中引文的国家偏见:狭隘主义还是可得性?
BJU Int. 1999 Oct;84(6):601-3. doi: 10.1046/j.1464-410x.1999.00267.x.
6
Active research fields in anesthesia: a document co-citation analysis of the anesthetic literature.麻醉学的活跃研究领域:麻醉学文献的文献共被引分析
Anesth Analg. 2008 May;106(5):1524-33, table of contents. doi: 10.1213/ane.0b013e31816d18a1.
7
Multi-label classification of research articles using Word2Vec and identification of similarity threshold.基于 Word2Vec 的研究论文多标签分类及相似度阈值确定
Sci Rep. 2021 Nov 9;11(1):21900. doi: 10.1038/s41598-021-01460-7.
8
Tracing the wider impacts of biomedical research: a literature search to develop a novel citation categorisation technique.追踪生物医学研究的更广泛影响:一项旨在开发新型引文分类技术的文献检索
Scientometrics. 2012 Oct;93(1):125-134. doi: 10.1007/s11192-012-0642-8. Epub 2012 Feb 1.
9
Scientometrics Approach to Research in Ovine Mastitis from 1970 to 2019 (with a Complete List of Relevant Literature References).1970年至2019年绵羊乳腺炎研究的科学计量学方法(附相关文献完整列表)
Pathogens. 2020 Jul 17;9(7):585. doi: 10.3390/pathogens9070585.
10
[The citation analysis of the publications in Chinese Journal of Preventive Medicine from 2014 to 2017].《2014年至2017年《中华预防医学杂志》发表论文的引文分析》
Zhonghua Yu Fang Yi Xue Za Zhi. 2020 Aug 6;54(8):867-874. doi: 10.3760/cma.j.cn112150-20200614-00876.

本文引用的文献

1
The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index.科学出版物的增长速度以及《科学引文索引》所提供覆盖范围的下降。
Scientometrics. 2010 Sep;84(3):575-603. doi: 10.1007/s11192-010-0202-z. Epub 2010 Mar 10.
2
The history and meaning of the journal impact factor.期刊影响因子的历史与意义
JAMA. 2006 Jan 4;295(1):90-3. doi: 10.1001/jama.295.1.90.
3
An index to quantify an individual's scientific research output.一个用于量化个人科研产出的指标。
Proc Natl Acad Sci U S A. 2005 Nov 15;102(46):16569-72. doi: 10.1073/pnas.0507655102. Epub 2005 Nov 7.