基于深度学习和引文网络链接预测的科学论文推荐系统。

Scientific paper recommender system using deep learning and link prediction in citation network.

作者信息

Li Weijuan

机构信息

Dean's Office, Yellow River Conservancy Technical Institute, Kaifeng, 475004, Henan, China.

出版信息

Heliyon. 2024 Jul 15;10(14):e34685. doi: 10.1016/j.heliyon.2024.e34685. eCollection 2024 Jul 30.

DOI:10.1016/j.heliyon.2024.e34685

PMID:39130403

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11315118/

Abstract

Today, the number of published scientific articles is increasing day by day, and this has made the process of searching for articles more difficult. The need to provide specific recommender systems (RSs) for suggesting scientific articles is strongly felt in this situation. Because searching for articles based only on matching the titles or content of other articles is not an efficient process. In this research, the combination of two content analysis and citation network is used to design an RS for scientific articles (RECSA). In RECSA, natural language processing and deep learning techniques are used to process the titles and extract the content attributes of the articles. For this purpose, first, the titles of the articles are pre-processed, and by using the Term Frequency Inverse Document Frequency (TF-IDF) criterion, the importance of each word in the title is estimated. Then the dimensions of the obtained attributes are reduced by using a convolutional neural network (CNN). Then, by using the cosine similarity criterion, the content similarity matrix of the articles is calculated based on the attribute vectors. Also, the link prediction approach is used to analyze the connections of scientific articles' citation network. Finally, in the third step of RECSA, the two similarity matrices calculated in the previous steps are combined using an influence coefficient parameter to obtain the final similarity matrix, and the recommendation operation is based on the highest similarity value. The efficiency of RECSA has been evaluated from different aspects and the results have been compared with previous works. According to the results, utilizing the combination of TF-IDF and CNN for analyzing content-based features, leads to at least 0.32 % improvement in terms of precision compared to previous works. Also, by integrating citation and content-based data, the precision of first suggestion in RECSA would be 99.01 % which indicates the minimum improvement of 0.9 % compared to compared methods. The results show that by using RECSA, the recommendation can be done with higher accuracy and efficiency.

摘要

如今，已发表的科学文章数量与日俱增，这使得文章检索过程变得更加困难。在这种情况下，人们强烈感受到需要提供特定的推荐系统（RS）来推荐科学文章。因为仅基于文章标题或内容匹配来搜索文章并非高效的过程。在本研究中，将两种内容分析与引文网络相结合，用于设计科学文章推荐系统（RECSA）。在RECSA中，使用自然语言处理和深度学习技术来处理文章标题并提取文章的内容属性。为此，首先对文章标题进行预处理，并使用词频逆文档频率（TF-IDF）准则来估计标题中每个单词的重要性。然后使用卷积神经网络（CNN）来降低所得属性的维度。接着，使用余弦相似度准则，基于属性向量计算文章的内容相似度矩阵。此外，采用链接预测方法来分析科学文章引文网络的连接。最后，在RECSA的第三步中，使用影响系数参数将前两步计算得到的两个相似度矩阵进行组合，以获得最终的相似度矩阵，并基于最高相似度值进行推荐操作。从不同方面对RECSA的效率进行了评估，并将结果与先前的工作进行了比较。结果表明，与先前的工作相比，利用TF-IDF和CNN的组合来分析基于内容的特征，在精度方面至少提高了0.32%。此外，通过整合基于引文和内容的数据，RECSA中首次推荐的精度将达到99.01%，这表明与比较方法相比至少提高了0.9%。结果表明，使用RECSA可以实现更高准确率和效率的推荐。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2d2/11315118/2af5235c8a95/gr1.jpg

相似文献

Scientific paper recommender system using deep learning and link prediction in citation network.基于深度学习和引文网络链接预测的科学论文推荐系统。

Heliyon. 2024 Jul 15;10(14):e34685. doi: 10.1016/j.heliyon.2024.e34685. eCollection 2024 Jul 30.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

How Does ChatGPT Use Source Information Compared With Google? A Text Network Analysis of Online Health Information.ChatGPT 与谷歌相比如何使用来源信息？在线健康信息的文本网络分析。

Clin Orthop Relat Res. 2024 Apr 1;482(4):578-588. doi: 10.1097/CORR.0000000000002995. Epub 2024 Mar 1.

A CNN-Based Framework for Predicting Public Emotion and Multi-Level Behaviors Based on Network Public Opinion.一种基于卷积神经网络的、基于网络舆情预测公众情绪和多层次行为的框架。

Front Psychol. 2022 Jun 23;13:909439. doi: 10.3389/fpsyg.2022.909439. eCollection 2022.

Heterogeneous deep graph convolutional network with citation relational BERT for COVID-19 inline citation recommendation.用于COVID-19内联引用推荐的具有引用关系BERT的异构深度图卷积网络。

Expert Syst Appl. 2023 Mar 1;213:118841. doi: 10.1016/j.eswa.2022.118841. Epub 2022 Sep 17.

Scientific text citation analysis using CNN features and ensemble learning model.基于 CNN 特征和集成学习模型的科技文本引文分析

PLoS One. 2024 May 28;19(5):e0302304. doi: 10.1371/journal.pone.0302304. eCollection 2024.

Recommender System for the Efficient Treatment of COVID-19 Using a Convolutional Neural Network Model and Image Similarity.基于卷积神经网络模型和图像相似度的新冠高效治疗推荐系统

Diagnostics (Basel). 2022 Nov 5;12(11):2700. doi: 10.3390/diagnostics12112700.

In the pursuit of a semantic similarity metric based on UMLS annotations for articles in PubMed Central Open Access.在为美国国立医学图书馆医学主题词表（UMLS）注释的基于PubMed Central开放获取文章的语义相似性度量标准的研究中。

J Biomed Inform. 2015 Oct;57:204-18. doi: 10.1016/j.jbi.2015.07.015. Epub 2015 Aug 1.

SVD-CNN: A Convolutional Neural Network Model with Orthogonal Constraints Based on SVD for Context-Aware Citation Recommendation.SVD-CNN：一种基于奇异值分解（SVD）的具有正交约束的卷积神经网络模型，用于上下文感知引用推荐。

Comput Intell Neurosci. 2020 Oct 22;2020:5343214. doi: 10.1155/2020/5343214. eCollection 2020.

Clustering more than two million biomedical publications: comparing the accuracies of nine text-based similarity approaches.对两百多万篇生物医学文献进行聚类：比较九种基于文本的相似度方法的准确性。

PLoS One. 2011 Mar 17;6(3):e18029. doi: 10.1371/journal.pone.0018029.

本文引用的文献

Emati: a recommender system for biomedical literature based on supervised learning.Emati：一种基于监督学习的生物医学文献推荐系统。

Database (Oxford). 2022 Dec 9;2022. doi: 10.1093/database/baac104.

Adaptive sigmoid-like and PReLU activation functions for all-optical perceptron.用于全光感知器的自适应类Sigmoid和PReLU激活函数。

Opt Lett. 2021 May 1;46(9):2003-2006. doi: 10.1364/OL.422930.

A content-based literature recommendation system for datasets to improve data reusability - A case study on Gene Expression Omnibus (GEO) datasets.基于内容的文献推荐系统，用于数据集，以提高数据可重用性 - 以基因表达综合 (GEO) 数据集为例。

J Biomed Inform. 2020 Apr;104:103399. doi: 10.1016/j.jbi.2020.103399. Epub 2020 Mar 6.

Text feature extraction based on deep learning: a review.基于深度学习的文本特征提取：综述。

EURASIP J Wirel Commun Netw. 2017;2017(1):211. doi: 10.1186/s13638-017-0993-1. Epub 2017 Dec 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于深度学习和引文网络链接预测的科学论文推荐系统。

Scientific paper recommender system using deep learning and link prediction in citation network.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献