• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SeEn:用于序列感知推荐的连续丰富数据集。

SeEn: Sequential enriched datasets for sequence-aware recommendations.

机构信息

LASIGE, Departamento de Informática, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal.

CENTRA, Departamento de Física, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal.

出版信息

Sci Data. 2022 Aug 4;9(1):478. doi: 10.1038/s41597-022-01598-7.

DOI:10.1038/s41597-022-01598-7
PMID:35927282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9352715/
Abstract

The recommendation of items based on the sequential past users' preferences has evolved in the last few years, mostly due to deep learning approaches, such as BERT4Rec. However, in scientific fields, recommender systems for recommending the next best item are not widely used. The main goal of this work is to improve the results for the recommendation of the next best item in scientific domains using sequence aware datasets and algorithms. In the first part of this work, we present the adaptation of a previous method (LIBRETTI) for creating sequential recommendation datasets for scientific fields. The results were assessed in Astronomy and Chemistry. In the second part of this work, we propose a new approach to improve the datasets, not the algorithms, to obtain better recommendations. The new hybrid approach is called sequential enrichment (SeEn), which consists of adding to a sequence of items the n most similar items after each original item. The results show that the enriched sequences obtained better results than the original ones. The Chemistry dataset improved by approximately seven percentage points and the Astronomy dataset by 16 percentage points for Hit Ratio and Normalized Discounted Cumulative Gain.

摘要

基于用户历史偏好的推荐物品在过去几年中得到了发展,主要得益于深度学习方法,如 BERT4Rec。然而,在科学领域,推荐下一个最佳物品的推荐系统并未得到广泛应用。本工作的主要目标是使用序列感知数据集和算法改进科学领域中推荐下一个最佳物品的结果。在本工作的第一部分,我们介绍了对以前的方法(LIBRETTI)的适应,用于为科学领域创建序列推荐数据集。结果在天文学和化学领域进行了评估。在本工作的第二部分,我们提出了一种新的方法来改进数据集,而不是算法,以获得更好的推荐。新的混合方法称为序列增强(SeEn),它由在每个原始项目之后向项目序列中添加 n 个最相似的项目组成。结果表明,与原始序列相比,增强序列获得了更好的结果。化学数据集的命中率和归一化折扣累积增益分别提高了约七个百分点,天文学数据集提高了 16 个百分点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/779030a27678/41597_2022_1598_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/1958f694c36e/41597_2022_1598_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/0c929a40b925/41597_2022_1598_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/ced3c911231e/41597_2022_1598_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/6097f4781512/41597_2022_1598_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/88e039c773da/41597_2022_1598_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/be6553b0f352/41597_2022_1598_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/f10cd316677d/41597_2022_1598_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/779030a27678/41597_2022_1598_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/1958f694c36e/41597_2022_1598_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/0c929a40b925/41597_2022_1598_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/ced3c911231e/41597_2022_1598_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/6097f4781512/41597_2022_1598_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/88e039c773da/41597_2022_1598_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/be6553b0f352/41597_2022_1598_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/f10cd316677d/41597_2022_1598_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9329/9352715/779030a27678/41597_2022_1598_Fig8_HTML.jpg

相似文献

1
SeEn: Sequential enriched datasets for sequence-aware recommendations.SeEn:用于序列感知推荐的连续丰富数据集。
Sci Data. 2022 Aug 4;9(1):478. doi: 10.1038/s41597-022-01598-7.
2
Self-Attention Based Time-Rating-Aware Context Recommender System.基于自注意力的时间评分感知上下文推荐系统。
Comput Intell Neurosci. 2022 Sep 17;2022:9288902. doi: 10.1155/2022/9288902. eCollection 2022.
3
Deep Reinforcement Learning Framework for Category-Based Item Recommendation.用于基于类别的项目推荐的深度强化学习框架
IEEE Trans Cybern. 2022 Nov;52(11):12028-12041. doi: 10.1109/TCYB.2021.3089941. Epub 2022 Oct 17.
4
Collaborative Filtering Recommendation on Users' Interest Sequences.基于用户兴趣序列的协同过滤推荐
PLoS One. 2016 May 19;11(5):e0155739. doi: 10.1371/journal.pone.0155739. eCollection 2016.
5
Multi-level sequence denoising with cross-signal contrastive learning for sequential recommendation.基于跨信号对比学习的多层次序列去噪在序列推荐中的应用。
Neural Netw. 2024 Nov;179:106480. doi: 10.1016/j.neunet.2024.106480. Epub 2024 Jun 29.
6
Sequential Recommendation Based on Multivariate Hawkes Process Embedding With Attention.基于带注意力机制的多元霍克斯过程嵌入的序列推荐
IEEE Trans Cybern. 2022 Nov;52(11):11893-11905. doi: 10.1109/TCYB.2021.3077361. Epub 2022 Oct 17.
7
Hierarchical User Intention-Preference for Sequential Recommendation with Relation-Aware Heterogeneous Information Network Embedding.基于关系感知异质信息网络嵌入的层次化用户意图-偏好序贯推荐
Big Data. 2022 Oct;10(5):466-478. doi: 10.1089/big.2021.0395. Epub 2022 Aug 24.
8
Deep Bi-LSTM Networks for Sequential Recommendation.用于序列推荐的深度双向长短期记忆网络
Entropy (Basel). 2020 Aug 7;22(8):870. doi: 10.3390/e22080870.
9
HAM: Hybrid Associations Models for Sequential Recommendation.HAM:用于序列推荐的混合关联模型
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4838-4853. doi: 10.1109/tkde.2021.3049692. Epub 2021 Jan 6.
10
Remembering past and predicting future: a hybrid recurrent neural network based recommender system.铭记过去,预测未来:基于混合循环神经网络的推荐系统。
J Ambient Intell Humaniz Comput. 2022 Sep 4:1-12. doi: 10.1007/s12652-022-04375-x.

本文引用的文献

1
Health Recommender Systems: Systematic Review.健康推荐系统:系统评价。
J Med Internet Res. 2021 Jun 29;23(6):e18035. doi: 10.2196/18035.
2
Hybrid semantic recommender system for chemical compounds in large-scale datasets.大规模数据集中化合物的混合语义推荐系统。
J Cheminform. 2021 Feb 23;13(1):15. doi: 10.1186/s13321-021-00495-2.
3
DSPLMF: A Method for Cancer Drug Sensitivity Prediction Using a Novel Regularization Approach in Logistic Matrix Factorization.DSPLMF:一种在逻辑矩阵分解中使用新型正则化方法进行癌症药物敏感性预测的方法。
Front Genet. 2020 Feb 27;11:75. doi: 10.3389/fgene.2020.00075. eCollection 2020.
4
A New Weighted Imputed Neighborhood-Regularized Tri-Factorization One-Class Collaborative Filtering Algorithm: Application to Target Gene Prediction of Transcription Factors.一种新的加权拟近邻正则化三因子化单类协同过滤算法:在转录因子靶基因预测中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):126-137. doi: 10.1109/TCBB.2020.2968442. Epub 2021 Feb 3.
5
STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity.STS-NLSP:一种基于网络的标签空间划分方法,用于利用结构和语义相似性的混合特征预测膜转运体底物的特异性
Front Bioeng Biotechnol. 2019 Nov 6;7:306. doi: 10.3389/fbioe.2019.00306. eCollection 2019.
6
On the Unreported-Profile-is-Negative Assumption for Predictive Cheminformatics.基于未报告特征为阴性的预测化学信息学假设
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1352-1363. doi: 10.1109/TCBB.2019.2913855. Epub 2019 Apr 30.
7
Human Disease Ontology 2018 update: classification, content and workflow expansion.人类疾病本体论 2018 更新:分类、内容和工作流程扩展。
Nucleic Acids Res. 2019 Jan 8;47(D1):D955-D962. doi: 10.1093/nar/gky1032.
8
The Gene Ontology Resource: 20 years and still GOing strong.《基因本体论资源:20 年,持续强大》
Nucleic Acids Res. 2019 Jan 8;47(D1):D330-D338. doi: 10.1093/nar/gky1055.
9
Dual-Layer Strengthened Collaborative Topic Regression Modeling for Predicting Drug Sensitivity.双层强化协同主题回归建模预测药物敏感性。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):587-598. doi: 10.1109/TCBB.2018.2864739. Epub 2018 Aug 10.
10
Datasets2Tools, repository and search engine for bioinformatics datasets, tools and canned analyses.Datasets2Tools,生物信息数据集、工具和预分析的存储库和搜索引擎。
Sci Data. 2018 Feb 27;5:180023. doi: 10.1038/sdata.2018.23.