• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模数据集中化合物的混合语义推荐系统。

Hybrid semantic recommender system for chemical compounds in large-scale datasets.

作者信息

Barros Marcia, Moitinho Andre, Couto Francisco M

机构信息

LASIGE, Departamento de Informática, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisboa, Portugal.

CENTRA, Departamento de Física, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisboa, Portugal.

出版信息

J Cheminform. 2021 Feb 23;13(1):15. doi: 10.1186/s13321-021-00495-2.

DOI:10.1186/s13321-021-00495-2
PMID:33622374
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7903631/
Abstract

The large, and increasing, number of chemical compounds poses challenges to the exploration of such datasets. In this work, we propose the usage of recommender systems to identify compounds of interest to scientific researchers. Our approach consists of a hybrid recommender model suitable for implicit feedback datasets and focused on retrieving a ranked list according to the relevance of the items. The model integrates collaborative-filtering algorithms for implicit feedback (Alternating Least Squares and Bayesian Personalized Ranking) and a new content-based algorithm, using the semantic similarity between the chemical compounds in the ChEBI ontology. The algorithms were assessed on an implicit dataset of chemical compounds, CheRM-20, with more than 16.000 items (chemical compounds). The hybrid model was able to improve the results of the collaborative-filtering algorithms, by more than ten percentage points in most of the assessed evaluation metrics.

摘要

大量且不断增加的化合物给探索此类数据集带来了挑战。在这项工作中,我们建议使用推荐系统来识别科研人员感兴趣的化合物。我们的方法包括一个适用于隐性反馈数据集的混合推荐模型,该模型专注于根据项目的相关性检索一个排名列表。该模型整合了用于隐性反馈的协同过滤算法(交替最小二乘法和贝叶斯个性化排序)以及一种新的基于内容的算法,该算法利用了ChEBI本体中化合物之间的语义相似性。这些算法在一个包含超过16000个项目(化合物)的化合物隐性数据集CheRM - 20上进行了评估。在大多数评估指标中,混合模型能够将协同过滤算法的结果提高超过十个百分点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/1683cb296397/13321_2021_495_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/1182e603209d/13321_2021_495_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/924fc463d1e6/13321_2021_495_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/0d8aa752b83a/13321_2021_495_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/35e47a2669ba/13321_2021_495_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/49c4856dd942/13321_2021_495_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/36f683112f8b/13321_2021_495_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/3a6b60e63df5/13321_2021_495_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/0df1ed13f421/13321_2021_495_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/7657e51cfb3d/13321_2021_495_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/9102f885277f/13321_2021_495_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/1683cb296397/13321_2021_495_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/1182e603209d/13321_2021_495_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/924fc463d1e6/13321_2021_495_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/0d8aa752b83a/13321_2021_495_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/35e47a2669ba/13321_2021_495_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/49c4856dd942/13321_2021_495_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/36f683112f8b/13321_2021_495_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/3a6b60e63df5/13321_2021_495_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/0df1ed13f421/13321_2021_495_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/7657e51cfb3d/13321_2021_495_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/9102f885277f/13321_2021_495_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e73/7903631/1683cb296397/13321_2021_495_Fig11_HTML.jpg

相似文献

1
Hybrid semantic recommender system for chemical compounds in large-scale datasets.大规模数据集中化合物的混合语义推荐系统。
J Cheminform. 2021 Feb 23;13(1):15. doi: 10.1186/s13321-021-00495-2.
2
A hybrid recommender system based on data enrichment on the ontology modelling.基于本体模型数据增强的混合推荐系统。
F1000Res. 2021 Sep 17;10:937. doi: 10.12688/f1000research.73060.1. eCollection 2021.
3
The Implementation of Recommender Systems for Mental Health Recovery Narratives: Evaluation of Use and Performance.心理健康康复叙事推荐系统的实现:使用和性能评估。
JMIR Ment Health. 2024 Mar 29;11:e45754. doi: 10.2196/45754.
4
A Smart Recommender Based on Hybrid Learning Methods for Personal Well-Being Services.基于混合学习方法的个人健康服务智能推荐器
Sensors (Basel). 2019 Jan 21;19(2):431. doi: 10.3390/s19020431.
5
HCF-CRS: A Hybrid Content based Fuzzy Conformal Recommender System for providing recommendations with confidence.HCF-CRS:一种基于混合内容的模糊一致推荐系统,用于提供有信心的推荐。
PLoS One. 2018 Oct 9;13(10):e0204849. doi: 10.1371/journal.pone.0204849. eCollection 2018.
6
Using Natural Language Processing Techniques to Provide Personalized Educational Materials for Chronic Disease Patients in China: Development and Assessment of a Knowledge-Based Health Recommender System.运用自然语言处理技术为中国慢性病患者提供个性化教育材料:基于知识的健康推荐系统的开发与评估
JMIR Med Inform. 2020 Apr 23;8(4):e17642. doi: 10.2196/17642.
7
Modeling user rating preference behavior to improve the performance of the collaborative filtering based recommender systems.建立用户评分偏好模型,以提升基于协同过滤的推荐系统的性能。
PLoS One. 2019 Aug 1;14(8):e0220129. doi: 10.1371/journal.pone.0220129. eCollection 2019.
8
Knowledge-based recommender systems: overview and research directions.基于知识的推荐系统:概述与研究方向。
Front Big Data. 2024 Feb 26;7:1304439. doi: 10.3389/fdata.2024.1304439. eCollection 2024.
9
An Approach to Integrating Sentiment Analysis into Recommender Systems.将情感分析集成到推荐系统中的方法。
Sensors (Basel). 2021 Aug 23;21(16):5666. doi: 10.3390/s21165666.
10
Efficient clustering in collaborative filtering recommender system: Hybrid method based on genetic algorithm and gravitational emulation local search algorithm.协同过滤推荐系统中的高效聚类:基于遗传算法和引力模拟局部搜索算法的混合方法。
Genomics. 2019 Dec;111(6):1902-1912. doi: 10.1016/j.ygeno.2019.01.001. Epub 2019 Jan 3.

引用本文的文献

1
SeEn: Sequential enriched datasets for sequence-aware recommendations.SeEn:用于序列感知推荐的连续丰富数据集。
Sci Data. 2022 Aug 4;9(1):478. doi: 10.1038/s41597-022-01598-7.

本文引用的文献

1
Linking chemical and disease entities to ontologies by integrating PageRank with extracted relations from literature.通过将PageRank与从文献中提取的关系相结合,将化学和疾病实体与本体进行关联。
J Cheminform. 2020 Sep 21;12(1):57. doi: 10.1186/s13321-020-00461-4.
2
Recommender Systems in Antiviral Drug Discovery.抗病毒药物发现中的推荐系统
ACS Omega. 2020 Jun 21;5(25):15039-15051. doi: 10.1021/acsomega.0c00857. eCollection 2020 Jun 30.
3
STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity.
STS-NLSP:一种基于网络的标签空间划分方法,用于利用结构和语义相似性的混合特征预测膜转运体底物的特异性
Front Bioeng Biotechnol. 2019 Nov 6;7:306. doi: 10.3389/fbioe.2019.00306. eCollection 2019.
4
Human Disease Ontology 2018 update: classification, content and workflow expansion.人类疾病本体论 2018 更新:分类、内容和工作流程扩展。
Nucleic Acids Res. 2019 Jan 8;47(D1):D955-D962. doi: 10.1093/nar/gky1032.
5
The Gene Ontology Resource: 20 years and still GOing strong.《基因本体论资源:20 年,持续强大》
Nucleic Acids Res. 2019 Jan 8;47(D1):D330-D338. doi: 10.1093/nar/gky1055.
6
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs.一种具有改进策略的新化学信息学方法,用于有效预测潜在药物。
J Cheminform. 2018 Oct 11;10(1):50. doi: 10.1186/s13321-018-0303-x.
7
Compositional descriptor-based recommender system for the materials discovery.基于成分描述符的材料发现推荐系统。
J Chem Phys. 2018 Jun 28;148(24):241719. doi: 10.1063/1.5016210.
8
Knowledge Representation and Management: a Linked Data Perspective.知识表示与管理:关联数据视角
Yearb Med Inform. 2016 Nov 10(1):178-183. doi: 10.15265/IY-2016-022.
9
ChEBI in 2016: Improved services and an expanding collection of metabolites.2016年的ChEBI:服务改进与代谢物集合的扩充
Nucleic Acids Res. 2016 Jan 4;44(D1):D1214-9. doi: 10.1093/nar/gkv1031. Epub 2015 Oct 13.
10
Identification of potent orally active factor Xa inhibitors based on conjugation strategy and application of predictable fragment recommender system.基于共轭策略的强效口服活性Xa因子抑制剂的鉴定及可预测片段推荐系统的应用。
Bioorg Med Chem. 2015 Jan 15;23(2):277-89. doi: 10.1016/j.bmc.2014.11.042. Epub 2014 Dec 5.