• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过特征处理方案优化药物-靶标亲和力预测方法。

Optimization of drug-target affinity prediction methods through feature processing schemes.

机构信息

Department of Computer Science, University of Tsukuba, Tsukuba, Japan.

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad615.

DOI:10.1093/bioinformatics/btad615
PMID:37812388
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10636279/
Abstract

MOTIVATION

Numerous high-accuracy drug-target affinity (DTA) prediction models, whose performance is heavily reliant on the drug and target feature information, are developed at the expense of complexity and interpretability. Feature extraction and optimization constitute a critical step that significantly influences the enhancement of model performance, robustness, and interpretability. Many existing studies aim to comprehensively characterize drugs and targets by extracting features from multiple perspectives; however, this approach has drawbacks: (i) an abundance of redundant or noisy features; and (ii) the feature sets often suffer from high dimensionality.

RESULTS

In this study, to obtain a model with high accuracy and strong interpretability, we utilize various traditional and cutting-edge feature selection and dimensionality reduction techniques to process self-associated features and adjacent associated features. These optimized features are then fed into learning to rank to achieve efficient DTA prediction. Extensive experimental results on two commonly used datasets indicate that, among various feature optimization methods, the regression tree-based feature selection method is most beneficial for constructing models with good performance and strong robustness. Then, by utilizing Shapley Additive Explanations values and the incremental feature selection approach, we obtain that the high-quality feature subset consists of the top 150D features and the top 20D features have a breakthrough impact on the DTA prediction. In conclusion, our study thoroughly validates the importance of feature optimization in DTA prediction and serves as inspiration for constructing high-performance and high-interpretable models.

AVAILABILITY AND IMPLEMENTATION

https://github.com/RUXIAOQING964914140/FS_DTA.

摘要

动机

许多高精度药物-靶标亲和力(DTA)预测模型,其性能严重依赖于药物和靶标特征信息,这些模型的开发代价是复杂性和可解释性。特征提取和优化是一个关键步骤,它会显著影响模型性能、鲁棒性和可解释性的提升。许多现有的研究旨在通过从多个角度提取特征来全面描述药物和靶标;然而,这种方法有两个缺点:(i)存在大量冗余或嘈杂的特征;(ii)特征集往往具有高维度。

结果

在这项研究中,为了获得具有高精度和强可解释性的模型,我们利用各种传统和前沿的特征选择和降维技术来处理自相关特征和相邻相关特征。然后,将这些优化后的特征输入到学习排序中,以实现高效的 DTA 预测。在两个常用数据集上的广泛实验结果表明,在各种特征优化方法中,基于回归树的特征选择方法最有利于构建性能良好、鲁棒性强的模型。然后,通过利用 Shapley Additive Explanations 值和增量特征选择方法,我们得出高质量特征子集由前 150D 特征组成,前 20D 特征对 DTA 预测有突破性影响。总之,我们的研究充分验证了特征优化在 DTA 预测中的重要性,并为构建高性能和高可解释性模型提供了启示。

可用性和实现

https://github.com/RUXIAOQING964914140/FS_DTA。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/a67779b69ca8/btad615f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/243d8270c0f5/btad615f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/7c3f002d65d5/btad615f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/9673618dc8b8/btad615f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/5ff738e09586/btad615f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/a67779b69ca8/btad615f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/243d8270c0f5/btad615f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/7c3f002d65d5/btad615f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/9673618dc8b8/btad615f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/5ff738e09586/btad615f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68ff/10636279/a67779b69ca8/btad615f5.jpg

相似文献

1
Optimization of drug-target affinity prediction methods through feature processing schemes.通过特征处理方案优化药物-靶标亲和力预测方法。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad615.
2
NerLTR-DTA: drug-target binding affinity prediction based on neighbor relationship and learning to rank.NerLTR-DTA:基于邻域关系和排序学习的药物-靶点结合亲和力预测
Bioinformatics. 2022 Mar 28;38(7):1964-1971. doi: 10.1093/bioinformatics/btac048.
3
Drug-target affinity prediction method based on multi-scale information interaction and graph optimization.基于多尺度信息交互和图优化的药物-靶标亲和力预测方法。
Comput Biol Med. 2023 Dec;167:107621. doi: 10.1016/j.compbiomed.2023.107621. Epub 2023 Oct 29.
4
NHGNN-DTA: a node-adaptive hybrid graph neural network for interpretable drug-target binding affinity prediction.NHGNN-DTA:一种用于可解释药物-靶标结合亲和力预测的节点自适应混合图神经网络。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad355.
5
MDF-DTA: A Multi-Dimensional Fusion Approach for Drug-Target Binding Affinity Prediction.MDF-DTA:一种用于药物-靶标结合亲和力预测的多维融合方法。
J Chem Inf Model. 2024 Jul 8;64(13):4980-4990. doi: 10.1021/acs.jcim.4c00310. Epub 2024 Jun 18.
6
MSGNN-DTA: Multi-Scale Topological Feature Fusion Based on Graph Neural Networks for Drug-Target Binding Affinity Prediction.MSGNN-DTA:基于图神经网络的多尺度拓扑特征融合的药物-靶标结合亲和力预测
Int J Mol Sci. 2023 May 5;24(9):8326. doi: 10.3390/ijms24098326.
7
AttentionMGT-DTA: A multi-modal drug-target affinity prediction using graph transformer and attention mechanism.AttentionMGT-DTA:一种基于图变换和注意力机制的多模态药物-靶标亲和力预测方法。
Neural Netw. 2024 Jan;169:623-636. doi: 10.1016/j.neunet.2023.11.018. Epub 2023 Nov 11.
8
MFF-DTA: Multi-scale feature fusion for drug-target affinity prediction.MFF-DTA:用于药物-靶标亲和力预测的多尺度特征融合。
Methods. 2024 Nov;231:1-7. doi: 10.1016/j.ymeth.2024.08.008. Epub 2024 Aug 30.
9
GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information.GSAML-DTA:一种基于图神经网络和自注意力机制以及互信息的可解释药物-靶标结合亲和力预测模型。
Comput Biol Med. 2022 Nov;150:106145. doi: 10.1016/j.compbiomed.2022.106145. Epub 2022 Oct 4.
10
BiComp-DTA: Drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach.BiComp-DTA:基于互补生物相关和压缩特征化方法的药物-靶标结合亲和力预测。
PLoS Comput Biol. 2023 Mar 31;19(3):e1011036. doi: 10.1371/journal.pcbi.1011036. eCollection 2023 Mar.

引用本文的文献

1
HNF-DDA: subgraph contrastive-driven transformer-style heterogeneous network embedding for drug-disease association prediction.HNF-DDA:用于药物-疾病关联预测的基于子图对比驱动的变压器式异构网络嵌入
BMC Biol. 2025 Apr 16;23(1):101. doi: 10.1186/s12915-025-02206-x.
2
A comprehensive review of deep learning-based approaches for drug-drug interaction prediction.基于深度学习的药物相互作用预测方法的全面综述。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae052.
3
Drug-target interaction prediction with collaborative contrastive learning and adaptive self-paced sampling strategy.

本文引用的文献

1
Molecular language models: RNNs or transformer?分子语言模型:RNN 还是转换器?
Brief Funct Genomics. 2023 Jul 17;22(4):392-400. doi: 10.1093/bfgp/elad012.
2
Deep generative model for drug design from protein target sequence.基于蛋白质靶点序列的药物设计深度生成模型。
J Cheminform. 2023 Mar 28;15(1):38. doi: 10.1186/s13321-023-00702-2.
3
NerLTR-DTA: drug-target binding affinity prediction based on neighbor relationship and learning to rank.NerLTR-DTA:基于邻域关系和排序学习的药物-靶点结合亲和力预测
基于协同对比学习和自适应自步采样策略的药物-靶标相互作用预测。
BMC Biol. 2024 Sep 27;22(1):216. doi: 10.1186/s12915-024-02012-x.
4
A comprehensive review of the recent advances on predicting drug-target affinity based on deep learning.基于深度学习预测药物-靶点亲和力的最新进展综述
Front Pharmacol. 2024 Apr 2;15:1375522. doi: 10.3389/fphar.2024.1375522. eCollection 2024.
Bioinformatics. 2022 Mar 28;38(7):1964-1971. doi: 10.1093/bioinformatics/btac048.
4
FusionDTA: attention-based feature polymerizer and knowledge distillation for drug-target binding affinity prediction.FusionDTA:基于注意力的特征聚合器和知识蒸馏在药物-靶标结合亲和力预测中的应用。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab506.
5
MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.MathFeature:基于数学描述符的 DNA、RNA 和蛋白质序列特征提取包。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab434.
6
Deep drug-target binding affinity prediction with multiple attention blocks.基于多注意力块的深度药物-靶标结合亲和力预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab117.
7
DeepPurpose: a deep learning library for drug-target interaction prediction.DeepPurpose:用于药物-靶标相互作用预测的深度学习库。
Bioinformatics. 2021 Apr 1;36(22-23):5545-5547. doi: 10.1093/bioinformatics/btaa1005.
8
CatBoost for big data: an interdisciplinary review.用于大数据的CatBoost:跨学科综述
J Big Data. 2020;7(1):94. doi: 10.1186/s40537-020-00369-8. Epub 2020 Nov 4.
9
GraphDTA: predicting drug-target binding affinity with graph neural networks.GraphDTA:基于图神经网络的药物-靶标结合亲和力预测。
Bioinformatics. 2021 May 23;37(8):1140-1147. doi: 10.1093/bioinformatics/btaa921.
10
ivis Dimensionality Reduction Framework for Biomacromolecular Simulations.用于生物大分子模拟的iVis降维框架。
J Chem Inf Model. 2020 Oct 26;60(10):4569-4581. doi: 10.1021/acs.jcim.0c00485. Epub 2020 Sep 1.