• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于融合多种特征、数据平衡和特征选择技术提高药物-靶标相互作用预测。

Improving prediction of drug-target interactions based on fusing multiple features with data balancing and feature selection techniques.

机构信息

Department of Computer Engineering, University of Zanjan, Zanjan, Iran.

School of Biological Sciences Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.

出版信息

PLoS One. 2023 Aug 3;18(8):e0288173. doi: 10.1371/journal.pone.0288173. eCollection 2023.

DOI:10.1371/journal.pone.0288173
PMID:37535616
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10399861/
Abstract

Drug discovery relies on predicting drug-target interaction (DTI), which is an important challenging task. The purpose of DTI is to identify the interaction between drug chemical compounds and protein targets. Traditional wet lab experiments are time-consuming and expensive, that's why in recent years, the use of computational methods based on machine learning has attracted the attention of many researchers. Actually, a dry lab environment focusing more on computational methods of interaction prediction can be helpful in limiting search space for wet lab experiments. In this paper, a novel multi-stage approach for DTI is proposed that called SRX-DTI. In the first stage, combination of various descriptors from protein sequences, and a FP2 fingerprint that is encoded from drug are extracted as feature vectors. A major challenge in this application is the imbalanced data due to the lack of known interactions, in this regard, in the second stage, the One-SVM-US technique is proposed to deal with this problem. Next, the FFS-RF algorithm, a forward feature selection algorithm, coupled with a random forest (RF) classifier is developed to maximize the predictive performance. This feature selection algorithm removes irrelevant features to obtain optimal features. Finally, balanced dataset with optimal features is given to the XGBoost classifier to identify DTIs. The experimental results demonstrate that our proposed approach SRX-DTI achieves higher performance than other existing methods in predicting DTIs. The datasets and source code are available at: https://github.com/Khojasteh-hb/SRX-DTI.

摘要

药物发现依赖于预测药物-靶标相互作用(DTI),这是一项具有挑战性的任务。DTI 的目的是识别药物化学化合物与蛋白质靶标的相互作用。传统的湿实验室实验既耗时又昂贵,这就是为什么近年来,基于机器学习的计算方法的使用引起了许多研究人员的关注。实际上,一个专注于交互预测的计算方法的干实验室环境有助于限制湿实验室实验的搜索空间。在本文中,提出了一种称为 SRX-DTI 的用于 DTI 的新型多阶段方法。在第一阶段,从蛋白质序列和编码药物的 FP2 指纹中提取各种描述符的组合作为特征向量。由于缺乏已知的相互作用,该应用中的一个主要挑战是数据不平衡,在这方面,在第二阶段,提出了 One-SVM-US 技术来解决这个问题。接下来,开发了一种称为 FFS-RF 的前向特征选择算法,与随机森林(RF)分类器相结合,以最大程度地提高预测性能。该特征选择算法可删除不相关的特征,从而获得最佳特征。最后,将具有最佳特征的平衡数据集提供给 XGBoost 分类器以识别 DTI。实验结果表明,我们提出的方法 SRX-DTI 在预测 DTI 方面的性能优于其他现有方法。数据集和源代码可在:https://github.com/Khojasteh-hb/SRX-DTI 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/255f56064a2c/pone.0288173.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/35dd1f678c74/pone.0288173.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/7d9366ca249f/pone.0288173.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/4e95773f0b9c/pone.0288173.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/255f56064a2c/pone.0288173.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/35dd1f678c74/pone.0288173.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/7d9366ca249f/pone.0288173.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/4e95773f0b9c/pone.0288173.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ff/10399861/255f56064a2c/pone.0288173.g004.jpg

相似文献

1
Improving prediction of drug-target interactions based on fusing multiple features with data balancing and feature selection techniques.基于融合多种特征、数据平衡和特征选择技术提高药物-靶标相互作用预测。
PLoS One. 2023 Aug 3;18(8):e0288173. doi: 10.1371/journal.pone.0288173. eCollection 2023.
2
Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting.基于蛋白质特征的药物-靶标相互作用预测,采用欠采样和特征选择技术与提升相结合。
Anal Biochem. 2020 Jan 15;589:113507. doi: 10.1016/j.ab.2019.113507. Epub 2019 Nov 15.
3
DeepStack-DTIs: Predicting Drug-Target Interactions Using LightGBM Feature Selection and Deep-Stacked Ensemble Classifier.DeepStack-DTIs:使用 LightGBM 特征选择和深度堆叠集成分类器预测药物-靶标相互作用。
Interdiscip Sci. 2022 Jun;14(2):311-330. doi: 10.1007/s12539-021-00488-7. Epub 2021 Nov 3.
4
A Machine Learning Approach for Drug-target Interaction Prediction using Wrapper Feature Selection and Class Balancing.基于包装特征选择和类别平衡的药物-靶标相互作用预测的机器学习方法。
Mol Inform. 2020 May;39(5):e1900062. doi: 10.1002/minf.201900062. Epub 2020 Feb 11.
5
DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features.DTI-CDF:一种基于混合特征的药物-靶标相互作用预测的级联深度森林模型。
Brief Bioinform. 2021 Jan 18;22(1):451-462. doi: 10.1093/bib/bbz152.
6
PreDTIs: prediction of drug-target interactions based on multiple feature information using gradient boosting framework with data balancing and feature selection techniques.PreDTIs:基于梯度提升框架使用数据平衡和特征选择技术,基于多种特征信息预测药物-靶标相互作用。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab046.
7
Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure.基于进化信息和化学结构的 Lasso 与随机森林预测药物-靶标相互作用。
Genomics. 2019 Dec;111(6):1839-1852. doi: 10.1016/j.ygeno.2018.12.007. Epub 2018 Dec 11.
8
Drug-Target Interaction Prediction Based on Drug Fingerprint Information and Protein Sequence.基于药物指纹信息和蛋白质序列的药物-靶标相互作用预测。
Molecules. 2019 Aug 19;24(16):2999. doi: 10.3390/molecules24162999.
9
ICAN: Interpretable cross-attention network for identifying drug and target protein interactions.ICAN:用于识别药物和靶蛋白相互作用的可解释交叉注意力网络。
PLoS One. 2022 Oct 24;17(10):e0276609. doi: 10.1371/journal.pone.0276609. eCollection 2022.
10
Semi-supervised heterogeneous graph contrastive learning for drug-target interaction prediction.基于半监督异质图对比学习的药物-靶标相互作用预测。
Comput Biol Med. 2023 Sep;163:107199. doi: 10.1016/j.compbiomed.2023.107199. Epub 2023 Jun 22.

引用本文的文献

1
Fair molecular feature selection unveils universally tumor lineage-informative methylation sites in colorectal cancer.合理的分子特征选择揭示了结直肠癌中普遍存在的肿瘤谱系信息性甲基化位点。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i150-i159. doi: 10.1093/bioinformatics/btaf237.
2
Predicting drug protein interactions based on improved support vector data description in unbalanced data.基于改进的支持向量数据描述在不平衡数据中预测药物-蛋白质相互作用
Bioimpacts. 2024 Dec 30;15:30468. doi: 10.34172/bi.30468. eCollection 2025.
3
BindingSiteDTI: differential-scale binding site modelling for drug-target interaction prediction.

本文引用的文献

1
Evaluating molecular fingerprint-based models of drug side effects against a statistical control.针对统计对照评估基于分子指纹的药物副作用模型。
Drug Discov Today. 2022 Nov;27(11):103364. doi: 10.1016/j.drudis.2022.103364. Epub 2022 Sep 14.
2
A novel circRNA-miRNA association prediction model based on structural deep neural network embedding.基于结构深度神经网络嵌入的新型环状 RNA-miRNA 关联预测模型。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac391.
3
RoFDT: Identification of Drug-Target Interactions from Protein Sequence and Drug Molecular Structure Using Rotation Forest.
BindingSiteDTI:用于药物-靶标相互作用预测的差分尺度结合位点建模。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae308.
4
Towards explainable interaction prediction: Embedding biological hierarchies into hyperbolic interaction space.迈向可解释的交互预测:将生物层次结构嵌入双曲交互空间。
PLoS One. 2024 Mar 21;19(3):e0300906. doi: 10.1371/journal.pone.0300906. eCollection 2024.
5
Improving structure-based protein-ligand affinity prediction by graph representation learning and ensemble learning.通过图表示学习和集成学习提高基于结构的蛋白配体亲和力预测。
PLoS One. 2024 Jan 17;19(1):e0296676. doi: 10.1371/journal.pone.0296676. eCollection 2024.
RoFDT:利用旋转森林从蛋白质序列和药物分子结构中识别药物-靶点相互作用
Biology (Basel). 2022 May 13;11(5):741. doi: 10.3390/biology11050741.
4
Comparing protein-protein interaction networks of SARS-CoV-2 and (H1N1) influenza using topological features.比较 SARS-CoV-2 和(H1N1)流感的蛋白质-蛋白质相互作用网络,使用拓扑特征。
Sci Rep. 2022 Apr 7;12(1):5867. doi: 10.1038/s41598-022-08574-6.
5
Identifying drug-target interactions via heterogeneous graph attention networks combined with cross-modal similarities.通过异质图注意网络与跨模态相似性相结合来识别药物-靶标相互作用。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac016.
6
A deep learning method for repurposing antiviral drugs against new viruses via multi-view nonnegative matrix factorization and its application to SARS-CoV-2.一种通过多视图非负矩阵分解对新病毒进行抗病毒药物再利用的深度学习方法及其在 SARS-CoV-2 中的应用。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab526.
7
Drug-target interaction prediction using unifying of graph regularized nuclear norm with bilinear factorization.基于图正则化核范数与双线性分解的统一方法进行药物-靶标相互作用预测。
BMC Bioinformatics. 2021 Nov 17;22(1):555. doi: 10.1186/s12859-021-04464-2.
8
The legacy of the Human Genome Project.人类基因组计划的遗产。
Science. 2021 Sep 24;373(6562):1442-1443. doi: 10.1126/science.abl5403. Epub 2021 Sep 23.
9
SANE: A sequence combined attentive network embedding model for COVID-19 drug repositioning.SANE:一种用于COVID-19药物重新定位的序列组合注意力网络嵌入模型。
Appl Soft Comput. 2021 Nov;111:107831. doi: 10.1016/j.asoc.2021.107831. Epub 2021 Aug 23.
10
A heterogeneous network embedding framework for predicting similarity-based drug-target interactions.一种基于异质网络嵌入的预测基于相似性的药物-靶标相互作用的框架。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab275.