• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DTI-BERT:基于BERT和深度学习方法识别细胞网络中的药物-靶点相互作用

DTI-BERT: Identifying Drug-Target Interactions in Cellular Networking Based on BERT and Deep Learning Method.

作者信息

Zheng Jie, Xiao Xuan, Qiu Wang-Ren

机构信息

Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen, China.

出版信息

Front Genet. 2022 Jun 8;13:859188. doi: 10.3389/fgene.2022.859188. eCollection 2022.

DOI:10.3389/fgene.2022.859188
PMID:35754843
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9213727/
Abstract

Drug-target interactions (DTIs) are regarded as an essential part of genomic drug discovery, and computational prediction of DTIs can accelerate to find the lead drug for the target, which can make up for the lack of time-consuming and expensive wet-lab techniques. Currently, many computational methods predict DTIs based on sequential composition or physicochemical properties of drug and target, but further efforts are needed to improve them. In this article, we proposed a new sequence-based method for accurately identifying DTIs. For target protein, we explore using pre-trained Bidirectional Encoder Representations from Transformers (BERT) to extract sequence features, which can provide unique and valuable pattern information. For drug molecules, Discrete Wavelet Transform (DWT) is employed to generate information from drug molecular fingerprints. Then we concatenate the feature vectors of the DTIs, and input them into a feature extraction module consisting of a batch-norm layer, rectified linear activation layer and linear layer, called BRL block and a Convolutional Neural Networks module to extract DTIs features further. Subsequently, a BRL block is used as the prediction engine. After optimizing the model based on contrastive loss and cross-entropy loss, it gave prediction accuracies of the target families of G Protein-coupled receptors, ion channels, enzymes, and nuclear receptors up to 90.1, 94.7, 94.9, and 89%, which indicated that the proposed method can outperform the existing predictors. To make it as convenient as possible for researchers, the web server for the new predictor is freely accessible at: https://bioinfo.jcu.edu.cn/dtibert or http://121.36.221.79/dtibert/. The proposed method may also be a potential option for other DITs.

摘要

药物-靶点相互作用(DTIs)被视为基因组药物发现的重要组成部分,DTIs的计算预测可以加速为靶点找到先导药物,这可以弥补耗时且昂贵的湿实验室技术的不足。目前,许多计算方法基于药物和靶点的序列组成或物理化学性质来预测DTIs,但仍需进一步努力改进。在本文中,我们提出了一种新的基于序列的方法来准确识别DTIs。对于靶蛋白,我们探索使用预训练的来自变换器的双向编码器表示(BERT)来提取序列特征,这可以提供独特且有价值的模式信息。对于药物分子,采用离散小波变换(DWT)从药物分子指纹中生成信息。然后我们连接DTIs的特征向量,并将它们输入到一个由批归一化层、修正线性激活层和线性层组成的特征提取模块,称为BRL块和一个卷积神经网络模块,以进一步提取DTIs特征。随后,使用BRL块作为预测引擎。基于对比损失和交叉熵损失对模型进行优化后,它对G蛋白偶联受体、离子通道、酶和核受体等靶标家族的预测准确率分别高达90.1%、94.7%、94.9%和89%,这表明所提出的方法优于现有预测器。为了让研究人员尽可能方便地使用,新预测器的网络服务器可通过以下网址免费访问:https://bioinfo.jcu.edu.cn/dtibert 或 http://121.36.221.79/dtibert/。所提出的方法也可能是其他药物-靶点相互作用研究的一个潜在选择。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/d8f117759eeb/fgene-13-859188-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/3568d60ddae2/fgene-13-859188-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/38fb6664e28e/fgene-13-859188-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/c12c4b9f7274/fgene-13-859188-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/c067db163d3a/fgene-13-859188-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/6a450a07ba32/fgene-13-859188-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/d8f117759eeb/fgene-13-859188-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/3568d60ddae2/fgene-13-859188-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/38fb6664e28e/fgene-13-859188-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/c12c4b9f7274/fgene-13-859188-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/c067db163d3a/fgene-13-859188-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/6a450a07ba32/fgene-13-859188-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd9c/9213727/d8f117759eeb/fgene-13-859188-g006.jpg

相似文献

1
DTI-BERT: Identifying Drug-Target Interactions in Cellular Networking Based on BERT and Deep Learning Method.DTI-BERT:基于BERT和深度学习方法识别细胞网络中的药物-靶点相互作用
Front Genet. 2022 Jun 8;13:859188. doi: 10.3389/fgene.2022.859188. eCollection 2022.
2
Using BERT to identify drug-target interactions from whole PubMed.使用 BERT 从整个 PubMed 中识别药物-靶标相互作用。
BMC Bioinformatics. 2022 Jun 21;23(1):245. doi: 10.1186/s12859-022-04768-x.
3
A Robust Drug-Target Interaction Prediction Framework with Capsule Network and Transfer Learning.一种基于胶囊网络和迁移学习的稳健药物-靶点相互作用预测框架。
Int J Mol Sci. 2023 Sep 14;24(18):14061. doi: 10.3390/ijms241814061.
4
Advancing drug-target interaction prediction: a comprehensive graph-based approach integrating knowledge graph embedding and ProtBert pretraining.推进药物-靶标相互作用预测:一种综合基于图的方法,整合知识图嵌入和 ProtBert 预训练。
BMC Bioinformatics. 2023 Dec 19;24(1):488. doi: 10.1186/s12859-023-05593-6.
5
DeepStack-DTIs: Predicting Drug-Target Interactions Using LightGBM Feature Selection and Deep-Stacked Ensemble Classifier.DeepStack-DTIs:使用 LightGBM 特征选择和深度堆叠集成分类器预测药物-靶标相互作用。
Interdiscip Sci. 2022 Jun;14(2):311-330. doi: 10.1007/s12539-021-00488-7. Epub 2021 Nov 3.
6
Predicting Drug-Target Interactions with Deep-Embedding Learning of Graphs and Sequences.基于图和序列深度学习嵌入预测药物-靶标相互作用。
J Phys Chem A. 2021 Jul 1;125(25):5633-5642. doi: 10.1021/acs.jpca.1c02419. Epub 2021 Jun 18.
7
Predicting drug-target interaction network using deep learning model.利用深度学习模型预测药物-靶标相互作用网络。
Comput Biol Chem. 2019 Jun;80:90-101. doi: 10.1016/j.compbiolchem.2019.03.016. Epub 2019 Mar 25.
8
EDC-DTI: An end-to-end deep collaborative learning model based on multiple information for drug-target interactions prediction.EDC-DTI:一种基于多源信息的端到端深度协同学习模型,用于药物-靶标相互作用预测。
J Mol Graph Model. 2023 Jul;122:108498. doi: 10.1016/j.jmgm.2023.108498. Epub 2023 Apr 21.
9
A fingerprints based molecular property prediction method using the BERT model.一种使用BERT模型基于指纹的分子性质预测方法。
J Cheminform. 2022 Oct 21;14(1):71. doi: 10.1186/s13321-022-00650-3.
10
Semi-supervised heterogeneous graph contrastive learning for drug-target interaction prediction.基于半监督异质图对比学习的药物-靶标相互作用预测。
Comput Biol Med. 2023 Sep;163:107199. doi: 10.1016/j.compbiomed.2023.107199. Epub 2023 Jun 22.

引用本文的文献

1
Bridging artificial intelligence and biological sciences: a comprehensive review of large language models in bioinformatics.连接人工智能与生物科学:生物信息学中大型语言模型的全面综述
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf357.
2
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
3
CHL-DTI: A Novel High-Low Order Information Convergence Framework for Effective Drug-Target Interaction Prediction.

本文引用的文献

1
ProteinBERT: a universal deep-learning model of protein sequence and function.蛋白质 BERT:一种通用的蛋白质序列和功能深度学习模型。
Bioinformatics. 2022 Apr 12;38(8):2102-2110. doi: 10.1093/bioinformatics/btac020.
2
iCDI-W2vCom: Identifying the Ion Channel-Drug Interaction in Cellular Networking Based on word2vec and node2vec.iCDI-W2vCom:基于词向量和节点向量识别细胞网络中的离子通道-药物相互作用
Front Genet. 2021 Sep 9;12:738274. doi: 10.3389/fgene.2021.738274. eCollection 2021.
3
BERT-m7G: A Transformer Architecture Based on BERT and Stacking Ensemble to Identify RNA N7-Methylguanosine Sites from Sequence Information.
CHL-DTI:一种用于有效药物-靶标相互作用预测的新型高低阶信息融合框架。
Interdiscip Sci. 2024 Sep;16(3):568-578. doi: 10.1007/s12539-024-00608-z. Epub 2024 Mar 14.
4
Advancing bioinformatics with large language models: components, applications and perspectives.利用大语言模型推进生物信息学:组件、应用与展望
ArXiv. 2025 Jan 31:arXiv:2401.04155v2.
5
Stacking-ac4C: an ensemble model using mixed features for identifying n4-acetylcytidine in mRNA.Stacking-ac4C:一种使用混合特征的集成模型,用于识别 mRNA 中的 N4-乙酰胞苷。
Front Immunol. 2023 Nov 29;14:1267755. doi: 10.3389/fimmu.2023.1267755. eCollection 2023.
6
A Robust Drug-Target Interaction Prediction Framework with Capsule Network and Transfer Learning.一种基于胶囊网络和迁移学习的稳健药物-靶点相互作用预测框架。
Int J Mol Sci. 2023 Sep 14;24(18):14061. doi: 10.3390/ijms241814061.
BERT-m7G:一种基于 BERT 和堆叠集成的转换器架构,用于从序列信息中识别 RNA N7-甲基鸟苷位点。
Comput Math Methods Med. 2021 Aug 25;2021:7764764. doi: 10.1155/2021/7764764. eCollection 2021.
4
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.
5
A review on compound-protein interaction prediction methods: Data, format, representation and model.复合蛋白相互作用预测方法综述:数据、格式、表示与模型
Comput Struct Biotechnol J. 2021 Mar 10;19:1541-1556. doi: 10.1016/j.csbj.2021.03.004. eCollection 2021.
6
BOW-GBDT: A GBDT Classifier Combining With Artificial Neural Network for Identifying GPCR-Drug Interaction Based on Wordbook Learning From Sequences.BOW-GBDT:一种基于从序列中学习词典的结合人工神经网络的GBDT分类器,用于识别GPCR-药物相互作用。
Front Cell Dev Biol. 2021 Feb 1;8:623858. doi: 10.3389/fcell.2020.623858. eCollection 2020.
7
A transformer architecture based on BERT and 2D convolutional neural network to identify DNA enhancers from sequence information.基于 BERT 和二维卷积神经网络的变压器架构,用于从序列信息中识别 DNA 增强子。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab005.
8
Deep learning-based cross-classifications reveal conserved spatial behaviors within tumor histological images.基于深度学习的跨分类揭示了肿瘤组织学图像中保守的空间行为。
Nat Commun. 2020 Dec 11;11(1):6367. doi: 10.1038/s41467-020-20030-5.
9
State-of-the-art augmented NLP transformer models for direct and single-step retrosynthesis.最先进的增强型自然语言处理转换器模型,用于直接和单步逆合成。
Nat Commun. 2020 Nov 4;11(1):5575. doi: 10.1038/s41467-020-19266-y.
10
TransformerCPI: improving compound-protein interaction prediction by sequence-based deep learning with self-attention mechanism and label reversal experiments.TransformerCPI:通过基于序列的深度学习、自注意力机制和标签反转实验提高化合物-蛋白质相互作用预测。
Bioinformatics. 2020 Aug 15;36(16):4406-4414. doi: 10.1093/bioinformatics/btaa524.