• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CSI:用于交互预测的对比数据分层及其在化合物-蛋白质相互作用预测中的应用。

CSI: Contrastive data Stratification for Interaction prediction and its application to compound-protein interaction prediction.

机构信息

Department of Computer Science, Tufts University, Medford, MA 02155, United States.

Google Research, Cambridge, MA 02142, Unites States.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad456.

DOI:10.1093/bioinformatics/btad456
PMID:37490457
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10423023/
Abstract

MOTIVATION

Accurately predicting the likelihood of interaction between two objects (compound-protein sequence, user-item, author-paper, etc.) is a fundamental problem in Computer Science. Current deep-learning models rely on learning accurate representations of the interacting objects. Importantly, relationships between the interacting objects, or features of the interaction, offer an opportunity to partition the data to create multi-views of the interacting objects. The resulting congruent and non-congruent views can then be exploited via contrastive learning techniques to learn enhanced representations of the objects.

RESULTS

We present a novel method, Contrastive Stratification for Interaction Prediction (CSI), to stratify (partition) a dataset in a manner that can be exploited via Contrastive Multiview Coding to learn embeddings that maximize the mutual information across congruent data views. CSI assigns a key and multiple views to each data point, where data partitions under a particular key form congruent views of the data. We showcase the effectiveness of CSI by applying it to the compound-protein sequence interaction prediction problem, a pressing problem whose solution promises to expedite drug delivery (drug-protein interaction prediction), metabolic engineering, and synthetic biology (compound-enzyme interaction prediction) applications. Comparing CSI with a baseline model that does not utilize data stratification and contrastive learning, and show gains in average precision ranging from 13.7% to 39% using compounds and sequences as keys across multiple drug-target and enzymatic datasets, and gains ranging from 16.9% to 63% using reaction features as keys across enzymatic datasets.

AVAILABILITY AND IMPLEMENTATION

Code and dataset available at https://github.com/HassounLab/CSI.

摘要

动机

准确预测两个对象(化合物-蛋白质序列、用户-项目、作者-论文等)之间相互作用的可能性是计算机科学中的一个基本问题。当前的深度学习模型依赖于学习交互对象的准确表示。重要的是,交互对象之间的关系或交互的特征为划分数据提供了机会,从而创建交互对象的多视图。然后,可以通过对比学习技术利用这些一致和不一致的视图来学习对象的增强表示。

结果

我们提出了一种新方法,即交互预测的对比分层 (CSI),以分层(分区)数据集的方式,通过对比多视图编码来利用,以学习最大程度地提高一致数据视图之间互信息的嵌入。CSI 为每个数据点分配一个键和多个视图,其中特定键下的数据分区形成数据的一致视图。我们通过将 CSI 应用于化合物-蛋白质序列相互作用预测问题来展示其有效性,这是一个紧迫的问题,其解决方案有望加速药物输送(药物-蛋白质相互作用预测)、代谢工程和合成生物学(化合物-酶相互作用预测)应用。将 CSI 与不利用数据分层和对比学习的基线模型进行比较,并在多个药物-靶标和酶数据集上使用化合物和序列作为键,平均精度提高 13.7%至 39%,在酶数据集上使用反应特征作为键,平均精度提高 16.9%至 63%。

可用性和实现

代码和数据集可在 https://github.com/HassounLab/CSI 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/0854e92bb136/btad456f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/8998190590a3/btad456f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/ac9abb7d18f8/btad456f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/0854e92bb136/btad456f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/8998190590a3/btad456f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/ac9abb7d18f8/btad456f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af8/10423023/0854e92bb136/btad456f3.jpg

相似文献

1
CSI: Contrastive data Stratification for Interaction prediction and its application to compound-protein interaction prediction.CSI:用于交互预测的对比数据分层及其在化合物-蛋白质相互作用预测中的应用。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad456.
2
Boost-RS: boosted embeddings for recommender systems and its application to enzyme-substrate interaction prediction.Boost-RS:用于推荐系统的增强嵌入及其在酶-底物相互作用预测中的应用。
Bioinformatics. 2022 May 13;38(10):2832-2838. doi: 10.1093/bioinformatics/btac201.
3
Multi-task prediction-based graph contrastive learning for inferring the relationship among lncRNAs, miRNAs and diseases.基于多任务预测的图对比学习推断 lncRNAs、miRNAs 和疾病之间的关系。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad276.
4
CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning.CMMS-GCL:基于图对比学习的跨模态代谢稳定性预测。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad503.
5
Similarity measures-based graph co-contrastive learning for drug-disease association prediction.基于相似性度量的图协同对比学习在药物-疾病关联预测中的应用。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad357.
6
Predicting microbe-drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy.利用结构增强对比学习和自步负采样策略预测微生物-药物关联
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbac634.
7
Molecular property prediction by semantic-invariant contrastive learning.基于语义不变对比学习的分子性质预测。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad462.
8
Semi-supervised heterogeneous graph contrastive learning for drug-target interaction prediction.基于半监督异质图对比学习的药物-靶标相互作用预测。
Comput Biol Med. 2023 Sep;163:107199. doi: 10.1016/j.compbiomed.2023.107199. Epub 2023 Jun 22.
9
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.
10
GCFMCL: predicting miRNA-drug sensitivity using graph collaborative filtering and multi-view contrastive learning.GCFMCL:基于图协同过滤和多视图对比学习的 miRNA 药物敏感性预测
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad247.

引用本文的文献

1
Protein-DNA binding sites prediction based on pre-trained protein language model and contrastive learning.基于预训练蛋白质语言模型和对比学习的蛋白质-DNA 结合位点预测。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad488.

本文引用的文献

1
Enzyme Promiscuity Prediction Using Hierarchy-Informed Multi-Label Classification.基于层次信息的多标签分类的酶多功能性预测。
Bioinformatics. 2021 Aug 4;37(14):2017–2024. doi: 10.1093/bioinformatics/btab054. Epub 2021 Jan 30.
2
BRENDA, the ELIXIR core data resource in 2021: new developments and updates.BRENDA,2021 年的 ELIXIR 核心数据资源:新的发展和更新。
Nucleic Acids Res. 2021 Jan 8;49(D1):D498-D508. doi: 10.1093/nar/gkaa1025.
3
KEGG: integrating viruses and cellular organisms.KEGG:整合病毒和细胞生物。
Nucleic Acids Res. 2021 Jan 8;49(D1):D545-D551. doi: 10.1093/nar/gkaa970.
4
GraphDTA: predicting drug-target binding affinity with graph neural networks.GraphDTA:基于图神经网络的药物-靶标结合亲和力预测。
Bioinformatics. 2021 May 23;37(8):1140-1147. doi: 10.1093/bioinformatics/btaa921.
5
MolTrans: Molecular Interaction Transformer for drug-target interaction prediction.MolTrans:用于药物-靶标相互作用预测的分子相互作用转换器。
Bioinformatics. 2021 May 5;37(6):830-836. doi: 10.1093/bioinformatics/btaa880.
6
Thermodynamics and Kinetics of Drug-Target Binding by Molecular Simulation.分子模拟研究药物-靶标结合的热力学和动力学。
Chem Rev. 2020 Dec 9;120(23):12788-12833. doi: 10.1021/acs.chemrev.0c00534. Epub 2020 Oct 2.
7
Deep Learning in Drug Target Interaction Prediction: Current and Future Perspectives.深度学习在药物靶点相互作用预测中的应用:现状与未来展望。
Curr Med Chem. 2021;28(11):2100-2113. doi: 10.2174/0929867327666200907141016.
8
Machine learning approaches and databases for prediction of drug-target interaction: a survey paper.机器学习方法和数据库在药物-靶标相互作用预测中的应用:综述论文。
Brief Bioinform. 2021 Jan 18;22(1):247-269. doi: 10.1093/bib/bbz157.
9
DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences.DeepConv-DTI:基于蛋白质序列卷积的深度学习预测药物-靶标相互作用
PLoS Comput Biol. 2019 Jun 14;15(6):e1007129. doi: 10.1371/journal.pcbi.1007129. eCollection 2019 Jun.
10
Revealing Drug-Target Interactions with Computational Models and Algorithms.揭示药物-靶标相互作用的计算模型和算法。
Molecules. 2019 May 2;24(9):1714. doi: 10.3390/molecules24091714.