• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用对抗性领域自适应推断 RNA 结合蛋白靶标偏好。

Inferring RNA-binding protein target preferences using adversarial domain adaptation.

机构信息

College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China.

Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada.

出版信息

PLoS Comput Biol. 2022 Feb 24;18(2):e1009863. doi: 10.1371/journal.pcbi.1009863. eCollection 2022 Feb.

DOI:10.1371/journal.pcbi.1009863
PMID:35202389
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8870515/
Abstract

Precise identification of target sites of RNA-binding proteins (RBP) is important to understand their biochemical and cellular functions. A large amount of experimental data is generated by in vivo and in vitro approaches. The binding preferences determined from these platforms share similar patterns but there are discernable differences between these datasets. Computational methods trained on one dataset do not always work well on another dataset. To address this problem which resembles the classic "domain shift" in deep learning, we adopted the adversarial domain adaptation (ADDA) technique and developed a framework (RBP-ADDA) that can extract RBP binding preferences from an integration of in vivo and vitro datasets. Compared with conventional methods, ADDA has the advantage of working with two input datasets, as it trains the initial neural network for each dataset individually, projects the two datasets onto a feature space, and uses an adversarial framework to derive an optimal network that achieves an optimal discriminative predictive power. In the first step, for each RBP, we include only the in vitro data to pre-train a source network and a task predictor. Next, for the same RBP, we initiate the target network by using the source network and use adversarial domain adaptation to update the target network using both in vitro and in vivo data. These two steps help leverage the in vitro data to improve the prediction on in vivo data, which is typically challenging with a lower signal-to-noise ratio. Finally, to further take the advantage of the fused source and target data, we fine-tune the task predictor using both data. We showed that RBP-ADDA achieved better performance in modeling in vivo RBP binding data than other existing methods as judged by Pearson correlations. It also improved predictive performance on in vitro datasets. We further applied augmentation operations on RBPs with less in vivo data to expand the input data and showed that it can improve prediction performances. Lastly, we explored the predictive interpretability of RBP-ADDA, where we quantified the contribution of the input features by Integrated Gradients and identified nucleotide positions that are important for RBP recognition.

摘要

准确鉴定 RNA 结合蛋白 (RBP) 的靶标位点对于了解其生化和细胞功能非常重要。大量的实验数据是通过体内和体外方法产生的。从这些平台确定的结合偏好具有相似的模式,但这些数据集之间存在明显的差异。在一个数据集上训练的计算方法并不总是在另一个数据集上有效。为了解决这个类似于深度学习中的经典“域转移”问题,我们采用了对抗性域自适应 (ADDA) 技术,并开发了一个框架 (RBP-ADDA),可以从体内和体外数据集的整合中提取 RBP 结合偏好。与传统方法相比,ADDA 的优势在于可以使用两个输入数据集,因为它为每个数据集单独训练初始神经网络,将两个数据集投影到特征空间,并使用对抗框架得出一个最佳网络,以实现最佳的判别预测能力。在第一步中,对于每个 RBP,我们仅包括体外数据来预训练源网络和任务预测器。接下来,对于同一个 RBP,我们使用源网络启动目标网络,并使用对抗性域自适应来使用体内和体外数据更新目标网络。这两个步骤有助于利用体外数据来提高对体内数据的预测能力,这通常具有挑战性,因为信噪比较低。最后,为了进一步利用融合的源和目标数据的优势,我们使用两种数据来微调任务预测器。我们通过 Pearson 相关性来判断,与其他现有方法相比,RBP-ADDA 在模拟体内 RBP 结合数据方面的性能更好。它还提高了对体外数据集的预测性能。我们进一步对体内数据较少的 RBP 进行扩充操作,以扩大输入数据,并表明它可以提高预测性能。最后,我们探索了 RBP-ADDA 的预测可解释性,通过集成梯度来量化输入特征的贡献,并确定对 RBP 识别重要的核苷酸位置。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/1463ae52b360/pcbi.1009863.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/c8cc4b85187a/pcbi.1009863.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/236ef7427338/pcbi.1009863.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/f9fc76231017/pcbi.1009863.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/1463ae52b360/pcbi.1009863.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/c8cc4b85187a/pcbi.1009863.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/236ef7427338/pcbi.1009863.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/f9fc76231017/pcbi.1009863.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/952e/8870515/1463ae52b360/pcbi.1009863.g004.jpg

相似文献

1
Inferring RNA-binding protein target preferences using adversarial domain adaptation.利用对抗性领域自适应推断 RNA 结合蛋白靶标偏好。
PLoS Comput Biol. 2022 Feb 24;18(2):e1009863. doi: 10.1371/journal.pcbi.1009863. eCollection 2022 Feb.
2
Deep neural networks for interpreting RNA-binding protein target preferences.用于解释 RNA 结合蛋白靶标偏好的深度神经网络。
Genome Res. 2020 Feb;30(2):214-226. doi: 10.1101/gr.247494.118. Epub 2020 Jan 28.
3
Deep neural networks for inferring binding sites of RNA-binding proteins by using distributed representations of RNA primary sequence and secondary structure.利用 RNA 一级序列和二级结构的分布式表示来推断 RNA 结合蛋白结合位点的深度神经网络。
BMC Genomics. 2020 Dec 17;21(Suppl 13):866. doi: 10.1186/s12864-020-07239-w.
4
Prediction of the RBP binding sites on lncRNAs using the high-order nucleotide encoding convolutional neural network.使用高阶核苷酸编码卷积神经网络预测长链非编码RNA上的RBP结合位点
Anal Biochem. 2019 Oct 15;583:113364. doi: 10.1016/j.ab.2019.113364. Epub 2019 Jul 16.
5
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.
6
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
7
RNAProt: an efficient and feature-rich RNA binding protein binding site predictor.RNAProt:一种高效且功能丰富的 RNA 结合蛋白结合位点预测器。
Gigascience. 2021 Aug 18;10(8). doi: 10.1093/gigascience/giab054.
8
Inferring RNA sequence preferences for poorly studied RNA-binding proteins based on co-evolution.基于共进化推断研究较少的 RNA 结合蛋白的 RNA 序列偏好。
BMC Bioinformatics. 2018 Mar 12;19(1):96. doi: 10.1186/s12859-018-2091-8.
9
RNA-binding protein recognition based on multi-view deep feature and multi-label learning.基于多视图深度特征和多标签学习的 RNA 结合蛋白识别。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa174.
10
A deep learning framework for modeling structural features of RNA-binding protein targets.一种用于对RNA结合蛋白靶点的结构特征进行建模的深度学习框架。
Nucleic Acids Res. 2016 Feb 29;44(4):e32. doi: 10.1093/nar/gkv1025. Epub 2015 Oct 13.

引用本文的文献

1
Emerging RNA-centric technologies to probe RNA-protein interactions: importance in decoding the life cycle of positive sense single strand RNA viruses and antiviral discovery.用于探测RNA-蛋白质相互作用的新兴RNA中心技术:在解读正链单链RNA病毒生命周期及抗病毒发现中的重要性。
Front Cell Infect Microbiol. 2025 May 21;15:1580337. doi: 10.3389/fcimb.2025.1580337. eCollection 2025.
2
A systematic benchmark of machine learning methods for protein-RNA interaction prediction.一种蛋白质- RNA 相互作用预测的机器学习方法的系统基准测试。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad307.

本文引用的文献

1
RNANetMotif: Identifying sequence-structure RNA network motifs in RNA-protein binding sites.RNANetMotif:在 RNA-蛋白质结合位点中识别序列-结构 RNA 网络基序。
PLoS Comput Biol. 2022 Jul 12;18(7):e1010293. doi: 10.1371/journal.pcbi.1010293. eCollection 2022 Jul.
2
Genome-wide landscape of RNA-binding protein target site dysregulation reveals a major impact on psychiatric disorder risk.全基因组范围内 RNA 结合蛋白靶位失调的研究揭示了其对精神疾病风险的重要影响。
Nat Genet. 2021 Feb;53(2):166-173. doi: 10.1038/s41588-020-00761-3. Epub 2021 Jan 18.
3
A Survey of Regulatory Interactions Among RNA Binding Proteins and MicroRNAs in Cancer.
癌症中RNA结合蛋白与微小RNA之间调控相互作用的综述
Front Genet. 2020 Sep 8;11:515094. doi: 10.3389/fgene.2020.515094. eCollection 2020.
4
Measuring Domain Shift for Deep Learning in Histopathology.深度学习在组织病理学中的领域迁移测量。
IEEE J Biomed Health Inform. 2021 Feb;25(2):325-336. doi: 10.1109/JBHI.2020.3032060. Epub 2021 Feb 5.
5
Deep transfer learning for reducing health care disparities arising from biomedical data inequality.基于深度迁移学习减少生物医学数据不平等导致的医疗保健差异
Nat Commun. 2020 Oct 12;11(1):5131. doi: 10.1038/s41467-020-18918-3.
6
A large-scale binding and functional map of human RNA-binding proteins.人类 RNA 结合蛋白的大规模结合和功能图谱。
Nature. 2020 Jul;583(7818):711-719. doi: 10.1038/s41586-020-2077-3. Epub 2020 Jul 29.
7
Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST.通过无偏倚的细胞嵌合进行 Cell BLAST,在大规模 scRNA-seq 数据库中搜索。
Nat Commun. 2020 Jul 10;11(1):3458. doi: 10.1038/s41467-020-17281-7.
8
Deep neural networks for interpreting RNA-binding protein target preferences.用于解释 RNA 结合蛋白靶标偏好的深度神经网络。
Genome Res. 2020 Feb;30(2):214-226. doi: 10.1101/gr.247494.118. Epub 2020 Jan 28.
9
Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms.对抗性域自适应在细胞电子冷冻断层图像中跨数据源大分子的原位结构分类中的应用。
Bioinformatics. 2019 Jul 15;35(14):i260-i268. doi: 10.1093/bioinformatics/btz364.
10
Discovering sequence and structure landscapes in RNA interaction motifs.发现 RNA 相互作用基序中的序列和结构景观。
Nucleic Acids Res. 2019 Jun 4;47(10):4958-4969. doi: 10.1093/nar/gkz250.