• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LPItabformer:通过表格Transformer增强lncRNA-蛋白质相互作用预测中的泛化能力。

LPItabformer: Enhancing generalization in predicting lncRNA-protein interactions via a tabular Transformer.

作者信息

Lin Qin, Sheng Jie, Zhou Chang, Xiao Tao, Meng Yilei, Lu Mingxin, Zhang Junfang, Yan Xueyun, Peng Luying, Cao Huaming, Li Li

机构信息

State Key Laboratory of Cardiovascular Diseases and Medical Innovation Center, Shanghai East Hospital, School of Medicine, Tongji University, Shanghai 200120, China.

Shanghai Arrhythmias Research Center, Shanghai East Hospital, Tongji University School of Medicine, Shanghai 200120, China.

出版信息

Comput Struct Biotechnol J. 2025 May 29;27:2323-2335. doi: 10.1016/j.csbj.2025.05.050. eCollection 2025.

DOI:10.1016/j.csbj.2025.05.050
PMID:40520600
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12167069/
Abstract

Long-noncoding RNAs (LncRNAs) play important roles in physiological and pathological processes. Accurately predicting lncRNA-protein interactions (LPIs) is vital strategy for clarify functions and pathogenic mechanisms of lncRNAs. Current computational methods for evaluating LPIs with their utility and generalization have significant room for improvement. In this study, data splitting by incorporating protein clusters as group information reveals that lots of LPI prediction methods suffer from generalization flaws due to data leakage caused by ignoring LPI biological properties. To address the issue, we present LPItabformer, a tabular Transformer framework for predicting LPIs, that incorporates a domain shifts with uncertainty (DSU) module for generalization enhancement. The LPItabformer demonstrates a capacity to alleviate the generalization challenges associated with biases in LPI data and preferences in protein binding patterns. In addition, LPItabformer shows greater robustness and generalization on human and mouse LPI datasets compared to state-of-the-art methods. Ultimately, we have verified that the LPItabformer is capable of predicting novel LPIs. Code is available at https://github.com/Ci-TJ/LPItabformer.

摘要

长链非编码RNA(LncRNAs)在生理和病理过程中发挥着重要作用。准确预测长链非编码RNA-蛋白质相互作用(LPI)是阐明长链非编码RNA功能和致病机制的关键策略。目前用于评估LPI及其效用和泛化能力的计算方法有很大的改进空间。在本研究中,通过将蛋白质簇作为分组信息进行数据拆分发现,许多LPI预测方法由于忽略LPI生物学特性导致数据泄露而存在泛化缺陷。为了解决这个问题,我们提出了LPItabformer,一种用于预测LPI的表格Transformer框架,它包含一个用于增强泛化能力的域转移不确定性(DSU)模块。LPItabformer展示了缓解与LPI数据偏差和蛋白质结合模式偏好相关的泛化挑战的能力。此外,与现有方法相比,LPItabformer在人类和小鼠LPI数据集上表现出更强的稳健性和泛化能力。最终,我们验证了LPItabformer能够预测新的LPI。代码可在https://github.com/Ci-TJ/LPItabformer获取。

相似文献

1
LPItabformer: Enhancing generalization in predicting lncRNA-protein interactions via a tabular Transformer.LPItabformer:通过表格Transformer增强lncRNA-蛋白质相互作用预测中的泛化能力。
Comput Struct Biotechnol J. 2025 May 29;27:2323-2335. doi: 10.1016/j.csbj.2025.05.050. eCollection 2025.
2
Capsule-LPI: a LncRNA-protein interaction predicting tool based on a capsule network.胶囊-LPI:一种基于胶囊网络的 LncRNA-蛋白质相互作用预测工具。
BMC Bioinformatics. 2021 May 13;22(1):246. doi: 10.1186/s12859-021-04171-y.
3
LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification.LPI-HyADBS:一种集成特征选择和分类的 lncRNA-蛋白质相互作用预测的混合框架。
BMC Bioinformatics. 2021 Nov 26;22(1):568. doi: 10.1186/s12859-021-04485-x.
4
LPI-deepGBDT: a multiple-layer deep framework based on gradient boosting decision trees for lncRNA-protein interaction identification.LPI-deepGBDT:基于梯度提升决策树的多层深度框架,用于 lncRNA-蛋白质相互作用识别。
BMC Bioinformatics. 2021 Oct 4;22(1):479. doi: 10.1186/s12859-021-04399-8.
5
Predicting lncRNA-protein interactions using a hybrid deep learning model with dinucleotide-codon fusion feature encoding.使用具有二核苷酸-密码子融合特征编码的混合深度学习模型预测长链非编码RNA-蛋白质相互作用。
BMC Genomics. 2024 Dec 28;25(1):1253. doi: 10.1186/s12864-024-11168-3.
6
LPI-CSFFR: Combining serial fusion with feature reuse for predicting LncRNA-protein interactions.LPI-CSFFR:结合序列融合与特征重用进行长链非编码 RNA-蛋白质相互作用预测。
Comput Biol Chem. 2022 Aug;99:107718. doi: 10.1016/j.compbiolchem.2022.107718. Epub 2022 Jun 27.
7
LPI-SKMSC: Predicting LncRNA-Protein Interactions with Segmented k-mer Frequencies and Multi-space Clustering.LPI-SKMSC:基于分段 k--mer 频率和多空间聚类的长链非编码 RNA-蛋白质相互作用预测。
Interdiscip Sci. 2024 Jun;16(2):378-391. doi: 10.1007/s12539-023-00598-4. Epub 2024 Jan 11.
8
LPI-SKF: Predicting lncRNA-Protein Interactions Using Similarity Kernel Fusions.LPI-SKF:使用相似性核融合预测长链非编码RNA与蛋白质的相互作用。
Front Genet. 2020 Dec 9;11:615144. doi: 10.3389/fgene.2020.615144. eCollection 2020.
9
Prediction of LncRNA-protein Interactions Using Auto-Encoder, SE-ResNet Models and Transfer Learning.使用自动编码器、SE-ResNet模型和迁移学习预测长链非编码RNA与蛋白质的相互作用
Microrna. 2024;13(2):155-165. doi: 10.2174/0122115366288068240322064431.
10
Finding lncRNA-Protein Interactions Based on Deep Learning With Dual-Net Neural Architecture.基于双网络神经架构深度学习的长链非编码RNA-蛋白质相互作用研究
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3456-3468. doi: 10.1109/TCBB.2021.3116232. Epub 2022 Dec 8.

本文引用的文献

1
RNA-protein interaction prediction using network-guided deep learning.利用网络引导的深度学习进行RNA-蛋白质相互作用预测。
Commun Biol. 2025 Feb 16;8(1):247. doi: 10.1038/s42003-025-07694-9.
2
ZeRPI: A graph neural network model for zero-shot prediction of RNA-protein interactions.ZeRPI:一种用于RNA-蛋白质相互作用零样本预测的图神经网络模型。
Methods. 2025 Mar;235:45-52. doi: 10.1016/j.ymeth.2025.01.014. Epub 2025 Jan 30.
3
Simulating 500 million years of evolution with a language model.用语言模型模拟5亿年的进化历程。
Science. 2025 Feb 21;387(6736):850-858. doi: 10.1126/science.ads0018. Epub 2025 Jan 16.
4
A foundation model of transcription across human cell types.一种跨人类细胞类型的转录基础模型。
Nature. 2025 Jan;637(8047):965-973. doi: 10.1038/s41586-024-08391-z. Epub 2025 Jan 8.
5
Chromatin remodelling drives immune cell-fibroblast communication in heart failure.染色质重塑驱动心力衰竭中免疫细胞-成纤维细胞通讯。
Nature. 2024 Nov;635(8038):434-443. doi: 10.1038/s41586-024-08085-6. Epub 2024 Oct 23.
6
An efficient, not-only-linear correlation coefficient based on clustering.一种基于聚类的高效、非线性相关系数。
Cell Syst. 2024 Sep 18;15(9):854-868.e3. doi: 10.1016/j.cels.2024.08.005. Epub 2024 Sep 6.
7
The silencing transcription factor REST targets UCHL1 to regulate inflammatory response and fibrosis during cardiac hypertrophy.沉默转录因子REST靶向UCHL1以调节心肌肥厚过程中的炎症反应和纤维化。
Genes Dis. 2023 Nov 25;11(6):101183. doi: 10.1016/j.gendis.2023.101183. eCollection 2024 Nov.
8
Guiding questions to avoid data leakage in biological machine learning applications.指导问题以避免生物机器学习应用中的数据泄露。
Nat Methods. 2024 Aug;21(8):1444-1453. doi: 10.1038/s41592-024-02362-y. Epub 2024 Aug 9.
9
CoRegNet: unraveling gene co-regulation networks from public RNA-Seq repositories using a beta-binomial statistical model.CoRegNet:利用贝塔二项式统计模型从公共 RNA-Seq 存储库中解析基因共调控网络。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad380.
10
Long non-coding RNAs in cardiac hypertrophy and heart failure: functions, mechanisms and clinical prospects.长链非编码 RNA 在心肌肥厚和心力衰竭中的作用、机制及临床前景。
Nat Rev Cardiol. 2024 May;21(5):326-345. doi: 10.1038/s41569-023-00952-5. Epub 2023 Nov 20.