• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

circ2LO:基于LucaOne大型模型的环状RNA鉴定

circ2LO: Identification of CircRNA Based on the LucaOne Large Model.

作者信息

Yu Haihao, Yu Yue, Xia Yanling

机构信息

Computer Science and Technology College, Heilongjiang Institute of Technology, No. 999 Hongqi Street, Harbin 150009, China.

College of Animal Science, Jilin University, No. 1977 Xinzhu Road, Changchun 130012, China.

出版信息

Genes (Basel). 2025 Mar 31;16(4):413. doi: 10.3390/genes16040413.

DOI:10.3390/genes16040413
PMID:40282373
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12026638/
Abstract

Circular RNA is a type of noncoding RNA with a special covalent bond structure. As an endogenous RNA in animals and plants, it is formed through RNA splicing. The 5' and 3' ends of the exons form circular RNA at the back-splicing sites. Circular RNA plays an important regulatory role in diseases by interacting with the associated miRNAs. Accurate identification of circular RNA can enrich the data on circular RNA and provide new ideas for drug development. At present, mainstream circular RNA recognition algorithms are divided into two categories: those based on RNA sequence position information and those based on RNA sequence biometric information. Herein, we propose a method for the recognition of circular RNA, called circ2LO, which utilizes the LucaOne large model for feature embedding of the splicing sites of RNA sequences as well as their upstream and downstream sequences to prevent semantic information loss caused by the traditional one-hot encoding method. Subsequently, it employs a convolutional layer to extract features and a self-attention mechanism to extract interactive features to accurately capture the core features of the circular RNA at the splicing sites. Finally, it uses a fully connected layer to identify circular RNA. The accuracy of circ2LO on the human dataset reached 95.47%, which is higher than the values shown by existing methods. It also achieved accuracies of 97.04% and 72.04% on the Arabidopsis and mouse datasets, respectively, demonstrating good robustness. Through rigorous validation, the circ2LO model has proven its high-precision identification capability for circular RNAs, marking it as a potentially transformative analytical platform in the circRNA research field.

摘要

环状RNA是一种具有特殊共价键结构的非编码RNA。作为动植物中的内源性RNA,它通过RNA剪接形成。外显子的5'端和3'端在反向剪接位点形成环状RNA。环状RNA通过与相关的微小RNA相互作用,在疾病中发挥重要的调节作用。准确识别环状RNA可以丰富环状RNA的数据,并为药物开发提供新思路。目前,主流的环状RNA识别算法分为两类:基于RNA序列位置信息的算法和基于RNA序列生物特征信息的算法。在此,我们提出了一种环状RNA识别方法,称为circ2LO,该方法利用LucaOne大模型对RNA序列及其上游和下游序列的剪接位点进行特征嵌入,以防止传统独热编码方法导致的语义信息丢失。随后,它采用卷积层提取特征,并采用自注意力机制提取交互特征,以准确捕捉剪接位点处环状RNA的核心特征。最后,它使用全连接层来识别环状RNA。circ2LO在人类数据集上的准确率达到了95.47%,高于现有方法的值。它在拟南芥和小鼠数据集上的准确率分别达到了97.04%和72.04%,显示出良好的稳健性。通过严格验证,circ2LO模型已证明其对环状RNA的高精度识别能力,标志着它是环状RNA研究领域中一个潜在的变革性分析平台。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/0ff696060866/genes-16-00413-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/09b4f82d3a6b/genes-16-00413-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/809645eccb56/genes-16-00413-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/fbb57189445e/genes-16-00413-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/a36d8e38af22/genes-16-00413-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/6ffae8a93e63/genes-16-00413-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/1916c39cc072/genes-16-00413-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/0ff696060866/genes-16-00413-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/09b4f82d3a6b/genes-16-00413-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/809645eccb56/genes-16-00413-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/fbb57189445e/genes-16-00413-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/a36d8e38af22/genes-16-00413-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/6ffae8a93e63/genes-16-00413-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/1916c39cc072/genes-16-00413-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf71/12026638/0ff696060866/genes-16-00413-g007.jpg

相似文献

1
circ2LO: Identification of CircRNA Based on the LucaOne Large Model.circ2LO:基于LucaOne大型模型的环状RNA鉴定
Genes (Basel). 2025 Mar 31;16(4):413. doi: 10.3390/genes16040413.
2
A Representation Learning Approach for Predicting circRNA Back-Splicing Event via Sequence-Interaction-Aware Dual Encoder.基于序列交互感知双编码器的 circRNA 反向剪接事件预测的表示学习方法。
IEEE Trans Nanobioscience. 2024 Oct;23(4):603-611. doi: 10.1109/TNB.2024.3454079. Epub 2024 Oct 15.
3
circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier.基于多视图深度学习、子空间学习和多视图分类器的 circRNA 结合蛋白位点预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab394.
4
CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.环状 RNA 与 RNA 结合蛋白相互作用位点的鉴定:基于集成神经网络方法。
PLoS Comput Biol. 2022 Jan 20;18(1):e1009798. doi: 10.1371/journal.pcbi.1009798. eCollection 2022 Jan.
5
PCirc: random forest-based plant circRNA identification software.PCirc:基于随机森林的植物 circRNA 鉴定软件。
BMC Bioinformatics. 2021 Jan 6;22(1):10. doi: 10.1186/s12859-020-03944-1.
6
Deep learning of the back-splicing code for circular RNA formation.环状 RNA 形成的剪接码深度学习。
Bioinformatics. 2019 Dec 15;35(24):5235-5242. doi: 10.1093/bioinformatics/btz382.
7
Predicting associations between CircRNA and diseases through structure-aware graph transformer and path-integral convolution.通过结构感知图变换和路径积分卷积预测环状 RNA 与疾病之间的关联。
Anal Biochem. 2024 Sep;692:115554. doi: 10.1016/j.ab.2024.115554. Epub 2024 May 4.
8
CRBPSA: CircRNA-RBP interaction sites identification using sequence structural attention model.利用序列结构注意力模型识别 circRNA-RBP 相互作用位点。
BMC Biol. 2024 Nov 14;22(1):260. doi: 10.1186/s12915-024-02055-0.
9
Ularcirc: visualization and enhanced analysis of circular RNAs via back and canonical forward splicing.Ularcirc:通过反向和规范的前体剪接可视化和增强环状 RNA 的分析。
Nucleic Acids Res. 2019 Nov 18;47(20):e123. doi: 10.1093/nar/gkz718.
10
Predicting circRNA-RBP Binding Sites Using a Hybrid Deep Neural Network.基于混合深度神经网络的 circRNA-RBP 结合位点预测。
Interdiscip Sci. 2024 Sep;16(3):635-648. doi: 10.1007/s12539-024-00616-z. Epub 2024 Feb 21.

引用本文的文献

1
Circular RNAs as Targets for Developing Anticancer Therapeutics.环状RNA作为开发抗癌疗法的靶点
Cells. 2025 Jul 18;14(14):1106. doi: 10.3390/cells14141106.

本文引用的文献

1
Predicting the toxic side effects of drug interactions using chemical structures and protein sequences.利用化学结构和蛋白质序列预测药物相互作用的毒副作用。
Sci Rep. 2024 Dec 28;14(1):31503. doi: 10.1038/s41598-024-82981-9.
2
DrugReAlign: a multisource prompt framework for drug repurposing based on large language models.DrugReAlign:一种基于大型语言模型的药物重定位多源提示框架。
BMC Biol. 2024 Oct 8;22(1):226. doi: 10.1186/s12915-024-02028-3.
3
Evaluating the capability of ChatGPT in predicting drug-drug interactions: Real-world evidence using hospitalized patient data.
评估 ChatGPT 预测药物相互作用的能力:基于住院患者数据的真实世界证据。
Br J Clin Pharmacol. 2024 Dec;90(12):3361-3366. doi: 10.1111/bcp.16275. Epub 2024 Oct 2.
4
Knowledge Graphs for drug repurposing: a review of databases and methods.用于药物再利用的知识图谱:数据库和方法综述。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae461.
5
DTI-LM: language model powered drug-target interaction prediction.DTI-LM:基于语言模型的药物-靶标相互作用预测。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae533.
6
Can large language models understand molecules?大语言模型能理解分子吗?
BMC Bioinformatics. 2024 Jun 26;25(1):225. doi: 10.1186/s12859-024-05847-x.
7
Hierarchical multimodal self-attention-based graph neural network for DTI prediction.基于分层多模态自注意力的图神经网络用于 DTI 预测。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae293.
8
MSI-DTI: predicting drug-target interaction based on multi-source information and multi-head self-attention.基于多源信息和多头自注意力的药物-靶点相互作用预测(MSI-DTI)
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae238.
9
Leveraging generative AI to prioritize drug repurposing candidates for Alzheimer's disease with real-world clinical validation.利用生成式人工智能对阿尔茨海默病药物再利用候选药物进行优先级排序,并进行真实世界临床验证。
NPJ Digit Med. 2024 Feb 26;7(1):46. doi: 10.1038/s41746-024-01038-3.
10
Computational approaches for circRNAs prediction and in silico characterization.环状 RNA 的计算方法预测和计算机分析。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad154.