• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iPseU-TWSVM:基于孪生支持向量机的RNA假尿苷位点识别

iPseU-TWSVM: Identification of RNA pseudouridine sites based on TWSVM.

作者信息

Chen Mingshuai, Zhang Xin, Ju Ying, Liu Qing, Ding Yijie

机构信息

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China.

出版信息

Math Biosci Eng. 2022 Sep 19;19(12):13829-13850. doi: 10.3934/mbe.2022644.

DOI:10.3934/mbe.2022644
PMID:36654069
Abstract

Biological sequence analysis is an important basic research work in the field of bioinformatics. With the explosive growth of data, machine learning methods play an increasingly important role in biological sequence analysis. By constructing a classifier for prediction, the input sequence feature vector is predicted and evaluated, and the knowledge of gene structure, function and evolution is obtained from a large amount of sequence information, which lays a foundation for researchers to carry out in-depth research. At present, many machine learning methods have been applied to biological sequence analysis such as RNA gene recognition and protein secondary structure prediction. As a biological sequence, RNA plays an important biological role in the encoding, decoding, regulation and expression of genes. The analysis of RNA data is currently carried out from the aspects of structure and function, including secondary structure prediction, non-coding RNA identification and functional site prediction. Pseudouridine (У) is the most widespread and rich RNA modification and has been discovered in a variety of RNAs. It is highly essential for the study of related functional mechanisms and disease diagnosis to accurately identify У sites in RNA sequences. At present, several computational approaches have been suggested as an alternative to experimental methods to detect У sites, but there is still potential for improvement in their performance. In this study, we present a model based on twin support vector machine (TWSVM) for У site identification. The model combines a variety of feature representation techniques and uses the max-relevance and min-redundancy methods to obtain the optimum feature subset for training. The independent testing accuracy is improved by 3.4% in comparison to current advanced У site predictors. The outcomes demonstrate that our model has better generalization performance and improves the accuracy of У site identification. iPseU-TWSVM can be a helpful tool to identify У sites.

摘要

生物序列分析是生物信息学领域一项重要的基础研究工作。随着数据的爆炸式增长,机器学习方法在生物序列分析中发挥着越来越重要的作用。通过构建预测分类器,对输入的序列特征向量进行预测和评估,从大量序列信息中获取基因结构、功能和进化等知识,为研究人员开展深入研究奠定基础。目前,许多机器学习方法已应用于生物序列分析,如RNA基因识别和蛋白质二级结构预测。RNA作为一种生物序列,在基因的编码、解码、调控和表达中发挥着重要的生物学作用。目前对RNA数据的分析主要从结构和功能方面进行,包括二级结构预测、非编码RNA识别和功能位点预测。假尿苷(Ψ)是分布最广泛、含量最丰富的RNA修饰,已在多种RNA中被发现。准确识别RNA序列中的Ψ位点对于相关功能机制研究和疾病诊断至关重要。目前,已有多种计算方法被提出作为检测Ψ位点的实验方法的替代方法,但其性能仍有提升空间。在本研究中,我们提出了一种基于孪生支持向量机(TWSVM)的Ψ位点识别模型。该模型结合了多种特征表示技术,并采用最大相关最小冗余方法获取用于训练的最优特征子集。与当前先进的Ψ位点预测器相比,独立测试准确率提高了3.4%。结果表明,我们的模型具有更好的泛化性能,提高了Ψ位点识别的准确性。iPseU-TWSVM可以成为识别Ψ位点的有用工具。

相似文献

1
iPseU-TWSVM: Identification of RNA pseudouridine sites based on TWSVM.iPseU-TWSVM:基于孪生支持向量机的RNA假尿苷位点识别
Math Biosci Eng. 2022 Sep 19;19(12):13829-13850. doi: 10.3934/mbe.2022644.
2
PseU-KeMRF: A Novel Method for Identifying RNA Pseudouridine Sites.PseU-KeMRF:一种识别 RNA 假尿嘧啶位点的新方法。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1423-1435. doi: 10.1109/TCBB.2024.3389094. Epub 2024 Oct 9.
3
A Feature Fusion Predictor for RNA Pseudouridine Sites with Particle Swarm Optimizer Based Feature Selection and Ensemble Learning Approach.基于粒子群优化算法特征选择和集成学习方法的 RNA 假尿嘧啶位点特征融合预测器。
Curr Issues Mol Biol. 2021 Nov 1;43(3):1844-1858. doi: 10.3390/cimb43030129.
4
PseUI: Pseudouridine sites identification based on RNA sequence information.PseUI:基于 RNA 序列信息的假尿嘧啶核苷位点鉴定。
BMC Bioinformatics. 2018 Aug 29;19(1):306. doi: 10.1186/s12859-018-2321-0.
5
iPseU-Layer: Identifying RNA Pseudouridine Sites Using Layered Ensemble Model.iPseU-Layer:使用分层集成模型识别 RNA 假尿嘧啶位点。
Interdiscip Sci. 2020 Jun;12(2):193-203. doi: 10.1007/s12539-020-00362-y. Epub 2020 Mar 13.
6
Porpoise: a new approach for accurate prediction of RNA pseudouridine sites.海豚:一种准确预测 RNA 假尿嘧啶位点的新方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab245.
7
Is There Any Sequence Feature in the RNA Pseudouridine Modification Prediction Problem?RNA假尿苷修饰预测问题中是否存在任何序列特征?
Mol Ther Nucleic Acids. 2020 Mar 6;19:293-303. doi: 10.1016/j.omtn.2019.11.014. Epub 2019 Nov 21.
8
iPseU-CNN: Identifying RNA Pseudouridine Sites Using Convolutional Neural Networks.iPseU-CNN:使用卷积神经网络识别RNA假尿苷位点。
Mol Ther Nucleic Acids. 2019 Jun 7;16:463-470. doi: 10.1016/j.omtn.2019.03.010. Epub 2019 Apr 11.
9
iPseU-NCP: Identifying RNA pseudouridine sites using random forest and NCP-encoded features.iPseU-NCP:基于随机森林和 NCP 编码特征识别 RNA 假尿嘧啶位点。
BMC Genomics. 2019 Dec 30;20(Suppl 10):971. doi: 10.1186/s12864-019-6357-y.
10
PseUpred-ELPSO Is an Ensemble Learning Predictor with Particle Swarm Optimizer for Improving the Prediction of RNA Pseudouridine Sites.PseUpred-ELPSO是一种带有粒子群优化器的集成学习预测器,用于改进RNA假尿苷位点的预测。
Biology (Basel). 2024 Apr 8;13(4):248. doi: 10.3390/biology13040248.

引用本文的文献

1
RSCNN-PseU: random searching-based convolutional neural network model for identifying RNA pseudouridine.RSCNN-PseU:基于随机搜索的用于识别RNA假尿苷的卷积神经网络模型。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf417.