• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FexRNA:非编码 RNA 的探索性数据分析和特征选择。

FexRNA: Exploratory Data Analysis and Feature Selection of Non-Coding RNA.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2795-2801. doi: 10.1109/TCBB.2021.3057128. Epub 2021 Dec 8.

DOI:10.1109/TCBB.2021.3057128
PMID:33539302
Abstract

Non-coding RNA (ncRNA) is involved in many biological processes and diseases in all species. Many ncRNA datasets exist that provide ncRNA data in FASTA format which is well suited for biomedical purposes. However, for ncRNA analysis and classification, statistical learning methods require hidden numerical features from the data. Furthermore, in the literature, a wealth of sequence intrinsic features has been proposed for ncRNA identification. The extraction of hidden features, their analysis, and usage of a suitable set of features is crucial for the performance of any statistical learning method. To alleviate the posed challenges, we generated 96 feature datasets from ncRNA widely used features. The feature datasets are based on RNACentral and consist of species, ncRNA types, and expert databases that are available on the FexRNA platform. Additionally, the feature datasets are explored and analysed to provide statistical information, univariate, and bivariate analysis. We sought to determine which of these 17 features would be most appropriate to use in developing ncRNA classification approaches. For feature selection (FS), a two-phase hierarchical FS framework based on correlation and majority voting is proposed and evaluated on 5 species. The FexRNA platform provides information about ncRNA feature analysis and selection.

摘要

非编码 RNA(ncRNA)参与了所有物种的许多生物过程和疾病。许多 ncRNA 数据集以 FASTA 格式提供 ncRNA 数据,非常适合生物医学用途。然而,对于 ncRNA 分析和分类,统计学习方法需要从数据中提取隐藏的数值特征。此外,在文献中,已经提出了大量用于 ncRNA 识别的序列固有特征。隐藏特征的提取、分析以及使用合适的特征集对于任何统计学习方法的性能都是至关重要的。为了缓解这些挑战,我们从广泛使用的 ncRNA 特征中生成了 96 个特征数据集。这些特征数据集基于 RNACentral,包括物种、ncRNA 类型以及 FexRNA 平台上可用的专家数据库。此外,还对这些特征数据集进行了探索和分析,以提供统计信息、单变量和双变量分析。我们试图确定在开发 ncRNA 分类方法时,哪些特征最适合使用。对于特征选择(FS),提出了一种基于相关性和多数投票的两阶段层次 FS 框架,并在 5 个物种上进行了评估。FexRNA 平台提供了关于 ncRNA 特征分析和选择的信息。

相似文献

1
FexRNA: Exploratory Data Analysis and Feature Selection of Non-Coding RNA.FexRNA:非编码 RNA 的探索性数据分析和特征选择。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2795-2801. doi: 10.1109/TCBB.2021.3057128. Epub 2021 Dec 8.
2
Advances in Computational Methodologies for Classification and Sub-Cellular Locality Prediction of Non-Coding RNAs.计算方法在非编码 RNA 分类和亚细胞定位预测中的研究进展。
Int J Mol Sci. 2021 Aug 13;22(16):8719. doi: 10.3390/ijms22168719.
3
BioDeepfuse: a hybrid deep learning approach with integrated feature extraction techniques for enhanced non-coding RNA classification.BioDeepfuse:一种混合深度学习方法,结合了集成特征提取技术,用于增强非编码 RNA 分类。
RNA Biol. 2024 Jan;21(1):1-12. doi: 10.1080/15476286.2024.2329451. Epub 2024 Mar 25.
4
MFPINC: prediction of plant ncRNAs based on multi-source feature fusion.MFPINC:基于多源特征融合的植物 ncRNAs 预测。
BMC Genomics. 2024 May 30;25(1):531. doi: 10.1186/s12864-024-10439-3.
5
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
6
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
7
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.
8
MFPred: prediction of ncRNA families based on multi-feature fusion.MFPred:基于多特征融合的 ncRNA 家族预测。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad303.
9
Exploring Non-Coding RNAs in RNAcentral.探索 RNAcentral 中的非编码 RNA。
Curr Protoc Bioinformatics. 2020 Sep;71(1):e104. doi: 10.1002/cpbi.104.
10
Chain-RNA: a comparative ncRNA search tool based on the two-dimensional chain algorithm.链 RNA:一种基于二维链算法的比较 ncRNA 搜索工具。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):274-85. doi: 10.1109/TCBB.2012.137.

引用本文的文献

1
GeneAI 3.0: powerful, novel, generalized hybrid and ensemble deep learning frameworks for miRNA species classification of stationary patterns from nucleotides.GeneAI 3.0:强大的、新颖的、通用的混合和集成深度学习框架,用于对核苷酸静止模式的 miRNA 物种进行分类。
Sci Rep. 2024 Mar 26;14(1):7154. doi: 10.1038/s41598-024-56786-9.
2
A Machine Learning Decision Support System (DSS) for Neuroendocrine Tumor Patients Treated with Somatostatin Analog (SSA) Therapy.用于接受生长抑素类似物(SSA)治疗的神经内分泌肿瘤患者的机器学习决策支持系统(DSS)
Diagnostics (Basel). 2021 Apr 28;11(5):804. doi: 10.3390/diagnostics11050804.