简单的统计模型可预测植物线粒体RNA中C到U的编辑位点。

Simple statistical models predict C-to-U edited sites in plant mitochondrial RNA.

作者信息

Cummings Michael P, Myers Daniel S

机构信息

Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742-3360, USA.

出版信息

BMC Bioinformatics. 2004 Sep 16;5:132. doi: 10.1186/1471-2105-5-132.

DOI:10.1186/1471-2105-5-132

PMID:15373947

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC521485/

Abstract

BACKGROUND

RNA editing is the process whereby an RNA sequence is modified from the sequence of the corresponding DNA template. In the mitochondria of land plants, some cytidines are converted to uridines before translation. Despite substantial study, the molecular biological mechanism by which C-to-U RNA editing proceeds remains relatively obscure, although several experimental studies have implicated a role for cis-recognition. A highly non-random distribution of nucleotides is observed in the immediate vicinity of edited sites (within 20 nucleotides 5' and 3'), but no precise consensus motif has been identified.

RESULTS

Data for analysis were derived from the the complete mitochondrial genomes of Arabidopsis thaliana, Brassica napus, and Oryza sativa; additionally, a combined data set of observations across all three genomes was generated. We selected datasets based on the 20 nucleotides 5' and the 20 nucleotides 3' of edited sites and an equivalently sized and appropriately constructed null-set of non-edited sites. We used tree-based statistical methods and random forests to generate models of C-to-U RNA editing based on the nucleotides surrounding the edited/non-edited sites and on the estimated folding energies of those regions. Tree-based statistical methods based on primary sequence data surrounding edited/non-edited sites and estimates of free energy of folding yield models with optimistic re-substitution-based estimates of approximately 0.71 accuracy, approximately 0.64 sensitivity, and approximately 0.88 specificity. Random forest analysis yielded better models and more exact performance estimates with approximately 0.74 accuracy, approximately 0.72 sensitivity, and approximately 0.81 specificity for the combined observations.

CONCLUSIONS

Simple models do moderately well in predicting which cytidines will be edited to uridines, and provide the first quantitative predictive models for RNA edited sites in plant mitochondria. Our analysis shows that the identity of the nucleotide -1 to the edited C and the estimated free energy of folding for a 41 nt region surrounding the edited C are the most important variables that distinguish most edited from non-edited sites. However, the results suggest that primary sequence data and simple free energy of folding calculations alone are insufficient to make highly accurate predictions.

摘要

背景

RNA编辑是指RNA序列从相应DNA模板序列发生改变的过程。在陆地植物的线粒体中，一些胞嘧啶在翻译前会转变为尿嘧啶。尽管已有大量研究，但C到U的RNA编辑过程的分子生物学机制仍相对模糊，不过一些实验研究表明顺式识别发挥了作用。在编辑位点的紧邻区域（5'和3'方向20个核苷酸范围内）观察到核苷酸的高度非随机分布，但尚未确定精确的共有基序。

结果

分析数据来源于拟南芥、甘蓝型油菜和水稻的完整线粒体基因组；此外，还生成了一个涵盖所有三个基因组的综合观察数据集。我们基于编辑位点5'方向的20个核苷酸和3'方向的20个核苷酸以及同等大小且构建适当的未编辑位点空集来选择数据集。我们使用基于树的统计方法和随机森林，根据编辑/未编辑位点周围的核苷酸以及这些区域的估计折叠能来生成C到U RNA编辑的模型。基于编辑/未编辑位点周围的一级序列数据和折叠自由能估计的基于树的统计方法产生的模型，基于重新代入法的乐观估计准确率约为0.71，灵敏度约为0.64，特异性约为0.88。随机森林分析产生了更好的模型和更精确的性能估计，综合观察的准确率约为0.74，灵敏度约为0.72，特异性约为0.81。

结论

简单模型在预测哪些胞嘧啶会被编辑为尿嘧啶方面表现尚可，并为植物线粒体中的RNA编辑位点提供了首个定量预测模型。我们的分析表明，编辑的C的上游第1个核苷酸的身份以及围绕编辑的C的41 nt区域的估计折叠自由能是区分大多数编辑位点和未编辑位点的最重要变量。然而，结果表明仅靠一级序列数据和简单的折叠自由能计算不足以进行高度准确的预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66ff/521485/c3d4bb2f383b/1471-2105-5-132-1.jpg

相似文献

Simple statistical models predict C-to-U edited sites in plant mitochondrial RNA.

BMC Bioinformatics. 2004 Sep 16;5:132. doi: 10.1186/1471-2105-5-132.

Computational analysis of RNA editing sites in plant mitochondrial genomes reveals similar information content and a sporadic distribution of editing sites.

Mol Biol Evol. 2007 Sep;24(9):1971-81. doi: 10.1093/molbev/msm125. Epub 2007 Jun 24.

PREP-Mt: predictive RNA editor for plant mitochondrial genes.

BMC Bioinformatics. 2005 Apr 12;6:96. doi: 10.1186/1471-2105-6-96.

Prediction of C-to-U RNA editing sites in higher plant mitochondria using only nucleotide sequence features.

Biochem Biophys Res Commun. 2007 Jun 22;358(1):336-41. doi: 10.1016/j.bbrc.2007.04.130. Epub 2007 Apr 30.

RNA editing in plant mitochondria—connecting RNA target sequences and acting proteins.

Mitochondrion. 2014 Nov;19 Pt B:191-7. doi: 10.1016/j.mito.2014.04.005. Epub 2014 Apr 13.

iPReditor-CMG: Improving a predictive RNA editor for crop mitochondrial genomes using genomic sequence features and an optimal support vector machine.

Phytochemistry. 2022 Aug;200:113222. doi: 10.1016/j.phytochem.2022.113222. Epub 2022 May 11.

Editing site recognition in plant mitochondria: the importance of 5'-flanking sequences.

Plant Mol Biol. 1998 Jan;36(2):229-37. doi: 10.1023/a:1005961718612.

RNA editing in higher plant mitochondria: analysis of biochemistry and specificity.

Biochimie. 1995;77(1-2):79-86. doi: 10.1016/0300-9084(96)88108-9.

Deepred-Mt: Deep representation learning for predicting C-to-U RNA editing in plant mitochondria.

Comput Biol Med. 2021 Sep;136:104682. doi: 10.1016/j.compbiomed.2021.104682. Epub 2021 Jul 27.

The DYW Subgroup PPR Protein MEF35 Targets RNA Editing Sites in the Mitochondrial rpl16, nad4 and cob mRNAs in Arabidopsis thaliana.

PLoS One. 2015 Oct 15;10(10):e0140680. doi: 10.1371/journal.pone.0140680. eCollection 2015.

引用本文的文献

Assembly and comparative analysis of the complete mitochondrial genome of Abies beshanzuensis: insights into conservation genomics of a critically endangered fir.

BMC Plant Biol. 2025 Aug 9;25(1):1049. doi: 10.1186/s12870-025-07132-2.

Complete sequencing of the mitochondrial genome of tea plant cv. 'Baihaozao': multichromosomal structure, phylogenetic relationships, and adaptive evolutionary analysis.

Front Plant Sci. 2025 Jun 13;16:1604404. doi: 10.3389/fpls.2025.1604404. eCollection 2025.

Ecological niche modeling for surveillance of foot-and-mouth disease in South Asia.

PLoS One. 2025 Apr 22;20(4):e0320921. doi: 10.1371/journal.pone.0320921. eCollection 2025.

Comparative analysis of the whole mitochondrial genomes of four species in sect. Chrysantha (Camellia L.), endemic taxa in China.

BMC Plant Biol. 2024 Oct 12;24(1):955. doi: 10.1186/s12870-024-05673-6.

RNA Editing in Chloroplast: Advancements and Opportunities.

Curr Issues Mol Biol. 2022 Nov 12;44(11):5593-5604. doi: 10.3390/cimb44110379.

Random forest versus logistic regression: a large-scale benchmark experiment.

BMC Bioinformatics. 2018 Jul 17;19(1):270. doi: 10.1186/s12859-018-2264-5.

Towards a comprehensive picture of C-to-U RNA editing sites in angiosperm mitochondria.

Plant Mol Biol. 2018 Jun;97(3):215-231. doi: 10.1007/s11103-018-0734-9. Epub 2018 May 14.

Intervention in prediction measure: a new approach to assessing variable importance for random forests.

BMC Bioinformatics. 2017 May 2;18(1):230. doi: 10.1186/s12859-017-1650-8.

Deep Transcriptome Sequencing of Two Green Algae, Chara vulgaris and Chlamydomonas reinhardtii, Provides No Evidence of Organellar RNA Editing.

Genes (Basel). 2017 Feb 20;8(2):80. doi: 10.3390/genes8020080.

An AUC-based permutation variable importance measure for random forests.

BMC Bioinformatics. 2013 Apr 5;14:119. doi: 10.1186/1471-2105-14-119.

本文引用的文献

GenBank: update.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D23-6. doi: 10.1093/nar/gkh045.

The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana.

Nucleic Acids Res. 2003 Oct 15;31(20):5907-16. doi: 10.1093/nar/gkg795.

Diversity and evolution of mitochondrial RNA editing systems.

IUBMB Life. 2003 Apr-May;55(4-5):227-33. doi: 10.1080/1521654031000119425.

The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants.

Mol Genet Genomics. 2002 Dec;268(4):434-45. doi: 10.1007/s00438-002-0767-1. Epub 2002 Nov 1.

Cross-competition in transgenic chloroplasts expressing single editing sites reveals shared cis elements.

Mol Cell Biol. 2002 Dec;22(24):8448-56. doi: 10.1128/MCB.22.24.8448-8456.2002.

RNA Editing in Plant Mitochondria: [alpha]-Phosphate Is Retained during C-to-U Conversion in mRNAs.

Plant Cell. 1993 Dec;5(12):1843-1852. doi: 10.1105/tpc.5.12.1843.

cis Recognition elements in plant mitochondrion RNA editing.

Mol Cell Biol. 2001 Oct;21(20):6731-7. doi: 10.1128/MCB.21.20.6731-6737.2001.

Relating amino acid sequence to phenotype: analysis of peptide-binding data.

Biometrics. 2001 Jun;57(2):632-42. doi: 10.1111/j.0006-341x.2001.00632.x.

RNA editing in Arabidopsis mitochondria effects 441 C to U changes in ORFs.

Proc Natl Acad Sci U S A. 1999 Dec 21;96(26):15324-9. doi: 10.1073/pnas.96.26.15324.

RNA editing site recognition in higher plant mitochondria.

J Hered. 1999 May-Jun;90(3):338-44. doi: 10.1093/jhered/90.3.338.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

简单的统计模型可预测植物线粒体RNA中C到U的编辑位点。

Simple statistical models predict C-to-U edited sites in plant mitochondrial RNA.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献