Suppr超能文献

忽略二级结构对核糖体 RNA 基因分歧时间估计的影响。

The influence of ignoring secondary structure on divergence time estimates from ribosomal RNA genes.

机构信息

Ludwig-Maximilians-University Munich, Department of Earth & Environmental Sciences, Palaeontology & Geobiology, Molecular Geo- & Palaeobiology Lab, Richard-Wagner-Str. 10, 80333 Munich, Germany.

出版信息

Mol Phylogenet Evol. 2014 Feb;71:214-23. doi: 10.1016/j.ympev.2013.12.003. Epub 2013 Dec 19.

Abstract

Genes coding for ribosomal RNA molecules (rDNA) are among the most popular markers in molecular phylogenetics and evolution. However, coevolution of sites that code for pairing regions (stems) in the RNA secondary structure can make it challenging to obtain accurate results from such loci. While the influence of ignoring secondary structure on multiple sequence alignment and tree topology has been investigated in numerous studies, its effect on molecular divergence time estimates is still poorly known. Here, I investigate this issue in Bayesian Markov Chain Monte Carlo (BMCMC) and penalized likelihood (PL) frameworks, using empirical datasets from dragonflies (Odonata: Anisoptera) and glass sponges (Porifera: Hexactinellida). My results indicate that highly biased inferences under substitution models that ignore secondary structure only occur if maximum-likelihood estimates of branch lengths are used as input to PL dating, whereas in a BMCMC framework and in PL dating based on Bayesian consensus branch lengths, the effect is far less severe. I conclude that accounting for coevolution of paired sites in molecular dating studies is not as important as previously suggested, as long as the estimates are based on Bayesian consensus branch lengths instead of ML point estimates. This finding is especially relevant for studies where computational limitations do not allow the use of secondary-structure specific substitution models, or where accurate consensus structures cannot be predicted. I also found that the magnitude and direction (over- vs. underestimating node ages) of bias in age estimates when secondary structure is ignored was not distributed randomly across the nodes of the phylogenies, a phenomenon that requires further investigation.

摘要

编码核糖体 RNA 分子(rDNA)的基因是分子系统发生学和进化研究中最常用的标记之一。然而,编码 RNA 二级结构配对区域(茎)的位点的共同进化使得从这些基因座获得准确的结果变得具有挑战性。虽然许多研究已经调查了忽略二级结构对多序列比对和树拓扑结构的影响,但它对分子分歧时间估计的影响仍然知之甚少。在这里,我使用蜻蜓(Odonata:Anisoptera)和玻璃海绵(Porifera:Hexactinellida)的经验数据集,在贝叶斯马尔可夫链蒙特卡罗(BMCMC)和惩罚似然(PL)框架中研究了这个问题。我的结果表明,如果将最大似然估计的分支长度用作 PL 年代测定的输入,那么忽略二级结构的替代模型下的高度有偏差的推断仅会发生,而在 BMCMC 框架中和基于贝叶斯一致分支长度的 PL 年代测定中,这种影响则不那么严重。我得出的结论是,只要估计值基于贝叶斯一致分支长度而不是 ML 点估计,在分子年代测定研究中考虑配对位点的共同进化就没有以前建议的那么重要。对于那些由于计算限制不允许使用二级结构特定替代模型的研究,或者无法准确预测共识结构的研究,这一发现尤其相关。我还发现,当忽略二级结构时,年龄估计中的偏差的大小和方向(高估或低估节点年龄)在系统发育的节点之间没有随机分布,这一现象需要进一步研究。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验