Suppr超能文献

nQuack:一个使用基于位点的杂合性从序列数据预测倍性水平的R软件包。

nQuack: An R package for predicting ploidal level from sequence data using site-based heterozygosity.

作者信息

Gaynor Michelle L, Landis Jacob B, O'Connor Timothy K, Laport Robert G, Doyle Jeff J, Soltis Douglas E, Ponciano José Miguel, Soltis Pamela S

机构信息

Florida Museum of Natural History University of Florida Gainesville 32611 Florida USA.

Department of Biology University of Florida Gainesville 32611 Florida USA.

出版信息

Appl Plant Sci. 2024 Jul 14;12(4):e11606. doi: 10.1002/aps3.11606. eCollection 2024 Jul-Aug.

Abstract

PREMISE

Traditional methods of ploidal-level estimation are tedious; using DNA sequence data for cytotype estimation is an ideal alternative. Multiple statistical approaches to leverage sequence data for ploidy inference based on site-based heterozygosity have been developed. However, these approaches may require high-coverage sequence data, use inappropriate probability distributions, or have additional statistical shortcomings that limit inference abilities. We introduce nQuack, an open-source R package that addresses the main shortcomings of current methods.

METHODS AND RESULTS

nQuack performs model selection for improved ploidy predictions. Here, we implement expectation maximization algorithms with normal, beta, and beta-binomial distributions. Using extensive computer simulations that account for variability in sequencing depth, as well as real data sets, we demonstrate the utility and limitations of nQuack.

CONCLUSIONS

Inferring ploidy based on site-based heterozygosity alone is difficult. Even though nQuack is more accurate than similar methods, we suggest caution when relying on any site-based heterozygosity method to infer ploidy.

摘要

前提

传统的倍性水平估计方法繁琐;利用DNA序列数据进行细胞型估计是一种理想的替代方法。已经开发了多种基于位点杂合性利用序列数据进行倍性推断的统计方法。然而,这些方法可能需要高覆盖度的序列数据,使用不适当的概率分布,或者存在其他限制推断能力的统计缺陷。我们引入了nQuack,一个解决当前方法主要缺点的开源R包。

方法与结果

nQuack进行模型选择以改进倍性预测。在这里,我们实现了具有正态分布、贝塔分布和贝塔二项分布的期望最大化算法。通过考虑测序深度变异性的广泛计算机模拟以及真实数据集,我们展示了nQuack的实用性和局限性。

结论

仅基于位点杂合性推断倍性是困难的。尽管nQuack比类似方法更准确,但我们建议在依靠任何基于位点杂合性的方法推断倍性时要谨慎。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/f5830f1196e6/APS3-12-e11606-g002.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验