• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

nQuack:一个使用基于位点的杂合性从序列数据预测倍性水平的R软件包。

nQuack: An R package for predicting ploidal level from sequence data using site-based heterozygosity.

作者信息

Gaynor Michelle L, Landis Jacob B, O'Connor Timothy K, Laport Robert G, Doyle Jeff J, Soltis Douglas E, Ponciano José Miguel, Soltis Pamela S

机构信息

Florida Museum of Natural History University of Florida Gainesville 32611 Florida USA.

Department of Biology University of Florida Gainesville 32611 Florida USA.

出版信息

Appl Plant Sci. 2024 Jul 14;12(4):e11606. doi: 10.1002/aps3.11606. eCollection 2024 Jul-Aug.

DOI:10.1002/aps3.11606
PMID:39184199
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11342224/
Abstract

PREMISE

Traditional methods of ploidal-level estimation are tedious; using DNA sequence data for cytotype estimation is an ideal alternative. Multiple statistical approaches to leverage sequence data for ploidy inference based on site-based heterozygosity have been developed. However, these approaches may require high-coverage sequence data, use inappropriate probability distributions, or have additional statistical shortcomings that limit inference abilities. We introduce nQuack, an open-source R package that addresses the main shortcomings of current methods.

METHODS AND RESULTS

nQuack performs model selection for improved ploidy predictions. Here, we implement expectation maximization algorithms with normal, beta, and beta-binomial distributions. Using extensive computer simulations that account for variability in sequencing depth, as well as real data sets, we demonstrate the utility and limitations of nQuack.

CONCLUSIONS

Inferring ploidy based on site-based heterozygosity alone is difficult. Even though nQuack is more accurate than similar methods, we suggest caution when relying on any site-based heterozygosity method to infer ploidy.

摘要

前提

传统的倍性水平估计方法繁琐;利用DNA序列数据进行细胞型估计是一种理想的替代方法。已经开发了多种基于位点杂合性利用序列数据进行倍性推断的统计方法。然而,这些方法可能需要高覆盖度的序列数据,使用不适当的概率分布,或者存在其他限制推断能力的统计缺陷。我们引入了nQuack,一个解决当前方法主要缺点的开源R包。

方法与结果

nQuack进行模型选择以改进倍性预测。在这里,我们实现了具有正态分布、贝塔分布和贝塔二项分布的期望最大化算法。通过考虑测序深度变异性的广泛计算机模拟以及真实数据集,我们展示了nQuack的实用性和局限性。

结论

仅基于位点杂合性推断倍性是困难的。尽管nQuack比类似方法更准确,但我们建议在依靠任何基于位点杂合性的方法推断倍性时要谨慎。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/02ab7ffc4783/APS3-12-e11606-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/f5830f1196e6/APS3-12-e11606-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/e5d4c594e259/APS3-12-e11606-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/62105c597279/APS3-12-e11606-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/02ab7ffc4783/APS3-12-e11606-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/f5830f1196e6/APS3-12-e11606-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/e5d4c594e259/APS3-12-e11606-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/62105c597279/APS3-12-e11606-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c09/11342224/02ab7ffc4783/APS3-12-e11606-g004.jpg

相似文献

1
nQuack: An R package for predicting ploidal level from sequence data using site-based heterozygosity.nQuack:一个使用基于位点的杂合性从序列数据预测倍性水平的R软件包。
Appl Plant Sci. 2024 Jul 14;12(4):e11606. doi: 10.1002/aps3.11606. eCollection 2024 Jul-Aug.
2
Cytotype coexistence in the field cannot be explained by inter-cytotype hybridization alone: linking experiments and computer simulations in the sexual species Pilosella echioides (Asteraceae).野外细胞型共存不能仅通过细胞型间杂交来解释:连接有性物种绒毛千里光(菊科)中的实验与计算机模拟。
BMC Evol Biol. 2017 Mar 23;17(1):87. doi: 10.1186/s12862-017-0934-y.
3
Genotyping single nucleotide polymorphisms and inferring ploidy by amplicon sequencing for polyploid, ploidy-variable organisms.通过扩增子测序对多倍体、倍性可变生物进行单核苷酸多态性基因分型和倍性推断。
Mol Ecol Resour. 2021 Oct;21(7):2288-2298. doi: 10.1111/1755-0998.13431. Epub 2021 Jun 6.
4
Detection of individual ploidy levels with genotyping-by-sequencing (GBS) analysis.利用基因组测序(GBS)分析进行个体倍性水平的检测。
Mol Ecol Resour. 2017 Nov;17(6):1156-1167. doi: 10.1111/1755-0998.12657. Epub 2017 Mar 9.
5
Inferring single-cell copy number profiles through cross-cell segmentation of read counts.通过读取计数的跨细胞分割推断单细胞拷贝数谱。
BMC Genomics. 2024 Jan 2;25(1):25. doi: 10.1186/s12864-023-09901-5.
6
Ploidy inference from single-cell data: application to human and mouse cell atlases.从单细胞数据推断倍性:在人类和小鼠细胞图谱中的应用。
Genetics. 2024 Jun 5;227(2). doi: 10.1093/genetics/iyae061.
7
Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions.多倍体生物和可变拷贝数基因组区域中单体型相位及缺失基因型的推断。
BMC Bioinformatics. 2008 Dec 1;9:513. doi: 10.1186/1471-2105-9-513.
8
Inferring Heterozygosity from Ancient and Low Coverage Genomes.从古代低覆盖度基因组推断杂合性
Genetics. 2017 Jan;205(1):317-332. doi: 10.1534/genetics.116.189985. Epub 2016 Nov 7.
9
Inferring Variation in Copy Number Using High Throughput Sequencing Data in R.利用R语言中的高通量测序数据推断拷贝数变异
Front Genet. 2018 Apr 13;9:123. doi: 10.3389/fgene.2018.00123. eCollection 2018.
10
Inference of individual ploidy level using codominant markers.利用共显性标记推断个体的倍性水平。
Mol Ecol Resour. 2019 Sep;19(5):1218-1229. doi: 10.1111/1755-0998.13032. Epub 2019 Jul 25.

引用本文的文献

1
Ancient polyploidization events influence the evolution of the ginseng family (Araliaceae).古代多倍体化事件影响人参科(五加科)的进化。
Front Plant Sci. 2025 Jun 13;16:1595321. doi: 10.3389/fpls.2025.1595321. eCollection 2025.
2
Exploring chromosomal variations in garden roses: Insights from high-density SNP array data and a new tool, Qploidy.探索花园玫瑰中的染色体变异:来自高密度SNP阵列数据和新工具Qploidy的见解
Plant Genome. 2025 Jun;18(2):e70044. doi: 10.1002/tpg2.70044.
3
Harnessing Genomics for Breeding L.: Genotyping and Ploidy Testing of Clonal Lines Through ddRADseq Applications.

本文引用的文献

1
Ploidy inference from single-cell data: application to human and mouse cell atlases.从单细胞数据推断倍性:在人类和小鼠细胞图谱中的应用。
Genetics. 2024 Jun 5;227(2). doi: 10.1093/genetics/iyae061.
2
Anatomy of a mega-radiation: Biogeography and niche evolution in Astragalus.巨型辐射的解剖学:黄芪属的生物地理学和生态位进化。
Am J Bot. 2024 Mar;111(3):e16299. doi: 10.1002/ajb2.16299. Epub 2024 Feb 28.
3
ploidyfrost: Reference-free estimation of ploidy level from whole genome sequencing data based on de Bruijn graphs.ploidyfrost:基于 de Bruijn 图的无参全基因组测序数据的倍性水平估计。
利用基因组学进行育种L.:通过ddRADseq应用对克隆系进行基因分型和倍性检测。
Int J Mol Sci. 2025 May 20;26(10):4898. doi: 10.3390/ijms26104898.
Mol Ecol Resour. 2023 Feb;23(2):499-510. doi: 10.1111/1755-0998.13720. Epub 2022 Nov 1.
4
Measuring the Invisible: The Sequences Causal of Genome Size Differences in Eyebrights () Revealed by k-mers.测量无形之物:通过k-mer揭示小米草属植物基因组大小差异的序列因果关系
Front Plant Sci. 2022 Jul 29;13:818410. doi: 10.3389/fpls.2022.818410. eCollection 2022.
5
Phylogenomics of the genus Glycine sheds light on polyploid evolution and life-strategy transition.属 Glycine 的系统基因组学揭示了多倍体进化和生活策略转变。
Nat Plants. 2022 Mar;8(3):233-244. doi: 10.1038/s41477-022-01102-4. Epub 2022 Mar 14.
6
ModEst: Accurate estimation of genome size from next generation sequencing data.ModEst:通过下一代测序数据准确估计基因组大小。
Mol Ecol Resour. 2022 May;22(4):1454-1464. doi: 10.1111/1755-0998.13570. Epub 2021 Dec 27.
7
Application-based guidelines for best practices in plant flow cytometry.基于应用的植物流式细胞术最佳实践指南。
Cytometry A. 2022 Sep;101(9):749-781. doi: 10.1002/cyto.a.24499. Epub 2021 Sep 29.
8
Errors in Statistical Inference Under Model Misspecification: Evidence, Hypothesis Testing, and AIC.模型设定错误下统计推断中的误差:证据、假设检验与赤池信息准则
Front Ecol Evol. 2019;7. doi: 10.3389/fevo.2019.00372. Epub 2019 Oct 21.
9
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
10
RepeatModeler2 for automated genomic discovery of transposable element families.RepeatModeler2 用于自动发现转座元件家族的基因组。
Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9451-9457. doi: 10.1073/pnas.1921046117. Epub 2020 Apr 16.