• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用下一代测序技术解决多倍体中的单体型分析难题:一项模拟研究。

Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study.

机构信息

Bioinformatics Group, Wageningen University and Research, The Netherlands.

Wageningen UR Plant Breeding, The Netherlands.

出版信息

Brief Bioinform. 2018 May 1;19(3):387-403. doi: 10.1093/bib/bbw126.

DOI:10.1093/bib/bbw126
PMID:28065918
Abstract

Haplotypes are the units of inheritance in an organism, and many genetic analyses depend on their precise determination. Methods for haplotyping single individuals use the phasing information available in next-generation sequencing reads, by matching overlapping single-nucleotide polymorphisms while penalizing post hoc nucleotide corrections made. Haplotyping diploids is relatively easy, but the complexity of the problem increases drastically for polyploid genomes, which are found in both model organisms and in economically relevant plant and animal species. Although a number of tools are available for haplotyping polyploids, the effects of the genomic makeup and the sequencing strategy followed on the accuracy of these methods have hitherto not been thoroughly evaluated.We developed the simulation pipeline haplosim to evaluate the performance of three haplotype estimation algorithms for polyploids: HapCompass, HapTree and SDhaP, in settings varying in sequencing approach, ploidy levels and genomic diversity, using tetraploid potato as the model. Our results show that sequencing depth is the major determinant of haplotype estimation quality, that 1 kb PacBio circular consensus sequencing reads and Illumina reads with large insert-sizes are competitive and that all methods fail to produce good haplotypes when ploidy levels increase. Comparing the three methods, HapTree produces the most accurate estimates, but also consumes the most resources. There is clearly room for improvement in polyploid haplotyping algorithms.

摘要

单体型是生物遗传的单位,许多遗传分析都依赖于对其的精确确定。单体型分析方法利用下一代测序读取中的相位信息,通过匹配重叠的单核苷酸多态性,同时惩罚事后进行的核苷酸校正。分析二倍体的单体型相对容易,但对于多倍体基因组来说,问题的复杂性会急剧增加,多倍体基因组存在于模式生物以及经济上相关的植物和动物物种中。尽管有许多工具可用于分析多倍体的单体型,但迄今为止,尚未对基因组组成和所采用的测序策略对这些方法的准确性的影响进行彻底评估。我们开发了模拟管道 haplosim,以评估三种用于多倍体的单体型估计算法的性能:HapCompass、HapTree 和 SDhaP,使用四倍体马铃薯作为模型,在测序方法、倍性水平和基因组多样性各不相同的设置下进行评估。我们的结果表明,测序深度是单体型估计质量的主要决定因素,1kb PacBio 圆形一致测序读取和具有大插入大小的 Illumina 读取具有竞争力,并且当倍性水平增加时,所有方法都无法产生良好的单体型。比较这三种方法,HapTree 产生的估计值最准确,但也消耗了最多的资源。显然,多倍体单体型分析算法还有改进的空间。

相似文献

1
Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study.利用下一代测序技术解决多倍体中的单体型分析难题:一项模拟研究。
Brief Bioinform. 2018 May 1;19(3):387-403. doi: 10.1093/bib/bbw126.
2
H-PoP and H-PoPG: heuristic partitioning algorithms for single individual haplotyping of polyploids.H-PoP 和 H-PoPG:用于多倍体单个体单体型分析的启发式分区算法。
Bioinformatics. 2016 Dec 15;32(24):3735-3744. doi: 10.1093/bioinformatics/btw537. Epub 2016 Aug 16.
3
Hap10: reconstructing accurate and long polyploid haplotypes using linked reads.Hap10:利用连锁reads 重建准确和长的多倍体单倍型。
BMC Bioinformatics. 2020 Jun 18;21(1):253. doi: 10.1186/s12859-020-03584-5.
4
flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning.flopp:通过均匀树分区实现超快速长读多倍体单体型相位。
J Comput Biol. 2022 Feb;29(2):195-211. doi: 10.1089/cmb.2021.0436. Epub 2022 Jan 17.
5
TriPoly: haplotype estimation for polyploids using sequencing data of related individuals.TriPoly:使用相关个体的测序数据估计多倍体的单体型。
Bioinformatics. 2018 Nov 15;34(22):3864-3872. doi: 10.1093/bioinformatics/bty442.
6
Haplotype threading: accurate polyploid phasing from long reads.单体型连接:长读长准确进行多倍体相位分析。
Genome Biol. 2020 Sep 21;21(1):252. doi: 10.1186/s13059-020-02158-1.
7
PWHATSHAP: efficient haplotyping for future generation sequencing.PWHATSHAP:用于下一代测序的高效单倍型分型
BMC Bioinformatics. 2016 Sep 22;17(Suppl 11):342. doi: 10.1186/s12859-016-1170-y.
8
nPhase: an accurate and contiguous phasing method for polyploids.nPhase:一种用于多倍体的准确连续相位方法。
Genome Biol. 2021 Apr 29;22(1):126. doi: 10.1186/s13059-021-02342-x.
9
A complete pipeline enables haplotyping and phasing macrohaplotype in long sequencing reads for polyploidy samples and a multi-source DNA mixture.一个完整的流程能够对多倍体样本和多源DNA混合物的长测序读段进行单倍型分型和宏单倍型定相。
Electrophoresis. 2024 May;45(9-10):877-884. doi: 10.1002/elps.202300143. Epub 2024 Jan 9.
10
HapTree: a novel Bayesian framework for single individual polyplotyping using NGS data.HapTree:一种使用二代测序数据进行单一个体多基因型分型的新型贝叶斯框架。
PLoS Comput Biol. 2014 Mar 27;10(3):e1003502. doi: 10.1371/journal.pcbi.1003502. eCollection 2014 Mar.

引用本文的文献

1
Enhancing quality and climate resilient traits in vegetatively propagated polyploids: transgenic and genome editing advancements, challenges and future directions.提高无性繁殖多倍体的品质和气候适应性状:转基因和基因组编辑的进展、挑战及未来方向
Front Genet. 2025 Aug 11;16:1599242. doi: 10.3389/fgene.2025.1599242. eCollection 2025.
2
DeepHapNet: a haplotype assembly method based on RetNet and deep spectral clustering.深度单倍型网络:一种基于RetNet和深度谱聚类的单倍型组装方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae656.
3
Haplotype-resolved assembly of a tetraploid potato genome using long reads and low-depth offspring data.
利用长读长和低深度后代数据进行四倍体马铃薯基因组的单倍型解析组装。
Genome Biol. 2024 Jan 19;25(1):26. doi: 10.1186/s13059-023-03160-z.
4
XHap: haplotype assembly using long-distance read correlations learned by transformers.XHap:利用通过变压器学习的长距离读段相关性进行单倍型组装。
Bioinform Adv. 2023 Nov 23;3(1):vbad169. doi: 10.1093/bioadv/vbad169. eCollection 2023.
5
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids.单倍型基因组组装揭示桉树种间杂种的结构变异。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad064. Epub 2023 Aug 26.
6
Pairwise comparative analysis of six haplotype assembly methods based on users' experience.基于用户体验的六种单倍型组装方法的两两比较分析。
BMC Genom Data. 2023 Jun 29;24(1):35. doi: 10.1186/s12863-023-01134-5.
7
Smooth Descent: A ploidy-aware algorithm to improve linkage mapping in the presence of genotyping errors.平滑下降法:一种在存在基因分型错误的情况下改进连锁图谱构建的倍性感知算法。
Front Genet. 2023 Mar 1;14:1049988. doi: 10.3389/fgene.2023.1049988. eCollection 2023.
8
Phylogenetic Analysis of Allotetraploid Species Using Polarized Genomic Sequences.利用极化基因组序列进行异源四倍体物种的系统发育分析。
Syst Biol. 2023 Jun 16;72(2):372-390. doi: 10.1093/sysbio/syad009.
9
HAT: haplotype assembly tool using short and error-prone long reads.HAT:使用短读长和易错长读进行单体型组装的工具。
Bioinformatics. 2022 Dec 13;38(24):5352-5359. doi: 10.1093/bioinformatics/btac702.
10
flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning.flopp:通过均匀树分区实现超快速长读多倍体单体型相位。
J Comput Biol. 2022 Feb;29(2):195-211. doi: 10.1089/cmb.2021.0436. Epub 2022 Jan 17.