利用双交群体构建遗传图谱时，考虑低覆盖率高通量测序数据中的错误。

Accounting for Errors in Low Coverage High-Throughput Sequencing Data When Constructing Genetic Maps Using Biparental Outcrossed Populations.

机构信息

Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand

Invermay Agricultural Centre, AgResearch, Mosgiel 9053, New Zealand.

出版信息

Genetics. 2018 May;209(1):65-76. doi: 10.1534/genetics.117.300627. Epub 2018 Feb 27.

DOI:10.1534/genetics.117.300627

PMID:29487138

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5937187/

Abstract

Next-generation sequencing is an efficient method that allows for substantially more markers than previous technologies, providing opportunities for building high-density genetic linkage maps, which facilitate the development of nonmodel species' genomic assemblies and the investigation of their genes. However, constructing genetic maps using data generated via high-throughput sequencing technology (, genotyping-by-sequencing) is complicated by the presence of sequencing errors and genotyping errors resulting from missing parental alleles due to low sequencing depth. If unaccounted for, these errors lead to inflated genetic maps. In addition, map construction in many species is performed using full-sibling family populations derived from the outcrossing of two individuals, where unknown parental phase and varying segregation types further complicate construction. We present a new methodology for modeling low coverage sequencing data in the construction of genetic linkage maps using full-sibling populations of diploid species, implemented in a package called GUSMap. Our model is based on the Lander-Green hidden Markov model but extended to account for errors present in sequencing data. We were able to obtain accurate estimates of the recombination fractions and overall map distance using GUSMap, while most existing mapping packages produced inflated genetic maps in the presence of errors. Our results demonstrate the feasibility of using low coverage sequencing data to produce genetic maps without requiring extensive filtering of potentially erroneous genotypes, provided that the associated errors are correctly accounted for in the model.

摘要

下一代测序是一种高效的方法，它可以比以前的技术产生更多的标记，为构建高密度遗传连锁图谱提供了机会，这有助于非模式物种基因组组装的发展和对其基因的研究。然而，使用高通量测序技术（如测序的基因分型）生成的数据构建遗传图谱会受到测序错误和由于测序深度低而导致的缺失亲本等位基因的基因分型错误的影响。如果不考虑这些错误，它们会导致遗传图谱膨胀。此外，许多物种的图谱构建都是使用来自两个个体杂交的全同胞家系群体进行的，其中未知的亲本相和不同的分离类型进一步增加了构建的复杂性。我们提出了一种新的方法，用于在使用二倍体物种的全同胞群体构建遗传连锁图谱时对低覆盖测序数据进行建模，该方法在一个名为 GUSMap 的软件包中实现。我们的模型基于 Lander-Green 隐马尔可夫模型，但扩展后可以考虑测序数据中的错误。我们能够使用 GUSMap 获得准确的重组分数和总体图谱距离估计，而大多数现有的映射软件包在存在错误的情况下会产生膨胀的遗传图谱。我们的结果表明，在模型中正确考虑相关错误的情况下，使用低覆盖测序数据生成遗传图谱而无需对潜在错误基因型进行广泛过滤是可行的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c50/5937187/956e54575526/65fig1.jpg

相似文献

Accounting for Errors in Low Coverage High-Throughput Sequencing Data When Constructing Genetic Maps Using Biparental Outcrossed Populations.利用双交群体构建遗传图谱时，考虑低覆盖率高通量测序数据中的错误。

Genetics. 2018 May;209(1):65-76. doi: 10.1534/genetics.117.300627. Epub 2018 Feb 27.

Construction of high-quality recombination maps with low-coverage genomic sequencing for joint linkage analysis in maize.利用低覆盖度基因组测序构建高质量重组图谱用于玉米的联合连锁分析

BMC Biol. 2015 Sep 21;13:78. doi: 10.1186/s12915-015-0187-4.

Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps.制定基于测序的基因型分析在连锁图谱构建中的最佳实践。

Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad092. Epub 2023 Oct 27.

Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design.杏仁的测序基因分型：单核苷酸多态性发现、连锁图谱构建及标记设计

G3 (Bethesda). 2018 Jan 4;8(1):161-172. doi: 10.1534/g3.117.300376.

Saturated linkage map construction in Rubus idaeus using genotyping by sequencing and genome-independent imputation.利用测序基因型和与基因组无关的推断构建树莓饱和连锁图谱。

BMC Genomics. 2013 Jan 16;14:2. doi: 10.1186/1471-2164-14-2.

Effect of genotyping errors on linkage map construction based on repeated chip analysis of two recombinant inbred line populations in wheat (Triticum aestivum L.).基因分型错误对基于小麦（Triticum aestivum L.）两个重组自交系群体重复芯片分析构建连锁图谱的影响。

BMC Plant Biol. 2024 Apr 22;24(1):306. doi: 10.1186/s12870-024-05005-8.

QTL mapping using high-throughput sequencing.利用高通量测序进行数量性状基因座定位。

Methods Mol Biol. 2015;1284:257-85. doi: 10.1007/978-1-4939-2444-8_13.

Genetic dissection of maize plant architecture with an ultra-high density bin map based on recombinant inbred lines.基于重组自交系的超高密度bin图谱对玉米株型进行遗传剖析。

BMC Genomics. 2016 Mar 3;17:178. doi: 10.1186/s12864-016-2555-z.

Construction of Genetic Linkage Maps in Multiparental Populations.构建多亲本群体的遗传连锁图谱。

Genetics. 2019 Aug;212(4):1031-1044. doi: 10.1534/genetics.119.302229. Epub 2019 Jun 10.

Probabilistic Multilocus Haplotype Reconstruction in Outcrossing Tetraploids.异交四倍体中概率多位点单倍型重建

Genetics. 2016 May;203(1):119-31. doi: 10.1534/genetics.115.185579. Epub 2016 Feb 26.

引用本文的文献

Assessing population allele frequency differences using low-depth sequencing data.使用低深度测序数据评估群体等位基因频率差异。

J R Soc N Z. 2025 May 21;55(6):2677-2688. doi: 10.1080/03036758.2025.2500999. eCollection 2025.

gene families in rye ( L.) - genome-wide identification, characterization and sequence diversity assessment via DArTreseq.黑麦（L.）中的基因家族——通过DArTreseq进行全基因组鉴定、特征分析和序列多样性评估

Front Plant Sci. 2025 Jun 16;16:1529358. doi: 10.3389/fpls.2025.1529358. eCollection 2025.

Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction.酵母中的单细胞eQTL图谱揭示了生长与繁殖之间的权衡。

Elife. 2025 Mar 12;13:RP95566. doi: 10.7554/eLife.95566.

De novo genome assembly of white clover (Trifolium repens L.) reveals the role of copy number variation in rapid environmental adaptation.白三叶草（Trifolium repens L.）的从头基因组组装揭示了拷贝数变异在快速环境适应中的作用。

BMC Biol. 2024 Aug 7;22(1):165. doi: 10.1186/s12915-024-01962-6.

A high-density linkage map reveals broad- and fine-scale sex differences in recombination in the hihi (stitchbird; Notiomystis cincta).高密度连锁图谱揭示了红额鹦鹉（Notiomystis cincta）在重组方面广泛存在的雌雄性别差异和精细尺度性别差异。

Heredity (Edinb). 2024 Oct;133(4):262-275. doi: 10.1038/s41437-024-00711-3. Epub 2024 Aug 2.

Construction of relatedness matrices in autopolyploid populations using low-depth high-throughput sequencing data.利用低深度高通量测序数据构建同源多倍体群体的亲缘关系矩阵。

Theor Appl Genet. 2024 Mar 2;137(3):64. doi: 10.1007/s00122-024-04568-2.

Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction.酵母中的单细胞eQTL定位揭示了生长与繁殖之间的权衡。

bioRxiv. 2024 Nov 8:2023.12.07.570640. doi: 10.1101/2023.12.07.570640.

Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps.制定基于测序的基因型分析在连锁图谱构建中的最佳实践。

Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad092. Epub 2023 Oct 27.

A multiplexed plant-animal SNP array for selective breeding and species conservation applications.用于选择性育种和物种保护应用的多重植物-动物 SNP 芯片。

G3 (Bethesda). 2023 Sep 30;13(10). doi: 10.1093/g3journal/jkad170.

Smooth Descent: A ploidy-aware algorithm to improve linkage mapping in the presence of genotyping errors.平滑下降法：一种在存在基因分型错误的情况下改进连锁图谱构建的倍性感知算法。

Front Genet. 2023 Mar 1;14:1049988. doi: 10.3389/fgene.2023.1049988. eCollection 2023.

本文引用的文献

De novo SNP discovery and genetic linkage mapping in poplar using restriction site associated DNA and whole-genome sequencing technologies.利用限制性位点相关DNA和全基因组测序技术在杨树中进行从头SNP发现和遗传连锁图谱构建。

BMC Genomics. 2016 Aug 18;17:656. doi: 10.1186/s12864-016-3003-9.

Imputing Genotypes in Biallelic Populations from Low-Coverage Sequence Data.利用低覆盖度序列数据推算双等位基因群体中的基因型

Genetics. 2016 Feb;202(2):487-95. doi: 10.1534/genetics.115.182071. Epub 2015 Dec 29.

Construction of Ultradense Linkage Maps with Lep-MAP2: Stickleback F2 Recombinant Crosses as an Example.利用Lep-MAP2构建超密集连锁图谱：以棘鱼F2重组杂交为例。

Genome Biol Evol. 2015 Dec 14;8(1):78-93. doi: 10.1093/gbe/evv250.

Construction of relatedness matrices using genotyping-by-sequencing data.使用简化基因组测序数据构建亲缘关系矩阵。

BMC Genomics. 2015 Dec 9;16:1047. doi: 10.1186/s12864-015-2252-3.

Heterozygous Mapping Strategy (HetMappS) for High Resolution Genotyping-By-Sequencing Markers: A Case Study in Grapevine.用于高分辨率测序基因分型标记的杂合映射策略（HetMappS）：葡萄中的一个案例研究

PLoS One. 2015 Aug 5;10(8):e0134880. doi: 10.1371/journal.pone.0134880. eCollection 2015.

Flexible and scalable genotyping-by-sequencing strategies for population studies.用于群体研究的灵活且可扩展的测序基因分型策略。

BMC Genomics. 2014 Nov 18;15(1):979. doi: 10.1186/1471-2164-15-979.

Using Mendelian inheritance to improve high-throughput SNP discovery.利用孟德尔遗传规律改进高通量单核苷酸多态性发现。

Genetics. 2014 Nov;198(3):847-57. doi: 10.1534/genetics.114.169052. Epub 2014 Sep 5.

Fast and cost-effective genetic mapping in apple using next-generation sequencing.利用下一代测序技术在苹果中进行快速且经济高效的基因定位

G3 (Bethesda). 2014 Jul 16;4(9):1681-7. doi: 10.1534/g3.114.011023.

Construction and analysis of high-density linkage map using high-throughput sequencing data.利用高通量测序数据构建和分析高密度连锁图谱。

PLoS One. 2014 Jun 6;9(6):e98855. doi: 10.1371/journal.pone.0098855. eCollection 2014.

Efficient imputation of missing markers in low-coverage genotyping-by-sequencing data from multiparental crosses.多亲本杂交低覆盖度测序基因分型数据中缺失标记的高效填充

Genetics. 2014 May;197(1):401-4. doi: 10.1534/genetics.113.158014. Epub 2014 Feb 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用双交群体构建遗传图谱时，考虑低覆盖率高通量测序数据中的错误。

Accounting for Errors in Low Coverage High-Throughput Sequencing Data When Constructing Genetic Maps Using Biparental Outcrossed Populations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献