• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种通过靶向单倍型而非个体来分配低覆盖度测序资源的方法。

A method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals.

作者信息

Ros-Freixedes Roger, Gonen Serap, Gorjanc Gregor, Hickey John M

机构信息

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK.

出版信息

Genet Sel Evol. 2017 Oct 25;49(1):78. doi: 10.1186/s12711-017-0353-y.

DOI:10.1186/s12711-017-0353-y
PMID:29070022
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5655873/
Abstract

BACKGROUND

This paper describes a heuristic method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals. Low-coverage sequencing assembles high-coverage sequence information for every individual by accumulating data from the genome segments that they share with many other individuals into consensus haplotypes. Deriving the consensus haplotypes accurately is critical for achieving a high phasing and imputation accuracy. In order to enable accurate phasing and imputation of sequence information for the whole population, we allocate the available sequencing resources among individuals with existing phased genomic data by targeting the sequencing coverage of their haplotypes.

RESULTS

Our method, called AlphaSeqOpt, prioritizes haplotypes using a score function that is based on the frequency of the haplotypes in the sequencing set relative to the target coverage. AlphaSeqOpt has two steps: (1) selection of an initial set of individuals by iteratively choosing the individuals that have the maximum score conditional on the current set, and (2) refinement of the set through several rounds of exchanges of individuals. AlphaSeqOpt is very effective for distributing a fixed amount of sequencing resources evenly across haplotypes, which results in a reduction of the proportion of haplotypes that are sequenced below the target coverage. AlphaSeqOpt can provide a greater proportion of haplotypes sequenced at the target coverage by sequencing less individuals, as compared with other methods that use a score function based on haplotype frequencies in the population. A refinement of the initially selected set can provide a larger more diverse set with more unique individuals, which is beneficial in the context of low-coverage sequencing. We extend the method with an approach for filtering rare haplotypes based on their flanking haplotypes, so that only those that are likely to derive from a recombination event are targeted.

CONCLUSIONS

We present a method for allocating sequencing resources so that a greater proportion of haplotypes are sequenced at a coverage that is sufficiently high for population-based imputation with low-coverage sequencing. The haplotype score function, the refinement step, and the new approach for filtering rare haplotypes make AlphaSeqOpt more effective for that purpose than previously reported methods for reducing sequencing redundancy.

摘要

背景

本文描述了一种通过针对单倍型而非个体来分配低覆盖度测序资源的启发式方法。低覆盖度测序通过将个体与许多其他个体共享的基因组片段的数据累积为一致单倍型,为每个个体组装高覆盖度序列信息。准确推导一致单倍型对于实现高定相和插补准确性至关重要。为了能够对整个人群的序列信息进行准确的定相和插补,我们通过针对个体单倍型的测序覆盖度,在具有现有定相基因组数据的个体之间分配可用的测序资源。

结果

我们的方法称为AlphaSeqOpt,它使用基于测序集中单倍型频率相对于目标覆盖度的评分函数对单倍型进行优先级排序。AlphaSeqOpt有两个步骤:(1) 通过迭代选择在当前集合条件下得分最高的个体来选择一组初始个体,以及(2) 通过几轮个体交换来优化该集合。AlphaSeqOpt对于将固定量的测序资源均匀分布在单倍型上非常有效,这导致低于目标覆盖度测序的单倍型比例降低。与其他使用基于群体中单倍型频率的评分函数的方法相比,AlphaSeqOpt通过对更少的个体进行测序,可以提供更大比例在目标覆盖度下测序的单倍型。对最初选择的集合进行优化可以提供一个更大、更多样化且包含更多独特个体的集合,这在低覆盖度测序的背景下是有益的。我们通过一种基于侧翼单倍型过滤稀有单倍型的方法扩展了该方法,以便仅针对那些可能源自重组事件的单倍型。

结论

我们提出了一种分配测序资源的方法,以便更大比例的单倍型在足够高的覆盖度下进行测序,从而用于基于群体的低覆盖度测序插补。单倍型评分函数、优化步骤以及过滤稀有单倍型的新方法使AlphaSeqOpt在该目的上比先前报道的减少测序冗余的方法更有效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/e7eb85e45c8c/12711_2017_353_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/3c700bd64b71/12711_2017_353_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/13981b6ac5c2/12711_2017_353_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/2e2dd71f504d/12711_2017_353_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/289662189047/12711_2017_353_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/03a3836c256d/12711_2017_353_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/df6dac6e30cb/12711_2017_353_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/43da7fb98baa/12711_2017_353_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/e7eb85e45c8c/12711_2017_353_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/3c700bd64b71/12711_2017_353_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/13981b6ac5c2/12711_2017_353_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/2e2dd71f504d/12711_2017_353_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/289662189047/12711_2017_353_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/03a3836c256d/12711_2017_353_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/df6dac6e30cb/12711_2017_353_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/43da7fb98baa/12711_2017_353_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d29/5655873/e7eb85e45c8c/12711_2017_353_Fig8_HTML.jpg

相似文献

1
A method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals.一种通过靶向单倍型而非个体来分配低覆盖度测序资源的方法。
Genet Sel Evol. 2017 Oct 25;49(1):78. doi: 10.1186/s12711-017-0353-y.
2
A method for the allocation of sequencing resources in genotyped livestock populations.一种在基因分型家畜群体中分配测序资源的方法。
Genet Sel Evol. 2017 May 18;49(1):47. doi: 10.1186/s12711-017-0322-5.
3
A strategy to improve phasing of whole-genome sequenced individuals through integration of familial information from dense genotype panels.一种通过整合来自密集基因型面板的家族信息来改善全基因组测序个体相位的策略。
Genet Sel Evol. 2017 May 16;49(1):46. doi: 10.1186/s12711-017-0321-6.
4
Extending long-range phasing and haplotype library imputation algorithms to large and heterogeneous datasets.将长程相位推断和单倍型文库推断算法扩展到大型和异质数据集。
Genet Sel Evol. 2020 Jul 8;52(1):38. doi: 10.1186/s12711-020-00558-2.
5
Hybrid peeling for fast and accurate calling, phasing, and imputation with sequence data of any coverage in pedigrees.混合去卷积用于在任何覆盖度的家系序列数据中快速准确地进行调用、定相和插补。
Genet Sel Evol. 2018 Dec 18;50(1):67. doi: 10.1186/s12711-018-0438-2.
6
Benchmarking phasing software with a whole-genome sequenced cattle pedigree.利用全基因组测序的牛系谱对相位软件进行基准测试。
BMC Genomics. 2022 Feb 15;23(1):130. doi: 10.1186/s12864-022-08354-6.
7
Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel.使用基于全基因组测序(WGS)的特定人群高覆盖度插补参考面板提高罕见和低频变异的插补准确性。
Eur J Hum Genet. 2017 Jun;25(7):869-876. doi: 10.1038/ejhg.2017.51. Epub 2017 Apr 12.
8
Phasing for medical sequencing using rare variants and large haplotype reference panels.使用罕见变异和大型单倍型参考面板进行医学测序的定相分析。
Bioinformatics. 2016 Jul 1;32(13):1974-80. doi: 10.1093/bioinformatics/btw065. Epub 2016 Feb 27.
9
The size and composition of haplotype reference panels impact the accuracy of imputation from low-pass sequencing in cattle.单体型参考面板的大小和组成会影响牛低深度测序数据的准确性。
Genet Sel Evol. 2023 May 11;55(1):33. doi: 10.1186/s12711-023-00809-y.
10
Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions.从全基因组序列数据进行基因组预测:测序设计对基因型推断和预测准确性的影响。
Heredity (Edinb). 2014 Jan;112(1):39-47. doi: 10.1038/hdy.2013.13. Epub 2013 Apr 3.

引用本文的文献

1
Evaluation of Low-Coverage Sequencing Strategies for Whole-Genome Imputation in Pacific Abalone .太平洋鲍鱼全基因组插补的低覆盖度测序策略评估
Int J Mol Sci. 2025 May 11;26(10):4598. doi: 10.3390/ijms26104598.
2
Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps.制定基于测序的基因型分析在连锁图谱构建中的最佳实践。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad092. Epub 2023 Oct 27.
3
Cost-effectively dissecting the genetic architecture of complex wool traits in rabbits by low-coverage sequencing.

本文引用的文献

1
A method for the allocation of sequencing resources in genotyped livestock populations.一种在基因分型家畜群体中分配测序资源的方法。
Genet Sel Evol. 2017 May 18;49(1):47. doi: 10.1186/s12711-017-0322-5.
2
Identification of key contributors in complex population structures.复杂群体结构中关键贡献因素的识别。
PLoS One. 2017 May 16;12(5):e0177638. doi: 10.1371/journal.pone.0177638. eCollection 2017.
3
AlphaSim: Software for Breeding Program Simulation.AlphaSim:种畜培育程序模拟软件。
通过低覆盖度测序,经济有效地剖析家兔复杂羊毛性状的遗传结构。
Genet Sel Evol. 2022 Nov 18;54(1):75. doi: 10.1186/s12711-022-00766-y.
4
Genomic prediction with whole-genome sequence data in intensely selected pig lines.全基因组序列数据在高度选育猪系中的基因组预测。
Genet Sel Evol. 2022 Sep 24;54(1):65. doi: 10.1186/s12711-022-00756-0.
5
Rare and population-specific functional variation across pig lines.猪种间罕见且具有种群特异性的功能变异。
Genet Sel Evol. 2022 Jun 3;54(1):39. doi: 10.1186/s12711-022-00732-8.
6
Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies.基于结构变异的泛基因组构建对单倍型解析牛组装体的变异性的敏感性较低。
Nat Commun. 2022 May 31;13(1):3012. doi: 10.1038/s41467-022-30680-2.
7
Comparison of the choice of animals for re-sequencing in two maternal pig lines.比较两个母系猪品系中用于重测序的动物选择。
Genet Sel Evol. 2022 Feb 19;54(1):16. doi: 10.1186/s12711-022-00706-w.
8
Mining massive genomic data of two Swiss Braunvieh cattle populations reveals six novel candidate variants that impair reproductive success.从两个瑞士褐牛群体的大量基因组数据中挖掘出六个影响繁殖成功率的新候选变异。
Genet Sel Evol. 2021 Dec 16;53(1):95. doi: 10.1186/s12711-021-00686-3.
9
Towards a Cost-Effective Implementation of Genomic Prediction Based on Low Coverage Whole Genome Sequencing in Dezhou Donkey.基于低覆盖度全基因组测序在德州驴中实现具有成本效益的基因组预测
Front Genet. 2021 Nov 3;12:728764. doi: 10.3389/fgene.2021.728764. eCollection 2021.
10
Optimizing Sequencing Resources in Genotyped Livestock Populations Using Linear Programming.利用线性规划优化基因分型家畜群体中的测序资源
Front Genet. 2021 Oct 22;12:740340. doi: 10.3389/fgene.2021.740340. eCollection 2021.
Plant Genome. 2016 Nov;9(3). doi: 10.3835/plantgenome2016.02.0013.
4
Low-, high-coverage, and two-stage DNA sequencing in the design of the genetic association study.遗传关联研究设计中的低覆盖度、高覆盖度和两阶段DNA测序
Genet Epidemiol. 2017 Apr;41(3):187-197. doi: 10.1002/gepi.22015. Epub 2016 Nov 4.
5
Genome-wide association of multiple complex traits in outbred mice by ultra-low-coverage sequencing.通过超低覆盖度测序对远交系小鼠多种复杂性状进行全基因组关联分析。
Nat Genet. 2016 Aug;48(8):912-8. doi: 10.1038/ng.3595. Epub 2016 Jul 4.
6
Rapid genotype imputation from sequence without reference panels.无需参考面板即可从序列中快速进行基因型推算。
Nat Genet. 2016 Aug;48(8):965-969. doi: 10.1038/ng.3594. Epub 2016 Jul 4.
7
Reducing animal sequencing redundancy by preferentially selecting animals with low-frequency haplotypes.通过优先选择低频单倍型的动物来减少动物测序冗余。
J Dairy Sci. 2016 Jul;99(7):5526-5534. doi: 10.3168/jds.2015-10347. Epub 2016 Apr 13.
8
Fast imputation using medium or low-coverage sequence data.使用中等或低覆盖率序列数据进行快速插补。
BMC Genet. 2015 Jul 14;16:82. doi: 10.1186/s12863-015-0243-7.
9
Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle.利用单一或多品种参考群体对牛全基因组序列进行填充的策略。
BMC Genomics. 2014 Aug 27;15(1):728. doi: 10.1186/1471-2164-15-728.
10
Accuracy of imputation to whole-genome sequence data in Holstein Friesian cattle.荷斯坦奶牛全基因组序列数据插补的准确性
Genet Sel Evol. 2014 Jul 15;46(1):41. doi: 10.1186/1297-9686-46-41.