• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用低覆盖度全基因组测序对结构变异和单核苷酸变异进行高性能插补

High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing.

作者信息

Gundappa Manu Kumar, Robledo Diego, Hamilton Alastair, Houston Ross D, Prendergast James G D, Macqueen Daniel J

机构信息

Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.

The Roslin Institute, University of Edinburgh, Easter Bush Campus, Edinburgh, EH25 9RG, UK.

出版信息

Genet Sel Evol. 2025 Mar 28;57(1):16. doi: 10.1186/s12711-025-00962-6.

DOI:10.1186/s12711-025-00962-6
PMID:40155798
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11951665/
Abstract

BACKGROUND

Whole genome sequencing (WGS), despite its advantages, is yet to replace methods for genotyping single nucleotide variants (SNVs) such as SNP arrays and targeted genotyping assays. Structural variants (SVs) have larger effects on traits than SNVs, but are more challenging to accurately genotype. Using low-coverage WGS with genotype imputation offers a cost-effective strategy to achieve genome-wide variant coverage, but is yet to be tested for SVs.

METHODS

Here, we investigate combined SNV and SV imputation with low-coverage WGS data in Atlantic salmon (Salmo salar). As the reference panel, we used genotypes for high-confidence SVs and SNVs for n = 365 wild individuals sampled from diverse populations. We also generated 15 × WGS data (n = 20 samples) for a commercial population external to the reference panel, and called SVs and SNVs with gold-standard approaches. An imputation method selected for its established performance using low-coverage sequencing data (GLIMPSE) was tested at WGS depths of 1 × , 2 × , 3 × , and 4 × for samples within and external to the reference panel.

RESULTS

SNVs were imputed with high accuracy and recall across all WGS depths, including for samples out-with the reference panel. For SVs, we compared imputation based purely on linkage disequilibrium (LD) with SNVs, to that supplemented with SV genotype likelihoods (GLs) from low-coverage WGS. Including SV GLs increased imputation accuracy, but as a trade-off with recall, requiring 3-4 × depth for best performance. Combining strategies allowed us to capture 84% of the reference panel deletions with 87% accuracy at 1 × depth. We also show that SV length affects imputation performance, with provision of SV GLs greatly enhancing accuracy for the longest SVs in the dataset.

CONCLUSIONS

This study highlights the promise of reference panel imputation using low-coverage WGS, including novel opportunities to enhance the resolution of genome-wide association studies by capturing SVs.

摘要

背景

全基因组测序(WGS)尽管具有诸多优势,但尚未取代单核苷酸变异(SNV)基因分型方法,如SNP阵列和靶向基因分型检测。结构变异(SV)对性状的影响比SNV更大,但准确进行基因分型更具挑战性。使用低覆盖度WGS结合基因型填充提供了一种经济高效的策略来实现全基因组变异覆盖,但尚未针对SV进行测试。

方法

在此,我们研究了大西洋鲑(Salmo salar)低覆盖度WGS数据中SNV和SV的联合填充。作为参考面板,我们使用了从不同种群中采样的n = 365个野生个体的高可信度SV和SNV的基因型。我们还为参考面板之外的一个商业种群生成了15×WGS数据(n = 20个样本),并使用金标准方法对SV和SNV进行了分型。选择一种因其在低覆盖度测序数据中已确立的性能而被采用的填充方法(GLIMPSE),在1×、2×、3×和4×的WGS深度下,对参考面板内外的样本进行测试。

结果

在所有WGS深度下,包括参考面板之外的样本,SNV的填充都具有高精度和召回率。对于SV,我们将仅基于与SNV的连锁不平衡(LD)的填充与补充了低覆盖度WGS的SV基因型似然性(GL)的填充进行了比较。纳入SV GL提高了填充准确性,但作为召回率的权衡,最佳性能需要3 - 4×深度。组合策略使我们能够在1×深度下以87%的准确率捕获参考面板中84%的缺失。我们还表明,SV长度会影响填充性能,提供SV GL极大地提高了数据集中最长SV的准确性。

结论

本研究突出了使用低覆盖度WGS进行参考面板填充的前景,包括通过捕获SV增强全基因组关联研究分辨率的新机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/3ea28462eb97/12711_2025_962_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/f856a0cbb938/12711_2025_962_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/702943549e0b/12711_2025_962_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/72ccc21c8b95/12711_2025_962_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/3ec98ff0600c/12711_2025_962_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/05a5dfa88b6a/12711_2025_962_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/b0571b11797d/12711_2025_962_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/ad8e95c66fd1/12711_2025_962_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/3ea28462eb97/12711_2025_962_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/f856a0cbb938/12711_2025_962_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/702943549e0b/12711_2025_962_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/72ccc21c8b95/12711_2025_962_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/3ec98ff0600c/12711_2025_962_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/05a5dfa88b6a/12711_2025_962_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/b0571b11797d/12711_2025_962_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/ad8e95c66fd1/12711_2025_962_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d0d/11951665/3ea28462eb97/12711_2025_962_Fig8_HTML.jpg

相似文献

1
High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing.利用低覆盖度全基因组测序对结构变异和单核苷酸变异进行高性能插补
Genet Sel Evol. 2025 Mar 28;57(1):16. doi: 10.1186/s12711-025-00962-6.
2
Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.利用简化基因组测序(GBS)和填充技术对圈养非人灵长类动物进行全基因组特征分析。
BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.
3
A large structural variant collection in Holstein cattle and associated database for variant discovery, characterization, and application.荷斯坦牛大型结构变异组库及相关数据库的建立,用于变异的发现、鉴定和应用。
BMC Genomics. 2024 Sep 30;25(1):903. doi: 10.1186/s12864-024-10812-2.
4
Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs.利用基于插补的全基因组测序数据提高猪的组合群体基因组预测的准确性。
Genet Sel Evol. 2019 Oct 21;51(1):58. doi: 10.1186/s12711-019-0500-8.
5
Comparison of genotype imputation strategies using a combined reference panel for chicken population.利用鸡群体的组合参考面板比较基因型推断策略。
Animal. 2019 Jun;13(6):1119-1126. doi: 10.1017/S1751731118002860. Epub 2018 Oct 29.
6
Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.基于大型参考面板的基因分型和模拟深度全基因组测序的程度。
Am J Hum Genet. 2022 Sep 1;109(9):1653-1666. doi: 10.1016/j.ajhg.2022.07.012. Epub 2022 Aug 17.
7
From cytogenetics to cytogenomics: whole-genome sequencing as a first-line test comprehensively captures the diverse spectrum of disease-causing genetic variation underlying intellectual disability.从细胞遗传学到细胞基因组学:全基因组测序作为一线检测手段,全面捕捉到导致智力障碍的遗传变异的多样谱。
Genome Med. 2019 Nov 7;11(1):68. doi: 10.1186/s13073-019-0675-1.
8
Performance evaluation of structural variation detection using DNBSEQ whole-genome sequencing.使用DNBSEQ全基因组测序进行结构变异检测的性能评估
BMC Genomics. 2025 Mar 25;26(1):299. doi: 10.1186/s12864-025-11494-0.
9
Assessment of linkage disequilibrium patterns between structural variants and single nucleotide polymorphisms in three commercial chicken populations.评估三个商业鸡群中结构变异与单核苷酸多态性之间的连锁不平衡模式。
BMC Genomics. 2022 Mar 9;23(1):193. doi: 10.1186/s12864-022-08418-7.
10
Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction.获取和修剪用于基因组预测的插补全基因组序列数据的策略。
Front Genet. 2019 Jul 17;10:673. doi: 10.3389/fgene.2019.00673. eCollection 2019.

本文引用的文献

1
The generation of the first chromosome-level de novo genome assembly and the development and validation of a 50K SNP array for the St. John River aquaculture strain of North American Atlantic salmon.生成首个染色体水平的从头基因组组装,并开发和验证了用于北美大西洋鲑圣约翰河养殖品系的 50K SNP 芯片。
G3 (Bethesda). 2023 Aug 30;13(9). doi: 10.1093/g3journal/jkad138.
2
ESCRT recruitment to mRNA-encoded SARS-CoV-2 spike induces virus-like particles and enhanced antibody responses.ESCRT募集到由mRNA编码的新冠病毒刺突蛋白上可诱导病毒样颗粒并增强抗体反应。
bioRxiv. 2022 Dec 27:2022.12.26.521940. doi: 10.1101/2022.12.26.521940.
3
Dissecting the loci underlying maturation timing in Atlantic salmon using haplotype and multi-SNP based association methods.
利用单倍型和多 SNP 关联方法剖析大西洋鲑鱼成熟时间的相关基因座。
Heredity (Edinb). 2022 Dec;129(6):356-365. doi: 10.1038/s41437-022-00570-w. Epub 2022 Nov 10.
4
The emergence of supergenes from inversions in Atlantic salmon.超级基因从大西洋鲑鱼的倒位中出现。
Philos Trans R Soc Lond B Biol Sci. 2022 Aug;377(1856):20210195. doi: 10.1098/rstb.2021.0195. Epub 2022 Jun 13.
5
Genome assembly, structural variants, and genetic differentiation between lake whitefish young species pairs (Coregonus sp.) with long and short reads.利用长读长和短读对湖拟鲤幼鱼不同种对(白鲑属)的基因组组装、结构变异和遗传分化进行分析。
Mol Ecol. 2023 Mar;32(6):1458-1477. doi: 10.1111/mec.16468. Epub 2022 Apr 29.
6
Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.充分利用 SNP 阵列:提取潜在基因组结构的工具的系统评价。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac043.
7
Assessment of the performance of different imputation methods for low-coverage sequencing in Holstein cattle.荷斯坦奶牛低覆盖度测序中不同插补方法性能评估。
J Dairy Sci. 2022 Apr;105(4):3355-3366. doi: 10.3168/jds.2021-21360. Epub 2022 Feb 10.
8
Structural variants are a major source of gene expression differences in humans and often affect multiple nearby genes.结构变异是人类基因表达差异的主要来源,并且常常影响多个邻近基因。
Genome Res. 2021 Dec;31(12):2249-2257. doi: 10.1101/gr.275488.121. Epub 2021 Sep 20.
9
Rapid genotype imputation from sequence with reference panels.基于参考面板的序列快速基因型推断。
Nat Genet. 2021 Jul;53(7):1104-1111. doi: 10.1038/s41588-021-00877-0. Epub 2021 Jun 3.
10
Association of structural variation with cardiometabolic traits in Finns.芬兰人群中心血管代谢特征与结构变异的关联。
Am J Hum Genet. 2021 Apr 1;108(4):583-596. doi: 10.1016/j.ajhg.2021.03.008.