• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从深度测序数据中准确推断未分型变异。

Accurate Imputation of Untyped Variants from Deep Sequencing Data.

机构信息

Département de Phytologie, Université Laval, Québec City, QC, Canada.

Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada.

出版信息

Methods Mol Biol. 2021;2243:271-281. doi: 10.1007/978-1-0716-1103-6_13.

DOI:10.1007/978-1-0716-1103-6_13
PMID:33606262
Abstract

The quality, statistical power, and resolution of genome-wide association studies (GWAS) are largely dependent on the comprehensiveness of genotypic data. Over the last few years, despite the constant decrease in the price of sequencing, whole-genome sequencing (WGS) of association panels comprising a large number of samples remains cost-prohibitive. Therefore, most GWAS populations are still genotyped using low-coverage genotyping methods resulting in incomplete datasets. Imputation of untyped variants is a powerful method to maximize the number of SNPs identified in study samples, it increases the power and resolution of GWAS and allows to integrate genotyping datasets obtained from various sources. Here, we describe the key concepts underlying imputation of untyped variants, including the architecture of reference panels, and review some of the associated challenges and how these can be addressed. We also discuss the need and available methods to rigorously assess the accuracy of imputed data prior to their use in any genetic study.

摘要

全基因组关联研究(GWAS)的质量、统计效力和分辨率在很大程度上取决于基因型数据的全面性。尽管过去几年测序价格持续下降,但包含大量样本的关联面板的全基因组测序(WGS)仍然成本过高。因此,大多数 GWAS 群体仍使用低覆盖率的基因分型方法进行基因分型,导致数据集不完整。未分型变体的推断是一种强大的方法,可以最大限度地增加研究样本中鉴定的 SNP 数量,它提高了 GWAS 的效力和分辨率,并允许整合来自各种来源的基因分型数据集。在这里,我们描述了推断未分型变体的关键概念,包括参考面板的架构,并回顾了一些相关的挑战以及如何解决这些挑战。我们还讨论了在任何遗传研究中使用之前,严格评估推断数据准确性的必要性和可用方法。

相似文献

1
Accurate Imputation of Untyped Variants from Deep Sequencing Data.从深度测序数据中准确推断未分型变异。
Methods Mol Biol. 2021;2243:271-281. doi: 10.1007/978-1-0716-1103-6_13.
2
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.未分型标记的全基因组推断准确性及其对关联研究统计效能的影响。
BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27.
3
Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.GWAS 中的测序和插补:在不同人群中提高效能和基因组覆盖范围的经济有效的策略。
Genet Epidemiol. 2020 Sep;44(6):537-549. doi: 10.1002/gepi.22326. Epub 2020 Jun 9.
4
Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.基于大型参考面板的基因分型和模拟深度全基因组测序的程度。
Am J Hum Genet. 2022 Sep 1;109(9):1653-1666. doi: 10.1016/j.ajhg.2022.07.012. Epub 2022 Aug 17.
5
Effect of genome-wide genotyping and reference panels on rare variants imputation.全基因组基因分型和参考面板对稀有变异体推断的影响。
J Genet Genomics. 2012 Oct 20;39(10):545-50. doi: 10.1016/j.jgg.2012.07.002. Epub 2012 Jul 24.
6
GWAS on Imputed Whole-Genome Resequencing From Genotyping-by-Sequencing Data for Farrowing Interval of Different Parities in Pigs.基于测序分型数据进行猪不同胎次产仔间隔的全基因组重测序推算的全基因组关联研究
Front Genet. 2019 Oct 18;10:1012. doi: 10.3389/fgene.2019.01012. eCollection 2019.
7
A new strategy for enhancing imputation quality of rare variants from next-generation sequencing data via combining SNP and exome chip data.一种通过结合单核苷酸多态性(SNP)和外显子芯片数据来提高下一代测序数据中罕见变异插补质量的新策略。
BMC Genomics. 2015 Dec 29;16:1109. doi: 10.1186/s12864-015-2192-y.
8
Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.牛群中的低深度测序基因分型(GBS):最大化高质量基因型选择和归因准确性的策略。
BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.
9
Scanning and Filling: Ultra-Dense SNP Genotyping Combining Genotyping-By-Sequencing, SNP Array and Whole-Genome Resequencing Data.扫描与填充:结合简化基因组测序、SNP芯片和全基因组重测序数据的超密集SNP基因分型
PLoS One. 2015 Jul 10;10(7):e0131533. doi: 10.1371/journal.pone.0131533. eCollection 2015.
10
Impact of the inaccessible genome on genotype imputation and genome-wide association studies.基因组的不可及性对基因型推断和全基因组关联研究的影响。
Hum Mol Genet. 2024 Jul 6;33(14):1207-1214. doi: 10.1093/hmg/ddae062.

引用本文的文献

1
High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing.利用低覆盖度全基因组测序对结构变异和单核苷酸变异进行高性能插补
Genet Sel Evol. 2025 Mar 28;57(1):16. doi: 10.1186/s12711-025-00962-6.
2
STICI: Split-Transformer with integrated convolutions for genotype imputation.STICI:用于基因型填充的集成卷积拆分变压器
Nat Commun. 2025 Jan 31;16(1):1218. doi: 10.1038/s41467-025-56273-3.
3
Machine learning-enhanced multi-trait genomic prediction for optimizing cannabinoid profiles in cannabis.

本文引用的文献

1
Learning the optimal scale for GWAS through hierarchical SNP aggregation.通过层次 SNP 聚合学习 GWAS 的最佳规模。
BMC Bioinformatics. 2018 Nov 29;19(1):459. doi: 10.1186/s12859-018-2475-9.
2
A One-Penny Imputed Genome from Next-Generation Reference Panels.基于新一代参考面板的单分钱估算基因组。
Am J Hum Genet. 2018 Sep 6;103(3):338-348. doi: 10.1016/j.ajhg.2018.07.015. Epub 2018 Aug 9.
3
Genotype imputation performance of three reference panels using African ancestry individuals.三种参考面板在非洲血统个体中的基因型推断性能。
机器学习增强的多性状基因组预测,用于优化大麻中的大麻素谱。
Plant J. 2025 Jan;121(1):e17164. doi: 10.1111/tpj.17164. Epub 2024 Nov 27.
4
Somatic Mutation Accumulations in Micropropagated Cannabis Are Proportional to the Number of Subcultures.微繁殖大麻中体细胞突变的积累与继代培养次数成正比。
Plants (Basel). 2024 Jul 11;13(14):1910. doi: 10.3390/plants13141910.
5
Venous thromboembolic disease genetics: from variants to function.静脉血栓栓塞性疾病遗传学:从变异到功能。
J Thromb Haemost. 2024 Sep;22(9):2393-2403. doi: 10.1016/j.jtha.2024.06.004. Epub 2024 Jun 21.
6
The Genotypic Variability among Short-Season Soybean Cultivars for Nitrogen Fixation under Drought Stress.干旱胁迫下短季大豆品种固氮的基因型变异
Plants (Basel). 2023 Feb 22;12(5):1004. doi: 10.3390/plants12051004.
7
An autoencoder-based deep learning method for genotype imputation.一种基于自动编码器的深度学习基因分型填充方法。
Front Artif Intell. 2022 Nov 3;5:1028978. doi: 10.3389/frai.2022.1028978. eCollection 2022.
8
Identification of quantitative trait loci associated with seed quality traits between Canadian and Ukrainian mega-environments using genome-wide association study.利用全基因组关联研究鉴定加拿大和乌克兰大环境之间与种子质量性状相关的数量性状基因座。
Theor Appl Genet. 2022 Jul;135(7):2515-2530. doi: 10.1007/s00122-022-04134-8. Epub 2022 Jun 18.
Hum Genet. 2018 Apr;137(4):281-292. doi: 10.1007/s00439-018-1881-4. Epub 2018 Apr 10.
4
Efficient genome-wide genotyping strategies and data integration in crop plants.作物中高效的全基因组基因分型策略与数据整合
Theor Appl Genet. 2018 Mar;131(3):499-511. doi: 10.1007/s00122-018-3056-z. Epub 2018 Jan 19.
5
Comprehensive description of genomewide nucleotide and structural variation in short-season soya bean.短季大豆全基因组核苷酸和结构变异的综合描述。
Plant Biotechnol J. 2018 Mar;16(3):749-759. doi: 10.1111/pbi.12825. Epub 2017 Nov 3.
6
10 Years of GWAS Discovery: Biology, Function, and Translation.全基因组关联研究十年发现:生物学、功能与转化
Am J Hum Genet. 2017 Jul 6;101(1):5-22. doi: 10.1016/j.ajhg.2017.06.005.
7
Crop Breeding Chips and Genotyping Platforms: Progress, Challenges, and Perspectives.作物育种芯片与基因分型平台:进展、挑战与展望。
Mol Plant. 2017 Aug 7;10(8):1047-1064. doi: 10.1016/j.molp.2017.06.008. Epub 2017 Jun 29.
8
Increasing mapping precision of genome-wide association studies: to genotype and impute, sequence, or both?提高全基因组关联研究的定位精度:进行基因分型和推算、测序,还是两者兼而有之?
Genome Biol. 2017 Jun 19;18(1):118. doi: 10.1186/s13059-017-1255-6.
9
A high-quality human reference panel reveals the complexity and distribution of genomic structural variants.高质量的人类参考面板揭示了基因组结构变异的复杂性和分布。
Nat Commun. 2016 Oct 6;7:12989. doi: 10.1038/ncomms12989.
10
Reference-based phasing using the Haplotype Reference Consortium panel.使用单倍型参考联盟面板进行基于参考的定相
Nat Genet. 2016 Nov;48(11):1443-1448. doi: 10.1038/ng.3679. Epub 2016 Oct 3.