• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大型参考面板的基因分型和模拟深度全基因组测序的程度。

Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.

机构信息

Department of Biostatistics and Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, MI, USA.

Institute of Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, Austria.

出版信息

Am J Hum Genet. 2022 Sep 1;109(9):1653-1666. doi: 10.1016/j.ajhg.2022.07.012. Epub 2022 Aug 17.

DOI:10.1016/j.ajhg.2022.07.012
PMID:35981533
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9502057/
Abstract

Understanding the genetic basis of human diseases and traits is dependent on the identification and accurate genotyping of genetic variants. Deep whole-genome sequencing (WGS), the gold standard technology for SNP and indel identification and genotyping, remains very expensive for most large studies. Here, we quantify the extent to which array genotyping followed by genotype imputation can approximate WGS in studies of individuals of African, Hispanic/Latino, and European ancestry in the US and of Finnish ancestry in Finland (a population isolate). For each study, we performed genotype imputation by using the genetic variants present on the Illumina Core, OmniExpress, MEGA, and Omni 2.5M arrays with the 1000G, HRC, and TOPMed imputation reference panels. Using the Omni 2.5M array and the TOPMed panel, ≥90% of bi-allelic single-nucleotide variants (SNVs) are well imputed (r > 0.8) down to minor-allele frequencies (MAFs) of 0.14% in African, 0.11% in Hispanic/Latino, 0.35% in European, and 0.85% in Finnish ancestries. There was little difference in TOPMed-based imputation quality among the arrays with >700k variants. Individual-level imputation quality varied widely between and within the three US studies. Imputation quality also varied across genomic regions, producing regions where even common (MAF > 5%) variants were consistently not well imputed across ancestries. The extent to which array genotyping and imputation can approximate WGS therefore depends on reference panel, genotype array, sample ancestry, and genomic location. Imputation quality by variant or genomic region can be queried with our new tool, RsqBrowser, now deployed on the Michigan Imputation Server.

摘要

理解人类疾病和特征的遗传基础依赖于遗传变异的识别和准确基因分型。深度全基因组测序(WGS)是 SNP 和 indel 识别和基因分型的金标准技术,但对于大多数大型研究来说仍然非常昂贵。在这里,我们量化了在对美国非裔、西班牙裔/拉丁裔和欧洲血统个体以及芬兰血统个体(一个人口隔离群体)的研究中,通过基因分型阵列和基因型推断来近似 WGS 的程度。对于每个研究,我们使用 Illumina Core、OmniExpress、MEGA 和 Omni 2.5M 阵列上的遗传变异,并使用 1000G、HRC 和 TOPMed 推断参考面板进行基因型推断。使用 Omni 2.5M 阵列和 TOPMed 面板,≥90%的双等位基因单核苷酸变异(SNV)在非洲、西班牙裔/拉丁裔的次要等位基因频率(MAF)低至 0.14%、欧洲的 0.35%和芬兰的 0.85%时,推断质量良好(r>0.8)。在具有 >700k 变体的阵列中,基于 TOPMed 的推断质量之间几乎没有差异。三个美国研究中的个体水平推断质量差异很大。推断质量也在基因组区域之间存在差异,导致即使是常见(MAF>5%)变体在不同血统中也始终不能很好地推断。因此,基因分型阵列和推断可以近似 WGS 的程度取决于参考面板、基因型阵列、样本血统和基因组位置。可以使用我们的新工具 RsqBrowser 按变体或基因组区域查询推断质量,该工具现在已部署在密歇根推断服务器上。

相似文献

1
Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.基于大型参考面板的基因分型和模拟深度全基因组测序的程度。
Am J Hum Genet. 2022 Sep 1;109(9):1653-1666. doi: 10.1016/j.ajhg.2022.07.012. Epub 2022 Aug 17.
2
Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.超过 10 万 NHLBI 转化医学精准医学(TOPMed)联盟全基因组序列的使用提高了混合非裔和西班牙裔/拉丁裔人群中罕见变异关联的推断质量和检测能力。
PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.
3
Genotype imputation performance of three reference panels using African ancestry individuals.三种参考面板在非洲血统个体中的基因型推断性能。
Hum Genet. 2018 Apr;137(4):281-292. doi: 10.1007/s00439-018-1881-4. Epub 2018 Apr 10.
4
Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.GWAS 中的测序和插补:在不同人群中提高效能和基因组覆盖范围的经济有效的策略。
Genet Epidemiol. 2020 Sep;44(6):537-549. doi: 10.1002/gepi.22326. Epub 2020 Jun 9.
5
The power of TOPMed imputation for the discovery of Latino-enriched rare variants associated with type 2 diabetes.TOPMed 插补在发现与 2 型糖尿病相关的拉丁裔丰富罕见变异中的作用。
Diabetologia. 2023 Jul;66(7):1273-1288. doi: 10.1007/s00125-023-05912-9. Epub 2023 May 6.
6
High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing.利用低覆盖度全基因组测序对结构变异和单核苷酸变异进行高性能插补
Genet Sel Evol. 2025 Mar 28;57(1):16. doi: 10.1186/s12711-025-00962-6.
7
Improving imputation quality in Samoans through the integration of population-specific sequences into existing reference panels.通过将特定人群序列整合到现有参考面板中来提高萨摩亚人的插补质量。
medRxiv. 2023 Oct 31:2023.10.31.23297835. doi: 10.1101/2023.10.31.23297835.
8
Rare variant genotype imputation with thousands of study-specific whole-genome sequences: implications for cost-effective study designs.利用数千个特定研究的全基因组序列进行罕见变异基因型填充:对具有成本效益的研究设计的影响。
Eur J Hum Genet. 2015 Jul;23(7):975-83. doi: 10.1038/ejhg.2014.216. Epub 2014 Oct 8.
9
Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population.使用元填充分析韩国参考基因组可提高韩国人群中罕见变异的填充准确性和范围。
Front Genet. 2022 Nov 24;13:1008646. doi: 10.3389/fgene.2022.1008646. eCollection 2022.
10
Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels.利用来自分布式参考面板的多组推算基因型提高关联检验效能。
Genet Epidemiol. 2017 Dec;41(8):744-755. doi: 10.1002/gepi.22067. Epub 2017 Sep 1.

引用本文的文献

1
Multi-omics integration reveals CYTL1 and H6PD as key regulators of tumor metabolism in mesothelioma.多组学整合揭示CYTL1和H6PD是间皮瘤肿瘤代谢的关键调节因子。
Genes Genomics. 2025 Aug 25. doi: 10.1007/s13258-025-01667-2.
2
Application of multigene panel testing for bleeding, thrombotic, and platelet disorders in patients and the general population in China.多基因检测在中国患者及普通人群出血、血栓形成和血小板疾病中的应用。
Mol Biomed. 2025 Jun 9;6(1):39. doi: 10.1186/s43556-025-00283-6.
3
The PRIMED Consortium: Reducing disparities in polygenic risk assessment.PRIMED联盟:减少多基因风险评估中的差异。
Am J Hum Genet. 2024 Dec 5;111(12):2594-2606. doi: 10.1016/j.ajhg.2024.10.010. Epub 2024 Nov 18.
4
Variants in the β-globin locus are associated with pneumonia in African American children.β-珠蛋白基因座的变异与非裔美国儿童的肺炎有关。
HGG Adv. 2025 Jan 9;6(1):100374. doi: 10.1016/j.xhgg.2024.100374. Epub 2024 Oct 22.
5
Rare variant contribution to the heritability of coronary artery disease.罕见变异对冠心病遗传力的贡献。
Nat Commun. 2024 Oct 9;15(1):8741. doi: 10.1038/s41467-024-52939-6.
6
Yield of genetic association signals from genomes, exomes and imputation in the UK Biobank.英国生物库中基因组、外显子组和导入数据的遗传关联信号的产生。
Nat Genet. 2024 Nov;56(11):2345-2351. doi: 10.1038/s41588-024-01930-4. Epub 2024 Sep 25.
7
Schizophrenia genomics: genetic complexity and functional insights.精神分裂症基因组学:遗传复杂性与功能见解。
Nat Rev Neurosci. 2024 Sep;25(9):611-624. doi: 10.1038/s41583-024-00837-7. Epub 2024 Jul 19.
8
Genome-wide meta-analysis identifies ancestry-specific loci for Alzheimer's disease.全基因组荟萃分析确定了阿尔茨海默病的特定种族遗传位点。
Alzheimers Dement. 2024 Sep;20(9):6243-6256. doi: 10.1002/alz.14121. Epub 2024 Jul 18.
9
Whole genome sequencing based analysis of inflammation biomarkers in the Trans-Omics for Precision Medicine (TOPMed) consortium.基于全基因组测序的精准医学转化研究联盟(TOPMed)炎症生物标志物分析。
Hum Mol Genet. 2024 Aug 6;33(16):1429-1441. doi: 10.1093/hmg/ddae050.
10
The predictive capacity of polygenic risk scores for disease risk is only moderately influenced by imputation panels tailored to the target population.多基因风险评分对疾病风险的预测能力仅受到针对目标人群定制的 imputation 面板的适度影响。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae036.

本文引用的文献

1
Results of genetic analysis of 11 341 participants enrolled in the My Life, Our Future hemophilia genotyping initiative in the United States.美国“我的生活,我们的未来”血友病基因分型计划中 11341 名参与者的基因分析结果。
J Thromb Haemost. 2022 Sep;20(9):2022-2034. doi: 10.1111/jth.15805. Epub 2022 Jul 17.
2
Leveraging TOPMed imputation server and constructing a cohort-specific imputation reference panel to enhance genotype imputation among cystic fibrosis patients.利用TOPMed插补服务器并构建特定队列的插补参考面板,以提高囊性纤维化患者的基因型插补。
HGG Adv. 2022 Jan 11;3(2):100090. doi: 10.1016/j.xhgg.2022.100090. eCollection 2022 Apr 14.
3
Whole-genome association analyses of sleep-disordered breathing phenotypes in the NHLBI TOPMed program.全基因组关联分析 NHLBI TOPMed 计划中睡眠呼吸紊乱表型。
Genome Med. 2021 Aug 26;13(1):136. doi: 10.1186/s13073-021-00917-8.
4
A comparison of genotyping arrays.基因分型芯片比较。
Eur J Hum Genet. 2021 Nov;29(11):1611-1624. doi: 10.1038/s41431-021-00917-7. Epub 2021 Jun 18.
5
Mitochondrial genome copy number measured by DNA sequencing in human blood is strongly associated with metabolic traits via cell-type composition differences.通过 DNA 测序测量的人血液中线粒体基因组拷贝数通过细胞类型组成差异与代谢特征强烈相关。
Hum Genomics. 2021 Jun 7;15(1):34. doi: 10.1186/s40246-021-00335-2.
6
Toward a fine-scale population health monitoring system.迈向精细化的人口健康监测系统。
Cell. 2021 Apr 15;184(8):2068-2083.e11. doi: 10.1016/j.cell.2021.03.034.
7
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
8
Exome sequencing and characterization of 49,960 individuals in the UK Biobank.英国生物银行中 49960 人的外显子组测序和特征分析。
Nature. 2020 Oct;586(7831):749-756. doi: 10.1038/s41586-020-2853-0. Epub 2020 Oct 21.
9
Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.超过 10 万 NHLBI 转化医学精准医学(TOPMed)联盟全基因组序列的使用提高了混合非裔和西班牙裔/拉丁裔人群中罕见变异关联的推断质量和检测能力。
PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.
10
Contributions of common genetic variants to risk of schizophrenia among individuals of African and Latino ancestry.常见基因变异对非洲和拉丁裔血统个体患精神分裂症风险的影响。
Mol Psychiatry. 2020 Oct;25(10):2455-2467. doi: 10.1038/s41380-019-0517-y. Epub 2019 Oct 7.