• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

低覆盖度测序能有效地检测到代表性不足的人群中的已知和新的变异。

Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations.

机构信息

Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA.

Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA.

出版信息

Am J Hum Genet. 2021 Apr 1;108(4):656-668. doi: 10.1016/j.ajhg.2021.03.012. Epub 2021 Mar 25.

DOI:10.1016/j.ajhg.2021.03.012
PMID:33770507
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8059370/
Abstract

Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of ≥4× captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1×) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 4× sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.

摘要

遗传研究在代表性不足的人群中确定了不成比例的新型关联。然而,大多数遗传研究使用的是基因分型阵列和测序参考面板,这些方法最能捕捉到欧洲血统人群中最常见的变异。为了比较最适合代表性不足人群的数据生成策略,我们作为神经精神遗传学非洲人群-精神病学(NeuroGAP-精神病学)研究的一部分,对来自埃塞俄比亚、肯尼亚、南非和乌干达的 91 名个体进行了全基因组测序,覆盖度达到了很高的水平。我们使用了一种抽样方法,通过计算这些技术与深度全基因组测序数据中推断出的变体的一致性,来评估两种具有成本效益的数据生成策略,即全基因组关联研究阵列与低覆盖度测序的质量。我们表明,深度为≥4×的低覆盖度测序比我们研究的所有常用全基因组关联研究阵列更准确地捕获了所有频率的变体,而且成本相当。测序深度较低(0.5-1×)与常用的低密度全基因组关联研究阵列性能相当。低覆盖度测序对新型变异也很敏感;4×测序可检测到在高覆盖度非洲全基因组中发现的 45%的单倍体和 95%的常见变异。低覆盖度测序方法克服了常见基因分型阵列确定带来的问题,有效地识别了新型变异,特别是在代表性不足的人群中,并提供了以类似于传统方法的成本增强变异发现的机会。

相似文献

1
Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations.低覆盖度测序能有效地检测到代表性不足的人群中的已知和新的变异。
Am J Hum Genet. 2021 Apr 1;108(4):656-668. doi: 10.1016/j.ajhg.2021.03.012. Epub 2021 Mar 25.
2
Low coverage whole genome sequencing enables accurate assessment of common variants and calculation of genome-wide polygenic scores.低覆盖度全基因组测序可实现常见变异的精确评估和全基因组多基因评分的计算。
Genome Med. 2019 Nov 26;11(1):74. doi: 10.1186/s13073-019-0682-2.
3
Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.GWAS 中的测序和插补:在不同人群中提高效能和基因组覆盖范围的经济有效的策略。
Genet Epidemiol. 2020 Sep;44(6):537-549. doi: 10.1002/gepi.22326. Epub 2020 Jun 9.
4
Comparing low-pass sequencing and genotyping for trait mapping in pharmacogenetics.比较低通量测序和基因分型用于药物遗传学中的性状定位
BMC Genomics. 2021 Mar 20;22(1):197. doi: 10.1186/s12864-021-07508-2.
5
Neuropsychiatric Genetics of African Populations-Psychosis (NeuroGAP-Psychosis): a case-control study protocol and GWAS in Ethiopia, Kenya, South Africa and Uganda.非裔人群神经精神遗传学-精神病(NeuroGAP-精神病):在埃塞俄比亚、肯尼亚、南非和乌干达开展的病例对照研究方案和全基因组关联研究。
BMJ Open. 2019 Feb 19;9(2):e025469. doi: 10.1136/bmjopen-2018-025469.
6
Imputation-based assessment of next generation rare exome variant arrays.基于插补法的新一代罕见外显子变异阵列评估
Pac Symp Biocomput. 2014:241-52.
7
Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations.中程全基因组测序能够实现对不同人群的生物医学遗传学研究。
BMC Genomics. 2021 Nov 1;22(1):666. doi: 10.1186/s12864-021-07949-9.
8
Ultra Low-Coverage Whole-Genome Sequencing as an Alternative to Genotyping Arrays in Genome-Wide Association Studies.超低覆盖度全基因组测序作为全基因组关联研究中基因分型阵列的替代方法
Front Genet. 2022 Feb 15;12:790445. doi: 10.3389/fgene.2021.790445. eCollection 2021.
9
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
10
Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.超过 10 万 NHLBI 转化医学精准医学(TOPMed)联盟全基因组序列的使用提高了混合非裔和西班牙裔/拉丁裔人群中罕见变异关联的推断质量和检测能力。
PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.

引用本文的文献

1
Benchmarking of low coverage sequencing workflows for precision genotyping in eggplant.茄子中用于精准基因分型的低覆盖度测序工作流程的基准测试
BMC Plant Biol. 2025 Aug 25;25(1):1125. doi: 10.1186/s12870-025-07242-x.
2
Genotype imputation from low-coverage data for medical and population genetic analyses.基于低覆盖度数据的基因型推断用于医学和群体遗传学分析。
Genome Res. 2025 Sep 2;35(9):1929-1941. doi: 10.1101/gr.280175.124.
3
MetaGLIMPSE: Meta Imputation of Low Coverage Sequencing Data for Modern and Ancient Genomes.MetaGLIMPSE:现代和古代基因组低覆盖度测序数据的元填补
bioRxiv. 2025 Jun 24:2025.06.24.660721. doi: 10.1101/2025.06.24.660721.
4
A map of blood regulatory variation in South Africans enables GWAS interpretation.一张南非人血液调节变异图谱有助于全基因组关联研究(GWAS)的解读。
Nat Genet. 2025 Jun 11. doi: 10.1038/s41588-025-02223-0.
5
Body fluid biomarkers and psychosis risk in The Accelerating Medicines Partnership® Schizophrenia Program: design considerations.加速药物合作组织精神分裂症项目中的体液生物标志物与精神病风险:设计考量
Schizophrenia (Heidelb). 2025 May 21;11(1):78. doi: 10.1038/s41537-025-00610-4.
6
Calculating maternal polygenic risk scores from prenatal screening by cell-free DNA data.通过游离DNA数据进行产前筛查来计算母体多基因风险评分。
Front Genet. 2025 Feb 20;16:1495604. doi: 10.3389/fgene.2025.1495604. eCollection 2025.
7
New crops on the block: effective strategies to broaden our food, fibre, and fuel repertoire in the face of increasingly volatile agricultural systems.新作物登场:面对日益不稳定的农业系统,拓宽我们的食物、纤维和燃料种类的有效策略。
J Exp Bot. 2025 May 27;76(8):2043-2063. doi: 10.1093/jxb/eraf023.
8
DPImpute: A Genotype Imputation Framework for Ultra-Low Coverage Whole-Genome Sequencing and its Application in Genomic Selection.DPImpute:一种用于超低覆盖度全基因组测序的基因型填充框架及其在基因组选择中的应用。
Adv Sci (Weinh). 2025 Apr;12(16):e2412482. doi: 10.1002/advs.202412482. Epub 2025 Feb 27.
9
Genomic Landscape and Prediction of Udder Traits in Saanen Dairy Goats.萨能奶山羊乳房性状的基因组格局与预测
Animals (Basel). 2025 Jan 17;15(2):261. doi: 10.3390/ani15020261.
10
Modeling Biases from Low-Pass Genome Sequencing to Enable Accurate Population Genetic Inferences.对低通量基因组测序中的偏差进行建模以实现准确的群体遗传推断。
Mol Biol Evol. 2025 Jan 6;42(1). doi: 10.1093/molbev/msaf002.

本文引用的文献

1
Rare coding variants in ten genes confer substantial risk for schizophrenia.十个基因中的罕见编码变异赋予精神分裂症的显著风险。
Nature. 2022 Apr;604(7906):509-516. doi: 10.1038/s41586-022-04556-w. Epub 2022 Apr 8.
2
Comparing low-pass sequencing and genotyping for trait mapping in pharmacogenetics.比较低通量测序和基因分型用于药物遗传学中的性状定位
BMC Genomics. 2021 Mar 20;22(1):197. doi: 10.1186/s12864-021-07508-2.
3
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
4
Efficient phasing and imputation of low-coverage sequencing data using large reference panels.利用大型参考面板实现低覆盖度测序数据的高效相位推断和插补。
Nat Genet. 2021 Jan;53(1):120-126. doi: 10.1038/s41588-020-00756-0. Epub 2021 Jan 7.
5
High-depth African genomes inform human migration and health.高深度的非洲基因组信息揭示了人类的迁徙和健康状况。
Nature. 2020 Oct;586(7831):741-748. doi: 10.1038/s41586-020-2859-7. Epub 2020 Oct 28.
6
Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.牛种特异性增强参考图谱有助于准确的序列读取映射和无偏的变异发现。
Genome Biol. 2020 Jul 27;21(1):184. doi: 10.1186/s13059-020-02105-0.
7
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.
8
Quality of whole genome sequencing from blood versus saliva derived DNA in cardiac patients.心脏病患者血液与唾液来源DNA的全基因组测序质量
BMC Med Genomics. 2020 Jan 29;13(1):11. doi: 10.1186/s12920-020-0664-7.
9
Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.超过 10 万 NHLBI 转化医学精准医学(TOPMed)联盟全基因组序列的使用提高了混合非裔和西班牙裔/拉丁裔人群中罕见变异关联的推断质量和检测能力。
PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.
10
Low coverage whole genome sequencing enables accurate assessment of common variants and calculation of genome-wide polygenic scores.低覆盖度全基因组测序可实现常见变异的精确评估和全基因组多基因评分的计算。
Genome Med. 2019 Nov 26;11(1):74. doi: 10.1186/s13073-019-0682-2.