• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用连锁不平衡在隔离人群中进行全基因组低深度测序和变异推断。

Low-pass genome-wide sequencing and variant inference using identity-by-descent in an isolated human population.

机构信息

Department of Computer Science, Columbia University,New York, New York 10027, USA.

出版信息

Genetics. 2012 Feb;190(2):679-89. doi: 10.1534/genetics.111.134874. Epub 2011 Nov 30.

DOI:10.1534/genetics.111.134874
PMID:22135348
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3276614/
Abstract

Whole-genome sequencing in an isolated population with few founders directly ascertains variants from the population bottleneck that may be rare elsewhere. In such populations, shared haplotypes allow imputation of variants in unsequenced samples without resorting to complex statistical methods as in studies of outbred cohorts. We focus on an isolated population cohort from the Pacific Island of Kosrae, Micronesia, where we previously collected SNP array and rich phenotype data for the majority of the population. We report identification of long regions with haplotypes co-inherited between pairs of individuals and methodology to leverage such shared genetic content for imputation. Our estimates show that sequencing as few as 40 personal genomes allows for inference in up to 60% of the 3000-person cohort at the average locus. We ascertained a pilot data set of whole-genome sequences from seven Kosraean individuals, with average 5× coverage. This assay identified 5,735,306 unique sites of which 1,212,831 were previously unknown. Additionally, these variants are unusually enriched for alleles that are rare in other populations when compared to geographic neighbors (published Korean genome SJK). We used the presence of shared haplotypes between the seven Kosraen individuals to estimate expected imputation accuracy of known and novel homozygous variants at 99.6% and 97.3%, respectively. This study presents whole-genome analysis of a homogenous isolate population with emphasis on optimal rare variant inference.

摘要

在一个仅有少数创始者的隔离人群中进行全基因组测序,可以直接确定人群瓶颈期的变异,而这些变异在其他地方可能很少见。在这样的人群中,共享的单倍型允许在没有像在杂交群体研究中那样使用复杂统计方法的情况下,对未测序样本中的变异进行推断。我们专注于来自密克罗尼西亚太平洋岛屿科斯雷的一个隔离人群队列,我们之前已经为该人群的大多数收集了 SNP 芯片和丰富的表型数据。我们报告了鉴定个体之间共同遗传的长单倍型区域的方法,并利用这种共享的遗传内容进行推断。我们的估计表明,测序少至 40 个人的基因组就可以在平均位点上推断出多达 60%的 3000 人队列中的个体。我们从七个科斯雷人个体中确定了一个全基因组序列的试点数据集,平均覆盖 5×。该检测鉴定了 5735306 个独特的位点,其中 1212831 个是以前未知的。此外,与地理邻居(已发表的韩国基因组 SJK)相比,这些变体异常丰富了其他人群中罕见的等位基因。我们利用七个科斯雷个体之间的共享单倍型来估计已知和新的纯合变体的预期推断准确性,分别为 99.6%和 97.3%。这项研究对一个同质隔离人群进行了全基因组分析,重点是优化稀有变异推断。

相似文献

1
Low-pass genome-wide sequencing and variant inference using identity-by-descent in an isolated human population.利用连锁不平衡在隔离人群中进行全基因组低深度测序和变异推断。
Genetics. 2012 Feb;190(2):679-89. doi: 10.1534/genetics.111.134874. Epub 2011 Nov 30.
2
Genome-wide association studies in an isolated founder population from the Pacific Island of Kosrae.对来自太平洋科斯雷岛的一个隔离奠基人群体进行全基因组关联研究。
PLoS Genet. 2009 Feb;5(2):e1000365. doi: 10.1371/journal.pgen.1000365. Epub 2009 Feb 6.
3
Increased power of mixed models facilitates association mapping of 10 loci for metabolic traits in an isolated population.在一个隔离群体中,增加混合模型的功效有助于对 10 个代谢性状的关联进行定位。
Hum Mol Genet. 2011 Feb 15;20(4):827-39. doi: 10.1093/hmg/ddq510. Epub 2010 Nov 30.
4
Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia.评估密克罗尼西亚偏远岛屿科斯雷岛进行全基因组研究的潜力。
Nat Genet. 2006 Feb;38(2):214-7. doi: 10.1038/ng1712. Epub 2006 Jan 22.
5
Systematic haplotype analysis resolves a complex plasma plant sterol locus on the Micronesian Island of Kosrae.系统单倍型分析解析了密克罗尼西亚科斯雷岛上一个复杂的血浆植物甾醇基因座。
Proc Natl Acad Sci U S A. 2009 Aug 18;106(33):13886-91. doi: 10.1073/pnas.0907336106. Epub 2009 Aug 10.
6
DASH: a method for identical-by-descent haplotype mapping uncovers association with recent variation.DASH:一种基于相同遗传单倍型的关联分析方法,用于发现与近期变异的关联。
Am J Hum Genet. 2011 Jun 10;88(6):706-717. doi: 10.1016/j.ajhg.2011.04.023. Epub 2011 May 27.
7
Interpretation of association signals and identification of causal variants from genome-wide association studies.全基因组关联研究中关联信号的解释和因果变异的识别。
Am J Hum Genet. 2010 May 14;86(5):730-42. doi: 10.1016/j.ajhg.2010.04.003. Epub 2010 Apr 29.
8
Fast imputation using medium or low-coverage sequence data.使用中等或低覆盖率序列数据进行快速插补。
BMC Genet. 2015 Jul 14;16:82. doi: 10.1186/s12863-015-0243-7.
9
Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.整合序列和阵列数据以创建改进的千人基因组计划单倍型参考面板。
Nat Commun. 2014 Jun 13;5:3934. doi: 10.1038/ncomms4934.
10
Comparison among three variant callers and assessment of the accuracy of imputation from SNP array data to whole-genome sequence level in chicken.鸡中三种变异检测工具的比较以及从SNP芯片数据到全基因组序列水平的填充准确性评估。
BMC Genomics. 2015 Oct 21;16:824. doi: 10.1186/s12864-015-2059-2.

引用本文的文献

1
Current Developments in Detection of Identity-by-Descent Methods and Applications.同源性检测方法的当前发展与应用
Front Genet. 2021 Sep 10;12:722602. doi: 10.3389/fgene.2021.722602. eCollection 2021.
2
Characterization of a haplotype-reference panel for genotyping by low-pass sequencing in Swiss Large White pigs.用于瑞士大白猪低通量测序基因分型的单倍型参考面板的特征分析
BMC Genomics. 2021 Apr 21;22(1):290. doi: 10.1186/s12864-021-07610-5.
3
Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations.对 487409 个英国样本的亲缘关系检测揭示了精细的人口结构和超罕见变异关联。
Nat Commun. 2020 Nov 30;11(1):6130. doi: 10.1038/s41467-020-19588-x.
4
Evolutionary history of modern Samoans.现代萨摩亚人的进化史。
Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9458-9465. doi: 10.1073/pnas.1913157117. Epub 2020 Apr 14.
5
Evaluation of sequencing strategies for whole-genome imputation with hybrid peeling.杂交剥脱全基因组基因分型策略评估。
Genet Sel Evol. 2020 Apr 6;52(1):18. doi: 10.1186/s12711-020-00537-7.
6
Assessing genomic diversity and signatures of selection in Original Braunvieh cattle using whole-genome sequencing data.利用全基因组测序数据评估原布劳恩维勒牛的基因组多样性和选择特征。
BMC Genomics. 2020 Jan 8;21(1):27. doi: 10.1186/s12864-020-6446-y.
7
Increased ultra-rare variant load in an isolated Scottish population impacts exonic and regulatory regions.在一个孤立的苏格兰人群中,超罕见变异负荷的增加影响外显子和调控区域。
PLoS Genet. 2019 Nov 25;15(11):e1008480. doi: 10.1371/journal.pgen.1008480. eCollection 2019 Nov.
8
Kinpute: using identity by descent to improve genotype imputation.金普亭:利用血缘关系提高基因型推断。
Bioinformatics. 2019 Nov 1;35(21):4321-4326. doi: 10.1093/bioinformatics/btz221.
9
A method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals.一种通过靶向单倍型而非个体来分配低覆盖度测序资源的方法。
Genet Sel Evol. 2017 Oct 25;49(1):78. doi: 10.1186/s12711-017-0353-y.
10
A method for the allocation of sequencing resources in genotyped livestock populations.一种在基因分型家畜群体中分配测序资源的方法。
Genet Sel Evol. 2017 May 18;49(1):47. doi: 10.1186/s12711-017-0322-5.

本文引用的文献

1
Low-coverage sequencing: implications for design of complex trait association studies.低覆盖度测序:对复杂性状关联研究设计的影响。
Genome Res. 2011 Jun;21(6):940-51. doi: 10.1101/gr.117259.110. Epub 2011 Apr 1.
2
Increased power of mixed models facilitates association mapping of 10 loci for metabolic traits in an isolated population.在一个隔离群体中,增加混合模型的功效有助于对 10 个代谢性状的关联进行定位。
Hum Mol Genet. 2011 Feb 15;20(4):827-39. doi: 10.1093/hmg/ddq510. Epub 2010 Nov 30.
3
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
4
Dindel: accurate indel calls from short-read data.Dindel:从短读数据中进行精确的插入缺失突变(Indel)调用。
Genome Res. 2011 Jun;21(6):961-73. doi: 10.1101/gr.112326.110. Epub 2010 Oct 27.
5
Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing.利用大规模平行测序对日本人进行全基因组测序和全面变异分析。
Nat Genet. 2010 Nov;42(11):931-6. doi: 10.1038/ng.691. Epub 2010 Oct 24.
6
Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants.200 个人类外显子组重测序发现低频非同义编码变异过度。
Nat Genet. 2010 Nov;42(11):969-72. doi: 10.1038/ng.680. Epub 2010 Oct 3.
7
Reconciling the analysis of IBD and IBS in complex trait studies.在复杂性状研究中协调 IBD 和 IBS 的分析。
Nat Rev Genet. 2010 Nov;11(11):800-5. doi: 10.1038/nrg2865. Epub 2010 Sep 28.
8
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
9
Genotype imputation for genome-wide association studies.全基因组关联研究中的基因型推断。
Nat Rev Genet. 2010 Jul;11(7):499-511. doi: 10.1038/nrg2796.
10
High-resolution detection of identity by descent in unrelated individuals.高分辨率检测无关个体间的血缘关系。
Am J Hum Genet. 2010 Apr 9;86(4):526-39. doi: 10.1016/j.ajhg.2010.02.021. Epub 2010 Mar 18.