• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于在全谱系亲缘关系中进行单倍型分型的通用方法。

A general approach for haplotype phasing across the full spectrum of relatedness.

作者信息

O'Connell Jared, Gurdasani Deepti, Delaneau Olivier, Pirastu Nicola, Ulivi Sheila, Cocca Massimiliano, Traglia Michela, Huang Jie, Huffman Jennifer E, Rudan Igor, McQuillan Ruth, Fraser Ross M, Campbell Harry, Polasek Ozren, Asiki Gershim, Ekoru Kenneth, Hayward Caroline, Wright Alan F, Vitart Veronique, Navarro Pau, Zagury Jean-Francois, Wilson James F, Toniolo Daniela, Gasparini Paolo, Soranzo Nicole, Sandhu Manjinder S, Marchini Jonathan

机构信息

Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom; Department of Statistics, University of Oxford, Oxford, United Kingdom.

Wellcome Trust Sanger Institute, Hinxton, United Kingdom; Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom.

出版信息

PLoS Genet. 2014 Apr 17;10(4):e1004234. doi: 10.1371/journal.pgen.1004234. eCollection 2014 Apr.

DOI:10.1371/journal.pgen.1004234
PMID:24743097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3990520/
Abstract

Many existing cohorts contain a range of relatedness between genotyped individuals, either by design or by chance. Haplotype estimation in such cohorts is a central step in many downstream analyses. Using genotypes from six cohorts from isolated populations and two cohorts from non-isolated populations, we have investigated the performance of different phasing methods designed for nominally 'unrelated' individuals. We find that SHAPEIT2 produces much lower switch error rates in all cohorts compared to other methods, including those designed specifically for isolated populations. In particular, when large amounts of IBD sharing is present, SHAPEIT2 infers close to perfect haplotypes. Based on these results we have developed a general strategy for phasing cohorts with any level of implicit or explicit relatedness between individuals. First SHAPEIT2 is run ignoring all explicit family information. We then apply a novel HMM method (duoHMM) to combine the SHAPEIT2 haplotypes with any family information to infer the inheritance pattern of each meiosis at all sites across each chromosome. This allows the correction of switch errors, detection of recombination events and genotyping errors. We show that the method detects numbers of recombination events that align very well with expectations based on genetic maps, and that it infers far fewer spurious recombination events than Merlin. The method can also detect genotyping errors and infer recombination events in otherwise uninformative families, such as trios and duos. The detected recombination events can be used in association scans for recombination phenotypes. The method provides a simple and unified approach to haplotype estimation, that will be of interest to researchers in the fields of human, animal and plant genetics.

摘要

许多现有的队列中,基因分型个体之间存在一定范围的亲缘关系,这或是出于设计,或是偶然因素。在此类队列中进行单倍型估计是许多下游分析的核心步骤。我们使用了来自隔离人群的六个队列以及来自非隔离人群的两个队列的基因型,研究了为名义上“无亲缘关系”个体设计的不同定相方法的性能。我们发现,与其他方法相比,包括专门为隔离人群设计的方法,SHAPEIT2在所有队列中产生的交换错误率要低得多。特别是当存在大量同源片段共享时,SHAPEIT2能推断出近乎完美的单倍型。基于这些结果,我们开发了一种通用策略,用于对个体间具有任何隐含或明确亲缘关系水平的队列进行定相。首先运行SHAPEIT2,忽略所有明确的家系信息。然后我们应用一种新颖的隐马尔可夫模型方法(duoHMM),将SHAPEIT2单倍型与任何家系信息相结合,以推断每条染色体上所有位点处每个减数分裂的遗传模式。这使得能够校正交换错误、检测重组事件和基因分型错误。我们表明,该方法检测到的重组事件数量与基于遗传图谱的预期非常吻合,并且它推断出的虚假重组事件比Merlin少得多。该方法还可以检测基因分型错误,并在其他信息不足的家系(如三联体和二元体)中推断重组事件。检测到的重组事件可用于重组表型的关联扫描。该方法为单倍型估计提供了一种简单统一的方法,将对人类、动物和植物遗传学领域的研究人员具有吸引力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/3f8ea0f355bd/pgen.1004234.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/8e730e92065d/pgen.1004234.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/5f5380616c97/pgen.1004234.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/90011115b8b6/pgen.1004234.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/10f05f8c609f/pgen.1004234.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/84a71d6fe19b/pgen.1004234.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/f98db4fc9314/pgen.1004234.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/3f8ea0f355bd/pgen.1004234.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/8e730e92065d/pgen.1004234.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/5f5380616c97/pgen.1004234.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/90011115b8b6/pgen.1004234.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/10f05f8c609f/pgen.1004234.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/84a71d6fe19b/pgen.1004234.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/f98db4fc9314/pgen.1004234.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca6/3990520/3f8ea0f355bd/pgen.1004234.g007.jpg

相似文献

1
A general approach for haplotype phasing across the full spectrum of relatedness.一种用于在全谱系亲缘关系中进行单倍型分型的通用方法。
PLoS Genet. 2014 Apr 17;10(4):e1004234. doi: 10.1371/journal.pgen.1004234. eCollection 2014 Apr.
2
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.HAPLORE:一个用于在无重组的一般家系中进行单倍型重建的程序。
Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.
3
Detection of recombination events, haplotype reconstruction and imputation of sires using half-sib SNP genotypes.利用半同胞 SNP 基因型检测重组事件、单倍型重建和估算父本。
Genet Sel Evol. 2014 Feb 4;46(1):11. doi: 10.1186/1297-9686-46-11.
4
Recombination locations and rates in beef cattle assessed from parent-offspring pairs.通过亲子对评估肉牛的重组位置和速率。
Genet Sel Evol. 2014 May 29;46(1):34. doi: 10.1186/1297-9686-46-34.
5
Efficient computation of minimum recombination with genotypes (not haplotypes).基于基因型(而非单倍型)的最小重组的高效计算。
J Bioinform Comput Biol. 2007 Apr;5(2a):181-200. doi: 10.1142/s0219720007002631.
6
Fast and Accurate Shared Segment Detection and Relatedness Estimation in Un-phased Genetic Data via TRUFFLE.通过 TRUFFLE 在非相位遗传数据中快速准确地检测共享片段和估计亲缘关系。
Am J Hum Genet. 2019 Jul 3;105(1):78-88. doi: 10.1016/j.ajhg.2019.05.007. Epub 2019 Jun 6.
7
Population-Specific Recombination Maps from Segments of Identity by Descent.基于血缘同一性片段的特定人群重组图谱。
Am J Hum Genet. 2020 Jul 2;107(1):137-148. doi: 10.1016/j.ajhg.2020.05.016. Epub 2020 Jun 12.
8
High-resolution population-specific recombination rates and their effect on phasing and genotype imputation.高分辨率人群特异性重组率及其对相位和基因型推断的影响。
Eur J Hum Genet. 2021 Apr;29(4):615-624. doi: 10.1038/s41431-020-00768-8. Epub 2020 Nov 28.
9
A comparison of different algorithms for phasing haplotypes using Holstein cattle genotypes and pedigree data.使用荷斯坦奶牛基因型和系谱数据对不同单倍型定相算法的比较。
J Dairy Sci. 2017 Apr;100(4):2837-2849. doi: 10.3168/jds.2016-11590. Epub 2017 Feb 1.
10
Multilocus linkage disequilibrium mapping by the decay of haplotype sharing with samples of related individuals.利用与相关个体样本的单倍型共享衰减进行多位点连锁不平衡作图。
Genet Epidemiol. 2005 Sep;29(2):128-40. doi: 10.1002/gepi.20081.

引用本文的文献

1
Genetic Consequences of Tree Planting Versus Natural Colonisation: Implications for Afforestation Programmes in the United Kingdom.植树造林与自然定居的遗传后果:对英国造林计划的启示
Evol Appl. 2025 Aug 27;18(8):e70146. doi: 10.1111/eva.70146. eCollection 2025 Aug.
2
DNA methylation of food sensitization in a French-Canadian population.法裔加拿大人群中食物致敏的DNA甲基化
Clin Epigenetics. 2025 Aug 17;17(1):143. doi: 10.1186/s13148-025-01951-8.
3
Wide-scale geographical analysis of genetic ancestry in the South African Coloured population.

本文引用的文献

1
Common and low-frequency variants associated with genome-wide recombination rate.常见且低频的变异与全基因组重组率相关。
Nat Genet. 2014 Jan;46(1):11-6. doi: 10.1038/ng.2833. Epub 2013 Nov 24.
2
The geography of recent genetic ancestry across Europe.欧洲近期遗传血统的地理分布。
PLoS Biol. 2013;11(5):e1001555. doi: 10.1371/journal.pbio.1001555. Epub 2013 May 7.
3
Mendel: the Swiss army knife of genetic analysis programs.门德尔:遗传分析程序的瑞士军刀。
南非混血人群遗传血统的大规模地理分析。
BMC Biol. 2025 Jul 22;23(1):219. doi: 10.1186/s12915-025-02317-5.
4
Incorporating local ancestry information to predict genetically associated DNA methylation in admixed populations.纳入本地祖先信息以预测混合人群中与基因相关的DNA甲基化。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf325.
5
An Analytic Pipeline to Obtain Reliable Genetic Ancestry Estimates from Tumor-Derived RNA Sequencing Data.一种从肿瘤来源的RNA测序数据中获得可靠遗传血统估计值的分析流程。
Cancer Epidemiol Biomarkers Prev. 2025 Sep 2;34(9):1593-1599. doi: 10.1158/1055-9965.EPI-25-0371.
6
Systematic mediation and interaction analyses of kidney function genetic loci in a general population study.一项普通人群研究中肾功能基因位点的系统中介和交互分析。
PLoS One. 2025 Jun 24;20(6):e0323057. doi: 10.1371/journal.pone.0323057. eCollection 2025.
7
Genomic-based genetic parameters and genome-wide association studies for productive and reproductive traits in Beef-on-Dairy crossbreds.基于基因组的奶牛与肉牛杂交后代生产性能和繁殖性状的遗传参数及全基因组关联研究
Front Genet. 2025 Jun 5;16:1530310. doi: 10.3389/fgene.2025.1530310. eCollection 2025.
8
[Identifying genetic etiology of ischemic stroke based on pleiotropy of obesity related genes: A sibling study].基于肥胖相关基因多效性识别缺血性卒中的遗传病因:一项同胞研究
Beijing Da Xue Xue Bao Yi Xue Ban. 2025 Jun 18;57(3):448-455. doi: 10.19723/j.issn.1671-167X.2025.03.007.
9
Inherited chromosomally integrated human herpesvirus 6: regional variation in prevalence, association with angina, and identification of ancestral viral lineages in two large UK studies.遗传性染色体整合人疱疹病毒6:两项大型英国研究中的患病率区域差异、与心绞痛的关联以及祖先病毒谱系的鉴定
J Virol. 2025 Jul 22;99(7):e0216024. doi: 10.1128/jvi.02160-24. Epub 2025 Jun 5.
10
Bayesian estimation of allele-specific expression in the presence of phasing uncertainty.存在定相不确定性时等位基因特异性表达的贝叶斯估计。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf283.
Bioinformatics. 2013 Jun 15;29(12):1568-70. doi: 10.1093/bioinformatics/btt187. Epub 2013 Apr 22.
4
The general population cohort in rural south-western Uganda: a platform for communicable and non-communicable disease studies.乌干达西南部农村地区的一般人群队列:传染病和非传染性疾病研究的平台。
Int J Epidemiol. 2013 Feb;42(1):129-41. doi: 10.1093/ije/dys234. Epub 2013 Jan 30.
5
Inference of identity by descent in population isolates and optimal sequencing studies.群体隔离和最佳测序研究中的血缘推断。
Eur J Hum Genet. 2013 Oct;21(10):1140-5. doi: 10.1038/ejhg.2012.307. Epub 2013 Jan 30.
6
Improved whole-chromosome phasing for disease and population genetic studies.用于疾病和群体遗传学研究的改进全染色体定相技术。
Nat Methods. 2013 Jan;10(1):5-6. doi: 10.1038/nmeth.2307.
7
Genetic characterization of northeastern Italian population isolates in the context of broader European genetic diversity.东北意大利人群分离株的遗传特征与更广泛的欧洲遗传多样性背景。
Eur J Hum Genet. 2013 Jun;21(6):659-65. doi: 10.1038/ejhg.2012.229. Epub 2012 Dec 19.
8
Length distributions of identity by descent reveal fine-scale demographic history.亲缘关系一致的长度分布揭示了精细的人口历史。
Am J Hum Genet. 2012 Nov 2;91(5):809-22. doi: 10.1016/j.ajhg.2012.08.030. Epub 2012 Oct 25.
9
Detecting identity by descent and homozygosity mapping in whole-exome sequencing data.通过全外显子组测序数据中的血缘关系和纯合性映射来检测身份。
PLoS One. 2012;7(10):e47618. doi: 10.1371/journal.pone.0047618. Epub 2012 Oct 11.
10
Phasing of many thousands of genotyped samples.对数千份基因分型样本进行分相。
Am J Hum Genet. 2012 Aug 10;91(2):238-51. doi: 10.1016/j.ajhg.2012.06.013.