• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

精确、可扩展且综合的单倍型估计。

Accurate, scalable and integrative haplotype estimation.

机构信息

Department of Computational Biology, University of Lausanne, Génopode, 1015, Lausanne, Switzerland.

Swiss Institute of Bioinformatics (SIB), University of Lausanne, Quartier Sorge - Batiment Amphipole, 1015, Lausanne, Switzerland.

出版信息

Nat Commun. 2019 Nov 28;10(1):5436. doi: 10.1038/s41467-019-13225-y.

DOI:10.1038/s41467-019-13225-y
PMID:31780650
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6882857/
Abstract

The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here we present a method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear running times with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads. We provide SHAPEIT4 in an open source format and demonstrate its performance in terms of accuracy and running times on two gold standard datasets: the UK Biobank data and the Genome In A Bottle.

摘要

人类基因组的基因分型或测序数量呈指数级增长,现在需要能够处理这种数量数据的高效单倍型估计方法。在这里,我们提出了一种方法 SHAPEIT4,它极大地改进了其他方法来处理大型基因型和高覆盖测序数据集。它显著表现出与样本大小呈次线性的运行时间,提供高度准确的单倍型,并允许整合外部相位信息,如大型参考单倍型面板、预定相变体集合和长测序reads。我们以开源格式提供 SHAPEIT4,并在两个黄金标准数据集(英国生物银行数据和基因组瓶)上展示其在准确性和运行时间方面的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/7c809bfec9fd/41467_2019_13225_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/8c0aae425b19/41467_2019_13225_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/fd2aaf6a9d0e/41467_2019_13225_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/7f0b39ab363f/41467_2019_13225_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/94784bc7a39c/41467_2019_13225_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/7c809bfec9fd/41467_2019_13225_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/8c0aae425b19/41467_2019_13225_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/fd2aaf6a9d0e/41467_2019_13225_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/7f0b39ab363f/41467_2019_13225_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/94784bc7a39c/41467_2019_13225_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac6/6882857/7c809bfec9fd/41467_2019_13225_Fig5_HTML.jpg

相似文献

1
Accurate, scalable and integrative haplotype estimation.精确、可扩展且综合的单倍型估计。
Nat Commun. 2019 Nov 28;10(1):5436. doi: 10.1038/s41467-019-13225-y.
2
Haplotype estimation for biobank-scale data sets.生物样本库规模数据集的单倍型估计
Nat Genet. 2016 Jul;48(7):817-20. doi: 10.1038/ng.3583. Epub 2016 Jun 6.
3
Fast two-stage phasing of large-scale sequence data.大规模序列数据的快速两阶段相位测定。
Am J Hum Genet. 2021 Oct 7;108(10):1880-1890. doi: 10.1016/j.ajhg.2021.08.005. Epub 2021 Sep 2.
4
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
5
trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios.trioPhaser:利用孟德尔遗传逻辑提高三体型的基因组相位。
BMC Bioinformatics. 2021 Nov 22;22(1):559. doi: 10.1186/s12859-021-04470-4.
6
Statistical phasing of 150,119 sequenced genomes in the UK Biobank.英国生物库中 150119 个测序基因组的统计相位。
Am J Hum Genet. 2023 Jan 5;110(1):161-165. doi: 10.1016/j.ajhg.2022.11.008. Epub 2022 Nov 29.
7
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
8
Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank.在英国生物样本库中对全基因组和外显子组测序数据进行准确的罕见变异相位分析。
Nat Genet. 2023 Jul;55(7):1243-1249. doi: 10.1038/s41588-023-01415-w. Epub 2023 Jun 29.
9
Benchmarking phasing software with a whole-genome sequenced cattle pedigree.利用全基因组测序的牛系谱对相位软件进行基准测试。
BMC Genomics. 2022 Feb 15;23(1):130. doi: 10.1186/s12864-022-08354-6.
10
HapCUT2: A Method for Phasing Genomes Using Experimental Sequence Data.HapCUT2:一种使用实验序列数据进行基因组相位分析的方法。
Methods Mol Biol. 2023;2590:139-147. doi: 10.1007/978-1-0716-2819-5_9.

引用本文的文献

1
Ancient genomes provide evidence of demographic shift to Slavic-associated groups in Moravia.古代基因组为摩拉维亚地区向与斯拉夫人相关群体的人口结构转变提供了证据。
Genome Biol. 2025 Sep 3;26(1):259. doi: 10.1186/s13059-025-03700-9.
2
TCR germline diversity reveals evidence of natural selection on variable and joining alpha chain genes.TCR种系多样性揭示了可变区和连接区α链基因存在自然选择的证据。
bioRxiv. 2025 Aug 24:2025.08.20.671277. doi: 10.1101/2025.08.20.671277.
3
Multi-ancestry Genome-wide Association Study of Inpatient Opioid Dosing Following Knee or Hip Arthroplasty.

本文引用的文献

1
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
2
A One-Penny Imputed Genome from Next-Generation Reference Panels.基于新一代参考面板的单分钱估算基因组。
Am J Hum Genet. 2018 Sep 6;103(3):338-348. doi: 10.1016/j.ajhg.2018.07.015. Epub 2018 Aug 9.
3
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
膝关节或髋关节置换术后住院患者阿片类药物剂量的多血统全基因组关联研究。
Res Sq. 2025 Aug 21:rs.3.rs-7330342. doi: 10.21203/rs.3.rs-7330342/v1.
4
Carrying APOL1 G1 allele is associated with cardiovascular complications during COVID-19 in an admixed population.在一个混合人群中,携带载脂蛋白L1(APOL1)G1等位基因与新型冠状病毒肺炎(COVID-19)期间的心血管并发症相关。
Hum Genomics. 2025 Aug 31;19(1):105. doi: 10.1186/s40246-025-00790-1.
5
Identification and replication of sex-dimorphic protein quantitative trait loci across multiple ancestries and their associations with diseases.跨多个祖先群体的性别二态性蛋白质数量性状位点的鉴定与验证及其与疾病的关联
Sci Rep. 2025 Aug 28;15(1):31721. doi: 10.1038/s41598-025-10031-z.
6
Molecular profiles of the great obstetrical syndromes reveal common features and dynamic changes in early pregnancy.重大产科综合征的分子特征揭示了早期妊娠的共同特征和动态变化。
Commun Med (Lond). 2025 Aug 25;5(1):369. doi: 10.1038/s43856-025-01103-2.
7
Beyond the genome: the role of functional markers in contemporary plant breeding.超越基因组:功能标记在当代植物育种中的作用
Front Plant Sci. 2025 Aug 5;16:1637299. doi: 10.3389/fpls.2025.1637299. eCollection 2025.
8
Telomere-to-telomere genome assembly uncovers Wolbachia-driven recurrent male bottleneck effect and selection in a sawfly.端粒到端粒的基因组组装揭示了叶蜂中沃尔巴克氏体驱动的反复出现的雄性瓶颈效应和选择。
Commun Biol. 2025 Aug 13;8(1):1211. doi: 10.1038/s42003-025-08629-0.
9
Multi-ancestry Genome-wide Association Study of Inpatient Opioid Dosing Following Knee or Hip Arthroplasty.膝关节或髋关节置换术后住院期间阿片类药物剂量的多血统全基因组关联研究。
medRxiv. 2025 Jul 24:2025.07.23.25331996. doi: 10.1101/2025.07.23.25331996.
10
Identification of gene-sun exposure interactions of GWAS-identified variants in perceived facial aging progression.在感知面部衰老进程中对全基因组关联研究(GWAS)鉴定出的变异进行基因-阳光暴露相互作用的鉴定。
Front Aging. 2025 Jul 23;6:1519799. doi: 10.3389/fragi.2025.1519799. eCollection 2025.
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
4
Dense and accurate whole-chromosome haplotyping of individual genomes.个体基因组的密集且精确的全染色体单倍型分型。
Nat Commun. 2017 Nov 3;8(1):1293. doi: 10.1038/s41467-017-01389-4.
5
HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies.HapCUT2:适用于多种测序技术的强大且准确的单倍型组装工具。
Genome Res. 2017 May;27(5):801-812. doi: 10.1101/gr.213462.116. Epub 2016 Dec 9.
6
Reference-based phasing using the Haplotype Reference Consortium panel.使用单倍型参考联盟面板进行基于参考的定相
Nat Genet. 2016 Nov;48(11):1443-1448. doi: 10.1038/ng.3679. Epub 2016 Oct 3.
7
Next-generation genotype imputation service and methods.下一代基因型填充服务和方法。
Nat Genet. 2016 Oct;48(10):1284-1287. doi: 10.1038/ng.3656. Epub 2016 Aug 29.
8
A reference panel of 64,976 haplotypes for genotype imputation.用于基因型插补的64976个单倍型参考面板。
Nat Genet. 2016 Oct;48(10):1279-83. doi: 10.1038/ng.3643. Epub 2016 Aug 22.
9
Extensive sequencing of seven human genomes to characterize benchmark reference materials.对七个人类基因组进行广泛测序以表征基准参考材料。
Sci Data. 2016 Jun 7;3:160025. doi: 10.1038/sdata.2016.25.
10
Fast and accurate long-range phasing in a UK Biobank cohort.英国生物银行队列中的快速准确长程定相
Nat Genet. 2016 Jul;48(7):811-6. doi: 10.1038/ng.3571. Epub 2016 Jun 6.