• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合序列和阵列数据以创建改进的千人基因组计划单倍型参考面板。

Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

作者信息

Delaneau Olivier, Marchini Jonathan

机构信息

Department of Statistics, University of Oxford, Oxford OX1 3TG, UK.

1] Department of Statistics, University of Oxford, Oxford OX1 3TG, UK [2] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK.

出版信息

Nat Commun. 2014 Jun 13;5:3934. doi: 10.1038/ncomms4934.

DOI:10.1038/ncomms4934
PMID:25653097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4338501/
Abstract

A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.

摘要

千人基因组计划(1000GP)数据的一个主要用途是在全基因组关联研究(GWAS)中进行基因型插补。在此,我们开发了一种从低覆盖度测序数据中估计单倍型的方法,该方法可以利用同一样本上的单核苷酸多态性(SNP)微阵列基因型。首先,对SNP阵列数据进行定相,以构建每条染色体上的单倍型主干(或“支架”)。然后,将序列数据“定相到”这个单倍型支架上。这种方法可以利用已测序样本和未测序样本之间的相关性来提高准确性。我们使用这种方法创建了一个新的1000GP单倍型参考集,供人类遗传学界使用。通过一组SNP和双等位基因插入缺失的验证基因型,我们表明这些单倍型具有较低的基因型不一致性,并提高了对下游GWAS样本的插补性能,尤其是在低频变异方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/1096faade506/emss-58243-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/a0aff1455726/emss-58243-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/5096422eec41/emss-58243-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/1096faade506/emss-58243-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/a0aff1455726/emss-58243-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/5096422eec41/emss-58243-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1888/4338501/1096faade506/emss-58243-f0003.jpg

相似文献

1
Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.整合序列和阵列数据以创建改进的千人基因组计划单倍型参考面板。
Nat Commun. 2014 Jun 13;5:3934. doi: 10.1038/ncomms4934.
2
Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold.使用下一代测序reads 和单倍型支架进行基因型调用和相位分析。
Bioinformatics. 2013 Jan 1;29(1):84-91. doi: 10.1093/bioinformatics/bts632. Epub 2012 Oct 23.
3
Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel.使用UK10K单倍型参考面板改进低频和罕见变异的填充。
Nat Commun. 2015 Sep 14;6:8111. doi: 10.1038/ncomms9111.
4
Imputation-Based HLA Typing with GWAS SNPs.基于 GWAS SNPs 的推断性 HLA 分型。
Methods Mol Biol. 2024;2809:127-143. doi: 10.1007/978-1-0716-3874-3_9.
5
Fast imputation using medium or low-coverage sequence data.使用中等或低覆盖率序列数据进行快速插补。
BMC Genet. 2015 Jul 14;16:82. doi: 10.1186/s12863-015-0243-7.
6
Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative.使用来自妇女健康倡议的约 4000 个非洲裔美国人的研究特定参考面板对 Metabochip SNPs 进行基因型推断。
Genet Epidemiol. 2012 Feb;36(2):107-17. doi: 10.1002/gepi.21603.
7
Two-stage strategy using denoising autoencoders for robust reference-free genotype imputation with missing input genotypes.两阶段策略使用去噪自动编码器实现稳健的无参考基因型缺失输入基因型的基因型推断。
J Hum Genet. 2024 Oct;69(10):511-518. doi: 10.1038/s10038-024-01261-6. Epub 2024 Jun 25.
8
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
9
A strategy to improve phasing of whole-genome sequenced individuals through integration of familial information from dense genotype panels.一种通过整合来自密集基因型面板的家族信息来改善全基因组测序个体相位的策略。
Genet Sel Evol. 2017 May 16;49(1):46. doi: 10.1186/s12711-017-0321-6.
10
Genotype Imputation in Genome-Wide Association Studies.全基因组关联研究中的基因型填充
Curr Protoc Hum Genet. 2019 Jun;102(1):e84. doi: 10.1002/cphg.84.

引用本文的文献

1
Regulatory polymorphisms of , , and genes affect survival of patients with immunotherapy-treated lung cancer.、、和基因的调控多态性影响接受免疫治疗的肺癌患者的生存率。
J Immunother Cancer. 2025 Sep 1;13(9):e011526. doi: 10.1136/jitc-2025-011526.
2
Tracing the Genetic Heritage of the Kirgiz People: Dual-Wave Admixture and Ancestry-Biased Adaptation.追溯吉尔吉斯人的遗传谱系:双波混合与祖先偏向性适应。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf196.
3
Multi-phase, multi-ethnic GWAS uncovers putative loci in predisposition to elite sprint and power performance, health and disease.

本文引用的文献

1
Haplotype estimation using sequencing reads.使用测序reads 进行单体型估计。
Am J Hum Genet. 2013 Oct 3;93(4):687-96. doi: 10.1016/j.ajhg.2013.09.002.
2
Joint haplotype phasing and genotype calling of multiple individuals using haplotype informative reads.利用单倍型信息读长对多个个体进行联合单倍型相位确定和基因型调用。
Bioinformatics. 2013 Oct 1;29(19):2427-34. doi: 10.1093/bioinformatics/btt418. Epub 2013 Aug 13.
3
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.
多阶段、多民族全基因组关联研究揭示了精英短跑和力量表现、健康与疾病易感性中的潜在基因座。
Biol Sport. 2025 Feb 4;42(3):141-159. doi: 10.5114/biolsport.2025.147015. eCollection 2025 Jul.
4
High-resolution detection of copy number alterations in single cells with HiScanner.使用HiScanner对单细胞中的拷贝数改变进行高分辨率检测。
Nat Commun. 2025 Jul 1;16(1):5477. doi: 10.1038/s41467-025-60446-5.
5
Population histories of the Indigenous Adivasi and Sinhalese from Sri Lanka using whole genomes.利用全基因组研究斯里兰卡原住民阿迪瓦西和僧伽罗人的群体历史。
Curr Biol. 2025 Jun 9;35(11):2554-2566.e7. doi: 10.1016/j.cub.2025.04.039.
6
Bayesian estimation of allele-specific expression in the presence of phasing uncertainty.存在定相不确定性时等位基因特异性表达的贝叶斯估计。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf283.
7
Genomic Analysis Reveals a New Cryptic Taxon Within the Anopheles gambiae Complex With a Distinct Insecticide Resistance Profile in the Coast of East Africa.基因组分析揭示了冈比亚按蚊复合体中一个新的隐秘分类群,其在东非海岸具有独特的抗杀虫剂特性。
Mol Ecol. 2025 May;34(10):e17762. doi: 10.1111/mec.17762. Epub 2025 Apr 16.
8
weIMPUTE: a user-friendly web-based genotype imputation platform.weIMPUTE:一个用户友好的基于网络的基因型填充平台。
Front Genet. 2025 Mar 17;16:1532464. doi: 10.3389/fgene.2025.1532464. eCollection 2025.
9
Genomic Exploration of Essential Hypertension in African-Brazilian Quilombo Populations: A Comprehensive Approach With Pedigree Analysis and Family-Based Association Studies.非洲裔巴西基隆波人群原发性高血压的基因组探索:一种结合系谱分析和基于家系的关联研究的综合方法。
J Am Heart Assoc. 2025 Apr;14(7):e036193. doi: 10.1161/JAHA.124.036193. Epub 2025 Mar 21.
10
Comparing Alzheimer's genes in African, European, and Amerindian induced pluripotent stem cell-derived microglia.比较非洲、欧洲和美洲印第安人诱导多能干细胞衍生的小胶质细胞中的阿尔茨海默病相关基因。
Alzheimers Dement. 2025 Feb;21(2):e70031. doi: 10.1002/alz.70031.
利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
4
An integrative variant analysis pipeline for accurate genotype/haplotype inference in population NGS data.一种整合的变异分析管道,用于准确推断人群 NGS 数据中的基因型/单倍型。
Genome Res. 2013 May;23(5):833-42. doi: 10.1101/gr.146084.112. Epub 2013 Jan 7.
5
Improved whole-chromosome phasing for disease and population genetic studies.用于疾病和群体遗传学研究的改进全染色体定相技术。
Nat Methods. 2013 Jan;10(1):5-6. doi: 10.1038/nmeth.2307.
6
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
7
Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold.使用下一代测序reads 和单倍型支架进行基因型调用和相位分析。
Bioinformatics. 2013 Jan 1;29(1):84-91. doi: 10.1093/bioinformatics/bts632. Epub 2012 Oct 23.
8
Genotype imputation with thousands of genomes.使用数千份基因组进行基因型推断。
G3 (Bethesda). 2011 Nov;1(6):457-70. doi: 10.1534/g3.111.001198. Epub 2011 Nov 1.
9
A linear complexity phasing method for thousands of genomes.一种用于数千个基因组的线性复杂度相位分析方法。
Nat Methods. 2011 Dec 4;9(2):179-81. doi: 10.1038/nmeth.1785.
10
A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.一种用于从测序数据中进行 SNP 调用、突变发现、关联映射和群体遗传参数估计的统计框架。
Bioinformatics. 2011 Nov 1;27(21):2987-93. doi: 10.1093/bioinformatics/btr509. Epub 2011 Sep 8.