• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在存在亲缘关系的情况下,对群体结构进行稳健推断,以进行血统预测和分层校正。

Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.

作者信息

Conomos Matthew P, Miller Michael B, Thornton Timothy A

机构信息

Department of Biostatistics, University of Washington, Seattle, Washington, 98195, United States of America.

出版信息

Genet Epidemiol. 2015 May;39(4):276-93. doi: 10.1002/gepi.21896. Epub 2015 Mar 23.

DOI:10.1002/gepi.21896
PMID:25810074
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4836868/
Abstract

Population structure inference with genetic data has been motivated by a variety of applications in population genetics and genetic association studies. Several approaches have been proposed for the identification of genetic ancestry differences in samples where study participants are assumed to be unrelated, including principal components analysis (PCA), multidimensional scaling (MDS), and model-based methods for proportional ancestry estimation. Many genetic studies, however, include individuals with some degree of relatedness, and existing methods for inferring genetic ancestry fail in related samples. We present a method, PC-AiR, for robust population structure inference in the presence of known or cryptic relatedness. PC-AiR utilizes genome-screen data and an efficient algorithm to identify a diverse subset of unrelated individuals that is representative of all ancestries in the sample. The PC-AiR method directly performs PCA on the identified ancestry representative subset and then predicts components of variation for all remaining individuals based on genetic similarities. In simulation studies and in applications to real data from Phase III of the HapMap Project, we demonstrate that PC-AiR provides a substantial improvement over existing approaches for population structure inference in related samples. We also demonstrate significant efficiency gains, where a single axis of variation from PC-AiR provides better prediction of ancestry in a variety of structure settings than using 10 (or more) components of variation from widely used PCA and MDS approaches. Finally, we illustrate that PC-AiR can provide improved population stratification correction over existing methods in genetic association studies with population structure and relatedness.

摘要

利用遗传数据进行群体结构推断,是受群体遗传学和遗传关联研究中的各种应用所推动。对于假定研究参与者无亲缘关系的样本,已经提出了几种方法来识别遗传血统差异,包括主成分分析(PCA)、多维尺度分析(MDS)以及用于比例血统估计的基于模型的方法。然而,许多遗传研究包含有一定亲缘关系的个体,而现有的推断遗传血统的方法在相关样本中会失效。我们提出了一种方法,即PC - AiR,用于在存在已知或潜在亲缘关系的情况下进行稳健的群体结构推断。PC - AiR利用基因组筛选数据和一种高效算法,来识别一个无亲缘关系个体的多样化子集,该子集代表了样本中的所有血统。PC - AiR方法直接对所识别的血统代表性子集进行主成分分析,然后基于遗传相似性预测所有其余个体的变异成分。在模拟研究以及对国际人类基因组单体型图计划(HapMap)项目第三阶段真实数据的应用中,我们证明PC - AiR相对于现有方法,在相关样本的群体结构推断方面有显著改进。我们还证明了显著的效率提升,即在各种结构设置下,PC - AiR的单个变异轴比使用广泛使用的PCA和MDS方法的10个(或更多)变异成分,能更好地预测血统。最后,我们说明在存在群体结构和亲缘关系的遗传关联研究中,PC - AiR相对于现有方法可以提供更好的群体分层校正。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/cc5008b99d45/nihms771658f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/4fb8166951d8/nihms771658f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/25d2cf986028/nihms771658f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/fd6d5656bf36/nihms771658f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/cc5008b99d45/nihms771658f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/4fb8166951d8/nihms771658f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/25d2cf986028/nihms771658f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/fd6d5656bf36/nihms771658f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e49/4836868/cc5008b99d45/nihms771658f4.jpg

相似文献

1
Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.在存在亲缘关系的情况下,对群体结构进行稳健推断,以进行血统预测和分层校正。
Genet Epidemiol. 2015 May;39(4):276-93. doi: 10.1002/gepi.21896. Epub 2015 Mar 23.
2
Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.利用多个群体的等位基因频率从DNA序列数据中快速推断个体祖先。
BMC Bioinformatics. 2015 Jan 16;16:4. doi: 10.1186/s12859-014-0418-7.
3
Novel probabilistic models of spatial genetic ancestry with applications to stratification correction in genome-wide association studies.用于全基因组关联研究分层校正的空间遗传血统新型概率模型。
Bioinformatics. 2017 Mar 15;33(6):879-885. doi: 10.1093/bioinformatics/btw720.
4
Sparse principal component analysis for identifying ancestry-informative markers in genome-wide association studies.稀疏主成分分析在全基因组关联研究中识别与祖先相关的标记。
Genet Epidemiol. 2012 May;36(4):293-302. doi: 10.1002/gepi.21621. Epub 2012 Apr 16.
5
FastPop: a rapid principal component derived method to infer intercontinental ancestry using genetic data.FastPop:一种利用遗传数据推断洲际血统的快速主成分衍生方法。
BMC Bioinformatics. 2016 Mar 9;17:122. doi: 10.1186/s12859-016-0965-1.
6
Eigenanalysis of SNP data with an identity by descent interpretation.基于血缘一致性解释的单核苷酸多态性(SNP)数据特征分析。
Theor Popul Biol. 2016 Feb;107:65-76. doi: 10.1016/j.tpb.2015.09.004. Epub 2015 Oct 23.
7
Genetic Ancestry Inference for Pharmacogenomics.遗传祖源推断在药物基因组学中的应用。
Methods Mol Biol. 2022;2547:595-609. doi: 10.1007/978-1-0716-2573-6_21.
8
Model-free Estimation of Recent Genetic Relatedness.近期遗传相关性的无模型估计
Am J Hum Genet. 2016 Jan 7;98(1):127-48. doi: 10.1016/j.ajhg.2015.11.022.
9
GRAF-pop: A Fast Distance-Based Method To Infer Subject Ancestry from Multiple Genotype Datasets Without Principal Components Analysis.GRAF-pop:一种无需主成分分析即可基于距离推断个体祖先的快速方法,适用于多种基因型数据集。
G3 (Bethesda). 2019 Aug 8;9(8):2447-2461. doi: 10.1534/g3.118.200925.
10
Robust relationship inference in genome-wide association studies.全基因组关联研究中的稳健关系推断。
Bioinformatics. 2010 Nov 15;26(22):2867-73. doi: 10.1093/bioinformatics/btq559. Epub 2010 Oct 5.

引用本文的文献

1
Multi-ancestry meta-analysis identifies genetic modifiers of age-at-onset of Alzheimer's disease at known and novel loci.多血统荟萃分析在已知和新发现的基因座上鉴定出阿尔茨海默病发病年龄的基因修饰因子。
Alzheimers Dement. 2025 Sep;21(9):e70489. doi: 10.1002/alz.70489.
2
The impact of polygenic risk for alzheimer's disease on neurotransmitter-related grey matter atrophy in the alzheimer continuum.阿尔茨海默病多基因风险对阿尔茨海默病连续体中神经递质相关灰质萎缩的影响。
Neurol Sci. 2025 Aug 25. doi: 10.1007/s10072-025-08393-3.
3
Pharmacogenomics of steroid-induced ocular hypertension: relationship to high-tension glaucomas and new pathophysiologic insight.

本文引用的文献

1
Local and global ancestry inference and applications to genetic association analysis for admixed populations.混合人群的局部和整体祖先推断及其在基因关联分析中的应用。
Genet Epidemiol. 2014 Sep;38 Suppl 1(0 1):S5-S12. doi: 10.1002/gepi.21819.
2
The genetical structure of populations.种群的遗传结构。
Ann Eugen. 1951 Mar;15(4):323-54. doi: 10.1111/j.1469-1809.1949.tb02451.x.
3
Advantages and pitfalls in the application of mixed-model association methods.混合模型关联方法应用的优缺点。
类固醇性高眼压症的药物基因组学:与高眼压型青光眼的关系及新的病理生理学见解
medRxiv. 2025 Aug 13:2025.08.11.25333245. doi: 10.1101/2025.08.11.25333245.
4
Polygenic Scores of Core-1 Alzheimer's Disease Biomarkers Predict Early Cognitive and Pathological Change.核心1型阿尔茨海默病生物标志物的多基因评分预测早期认知和病理变化。
medRxiv. 2025 Jul 14:2025.07.12.25331438. doi: 10.1101/2025.07.12.25331438.
5
Evaluating polygenic risk score prediction performance for Alzheimer's disease in a population-based Hispanic cohort using single- and multi-ancestry models.在一个基于人群的西班牙裔队列中,使用单祖先和多祖先模型评估阿尔茨海默病的多基因风险评分预测性能。
Lancet Reg Health Am. 2025 Jul 25;49:101198. doi: 10.1016/j.lana.2025.101198. eCollection 2025 Sep.
6
Towards improved fine-mapping of candidate causal variants.迈向对候选因果变异更精细的定位。
Nat Rev Genet. 2025 Jul 28. doi: 10.1038/s41576-025-00869-4.
7
Genes linked to schistosome resistance identified in a genome-wide association study of African snail vectors.在一项针对非洲蜗牛载体的全基因组关联研究中鉴定出与血吸虫抗性相关的基因。
Nat Commun. 2025 Jul 27;16(1):6918. doi: 10.1038/s41467-025-61760-8.
8
Identity by descent and local ancestry mapping of HCV spontaneous clearance in populations of diverse ancestries.不同血统人群中丙型肝炎病毒自发清除的同源性和本地祖先图谱分析
BMC Genomics. 2025 Jul 12;26(1):661. doi: 10.1186/s12864-024-11076-6.
9
Gene-environment interactions contribute to blood pressure variation across global populations.基因与环境的相互作用导致了全球人群血压的差异。
medRxiv. 2025 Jul 3:2025.07.02.25330727. doi: 10.1101/2025.07.02.25330727.
10
Extending Genome-Wide Association Studies to admixed cohorts with high degrees of relatedness.将全基因组关联研究扩展至具有高度亲缘关系的混合队列。
medRxiv. 2025 Jun 9:2025.05.27.25328444. doi: 10.1101/2025.05.27.25328444.
Nat Genet. 2014 Feb;46(2):100-6. doi: 10.1038/ng.2876.
4
Reconstructing Native American migrations from whole-genome and whole-exome data.从全基因组和全外显子组数据重建美洲原住民的迁徙。
PLoS Genet. 2013;9(12):e1004023. doi: 10.1371/journal.pgen.1004023. Epub 2013 Dec 26.
5
RelateAdmix: a software tool for estimating relatedness between admixed individuals.RelateAdmix:用于估计混合个体之间亲缘关系的软件工具。
Bioinformatics. 2014 Apr 1;30(7):1027-8. doi: 10.1093/bioinformatics/btt652. Epub 2013 Nov 8.
6
Improved ancestry inference using weights from external reference panels.利用外部参考面板的权重提高祖先推断。
Bioinformatics. 2013 Jun 1;29(11):1399-406. doi: 10.1093/bioinformatics/btt144. Epub 2013 Mar 28.
7
Utilizing graph theory to select the largest set of unrelated individuals for genetic analysis.利用图论选择最大的一组无关联个体进行遗传分析。
Genet Epidemiol. 2013 Feb;37(2):136-41. doi: 10.1002/gepi.21684. Epub 2012 Sep 19.
8
Principal components analysis of population admixture.群体混合的主成分分析。
PLoS One. 2012;7(7):e40115. doi: 10.1371/journal.pone.0040115. Epub 2012 Jul 9.
9
Estimating kinship in admixed populations.估算混合人群中的亲属关系。
Am J Hum Genet. 2012 Jul 13;91(1):122-38. doi: 10.1016/j.ajhg.2012.05.024. Epub 2012 Jun 28.
10
Genome-wide efficient mixed-model analysis for association studies.全基因组高效混合模型关联分析。
Nat Genet. 2012 Jun 17;44(7):821-4. doi: 10.1038/ng.2310.