• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将主成分判别分析应用于基因型数据的标准化指南。

Guidelines for standardizing the application of discriminant analysis of principal components to genotype data.

作者信息

Thia Joshua A

机构信息

Bio21 Institute, School of BioSciences, The University of Melbourne, Melbourne, Victoria, Australia.

出版信息

Mol Ecol Resour. 2023 Apr;23(3):523-538. doi: 10.1111/1755-0998.13706. Epub 2022 Sep 7.

DOI:10.1111/1755-0998.13706
PMID:36039574
Abstract

Despite the popularity of discriminant analysis of principal components (DAPC) for studying population structure, there has been little discussion of best practice for this method. In this work, I provide guidelines for standardizing the application of DAPC to genotype data sets. An often overlooked fact is that DAPC generates a model describing genetic differences among a set of populations defined by a researcher. Appropriate parameterization of this model is critical for obtaining biologically meaningful results. I show that the number of leading PC axes used as predictors of among-population differences, p , should not exceed the k-1 biologically informative PC axes that are expected for k effective populations in a genotype data set. This k-1 criterion for p specification is more appropriate compared to the widely used proportional variance criterion, which often results in a choice of p  ≫ k-1. DAPC parameterized with no more than the leading k-1 PC axes: (i) is more parsimonious; (ii) captures maximal among-population variation on biologically relevant predictors; (iii) is less sensitive to unintended interpretations of population structure; and (iv) is more generally applicable to independent sample sets. Assessing model fit should be routine practice and aids interpretation of population structure. It is imperative that researchers articulate their study goals, that is, testing a priori expectations vs. studying de novo inferred populations, because this has implications on how their DAPC results should be interpreted. The discussion and practical recommendations in this work provide the molecular ecology community with a roadmap for using DAPC in population genetic investigations.

摘要

尽管主成分判别分析(DAPC)在研究种群结构方面很受欢迎,但对于该方法的最佳实践却鲜有讨论。在这项工作中,我提供了将DAPC应用于基因型数据集的标准化指南。一个经常被忽视的事实是,DAPC生成了一个描述研究人员定义的一组种群之间遗传差异的模型。对该模型进行适当的参数化对于获得具有生物学意义的结果至关重要。我表明,用作种群间差异预测因子的主成分轴数量p不应超过基因型数据集中k个有效种群预期的k - 1个具有生物学信息的主成分轴。与广泛使用的比例方差标准相比,这个用于指定p的k - 1标准更为合适,比例方差标准常常导致选择的p≫k - 1。用不超过前k - 1个主成分轴进行参数化的DAPC:(i)更简洁;(ii)在生物学相关预测因子上捕获最大的种群间变异;(iii)对种群结构的意外解释不太敏感;(iv)更普遍适用于独立样本集。评估模型拟合应该是常规操作,并有助于对种群结构的解释。研究人员必须阐明他们的研究目标,即检验先验预期与研究新推断的种群,因为这对如何解释他们的DAPC结果有影响。这项工作中的讨论和实际建议为分子生态学领域提供了在种群遗传调查中使用DAPC的路线图。

相似文献

1
Guidelines for standardizing the application of discriminant analysis of principal components to genotype data.将主成分判别分析应用于基因型数据的标准化指南。
Mol Ecol Resour. 2023 Apr;23(3):523-538. doi: 10.1111/1755-0998.13706. Epub 2022 Sep 7.
2
The influence of a priori grouping on inference of genetic clusters: simulation study and literature review of the DAPC method.先验分组对遗传聚类推断的影响:DAPC 方法的模拟研究和文献综述。
Heredity (Edinb). 2020 Nov;125(5):269-280. doi: 10.1038/s41437-020-0348-2. Epub 2020 Aug 4.
3
A roadmap to robust discriminant analysis of principal components.主成分稳健判别分析路线图。
Mol Ecol Resour. 2023 Apr;23(3):519-522. doi: 10.1111/1755-0998.13724. Epub 2022 Nov 6.
4
Discriminant analysis of principal components: a new method for the analysis of genetically structured populations.主成分判别分析:一种用于分析遗传结构群体的新方法。
BMC Genet. 2010 Oct 15;11:94. doi: 10.1186/1471-2156-11-94.
5
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
6
Ancestry informative markers derived from discriminant analysis of principal components provide important insights into the composition of crossbred cattle.基于主成分判别分析的祖源信息标记物为杂交牛的组成提供了重要的见解。
Genomics. 2020 Mar;112(2):1726-1733. doi: 10.1016/j.ygeno.2019.10.008. Epub 2019 Oct 31.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Human loci involved in drug biotransformation: worldwide genetic variation, population structure, and pharmacogenetic implications.涉及药物生物转化的人类基因座:全球遗传变异、人群结构和药物遗传学意义。
Hum Genet. 2013 May;132(5):563-77. doi: 10.1007/s00439-013-1268-5. Epub 2013 Jan 26.
9
Analysis of genetic population structure in Acacia caven (Leguminosae, Mimosoideae), comparing one exploratory and two Bayesian-model-based methods.分析金合欢属(豆科,含羞草亚科)中基因种群结构,比较一种探索性方法和两种基于贝叶斯模型的方法。
Genet Mol Biol. 2014 Mar;37(1):64-72. doi: 10.1590/s1415-47572014000100012. Epub 2013 Feb 28.
10
Dog breed affiliation with a forensically validated canine STR set.犬种与经法证验证的犬 STR 检测试剂盒的关联。
Forensic Sci Int Genet. 2018 Nov;37:126-134. doi: 10.1016/j.fsigen.2018.08.005. Epub 2018 Aug 16.

引用本文的文献

1
Global stock structure of the Silky shark (, Carcharhinidae) assessed with high-throughput DNA sequencing.利用高通量DNA测序评估丝鲨(真鲨科)的全球种群结构
PeerJ. 2025 Jul 7;13:e19493. doi: 10.7717/peerj.19493. eCollection 2025.
2
DAPCy: a Python package for the discriminant analysis of principal components method for population genetic analyses.DAPCy:一个用于群体遗传分析的主成分判别分析方法的Python软件包。
Bioinform Adv. 2025 Jun 18;5(1):vbaf143. doi: 10.1093/bioadv/vbaf143. eCollection 2025.
3
Genotyping by Amplicon Sequencing (GBAS) With Newly Developed SSR and EPIC Markers Reveals Structure in Populations of the Green Toad () Across Rural and Urban Environments.
利用新开发的SSR和EPIC标记通过扩增子测序进行基因分型(GBAS)揭示了城乡环境中绿蟾蜍(Bufo viridis)种群的结构。
Ecol Evol. 2025 Jul 8;15(7):e71652. doi: 10.1002/ece3.71652. eCollection 2025 Jul.
4
Urbanization and distance shape population structure in .城市化和距离塑造了……的人口结构。 你提供的原文似乎不完整,最后的“in.”后面缺少具体内容。
J Urban Ecol. 2024;10(1):juae016. doi: 10.1093/jue/juae016. Epub 2024 Aug 19.
5
Rapid Colonisation of Synanthropic Stone Martens in a Highly Urbanised Region: Insights From Temporal and Spatial Analysis.高度城市化地区共生石貂的快速定殖:时空分析的见解
Ecol Evol. 2025 May 26;15(5):e71392. doi: 10.1002/ece3.71392. eCollection 2025 May.
6
ShiNyP: Unlocking SNP-Based Population Genetics-AI-Assisted Platform for Rapid and Interactive Visual Exploration.ShiNyP:解锁基于单核苷酸多态性的群体遗传学——用于快速交互式可视化探索的人工智能辅助平台。
Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf117.
7
Heritability and genomic basis of age-at-maturity in Chinook Salmon.奇努克鲑鱼成熟年龄的遗传力和基因组基础。
J Hered. 2025 Aug 23;116(5):632-652. doi: 10.1093/jhered/esaf021.
8
Floristic classifications and bioregionalizations are not predictors of intra-specific evolutionary patterns.植物区系分类和生物区域划分并非种内进化模式的预测指标。
Nat Commun. 2024 Dec 30;15(1):10770. doi: 10.1038/s41467-024-54930-7.
9
Population genomics informs the management of harvested snappers across north-western Australia.群体基因组学为澳大利亚西北部捕捞鲷鱼的管理提供了信息。
Sci Rep. 2024 Nov 4;14(1):26598. doi: 10.1038/s41598-024-77424-4.
10
Genetic consequences of improved river connectivity in brown trout ( L.).褐鳟(L.)河流连通性改善的遗传后果
Evol Appl. 2024 Apr 11;17(4):e13660. doi: 10.1111/eva.13660. eCollection 2024 Apr.