• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pcadapt:一个基于主成分分析进行选择的基因组扫描的R软件包。

pcadapt: an R package to perform genome scans for selection based on principal component analysis.

作者信息

Luu Keurcien, Bazin Eric, Blum Michael G B

机构信息

Laboratoire TIMC-IMAG, UMR 5525, CNRS, Université Grenoble Alpes, Grenoble, France.

Laboratoire d'Ecologie Alpine UMR 5553, CNRS, Université Grenoble Alpes, Grenoble, France.

出版信息

Mol Ecol Resour. 2017 Jan;17(1):67-77. doi: 10.1111/1755-0998.12592. Epub 2016 Sep 7.

DOI:10.1111/1755-0998.12592
PMID:27601374
Abstract

The R package pcadapt performs genome scans to detect genes under selection based on population genomic data. It assumes that candidate markers are outliers with respect to how they are related to population structure. Because population structure is ascertained with principal component analysis, the package is fast and works with large-scale data. It can handle missing data and pooled sequencing data. By contrast to population-based approaches, the package handle admixed individuals and does not require grouping individuals into populations. Since its first release, pcadapt has evolved in terms of both statistical approach and software implementation. We present results obtained with robust Mahalanobis distance, which is a new statistic for genome scans available in the 2.0 and later versions of the package. When hierarchical population structure occurs, Mahalanobis distance is more powerful than the communality statistic that was implemented in the first version of the package. Using simulated data, we compare pcadapt to other computer programs for genome scans (BayeScan, hapflk, OutFLANK, sNMF). We find that the proportion of false discoveries is around a nominal false discovery rate set at 10% with the exception of BayeScan that generates 40% of false discoveries. We also find that the power of BayeScan is severely impacted by the presence of admixed individuals whereas pcadapt is not impacted. Last, we find that pcadapt and hapflk are the most powerful in scenarios of population divergence and range expansion. Because pcadapt handles next-generation sequencing data, it is a valuable tool for data analysis in molecular ecology.

摘要

R软件包pcadapt基于群体基因组数据进行全基因组扫描,以检测受到选择的基因。它假定候选标记在与群体结构的关系方面是异常值。由于群体结构是通过主成分分析确定的,该软件包速度快,可处理大规模数据。它可以处理缺失数据和混合测序数据。与基于群体的方法不同,该软件包可以处理混合个体,并且不需要将个体分组到不同群体中。自首次发布以来,pcadapt在统计方法和软件实现方面都有所发展。我们展示了使用稳健马氏距离获得的结果,这是该软件包2.0及更高版本中可用的一种新的全基因组扫描统计量。当出现分层群体结构时,马氏距离比该软件包第一版中实现的共性统计量更有效。使用模拟数据,我们将pcadapt与其他用于全基因组扫描的计算机程序(BayeScan、hapflk、OutFLANK、sNMF)进行了比较。我们发现,除了产生40%错误发现的BayeScan外,错误发现的比例约为设定的10%的名义错误发现率。我们还发现,BayeScan的功效受到混合个体存在的严重影响,而pcadapt则不受影响。最后,我们发现pcadapt和hapflk在群体分化和范围扩张的情况下最有效。由于pcadapt可以处理下一代测序数据,它是分子生态学数据分析的一个有价值的工具。

相似文献

1
pcadapt: an R package to perform genome scans for selection based on principal component analysis.pcadapt:一个基于主成分分析进行选择的基因组扫描的R软件包。
Mol Ecol Resour. 2017 Jan;17(1):67-77. doi: 10.1111/1755-0998.12592. Epub 2016 Sep 7.
2
minotaur: A platform for the analysis and visualization of multivariate results from genome scans with R Shiny.牛头怪:一个用于使用R Shiny对基因组扫描的多变量结果进行分析和可视化的平台。
Mol Ecol Resour. 2017 Jan;17(1):33-43. doi: 10.1111/1755-0998.12579. Epub 2016 Aug 31.
3
Evaluation of redundancy analysis to identify signatures of local adaptation.冗余分析评价用于识别局部适应特征。
Mol Ecol Resour. 2018 Nov;18(6):1223-1233. doi: 10.1111/1755-0998.12906. Epub 2018 Jun 17.
4
Performing Highly Efficient Genome Scans for Local Adaptation with R Package pcadapt Version 4.使用 R 包 pcadapt 版本 4 进行高效的基因组扫描以检测局部适应
Mol Biol Evol. 2020 Jul 1;37(7):2153-2154. doi: 10.1093/molbev/msaa053.
5
Detecting Genomic Signatures of Natural Selection with Principal Component Analysis: Application to the 1000 Genomes Data.利用主成分分析检测自然选择的基因组特征:应用于千人基因组数据
Mol Biol Evol. 2016 Apr;33(4):1082-93. doi: 10.1093/molbev/msv334. Epub 2015 Dec 29.
6
rehh 2.0: a reimplementation of the R package rehh to detect positive selection from haplotype structure.rehh 2.0:R包rehh的重新实现,用于从单倍型结构中检测正选择。
Mol Ecol Resour. 2017 Jan;17(1):78-90. doi: 10.1111/1755-0998.12634. Epub 2016 Nov 28.
7
Genome scans for detecting footprints of local adaptation using a Bayesian factor model.使用贝叶斯因子模型进行基因组扫描以检测局部适应性印记。
Mol Biol Evol. 2014 Sep;31(9):2483-95. doi: 10.1093/molbev/msu182. Epub 2014 Jun 3.
8
stratag: An r package for manipulating, summarizing and analysing population genetic data.Stratag:一个用于处理、汇总和分析群体遗传数据的R软件包。
Mol Ecol Resour. 2017 Jan;17(1):5-11. doi: 10.1111/1755-0998.12559. Epub 2016 Jul 20.
9
parallelnewhybrid: an R package for the parallelization of hybrid detection using newhybrids.并行新杂种法:一个用于使用新杂种法并行化杂种检测的R包。
Mol Ecol Resour. 2017 Jan;17(1):91-95. doi: 10.1111/1755-0998.12597. Epub 2016 Oct 1.
10
Identifying outlier loci in admixed and in continuous populations using ancestral population differentiation statistics.使用祖先群体分化统计量在混合群体和连续群体中识别异常位点。
Mol Ecol. 2016 Oct;25(20):5029-5042. doi: 10.1111/mec.13822. Epub 2016 Sep 14.

引用本文的文献

1
Local Adaptation Drives Leaf Thermoregulation in Tropical Rainforest Trees.局部适应性驱动热带雨林树木的叶片温度调节。
Glob Chang Biol. 2025 Sep;31(9):e70461. doi: 10.1111/gcb.70461.
2
Aphid herbivory on macrophytes drives adaptive evolution in an aquatic community via indirect effects.蚜虫对大型植物的取食通过间接影响推动水生群落的适应性进化。
Proc Natl Acad Sci U S A. 2025 Aug 26;122(34):e2502742122. doi: 10.1073/pnas.2502742122. Epub 2025 Aug 21.
3
Range-wide climate risk and adaptive potential in a cold-water fish species.
一种冷水鱼类的全分布范围气候风险与适应潜力
Nat Commun. 2025 Aug 13;16(1):7514. doi: 10.1038/s41467-025-62811-w.
4
Over-expression and increased copy numbers of a cytochrome P450 and two UDP-glucuronosyltransferase genes in macrocyclic lactone resistant Psoroptes ovis of cattle.牛大环内酯抗性绵羊痒螨中一种细胞色素P450和两种尿苷二磷酸葡萄糖醛酸基转移酶基因的过表达及拷贝数增加
PLoS Pathog. 2025 Jul 29;21(7):e1012963. doi: 10.1371/journal.ppat.1012963. eCollection 2025 Jul.
5
A novel expectation-maximization approach to infer general diploid selection from time-series genetic data.一种从时间序列遗传数据推断一般二倍体选择的新型期望最大化方法。
PLoS Genet. 2025 Jul 22;21(7):e1011769. doi: 10.1371/journal.pgen.1011769. eCollection 2025 Jul.
6
The Influence of Spatial Distance and Environment on Small-Scale Genetic Variability in Eelgrass and Its Application for Restoration.空间距离和环境对鳗草小尺度遗传变异性的影响及其在恢复中的应用。
Evol Appl. 2025 Jul 11;18(7):e70127. doi: 10.1111/eva.70127. eCollection 2025 Jul.
7
Interplay between large low-recombining regions and pseudo-overdominance in a plant genome.植物基因组中大型低重组区域与假超显性之间的相互作用。
Nat Commun. 2025 Jul 12;16(1):6458. doi: 10.1038/s41467-025-61529-z.
8
'Highly-Informative' Genetic Markers Can Bias Conclusions: Examples and General Solutions.“高信息量”基因标记可能会使结论产生偏差:示例与通用解决方案
Mol Ecol Resour. 2025 Oct;25(7):e70011. doi: 10.1111/1755-0998.70011. Epub 2025 Jul 11.
9
Combining population genomics, larval dispersal modelling and graph theory to reveal connectivity patterns of the black-lipped pearl oyster in French Polynesia.结合种群基因组学、幼体扩散建模和图论来揭示法属波利尼西亚黑唇珍珠牡蛎的连通性模式。
Sci Rep. 2025 Jul 9;15(1):24611. doi: 10.1038/s41598-025-10113-y.
10
Host-Associated Genetic Differentiation in the Face of Ongoing Gene Flow: Ecological Speciation in a Pathogenic Parasite of Freshwater Fish.在持续基因流情况下宿主相关的遗传分化:淡水鱼致病寄生虫中的生态物种形成
Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf163.