• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SambaR:一个用于快速、轻松且可重复的二倍体 SNP 数据集群体遗传分析的 R 包。

SambaR: An R package for fast, easy and reproducible population-genetic analyses of biallelic SNP data sets.

机构信息

Department of Biosciences, Durham University, Durham, UK.

Biodiversity and Climate Research Centre, Senckenberg Institute, Frankfurt am Main, Germany.

出版信息

Mol Ecol Resour. 2021 May;21(4):1369-1379. doi: 10.1111/1755-0998.13339. Epub 2021 Feb 20.

DOI:10.1111/1755-0998.13339
PMID:33503314
Abstract

SNP data sets can be used to infer a wealth of information about natural populations, including information about their structure, genetic diversity, and the presence of loci under selection. However, SNP data analysis can be a time-consuming and challenging process, not in the least because at present many different software packages are needed to execute and depict the wide variety of mainstream population-genetic analyses. Here, we present SambaR, an integrative and user-friendly R package which automates and simplifies quality control and population-genetic analyses of biallelic SNP data sets. SambaR allows users to perform mainstream population-genetic analyses and to generate a wide variety of ready to publish graphs with a minimum number of commands (less than 10). These wrapper commands call functions of existing packages (including adegenet, ape, LEA, poppr, pcadapt and StAMPP) as well as new tools uniquely implemented in SambaR. We tested SambaR on online available SNP data sets and found that SambaR can process data sets of over 100,000 SNPs and hundreds of individuals within hours, given sufficient computing power. Newly developed tools implemented in SambaR facilitate optimization of filter settings, objective interpretation of ordination analyses, enhance comparability of diversity estimates from reduced representation library SNP data sets, and generate reduced SNP panels and structure-like plots with Bayesian population assignment probabilities. SambaR facilitates rapid population genetic analyses on biallelic SNP data sets by removing three major time sinks: file handling, software learning, and data plotting. In addition, SambaR provides a convenient platform for SNP data storage and management, as well as several new utilities, including guidance in setting appropriate data filters. The SambaR source script, manual and example data set are distributed through GitHub: https://github.com/mennodejong1986/SambaR.

摘要

SNP 数据集可用于推断有关自然种群的大量信息,包括其结构、遗传多样性以及受选择影响的基因座的信息。然而,SNP 数据分析可能是一个耗时且具有挑战性的过程,这主要是因为目前需要许多不同的软件包来执行和描述各种主流的群体遗传学分析。在这里,我们介绍了 SambaR,这是一个集成的、用户友好的 R 包,它可以自动简化二态 SNP 数据集的质量控制和群体遗传学分析。SambaR 允许用户执行主流的群体遗传学分析,并使用最少的命令(少于 10 个)生成各种准备发布的图形。这些包装命令调用现有的包(包括 adegenet、ape、LEA、poppr、pcadapt 和 StAMPP)的功能,以及在 SambaR 中唯一实现的新工具。我们在在线可用的 SNP 数据集上测试了 SambaR,发现只要有足够的计算能力,SambaR 可以在数小时内处理超过 100,000 个 SNP 和数百个个体的数据集。SambaR 中开发的新工具有助于优化过滤设置、对排序分析的目标解释、增强减少代表性文库 SNP 数据集的多样性估计的可比性,并生成带有贝叶斯群体分配概率的简化 SNP 面板和结构样图。SambaR 通过消除三个主要的时间消耗源来促进二态 SNP 数据集的快速群体遗传分析:文件处理、软件学习和数据绘图。此外,SambaR 为 SNP 数据存储和管理提供了一个方便的平台,以及几个新的实用程序,包括适当的数据过滤设置的指导。SambaR 的源代码、手册和示例数据集通过 GitHub 分发:https://github.com/mennodejong1986/SambaR。

相似文献

1
SambaR: An R package for fast, easy and reproducible population-genetic analyses of biallelic SNP data sets.SambaR:一个用于快速、轻松且可重复的二倍体 SNP 数据集群体遗传分析的 R 包。
Mol Ecol Resour. 2021 May;21(4):1369-1379. doi: 10.1111/1755-0998.13339. Epub 2021 Feb 20.
2
dartr: An r package to facilitate analysis of SNP data generated from reduced representation genome sequencing.dartr:一个 r 包,用于简化从简化代表性基因组测序生成的 SNP 数据的分析。
Mol Ecol Resour. 2018 May;18(3):691-699. doi: 10.1111/1755-0998.12745. Epub 2018 Jan 15.
3
adegenet 1.3-1: new tools for the analysis of genome-wide SNP data.adegenet 1.3-1:全基因组 SNP 数据分析的新工具。
Bioinformatics. 2011 Nov 1;27(21):3070-1. doi: 10.1093/bioinformatics/btr521. Epub 2011 Sep 16.
4
angsd-wrapper: utilities for analysing next-generation sequencing data.angsd包装器:用于分析下一代测序数据的实用工具。
Mol Ecol Resour. 2016 Nov;16(6):1449-1454. doi: 10.1111/1755-0998.12578. Epub 2016 Aug 29.
5
Population genetics of Sambar (Rusa unicolor) from the Western Himalayas: preliminary findings.西喜马拉雅山泽鹿(Rusa unicolor)的群体遗传学:初步发现。
Mol Biol Rep. 2022 Jan;49(1):811-816. doi: 10.1007/s11033-021-06845-5. Epub 2021 Oct 19.
6
snpfiltr: An R package for interactive and reproducible SNP filtering.snpfiltr:一个用于交互式和可重复 SNP 过滤的 R 包。
Mol Ecol Resour. 2022 Aug;22(6):2443-2453. doi: 10.1111/1755-0998.13618. Epub 2022 Apr 24.
7
stratag: An r package for manipulating, summarizing and analysing population genetic data.Stratag:一个用于处理、汇总和分析群体遗传数据的R软件包。
Mol Ecol Resour. 2017 Jan;17(1):5-11. doi: 10.1111/1755-0998.12559. Epub 2016 Jul 20.
8
TRES: Identification of Discriminatory and Informative SNPs from Population Genomic Data.TRES:从群体基因组数据中识别具有鉴别力和信息量的单核苷酸多态性
J Hered. 2015 Sep-Oct;106(5):672-6. doi: 10.1093/jhered/esv044. Epub 2015 Jul 2.
9
StAMPP: an R package for calculation of genetic differentiation and structure of mixed-ploidy level populations.StAMPP:用于计算混合倍性水平群体遗传分化和结构的 R 包。
Mol Ecol Resour. 2013 Sep;13(5):946-52. doi: 10.1111/1755-0998.12129. Epub 2013 Jun 6.
10
fcGENE: a versatile tool for processing and transforming SNP datasets.fcGENE:一种用于处理和转换单核苷酸多态性数据集的通用工具。
PLoS One. 2014 Jul 22;9(7):e97589. doi: 10.1371/journal.pone.0097589. eCollection 2014.

引用本文的文献

1
Population structure in a fungal human pathogen is potentially linked to pathogenicity.一种真菌性人类病原体的种群结构可能与致病性有关。
Nat Commun. 2025 Aug 15;16(1):7594. doi: 10.1038/s41467-025-62777-9.
2
Host-Associated Genetic Differentiation in the Face of Ongoing Gene Flow: Ecological Speciation in a Pathogenic Parasite of Freshwater Fish.在持续基因流情况下宿主相关的遗传分化:淡水鱼致病寄生虫中的生态物种形成
Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf163.
3
Genome-wide association mapping for heat shock tolerance in Mercenaria mercenaria through SNP microarray analysis.
通过SNP微阵列分析对硬壳蛤热休克耐受性进行全基因组关联定位。
BMC Genomics. 2025 May 30;26(1):547. doi: 10.1186/s12864-025-11689-5.
4
Phylogenomic Analysis of Wide-Ranging Least Shrews Refines Conservation Priorities and Supports a Paradigm for Evolution of Biota Spanning Eastern North America and Mesoamerica.广泛分布的伶鼩鼱的系统基因组分析优化了保护重点,并支持了一个跨越北美东部和中美洲生物群进化的范例。
Ecol Evol. 2025 May 12;15(5):e71263. doi: 10.1002/ece3.71263. eCollection 2025 May.
5
Unique demographic history and population substructure among the Coorgs of Southern India.印度南部库格人的独特人口历史和种群亚结构。
Commun Biol. 2025 May 5;8(1):698. doi: 10.1038/s42003-025-08073-0.
6
Fishing for Florida Bass in West Virginia: Genomic Evaluation of Florida Bass Presence and Establishing Baselines of Genetic Structure and Diversity for Native Largemouth Bass.在西弗吉尼亚捕捞佛罗里达鲈鱼:佛罗里达鲈鱼存在情况的基因组评估以及建立本土大口黑鲈遗传结构和多样性基线
Biology (Basel). 2025 Apr 9;14(4):392. doi: 10.3390/biology14040392.
7
Green Turtle Conservation in the Genomic Era-Monitoring an Endangered Mediterranean Population and Its Breeding Habits.基因组时代的绿海龟保护——监测濒危的地中海种群及其繁殖习性
Ecol Evol. 2025 Apr 24;15(4):e71124. doi: 10.1002/ece3.71124. eCollection 2025 Apr.
8
Survival of the fittest: genomic investigations of the bay scallop reveal a shift in population structure through a summer mortality event.适者生存:海湾扇贝的基因组研究揭示了夏季死亡事件导致的种群结构变化。
BMC Genomics. 2025 Feb 15;26(1):146. doi: 10.1186/s12864-025-11337-y.
9
Red Deer Resequencing Reveals the Importance of Sex Chromosomes for Reconstructing Late Quaternary Events.马鹿重测序揭示性染色体在重建晚更新世事件中的重要性。
Mol Biol Evol. 2025 Feb 3;42(2). doi: 10.1093/molbev/msaf031.
10
Genomics Reveal Population Structure and Intergeneric Hybridization in an Endangered South American Bird: Implications for Management and Conservation.基因组学揭示一种濒危南美鸟类的种群结构和属间杂交:对管理和保护的启示
Ecol Evol. 2025 Jan 8;15(1):e70820. doi: 10.1002/ece3.70820. eCollection 2025 Jan.