Suppr超能文献

样本量影响(SaSii):一个用于估计群体遗传学和群体基因组学研究中最优样本量的R脚本。

Sample Size Impact (SaSii): An R script for estimating optimal sample sizes in population genetics and population genomics studies.

作者信息

Scaketti Matheus, Sujii Patricia Sanae, Alves-Pereira Alessandro, Schwarcz Kaiser Dias, Francisconi Ana Flávia, Moro Matheus Sartori, Moreno Martins Kauanne Karolline, de Jesus Thiago Araujo, de Souza Guilherme Brener Ferreira, Zucchi Maria Imaculada

机构信息

Biology Institute, State University of Campinas-UNICAMP, Campinas, São Paulo, Brazil.

Applied Biology Laboratory, Centro de Ensino Unificado do Distrito Federal, Brasília, Distrito Federal, Brazil.

出版信息

PLoS One. 2025 Feb 13;20(2):e0316634. doi: 10.1371/journal.pone.0316634. eCollection 2025.

Abstract

Obtaining large sample sizes for genetic studies can be challenging, time-consuming, and expensive, and small sample sizes may generate biased or imprecise results. Many studies have suggested the minimum sample size necessary to obtain robust and reliable results, but it is not possible to define one ideal minimum sample size that fits all studies. Here, we present SaSii (Sample Size Impact), an R script to help researchers define the minimum sample size. Based on empirical and simulated data analysis using SaSii, we present patterns and suggest minimum sample sizes for experiment design. The patterns were obtained by analyzing previously published genotype datasets with SaSii and can be used as a starting point for the sample design of population genetics and genomic studies. Our results showed that it is possible to estimate an adequate sample size that accurately represents the real population without requiring the scientist to write any program code, extract and sequence samples, or use population genetics programs, thus simplifying the process. We also confirmed that the minimum sample sizes for SNP (single-nucleotide polymorphism) analysis are usually smaller than for SSR (simple sequence repeat) analysis and discussed other patterns observed from empirical plant and animal datasets.

摘要

获取用于基因研究的大样本量可能具有挑战性、耗时且成本高昂,而小样本量可能会产生有偏差或不精确的结果。许多研究都提出了获得稳健可靠结果所需的最小样本量,但不可能定义一个适用于所有研究的理想最小样本量。在此,我们展示了SaSii(样本量影响),这是一个R脚本,可帮助研究人员确定最小样本量。基于使用SaSii进行的实证和模拟数据分析,我们呈现了相关模式并为实验设计建议了最小样本量。这些模式是通过使用SaSii分析先前发表的基因型数据集获得的,可作为群体遗传学和基因组研究样本设计的起点。我们的结果表明,无需科学家编写任何程序代码、提取和测序样本或使用群体遗传学程序,就有可能估计出能准确代表真实群体的足够样本量,从而简化了这一过程。我们还证实,单核苷酸多态性(SNP)分析的最小样本量通常小于简单序列重复(SSR)分析的最小样本量,并讨论了从实证植物和动物数据集中观察到的其他模式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aad/11824989/33400c421787/pone.0316634.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验