• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于质粒模型的模拟评估肥胖的鼠类遗传模型:I 型错误率和常用分析方法的效能。

Murine genetic models of obesity: type I error rates and the power of commonly used analyses as assessed by plasmode-based simulation.

机构信息

Department of Epidemiology and Biostatistics, Indiana University School of Public Health-Bloomington, Bloomington, IN, USA.

Institute of Industrial Science, The University of Tokyo, Tokyo, Japan.

出版信息

Int J Obes (Lond). 2020 Jun;44(6):1440-1449. doi: 10.1038/s41366-020-0554-2. Epub 2020 Feb 25.

DOI:10.1038/s41366-020-0554-2
PMID:32099106
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7261642/
Abstract

BACKGROUND/OBJECTIVES: Genetic contributors to obesity are frequently studied in murine models. However, the sample sizes of these studies are often small, and the data may violate assumptions of common statistical tests, such as normality of distributions. We examined whether, in these cases, type I error rates and power are affected by the choice of statistical test.

SUBJECTS/METHODS: We conducted "plasmode"-based simulation using empirical data on body mass (weight) from murine genetic models of obesity. For the type I error simulation, the weight distributions were adjusted to ensure no difference in means between control and mutant groups. For the power simulation, the distributions of the mutant groups were shifted to ensure specific effect sizes. Three to twenty mice were resampled from the empirical distributions to create a plasmode. We then computed type I error rates and power for five common tests on the plasmodes: Student's t test, Welch's t test, Wilcoxon rank sum test (aka, Mann-Whitney U test), permutation test, and bootstrap test.

RESULTS

We observed type I error inflation for all tests, except the bootstrap test, with small samples (≤5). Type I error inflation decreased as sample size increased (≥8) but remained. The Wilcoxon test should be avoided because of heterogeneity of distributions. For power, a departure from the reference was observed with small samples for all tests. Compared with the other tests, the bootstrap test had less power with small samples.

CONCLUSIONS

Overall, the bootstrap test is recommended for small samples to avoid type I error inflation, but this benefit comes at the cost of lower power. When sample size is large enough, Welch's t test is recommended because of high power with minimal type I error inflation.

摘要

背景/目的:肥胖的遗传因素经常在小鼠模型中进行研究。然而,这些研究的样本量通常较小,并且数据可能违反常见统计检验的假设,例如分布的正态性。我们研究了在这些情况下,选择统计检验是否会影响Ⅰ类错误率和功效。

受试者/方法:我们使用肥胖小鼠遗传模型的体重(重量)的经验数据进行了基于“plasmode”的模拟。对于Ⅰ类错误模拟,调整体重分布以确保对照组和突变组之间的平均值没有差异。对于功效模拟,将突变组的分布移动以确保特定的效应大小。从经验分布中对 3 到 20 只老鼠进行重采样以创建 plasmode。然后,我们计算了五个常见测试的 plasmode 的Ⅰ类错误率和功效:学生 t 检验、Welch t 检验、Wilcoxon 秩和检验(又名 Mann-Whitney U 检验)、置换检验和自举检验。

结果

我们观察到除自举检验外,所有检验的Ⅰ类错误率都有膨胀,尤其是小样本(≤5)。随着样本量的增加(≥8),Ⅰ类错误率的膨胀会降低,但仍会存在。由于分布不均一,应避免使用 Wilcoxon 检验。对于功效,所有检验在小样本时均观察到与参考值的偏差。与其他检验相比,小样本时自举检验的功效较低。

结论

总体而言,对于小样本,建议使用自举检验以避免Ⅰ类错误率膨胀,但这会以降低功效为代价。当样本量足够大时,由于最小的Ⅰ类错误率膨胀和高功效,建议使用 Welch t 检验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/b6be317e5369/nihms-1560827-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/9950cfc459d9/nihms-1560827-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/44ecfe317441/nihms-1560827-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/e6468aa50454/nihms-1560827-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/8f8313ed4350/nihms-1560827-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/b6be317e5369/nihms-1560827-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/9950cfc459d9/nihms-1560827-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/44ecfe317441/nihms-1560827-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/e6468aa50454/nihms-1560827-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/8f8313ed4350/nihms-1560827-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00de/7261642/b6be317e5369/nihms-1560827-f0005.jpg

相似文献

1
Murine genetic models of obesity: type I error rates and the power of commonly used analyses as assessed by plasmode-based simulation.基于质粒模型的模拟评估肥胖的鼠类遗传模型:I 型错误率和常用分析方法的效能。
Int J Obes (Lond). 2020 Jun;44(6):1440-1449. doi: 10.1038/s41366-020-0554-2. Epub 2020 Feb 25.
2
Analysis of small sample size studies using nonparametric bootstrap test with pooled resampling method.使用合并重采样方法的非参数自助检验对小样本量研究进行分析。
Stat Med. 2017 Jun 30;36(14):2187-2205. doi: 10.1002/sim.7263. Epub 2017 Mar 9.
3
Case for omitting tied observations in the two-sample t-test and the Wilcoxon-Mann-Whitney Test.两样本 t 检验和 Wilcoxon-Mann-Whitney 检验中剔除结的情况。
PLoS One. 2018 Jul 24;13(7):e0200837. doi: 10.1371/journal.pone.0200837. eCollection 2018.
4
Empirical versus theoretical power and type I error (false-positive) rates estimated from real murine aging research data.从真实的小鼠衰老研究数据中估计的经验与理论功效以及 I 型错误(假阳性)率。
Cell Rep. 2021 Aug 17;36(7):109560. doi: 10.1016/j.celrep.2021.109560.
5
To test or not to test: Preliminary assessment of normality when comparing two independent samples.是否进行检验:比较两个独立样本时对正态性的初步评估。
BMC Med Res Methodol. 2012 Jun 19;12:81. doi: 10.1186/1471-2288-12-81.
6
Outlier removal, sum scores, and the inflation of the Type I error rate in independent samples t tests: the power of alternatives and recommendations.异常值剔除、总和得分和独立样本 t 检验中 I 型错误率的膨胀:替代方案的功效和建议。
Psychol Methods. 2014 Sep;19(3):409-27. doi: 10.1037/met0000014. Epub 2014 Apr 28.
7
Persistent confusion in nutrition and obesity research about the validity of classic nonparametric tests in the presence of heteroscedasticity: evidence of the problem and valid alternatives.在存在异方差的情况下,营养和肥胖研究中对经典非参数检验有效性的持续混淆:问题的证据和有效的替代方法。
Am J Clin Nutr. 2021 Mar 11;113(3):517-524. doi: 10.1093/ajcn/nqaa357.
8
A bootstrap test for the analysis of microarray experiments with a very small number of replications.一种用于分析复制次数极少的微阵列实验的自助检验。
Appl Bioinformatics. 2006;5(3):173-9. doi: 10.2165/00822942-200605030-00005.
9
Increasing physicians' awareness of the impact of statistics on research outcomes: comparative power of the t-test and and Wilcoxon Rank-Sum test in small samples applied research.提高医生对统计学对研究结果影响的认识:小样本应用研究中t检验和Wilcoxon秩和检验的比较效能
J Clin Epidemiol. 1999 Mar;52(3):229-35. doi: 10.1016/s0895-4356(98)00168-1.
10
How to Compare the Length of Stay of Two Samples of Inpatients? A Simulation Study to Compare Type I and Type II Errors of 12 Statistical Tests.如何比较两个住院患者样本的住院时间?一项比较12种统计检验的I型和II型错误的模拟研究。
Value Health. 2017 Jul-Aug;20(7):992-998. doi: 10.1016/j.jval.2017.02.009. Epub 2017 Mar 28.

引用本文的文献

1
Bias in nutrition-health associations is not eliminated by excluding extreme reporters in empirical or simulation studies.在实证或模拟研究中,通过排除极端报告者并不能消除营养-健康关联中的偏倚。
Elife. 2023 Apr 5;12:e83616. doi: 10.7554/eLife.83616.
2
From Model Organisms to Humans, the Opportunity for More Rigor in Methodologic and Statistical Analysis, Design, and Interpretation of Aging and Senescence Research.从模式生物到人类,在衰老和衰老研究的方法学和统计学分析、设计和解释方面有机会更加严谨。
J Gerontol A Biol Sci Med Sci. 2022 Nov 21;77(11):2155-2164. doi: 10.1093/gerona/glab382.
3
Empirical versus theoretical power and type I error (false-positive) rates estimated from real murine aging research data.

本文引用的文献

1
Redefine statistical significance.重新定义统计学显著性。
Nat Hum Behav. 2018 Jan;2(1):6-10. doi: 10.1038/s41562-017-0189-z.
2
How sure are you of your result? Put a number on it.你对你的结果有多确定?用一个数字来表示。
Nature. 2018 Dec;564(7734):7. doi: 10.1038/d41586-018-07589-2.
3
Issues with data and analyses: Errors, underlying themes, and potential solutions.数据和分析问题:错误、潜在主题和潜在解决方案。
从真实的小鼠衰老研究数据中估计的经验与理论功效以及 I 型错误(假阳性)率。
Cell Rep. 2021 Aug 17;36(7):109560. doi: 10.1016/j.celrep.2021.109560.
Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2563-2570. doi: 10.1073/pnas.1708279115.
4
PREPARE: guidelines for planning animal research and testing.《准备:动物研究与试验规划指南》
Lab Anim. 2018 Apr;52(2):135-141. doi: 10.1177/0023677217724823. Epub 2017 Aug 3.
5
Reproducibility: A tragedy of errors.可重复性:错误的悲剧。
Nature. 2016 Feb 4;530(7588):27-9. doi: 10.1038/530027a.
6
Reproducibility in science: improving the standard for basic and preclinical research.科学可重复性:提高基础和临床前研究的标准。
Circ Res. 2015 Jan 2;116(1):116-26. doi: 10.1161/CIRCRESAHA.114.303819.
7
Drug development: Raise standards for preclinical cancer research.药物研发:提高临床前癌症研究标准。
Nature. 2012 Mar 28;483(7391):531-3. doi: 10.1038/483531a.
8
Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research.改进生物科学研究报告:动物研究报告的ARRIVE指南
PLoS Biol. 2010 Jun 29;8(6):e1000412. doi: 10.1371/journal.pbio.1000412.
9
The generalisation of student's problems when several different population variances are involved.当涉及几个不同总体方差时学生问题的推广。
Biometrika. 1947;34(1-2):28-35. doi: 10.1093/biomet/34.1-2.28.
10
Evaluating statistical methods using plasmode data sets in the age of massive public databases: an illustration using false discovery rates.在海量公共数据库时代,使用模拟数据集评估统计方法:以错误发现率为例
PLoS Genet. 2008 Jun 20;4(6):e1000098. doi: 10.1371/journal.pgen.1000098.