• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小样本量:高维数据分析中的大数据问题。

Small sample sizes: A big data problem in high-dimensional data analysis.

机构信息

Charité-Universitätsmedizin Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Institute of Biometry and Clinical Epidemiology, Charitéplatz 1, Berlin, Germany.

Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Straße 2, Berlin, Germany.

出版信息

Stat Methods Med Res. 2021 Mar;30(3):687-701. doi: 10.1177/0962280220970228. Epub 2020 Nov 24.

DOI:10.1177/0962280220970228
PMID:33228480
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8008424/
Abstract

In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of -test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods.

摘要

在许多实验中,尤其是在转化和临床前研究中,样本量非常小。此外,数据设计通常是高维的,即试验的重复比独立复制观察到的更多。本文讨论了在小样本量的高维设计(重复测量或多变量)中应用 t 检验型统计量(多重比较检验)的适用性。开发了一种基于随机化的方法来近似最大统计量的分布。广泛的模拟研究证实,新方法特别适用于分析小样本量数据集。一个真实数据集说明了该方法的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1da/8008424/f73e49fa9882/10.1177_0962280220970228-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1da/8008424/91a989793f42/10.1177_0962280220970228-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1da/8008424/f73e49fa9882/10.1177_0962280220970228-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1da/8008424/91a989793f42/10.1177_0962280220970228-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1da/8008424/f73e49fa9882/10.1177_0962280220970228-fig2.jpg

相似文献

1
Small sample sizes: A big data problem in high-dimensional data analysis.小样本量:高维数据分析中的大数据问题。
Stat Methods Med Res. 2021 Mar;30(3):687-701. doi: 10.1177/0962280220970228. Epub 2020 Nov 24.
2
Analysis of covariance under variance heteroscedasticity in general factorial designs.方差异质性下一般析因设计的协方差分析。
Stat Med. 2021 Sep 20;40(21):4732-4749. doi: 10.1002/sim.9092. Epub 2021 Jun 14.
3
Ranking procedures for repeated measures designs with missing data: Estimation, testing and asymptotic theory.重复测量设计中缺失数据的排名程序:估计、检验和渐近理论。
Stat Methods Med Res. 2022 Jan;31(1):105-118. doi: 10.1177/09622802211046389. Epub 2021 Nov 29.
4
Simultaneous inference for multiple marginal generalized estimating equation models.多个边际广义估计方程模型的同时推断
Stat Methods Med Res. 2020 Jun;29(6):1746-1762. doi: 10.1177/0962280219873005. Epub 2019 Sep 17.
5
High-dimensional multivariate analysis of variance via geometric median and bootstrapping.基于几何中位数和自举法的高维多元方差分析。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae088.
6
Sample size planning for multiple contrast tests.多重对比检验的样本量规划
Biom J. 2023 Dec;65(8):e2200081. doi: 10.1002/bimj.202200081. Epub 2023 Sep 4.
7
Analysis of small sample size studies using nonparametric bootstrap test with pooled resampling method.使用合并重采样方法的非参数自助检验对小样本量研究进行分析。
Stat Med. 2017 Jun 30;36(14):2187-2205. doi: 10.1002/sim.7263. Epub 2017 Mar 9.
8
Sample size planning for rank-based multiple contrast tests.基于等级的多重对照检验的样本量规划。
Biom J. 2024 Apr;66(3):e2300240. doi: 10.1002/bimj.202300240.
9
A bootstrap test for the analysis of microarray experiments with a very small number of replications.一种用于分析复制次数极少的微阵列实验的自助检验。
Appl Bioinformatics. 2006;5(3):173-9. doi: 10.2165/00822942-200605030-00005.
10
Resampling-Based Inference Methods for Comparing Two Coefficients Alpha.基于重采样的比较两个系数α的推断方法
Psychometrika. 2018 Mar;83(1):203-222. doi: 10.1007/s11336-017-9601-x. Epub 2018 Jan 2.

引用本文的文献

1
Current status of next-generation vaccines against mpox virus: a scoping review.抗猴痘病毒的下一代疫苗的现状:一项范围综述
Front Pharmacol. 2025 Apr 28;16:1533533. doi: 10.3389/fphar.2025.1533533. eCollection 2025.
2
Use of the Evidence-Based practice Attitude and utilization SurvEy to determine the use of evidence-based practice by chiropractic students.使用基于证据的实践态度与利用情况调查来确定脊椎按摩疗法专业学生对循证实践的运用。
J Chiropr Educ. 2025 May 31;39. doi: 10.7899/JCE-21-4.
3
A computationally efficient approach to false discovery rate control and power maximisation via randomisation and mirror statistic.

本文引用的文献

1
Simulation-based hypothesis testing of high dimensional means under covariance heterogeneity.协方差异质性下高维均值的基于模拟的假设检验
Biometrics. 2017 Dec;73(4):1300-1310. doi: 10.1111/biom.12695. Epub 2017 Mar 31.
2
A Two-Sample Test for Equality of Means in High Dimension.高维均值相等性的双样本检验
J Am Stat Assoc. 2015 Jun 1;110(510):837-849. doi: 10.1080/01621459.2014.934826.
3
Multivariate multidistance tests for high-dimensional low sample size case-control studies.高维小样本病例对照研究的多变量多距离检验
一种通过随机化和镜像统计进行错误发现率控制和功效最大化的计算高效方法。
Stat Methods Med Res. 2025 Jun;34(6):1233-1253. doi: 10.1177/09622802251329768. Epub 2025 Mar 31.
4
LD-informed deep learning for Alzheimer's gene loci detection using WGS data.基于全基因组测序(WGS)数据,利用LD信息的深度学习进行阿尔茨海默病基因座检测
Alzheimers Dement (N Y). 2025 Jan 16;11(1):e70041. doi: 10.1002/trc2.70041. eCollection 2025 Jan-Mar.
5
LD-informed deep learning for Alzheimer's gene loci detection using WGS data.基于全基因组测序(WGS)数据,利用LD信息的深度学习进行阿尔茨海默病基因座检测
medRxiv. 2024 Dec 12:2024.09.19.24313993. doi: 10.1101/2024.09.19.24313993.
6
High-dimensional multivariate analysis of variance via geometric median and bootstrapping.基于几何中位数和自举法的高维多元方差分析。
Biometrics. 2024 Jul 1;80(3). doi: 10.1093/biomtc/ujae088.
7
Integrated analysis of gut metabolome, microbiome, and exfoliome data in an equine model of intestinal injury.肠道代谢组、微生物组和表皮组数据的综合分析在肠道损伤的马模型中。
Microbiome. 2024 Apr 15;12(1):74. doi: 10.1186/s40168-024-01785-1.
8
Molecular and translational biology of the blood-based VeriStrat® proteomic test used in cancer immunotherapy treatment guidance.用于癌症免疫治疗指导的基于血液的VeriStrat®蛋白质组学检测的分子与转化生物学
J Mass Spectrom Adv Clin Lab. 2023 Nov 20;30:51-60. doi: 10.1016/j.jmsacl.2023.11.001. eCollection 2023 Nov.
9
The hitchhikers' guide to RNA sequencing and functional analysis.RNA 测序和功能分析的搭便车指南。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac529.
10
An ensemble learning approach to identify pastured poultry farm practice variables and soil constituents that promote prevalence.一种用于识别促进患病率的放牧式家禽养殖场实践变量和土壤成分的集成学习方法。
Heliyon. 2022 Nov 7;8(11):e11331. doi: 10.1016/j.heliyon.2022.e11331. eCollection 2022 Nov.
Stat Med. 2015 Apr 30;34(9):1511-26. doi: 10.1002/sim.6418. Epub 2015 Jan 29.
4
Multivariate tests based on interpoint distances with application to magnetic resonance imaging.基于点间距离的多元检验及其在磁共振成像中的应用。
Stat Methods Med Res. 2016 Dec;25(6):2593-2610. doi: 10.1177/0962280214529104. Epub 2014 Apr 16.
5
Multiple contrast tests for multiple endpoints in the presence of heteroscedasticity.存在异方差性时针对多个终点的多重对比检验。
Int J Biostat. 2014;10(1):17-28. doi: 10.1515/ijb-2012-0015.
6
Multiple contrasts for repeated measures.
Int J Biostat. 2013 Jul 27;9(1):/j/ijb.2013.9.issue-1/ijb-2012-0025/ijb-2012-0025.xml. doi: 10.1515/ijb-2012-0025.
7
Estimation of Box's ε for low- and high-dimensional repeated measures designs with unequal covariance matrices.针对具有不等协方差矩阵的低维和高维重复测量设计的Box's ε估计。
Biom J. 2012 May;54(3):301-16. doi: 10.1002/bimj.201100160.
8
Multiple contrast tests in the presence of heteroscedasticity.异方差情况下的多重对比检验。
Biom J. 2008 Oct;50(5):793-800. doi: 10.1002/bimj.200710466.
9
Simultaneous inference in general parametric models.一般参数模型中的同时推断。
Biom J. 2008 Jun;50(3):346-63. doi: 10.1002/bimj.200810425.
10
To permute or not to permute.是否进行置换。
Bioinformatics. 2006 Sep 15;22(18):2244-8. doi: 10.1093/bioinformatics/btl383. Epub 2006 Jul 26.