• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小样本研究中接受者操作特征曲线下面积的置信区间

Confidence intervals for the receiver operating characteristic area in studies with small samples.

作者信息

Obuchowski N A, Lieber M L

机构信息

Department of Biostatistics and Epidemiology, Cleveland Clinic Foundation, OH 44195-5196, USA.

出版信息

Acad Radiol. 1998 Aug;5(8):561-71. doi: 10.1016/s1076-6332(98)80208-0.

DOI:10.1016/s1076-6332(98)80208-0
PMID:9702267
Abstract

RATIONALE AND OBJECTIVES

The authors performed this study to address two practical questions. First, how large does the sample size need to be for confidence intervals (CIs) based on the usual asymptotic methods to be appropriate? Second, when the sample size is smaller than this threshold, what alternative method of CI construction should be used?

MATERIALS AND METHODS

The authors performed a Monte Carlo simulation study where 95% CIs were constructed for the receiver operating characteristic (ROC) area and for the difference between two ROC areas for rating and continuous test results--for ROC areas of moderate and high accuracy--by using both parametric and nonparametric estimation methods. Alternative methods evaluated included several bootstrap CIs and CIs with the Student t distribution.

RESULTS

For the difference between two ROC areas, CIs based on the asymptotic theory provided adequate coverage even when the sample size was very small (20 patients). In contrast, for a single ROC area, the asymptotic methods do not provide adequate CI coverage for small samples; for ROC areas of high accuracy, the sample size must be large (more than 200 patients) for the asymptotic methods to be applicable. The recommended alternative (bootstrap percentile, bootstrap t, or bootstrap bias-corrected accelerated method) depends on the estimation approach, format of the test results, and ROC area.

CONCLUSION

Currently, there is not a single best alternative for constructing CIs for a single ROC area for small samples.

摘要

原理与目的

作者开展本研究以解决两个实际问题。其一,基于常用渐近方法的置信区间(CI)要适用的话,样本量需要多大?其二,当样本量小于该阈值时,应使用何种替代的CI构建方法?

材料与方法

作者进行了一项蒙特卡洛模拟研究,通过参数估计和非参数估计方法,针对中等和高精度的受试者工作特征(ROC)曲线下面积以及评分和连续检验结果的两个ROC曲线下面积之差构建95%置信区间。评估的替代方法包括几种自助法置信区间和基于学生t分布的置信区间。

结果

对于两个ROC曲线下面积之差,即使样本量非常小(20例患者),基于渐近理论的置信区间也能提供足够的覆盖范围。相比之下,对于单个ROC曲线下面积,渐近方法对于小样本不能提供足够的置信区间覆盖范围;对于高精度的ROC曲线下面积,渐近方法要适用的话样本量必须很大(超过200例患者)。推荐的替代方法(自助百分位数法、自助t法或自助偏差校正加速法)取决于估计方法、检验结果的形式以及ROC曲线下面积。

结论

目前,对于小样本单个ROC曲线下面积构建置信区间,不存在单一的最佳替代方法。

相似文献

1
Confidence intervals for the receiver operating characteristic area in studies with small samples.小样本研究中接受者操作特征曲线下面积的置信区间
Acad Radiol. 1998 Aug;5(8):561-71. doi: 10.1016/s1076-6332(98)80208-0.
2
A comparison of confidence/credible interval methods for the area under the ROC curve for continuous diagnostic tests with small sample size.小样本量连续诊断试验中ROC曲线下面积的置信/可信区间方法比较
Stat Methods Med Res. 2017 Dec;26(6):2603-2621. doi: 10.1177/0962280215602040. Epub 2015 Aug 30.
3
Confidence bounds when the estimated ROC area is 1.01.当估计的ROC面积为1.01时的置信区间。
Acad Radiol. 2002 May;9(5):526-30. doi: 10.1016/s1076-6332(03)80329-x.
4
Bootstrap estimation of diagnostic accuracy with patient-clustered data.使用患者聚类数据进行诊断准确性的自助法估计。
Acad Radiol. 2000 Jun;7(6):413-9. doi: 10.1016/s1076-6332(00)80381-5.
5
Confidence intervals for the length of the receiver-operating characteristic curve based on a smooth estimator.基于平滑估计的受试者工作特征曲线长度的置信区间。
Stat Methods Med Res. 2023 May;32(5):978-993. doi: 10.1177/09622802231160053. Epub 2023 Mar 15.
6
Coverage and precision of confidence intervals for area under the curve using parametric and non-parametric methods in a toxicokinetic experimental design.在毒代动力学实验设计中使用参数法和非参数法时曲线下面积置信区间的覆盖范围和精度。
Pharm Res. 1998 Mar;15(3):405-10. doi: 10.1023/a:1011968129921.
7
Components-of-variance models and multiple-bootstrap experiments: an alternative method for random-effects, receiver operating characteristic analysis.方差成分模型与多重自助法实验:一种用于随机效应、接受者操作特征分析的替代方法。
Acad Radiol. 2000 May;7(5):341-9. doi: 10.1016/s1076-6332(00)80008-2.
8
Applications of Monte Carlo Simulation in Modelling of Biochemical Processes蒙特卡罗模拟在生化过程建模中的应用
9
Monte Carlo validation of a multireader method for receiver operating characteristic discrete rating data: factorial experimental design.用于接收器操作特性离散评分数据的多读者方法的蒙特卡罗验证:析因实验设计
Acad Radiol. 1998 Sep;5(9):591-602. doi: 10.1016/s1076-6332(98)80294-8.
10
Bootstrap confidence intervals for adaptive cluster sampling.自适应整群抽样的自助置信区间。
Biometrics. 2000 Jun;56(2):503-10. doi: 10.1111/j.0006-341x.2000.00503.x.

引用本文的文献

1
Contrast-Enhanced Ultrasonography with Arrival Time Parametric Imaging as a Non-Invasive Diagnostic Tool for Liver Cirrhosis.对比增强超声造影结合到达时间参数成像作为肝硬化的无创诊断工具
Diagnostics (Basel). 2022 Dec 1;12(12):3013. doi: 10.3390/diagnostics12123013.
2
Low lean mass and chemotherapy toxicity risk in the elderly: the Fraction study protocol.低瘦体重与老年人化疗毒性风险:Fraction 研究方案。
BMC Cancer. 2019 Nov 27;19(1):1153. doi: 10.1186/s12885-019-6377-7.
3
A multimethod screening approach for pediatric depression onset: An incremental validity study.
一种用于儿童抑郁发作的多方法筛查方法:一项增量有效性研究。
J Consult Clin Psychol. 2019 Feb;87(2):184-197. doi: 10.1037/ccp0000364. Epub 2018 Dec 20.
4
Usefulness of the Waist Circumference-to-Height Ratio in Screening for Obesity and Metabolic Syndrome among Korean Children and Adolescents: Korea National Health and Nutrition Examination Survey, 2010-2014.腰围身高比在韩国儿童和青少年肥胖及代谢综合征筛查中的应用:2010 - 2014年韩国国家健康与营养检查调查
Nutrients. 2017 Mar 10;9(3):256. doi: 10.3390/nu9030256.
5
Estimating the Area Under ROC Curve When the Fitted Binormal Curves Demonstrate Improper Shape.当拟合的双正态曲线呈现不合适的形状时估计ROC曲线下的面积。
Acad Radiol. 2017 Feb;24(2):209-219. doi: 10.1016/j.acra.2016.09.020. Epub 2016 Nov 21.
6
Combining large number of weak biomarkers based on AUC.基于曲线下面积(AUC)合并大量弱生物标志物。
Stat Med. 2015 Dec 20;34(29):3811-30. doi: 10.1002/sim.6600. Epub 2015 Jul 30.
7
Contemporary Considerations for Constructing a Genetic Risk Score: An Empirical Approach.构建遗传风险评分的当代考量:一种实证方法
Genet Epidemiol. 2015 Sep;39(6):439-45. doi: 10.1002/gepi.21912. Epub 2015 Jul 22.
8
A wild bootstrap approach for the selection of biomarkers in early diagnostic trials.一种用于早期诊断试验中生物标志物选择的野生自助法。
BMC Med Res Methodol. 2015 May 1;15:43. doi: 10.1186/s12874-015-0025-y.
9
Cell signaling-based classifier predicts response to induction therapy in elderly patients with acute myeloid leukemia.基于细胞信号传导的分类器可预测老年急性髓系白血病患者对诱导治疗的反应。
PLoS One. 2015 Apr 17;10(4):e0118485. doi: 10.1371/journal.pone.0118485. eCollection 2015.
10
Visually cued action timing in the primary visual cortex.初级视觉皮层中的视觉提示动作时机
Neuron. 2015 Apr 8;86(1):319-30. doi: 10.1016/j.neuron.2015.02.043. Epub 2015 Mar 26.