• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估基于调查数据估计的二项分类器。

Evaluating Binary Outcome Classifiers Estimated from Survey Data.

机构信息

From the Department of Statistical Science, Duke University, Durham, NC.

出版信息

Epidemiology. 2024 Nov 1;35(6):805-812. doi: 10.1097/EDE.0000000000001776. Epub 2024 Aug 14.

DOI:10.1097/EDE.0000000000001776
PMID:39140488
Abstract

Surveys are commonly used to facilitate research in epidemiology, health, and the social and behavioral sciences. Often, these surveys are not simple random samples, and respondents are given weights reflecting their probability of selection into the survey. We show that using survey weights can be beneficial for evaluating the quality of predictive models when splitting data into training and test sets. In particular, we characterize model assessment statistics, such as sensitivity and specificity, as finite population quantities and compute survey-weighted estimates of these quantities with test data comprising a random subset of the original data. Using simulations with data from the National Survey on Drug Use and Health and the National Comorbidity Survey, we show that unweighted metrics estimated with sample test data can misrepresent population performance, but weighted metrics appropriately adjust for the complex sampling design. We also show that this conclusion holds for models trained using upsampling for mitigating class imbalance. The results suggest that weighted metrics should be used when evaluating performance on test data derived from complex surveys.

摘要

调查通常用于促进流行病学、健康、社会和行为科学领域的研究。通常情况下,这些调查不是简单的随机样本,被调查者会被赋予反映其被选中参与调查的概率的权重。我们表明,在将数据分割为训练集和测试集时,使用调查权重可以有益于评估预测模型的质量。具体来说,我们将模型评估统计量(如灵敏度和特异性)表示为有限总体数量,并使用测试数据(由原始数据的随机子集组成)计算这些数量的调查加权估计值。使用来自国家药物使用和健康调查以及国家共病调查的数据进行模拟,我们表明,使用样本测试数据估计的未加权指标可能会对人口表现产生误导,但加权指标可以适当地调整复杂的抽样设计。我们还表明,对于使用上采样来减轻类别不平衡的模型,这一结论也成立。结果表明,当评估源自复杂调查的测试数据的性能时,应该使用加权指标。

相似文献

1
Evaluating Binary Outcome Classifiers Estimated from Survey Data.评估基于调查数据估计的二项分类器。
Epidemiology. 2024 Nov 1;35(6):805-812. doi: 10.1097/EDE.0000000000001776. Epub 2024 Aug 14.
2
Sexual Harassment and Prevention Training性骚扰与预防培训
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
7
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
8
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备:证据综合和成本效益分析。
Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.
9
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
10
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

本文引用的文献

1
PROVIDENT: Development and Validation of a Machine Learning Model to Predict Neighborhood-level Overdose Risk in Rhode Island.PROVIDENT:开发和验证一种机器学习模型,以预测罗德岛地区的社区级药物过量风险。
Epidemiology. 2024 Mar 1;35(2):232-240. doi: 10.1097/EDE.0000000000001695. Epub 2024 Jan 2.
2
Implementing machine learning methods with complex survey data: Lessons learned on the impacts of accounting sampling weights in gradient boosting.运用复杂调查数据实施机器学习方法:梯度提升中会计抽样权重影响的经验教训。
PLoS One. 2023 Jan 13;18(1):e0280387. doi: 10.1371/journal.pone.0280387. eCollection 2023.
3
Transporting a Prediction Model for Use in a New Target Population.
将预测模型运用于新目标人群。
Am J Epidemiol. 2023 Feb 1;192(2):296-304. doi: 10.1093/aje/kwac128.
4
The First Year After Military Service: Predictors of U.S. Veterans' Suicidal Ideation.军人服役后第一年:美国退伍军人自杀意念的预测因素。
Am J Prev Med. 2022 Aug;63(2):233-241. doi: 10.1016/j.amepre.2022.03.017. Epub 2022 May 6.
5
Addressing Measurement Error in Random Forests Using Quantitative Bias Analysis.利用定量偏差分析解决随机森林中的测量误差问题。
Am J Epidemiol. 2021 Sep 1;190(9):1830-1840. doi: 10.1093/aje/kwab010.
6
Prediction, Not Association, Paves the Road to Precision Medicine.预测而非关联,为精准医学铺平道路。
JAMA Psychiatry. 2021 Feb 1;78(2):127-128. doi: 10.1001/jamapsychiatry.2020.2549.
7
Supervised Machine Learning: A Brief Primer.监督机器学习:简介。
Behav Ther. 2020 Sep;51(5):675-687. doi: 10.1016/j.beth.2020.05.002. Epub 2020 May 16.
8
Understanding Opioid Use Disorder (OUD) using tree-based classifiers.使用基于树的分类器理解阿片类药物使用障碍(OUD)。
Drug Alcohol Depend. 2020 Mar 1;208:107839. doi: 10.1016/j.drugalcdep.2020.107839. Epub 2020 Jan 15.
9
Innovative Identification of Substance Use Predictors: Machine Learning in a National Sample of Mexican Children.创新识别物质使用预测因子:墨西哥全国儿童样本中的机器学习。
Prev Sci. 2020 Feb;21(2):171-181. doi: 10.1007/s11121-020-01089-4.
10
Machine Learning for Fetal Growth Prediction.机器学习在胎儿生长预测中的应用。
Epidemiology. 2018 Mar;29(2):290-298. doi: 10.1097/EDE.0000000000000788.