• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用共识理论在没有“金标准”的情况下评估评分者的表现。

Assessing rater performance without a "gold standard" using consensus theory.

作者信息

Weller S C, Mann N C

机构信息

Department of Preventive Medicine and Community Health, University of Texas Medical Branch, Galveston 77555-1153, USA.

出版信息

Med Decis Making. 1997 Jan-Mar;17(1):71-9. doi: 10.1177/0272989X9701700108.

DOI:10.1177/0272989X9701700108
PMID:8994153
Abstract

This study illustrates the use of consensus theory to assess the diagnostic performances of raters and to estimate case diagnoses in the absence of a criterion or "gold" standard. A description is provided of how consensus theory "pools" information provided by raters, estimating rater competencies and differentially weighting their responses. Although the model assumes that raters respond without bias (i.e., sensitivity = specificity), a Monte Carlo simulation with 1,200 data sets shows that model estimates appear to be robust even with bias. The model is illustrated on a set of elbow radiographs, and consensus-model estimates are compared with those obtained from follow-up data. Results indicate that with high rater competencies, the model retrieves accurate estimates of competency and case diagnoses even when raters' responses are biased.

摘要

本研究阐述了如何运用共识理论评估评分者的诊断表现,并在缺乏标准或“金”标准的情况下估计病例诊断。文中描述了共识理论如何“汇总”评分者提供的信息,评估评分者能力并对他们的回答进行差异化加权。尽管该模型假设评分者的回答无偏差(即敏感度 = 特异度),但一项对1200个数据集的蒙特卡洛模拟表明,即使存在偏差,模型估计似乎也很稳健。该模型在一组肘部X光片上进行了演示,并将共识模型估计与从随访数据中获得的估计进行了比较。结果表明,当评分者能力较高时,即使评分者的回答存在偏差,该模型也能获得准确的能力估计和病例诊断。

相似文献

1
Assessing rater performance without a "gold standard" using consensus theory.运用共识理论在没有“金标准”的情况下评估评分者的表现。
Med Decis Making. 1997 Jan-Mar;17(1):71-9. doi: 10.1177/0272989X9701700108.
2
Modeling rater diagnostic skills in binary classification processes.对二进制分类过程中的评分者诊断技能进行建模。
Stat Med. 2018 Feb 20;37(4):557-571. doi: 10.1002/sim.7530. Epub 2017 Nov 2.
3
Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study.使用个体拟合统计量检测评分者偏差:一项蒙特卡罗模拟研究。
Perspect Med Educ. 2018 Apr;7(2):83-92. doi: 10.1007/s40037-017-0391-8.
4
A crossed random effects modeling approach for estimating diagnostic accuracy from ordinal ratings without a gold standard.一种交叉随机效应模型方法,用于在没有金标准的情况下从等级评分中估计诊断准确性。
Stat Med. 2013 Sep 10;32(20):3472-85. doi: 10.1002/sim.5784. Epub 2013 Mar 26.
5
Reliability and Effectiveness of Smartphone Technology for the Diagnosis and Treatment Planning of Pediatric Elbow Trauma.智能手机技术在小儿肘部创伤诊断与治疗规划中的可靠性和有效性
J Pediatr Orthop. 2016 Jul-Aug;36(5):483-7. doi: 10.1097/BPO.0000000000000477.
6
Accuracy of point-of-care ultrasonography for diagnosis of elbow fractures in children.即时超声诊断儿童肘部骨折的准确性。
Ann Emerg Med. 2013 Jan;61(1):9-17. doi: 10.1016/j.annemergmed.2012.07.112. Epub 2012 Nov 9.
7
Comparison of algorithm-based estimates of occupational diesel exhaust exposure to those of multiple independent raters in a population-based case-control study.在一项基于人群的病例对照研究中,基于算法的职业性柴油尾气暴露估计值与多个独立评估者的估计值的比较。
Ann Occup Hyg. 2013 May;57(4):470-81. doi: 10.1093/annhyg/mes082. Epub 2012 Nov 25.
8
Using latent variable modeling and multiple imputation to calibrate rater bias in diagnosis assessment.使用潜在变量建模和多重插补来校准诊断评估中的评分者偏差。
Stat Med. 2011 Jan 30;30(2):160-74. doi: 10.1002/sim.4109. Epub 2010 Nov 5.
9
Using a Dual-Input Convolutional Neural Network for Automated Detection of Pediatric Supracondylar Fracture on Conventional Radiography.利用双输入卷积神经网络自动检测常规 X 光片中的小儿髁上骨折。
Invest Radiol. 2020 Feb;55(2):101-110. doi: 10.1097/RLI.0000000000000615.
10
Improved diagnostic confidence and accuracy of pediatric elbow fractures with digital tomosynthesis.数字断层融合摄影术提高小儿肘部骨折的诊断信心和准确性。
Pediatr Radiol. 2020 Mar;50(3):363-370. doi: 10.1007/s00247-019-04548-5. Epub 2019 Nov 19.

引用本文的文献

1
Feeding Habits in the Cultural Domains of Child Care: Elements for Health Promotion.儿童保健文化领域的喂养习惯:促进健康的要素。
Front Public Health. 2021 Mar 3;9:536176. doi: 10.3389/fpubh.2021.536176. eCollection 2021.
2
How to detect high-performing individuals and groups: Decision similarity predicts accuracy.如何发现表现优异的个体和群体:决策相似度可预测准确性。
Sci Adv. 2019 Nov 20;5(11):eaaw9011. doi: 10.1126/sciadv.aaw9011. eCollection 2019 Nov.
3
Precision annotation of digital samples in NCBI's gene expression omnibus.
NCBI 基因表达综合数据库中数字样本的精确注释。
Sci Data. 2017 Sep 19;4:170125. doi: 10.1038/sdata.2017.125.
4
Interactive machine learning for health informatics: when do we need the human-in-the-loop?健康信息学中的交互式机器学习:何时需要人工介入?
Brain Inform. 2016 Jun;3(2):119-131. doi: 10.1007/s40708-016-0042-6. Epub 2016 Mar 2.
5
Voting for image scoring and assessment (VISA)--theory and application of a 2 + 1 reader algorithm to improve accuracy of imaging endpoints in clinical trials.影像评分与评估投票法(VISA)——一种2+1阅片者算法在提高临床试验影像终点准确性方面的理论与应用
BMC Med Imaging. 2015 Feb 19;15:6. doi: 10.1186/s12880-015-0049-0.
6
Development of a diagnostic test set to assess agreement in breast pathology: practical application of the Guidelines for Reporting Reliability and Agreement Studies (GRRAS).制定一套用于评估乳腺病理学一致性的诊断测试:《报告可靠性和一致性研究指南(GRRAS)》的实际应用。
BMC Womens Health. 2013 Feb 5;13:3. doi: 10.1186/1472-6874-13-3.
7
Accuracy of symptoms, signs, and C-reactive protein for early chronic obstructive pulmonary disease.症状、体征和 C 反应蛋白对早期慢性阻塞性肺疾病的诊断准确性。
Br J Gen Pract. 2012 Sep;62(602):e632-8. doi: 10.3399/bjgp12X654605.
8
Living with sugar: influence of cultural beliefs on type 2 diabetes self-management of English-speaking Afro-Caribbean women.与糖共存:文化信仰对讲英语的非洲裔加勒比裔女性 2 型糖尿病自我管理的影响。
J Immigr Minor Health. 2012 Aug;14(4):640-7. doi: 10.1007/s10903-011-9513-2.
9
Undetected chronic obstructive pulmonary disease and asthma in people over 50 years with persistent cough.50 岁以上持续性咳嗽人群中未被发现的慢性阻塞性肺疾病和哮喘。
Br J Gen Pract. 2010 Jul;60(576):489-94. doi: 10.3399/bjgp10X514738.
10
Informed decision making changes test preferences for colorectal cancer screening in a diverse population.知情决策改变了不同人群对结直肠癌筛查的检测偏好。
Ann Fam Med. 2010 Mar-Apr;8(2):141-50. doi: 10.1370/afm.1054.