• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Modeling rater diagnostic skills in binary classification processes.对二进制分类过程中的评分者诊断技能进行建模。
Stat Med. 2018 Feb 20;37(4):557-571. doi: 10.1002/sim.7530. Epub 2017 Nov 2.
2
Measuring rater bias in diagnostic tests with ordinal ratings.用等级评定测量诊断测试中的评分者偏倚。
Stat Med. 2021 Jul 30;40(17):4014-4033. doi: 10.1002/sim.9011. Epub 2021 May 9.
3
Bayesian analysis of ROC curves using Markov-chain Monte Carlo methods.使用马尔可夫链蒙特卡罗方法对ROC曲线进行贝叶斯分析。
Med Decis Making. 1996 Oct-Dec;16(4):404-11. doi: 10.1177/0272989X9601600411.
4
Assessing rater performance without a "gold standard" using consensus theory.运用共识理论在没有“金标准”的情况下评估评分者的表现。
Med Decis Making. 1997 Jan-Mar;17(1):71-9. doi: 10.1177/0272989X9701700108.
5
Bayesian hierarchical model for analyzing multiresponse longitudinal pharmacokinetic data.用于分析多响应纵向药代动力学数据的贝叶斯层次模型。
Stat Med. 2017 Dec 30;36(30):4816-4830. doi: 10.1002/sim.7505. Epub 2017 Sep 27.
6
Inference on cancer screening exam accuracy using population-level administrative data.利用人群水平的行政数据推断癌症筛查检查的准确性。
Stat Med. 2016 Jan 15;35(1):130-46. doi: 10.1002/sim.6619. Epub 2015 Aug 16.
7
Assessing the convergence of Markov Chain Monte Carlo methods: an example from evaluation of diagnostic tests in absence of a gold standard.评估马尔可夫链蒙特卡罗方法的收敛性:来自无金标准情况下诊断试验评估的一个例子。
Prev Vet Med. 2007 May 16;79(2-4):244-56. doi: 10.1016/j.prevetmed.2007.01.003. Epub 2007 Feb 9.
8
Assessing the influence of rater and subject characteristics on measures of agreement for ordinal ratings.评估评分者和受试者特征对有序评分一致性测量的影响。
Stat Med. 2017 Sep 10;36(20):3181-3199. doi: 10.1002/sim.7323. Epub 2017 Jun 13.
9
Using latent variable modeling and multiple imputation to calibrate rater bias in diagnosis assessment.使用潜在变量建模和多重插补来校准诊断评估中的评分者偏差。
Stat Med. 2011 Jan 30;30(2):160-74. doi: 10.1002/sim.4109. Epub 2010 Nov 5.
10
A Hierarchical Multi-Unidimensional IRT Approach for Analyzing Sparse, Multi-Group Data for Integrative Data Analysis.一种用于综合数据分析的稀疏多组数据分层多单维项目反应理论方法。
Psychometrika. 2015 Sep;80(3):834-55. doi: 10.1007/s11336-014-9420-2. Epub 2014 Sep 30.

引用本文的文献

1
Measuring rater bias in diagnostic tests with ordinal ratings.用等级评定测量诊断测试中的评分者偏倚。
Stat Med. 2021 Jul 30;40(17):4014-4033. doi: 10.1002/sim.9011. Epub 2021 May 9.
2
Bayesian hierarchical latent class models for estimating diagnostic accuracy.贝叶斯层次潜在类别模型估计诊断准确性。
Stat Methods Med Res. 2020 Apr;29(4):1112-1128. doi: 10.1177/0962280219852649. Epub 2019 May 30.

本文引用的文献

1
Estimating diagnostic accuracy without a gold standard: A continued controversy.在没有金标准的情况下评估诊断准确性:持续存在的争议。
J Biopharm Stat. 2016;26(6):1078-1082. doi: 10.1080/10543406.2016.1226334. Epub 2016 Aug 22.
2
Focused Professional Performance Evaluation of a Radiologist--a Centers for Medicare and Medicaid Services and Joint Commission Requirement.放射科医生的专项专业绩效评估——医疗保险和医疗补助服务中心及联合委员会的要求
Curr Probl Diagn Radiol. 2016 Mar-Apr;45(2):87-93. doi: 10.1067/j.cpradiol.2015.08.006. Epub 2015 Aug 14.
3
Generalized linear mixed models for multi-reader multi-case studies of diagnostic tests.用于诊断试验多读者多病例研究的广义线性混合模型。
Stat Methods Med Res. 2017 Jun;26(3):1373-1388. doi: 10.1177/0962280215579476. Epub 2015 Apr 5.
4
On ROC analysis with nonbinary reference standard.使用非二元参考标准进行ROC分析。
Biom J. 2012 Jul;54(4):457-80. doi: 10.1002/bimj.201100206. Epub 2012 May 29.
5
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.随机松弛,吉布斯分布,以及贝叶斯图像恢复。
IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.
6
Understanding diagnostic tests 3: Receiver operating characteristic curves.理解诊断测试3:受试者工作特征曲线。
Acta Paediatr. 2007 May;96(5):644-7. doi: 10.1111/j.1651-2227.2006.00178.x. Epub 2007 Mar 21.
7
Hierarchical models for ROC curve summary measures: design and analysis of multi-reader, multi-modality studies of medical tests.用于ROC曲线汇总指标的分层模型:医学检验多阅片者、多模态研究的设计与分析
Stat Med. 2008 Jan 30;27(2):243-56. doi: 10.1002/sim.2828.
8
An ROC-type measure of diagnostic accuracy when the gold standard is continuous-scale.当金标准为连续尺度时诊断准确性的一种ROC类测量方法。
Stat Med. 2006 Feb 15;25(3):481-93. doi: 10.1002/sim.2228.
9
Receiver operating characteristic (ROC) curve: practical review for radiologists.接受者操作特征(ROC)曲线:放射科医生实用综述
Korean J Radiol. 2004 Jan-Mar;5(1):11-8. doi: 10.3348/kjr.2004.5.1.11.
10
Association of volume and volume-independent factors with accuracy in screening mammogram interpretation.乳房X线筛查影像解读中容积及非容积因素与准确性的关联
J Natl Cancer Inst. 2003 Feb 19;95(4):282-90. doi: 10.1093/jnci/95.4.282.

对二进制分类过程中的评分者诊断技能进行建模。

Modeling rater diagnostic skills in binary classification processes.

机构信息

Department of Statistics, University of South Carolina, Columbia, SC, 29208, USA.

Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA.

出版信息

Stat Med. 2018 Feb 20;37(4):557-571. doi: 10.1002/sim.7530. Epub 2017 Nov 2.

DOI:10.1002/sim.7530
PMID:29094378
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5771896/
Abstract

Many disease diagnoses involve subjective judgments by qualified raters. For example, through the inspection of a mammogram, MRI, or ultrasound image, the clinician himself becomes part of the measuring instrument. To reduce diagnostic errors and improve the quality of diagnoses, it is necessary to assess raters' diagnostic skills and to improve their skills over time. This paper focuses on a subjective binary classification process, proposing a hierarchical model linking data on rater opinions with patient true disease-development outcomes. The model allows for the quantification of the effects of rater diagnostic skills (bias and magnifier) and patient latent disease severity on the rating results. A Bayesian Markov chain Monte Carlo (MCMC) algorithm is developed to estimate these parameters. Linking to patient true disease outcomes, the rater-specific sensitivity and specificity can be estimated using MCMC samples. Cost theory is used to identify poor- and strong-performing raters and to guide adjustment of rater bias and diagnostic magnifier to improve the rating performance. Furthermore, diagnostic magnifier is shown as a key parameter to present a rater's diagnostic ability because a rater with a larger diagnostic magnifier has a uniformly better receiver operating characteristic (ROC) curve when varying the value of diagnostic bias. A simulation study is conducted to evaluate the proposed methods, and the methods are illustrated with a mammography example.

摘要

许多疾病诊断都涉及到有资质的评估者进行主观判断。例如,通过对乳房 X 光片、MRI 或超声图像的检查,临床医生自身就成为了测量仪器的一部分。为了减少诊断错误,提高诊断质量,有必要评估评估者的诊断技能,并随着时间的推移提高他们的技能。本文专注于主观的二分类过程,提出了一个层次模型,将评估者意见的数据与患者真实的疾病发展结果联系起来。该模型允许量化评估者诊断技能(偏差和放大率)和患者潜在疾病严重程度对评分结果的影响。开发了一种贝叶斯马尔可夫链蒙特卡罗(MCMC)算法来估计这些参数。通过与患者真实的疾病结果相联系,可以使用 MCMC 样本估计评估者特定的敏感性和特异性。成本理论用于识别表现不佳和表现良好的评估者,并指导调整评估者偏差和诊断放大率,以提高评分性能。此外,诊断放大率被视为一个关键参数来展示评估者的诊断能力,因为当诊断偏差值变化时,具有较大诊断放大率的评估者具有更均匀的更好的接收器操作特征(ROC)曲线。进行了模拟研究来评估所提出的方法,并通过乳房 X 光摄影示例说明了这些方法。