• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

提高基于人群的一致性研究中诊断试验的可靠性。

Improving the reliability of diagnostic tests in population-based agreement studies.

机构信息

Massachusetts General Hospital and Harvard Medical School, Biostatistics Center, 50 Staniford Street, Suite 560, Boston, MA 02114, USA.

出版信息

Stat Med. 2010 Mar 15;29(6):617-26. doi: 10.1002/sim.3819.

DOI:10.1002/sim.3819
PMID:20128018
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5079112/
Abstract

Many large-scale studies have recently been carried out to assess the reliability of diagnostic procedures, such as mammography for the detection of breast cancer. The large numbers of raters and subjects involved raise new challenges in how to measure agreement in these types of studies. An important motivator of these studies is the identification of factors that contribute to the often wide discrepancies observed between raters' classifications, such as a rater's experience, in order to improve the reliability of the diagnostic process of interest. Incorporating covariate information into the agreement model is a key component in addressing these questions. Few agreement models are currently available that jointly model larger numbers of raters and subjects and incorporate covariate information. In this paper, we extend a recently developed population-based model and measure of agreement for binary ratings to incorporate covariate information using the class of generalized linear mixed models with a probit link function. Important information on factors related to the subjects and raters can be included as fixed and/or random effects in the model. We demonstrate how agreement can be assessed between subgroups of the raters and/or subjects, for example, comparing agreement between experienced and less experienced raters. Simulation studies are carried out to test the performance of the proposed models and measures of agreement. Application to a large-scale breast cancer study is presented.

摘要

最近进行了许多大规模的研究,以评估诊断程序的可靠性,例如用于检测乳腺癌的乳房 X 光检查。涉及的大量评估者和对象提出了如何在这些类型的研究中衡量一致性的新挑战。这些研究的一个重要动机是确定导致评估者分类之间经常出现广泛差异的因素,例如评估者的经验,以提高感兴趣的诊断过程的可靠性。将协变量信息纳入一致性模型是解决这些问题的关键组成部分。目前可用的少数一致性模型可联合对更多的评估者和对象进行建模,并纳入协变量信息。在本文中,我们扩展了最近开发的基于人群的二进制评分一致性模型和度量标准,以使用具有概率链接函数的广义线性混合模型类来纳入协变量信息。与受试者和评估者相关的重要信息可以作为固定和/或随机效应包含在模型中。我们展示了如何在评估者和/或受试者的子组之间评估一致性,例如,比较经验丰富的评估者和经验较少的评估者之间的一致性。进行了模拟研究以测试所提出的模型和一致性度量的性能。介绍了对大规模乳腺癌研究的应用。

相似文献

1
Improving the reliability of diagnostic tests in population-based agreement studies.提高基于人群的一致性研究中诊断试验的可靠性。
Stat Med. 2010 Mar 15;29(6):617-26. doi: 10.1002/sim.3819.
2
Assessing the influence of rater and subject characteristics on measures of agreement for ordinal ratings.评估评分者和受试者特征对有序评分一致性测量的影响。
Stat Med. 2017 Sep 10;36(20):3181-3199. doi: 10.1002/sim.7323. Epub 2017 Jun 13.
3
Summary measures of agreement and association between many raters' ordinal classifications.多位评估者的有序分类之间一致性和关联性的汇总指标。
Ann Epidemiol. 2017 Oct;27(10):677-685.e4. doi: 10.1016/j.annepidem.2017.09.001. Epub 2017 Sep 22.
4
A measure of association for ordered categorical data in population-based studies.基于人群的研究中有序分类数据的关联度量。
Stat Methods Med Res. 2018 Mar;27(3):812-831. doi: 10.1177/0962280216643347. Epub 2016 May 16.
5
Measuring intrarater association between correlated ordinal ratings.测量相关等级评定的组内关联性。
Biom J. 2020 Nov;62(7):1687-1701. doi: 10.1002/bimj.201900177. Epub 2020 Jun 11.
6
Measuring rater bias in diagnostic tests with ordinal ratings.用等级评定测量诊断测试中的评分者偏倚。
Stat Med. 2021 Jul 30;40(17):4014-4033. doi: 10.1002/sim.9011. Epub 2021 May 9.
7
Evaluating the effects of rater and subject factors on measures of association.评估评分者和受试者因素对关联度量的影响。
Biom J. 2018 May;60(3):639-656. doi: 10.1002/bimj.201700078. Epub 2018 Jan 19.
8
Measures of agreement between many raters for ordinal classifications.多个评分者对有序分类的一致性度量。
Stat Med. 2015 Oct 15;34(23):3116-32. doi: 10.1002/sim.6546. Epub 2015 Jun 21.
9
Correcting for rater bias in scores on a continuous scale, with application to breast density.在连续尺度上的评分中纠正评分者偏差,应用于乳腺密度。
Stat Med. 2013 Nov 20;32(26):4666-78. doi: 10.1002/sim.5848. Epub 2013 May 15.
10
Simultaneous estimation of intrarater and interrater agreement for multiple raters under order restrictions for a binary trait.在二元性状的顺序限制下,对多个评分者的评分者内一致性和评分者间一致性进行同时估计。
Stat Med. 2002 Jun 30;21(12):1761-72. doi: 10.1002/sim.1138.

引用本文的文献

1
Simulating and estimating agreement in the presence of multiple raters and covariates.模拟和估计存在多个评分者和协变量时的一致性。
Stat Med. 2023 May 20;42(11):1687-1698. doi: 10.1002/sim.9694. Epub 2023 Mar 5.
2
Measures of agreement between many raters for ordinal classifications.多个评分者对有序分类的一致性度量。
Stat Med. 2015 Oct 15;34(23):3116-32. doi: 10.1002/sim.6546. Epub 2015 Jun 21.

本文引用的文献

1
Radiologist characteristics associated with interpretive performance of diagnostic mammography.与诊断性乳腺钼靶解读表现相关的放射科医生特征。
J Natl Cancer Inst. 2007 Dec 19;99(24):1854-63. doi: 10.1093/jnci/djm238. Epub 2007 Dec 11.
2
Changing patterns in breast cancer incidence trends.乳腺癌发病率趋势的变化模式。
J Natl Cancer Inst Monogr. 2006(36):19-25. doi: 10.1093/jncimonographs/lgj016.
3
Performance benchmarks for diagnostic mammography.诊断性乳腺钼靶摄影的性能基准。
Radiology. 2005 Jun;235(3):775-90. doi: 10.1148/radiol.2353040738.
4
Association of volume and volume-independent factors with accuracy in screening mammogram interpretation.乳房X线筛查影像解读中容积及非容积因素与准确性的关联
J Natl Cancer Inst. 2003 Feb 19;95(4):282-90. doi: 10.1093/jnci/95.4.282.
5
Factors affecting radiologist inconsistency in screening mammography.影响乳腺钼靶筛查中放射科医生诊断不一致性的因素。
Acad Radiol. 2002 May;9(5):531-40. doi: 10.1016/s1076-6332(03)80330-6.
6
Kappa coefficients in medical research.医学研究中的卡帕系数。
Stat Med. 2002 Jul 30;21(14):2109-29. doi: 10.1002/sim.1180.
7
Does practice make perfect when interpreting mammography?
J Natl Cancer Inst. 2002 Mar 6;94(5):321-3. doi: 10.1093/jnci/94.5.321.
8
Statistical description of interrater variability in ordinal ratings.有序评分中评分者间变异性的统计描述。
Stat Methods Med Res. 2000 Oct;9(5):475-96. doi: 10.1177/096228020000900505.
9
Interobserver reproducibility of Gleason grading of prostatic carcinoma: general pathologist.前列腺癌Gleason分级的观察者间再现性:普通病理学家
Hum Pathol. 2001 Jan;32(1):81-8. doi: 10.1053/hupa.2001.21135.
10
Measurement of interrater agreement with adjustment for covariates.测量评分者间一致性并对协变量进行校正。
Biometrics. 1996 Jun;52(2):695-702.