• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

你需要了解的统计学知识,第二部分:诊断和筛查测试的可靠性。

What you need to know about statistics, part II: reliability of diagnostic and screening tests.

作者信息

Zidan Marwan, Thomas Ronald L, Slovis Thomas L

机构信息

Children's Research Center of Michigan, Department of Pediatrics, Wayne State University School of Medicine, 3901 Beaubien Blvd., Detroit, MI, 48201, USA,

出版信息

Pediatr Radiol. 2015 Mar;45(3):317-28. doi: 10.1007/s00247-014-2944-x. Epub 2015 Mar 1.

DOI:10.1007/s00247-014-2944-x
PMID:25726014
Abstract

The foundation for the usefulness of any diagnostic test should be that it is both reliable and accurate in its clinical diagnosis. In this article we present the second of a two-part series on validity and reliability, discussing the assessment of reliability among raters of diagnostic tests and between diagnostics tests themselves. To examine reproducibility (reliability) among raters of diagnostic tests we present the calculation of two statistical procedures: (1) the kappa coefficient statistic when presented with categorical data for the presence or absence of a clinical diagnosis and (2) the intraclass correlation coefficient (ICC) for continuously scaled data among raters. The accuracy among diagnostic tests (i.e. their interchangeability) can be evaluated by application of (1) a Bland-Altman plot procedure (with its 95% limits of agreement) and (2) the Passing-Bablok regression procedure (for the identification and evaluation of systematic and proportional differences). When deciding whether to select a diagnostic test one must evaluate its ability to provide more precise information than a gold standard test, and whether in clinical practice it would be more beneficial for patients to adopt it.

摘要

任何诊断测试有用性的基础都应该是其在临床诊断中既可靠又准确。在本文中,我们呈现关于效度和信度的系列文章的第二篇,讨论诊断测试评分者之间以及诊断测试本身之间的信度评估。为了检验诊断测试评分者之间的可重复性(信度),我们介绍两种统计方法的计算:(1)当呈现临床诊断存在或不存在的分类数据时的kappa系数统计量,以及(2)评分者之间连续尺度数据的组内相关系数(ICC)。诊断测试之间的准确性(即它们的互换性)可以通过应用(1)Bland-Altman图程序(及其95%一致性界限)和(2)Passing-Bablok回归程序(用于识别和评估系统差异和比例差异)来评估。在决定是否选择一种诊断测试时,必须评估其提供比金标准测试更精确信息的能力,以及在临床实践中采用它对患者是否更有益。

相似文献

1
What you need to know about statistics, part II: reliability of diagnostic and screening tests.你需要了解的统计学知识,第二部分:诊断和筛查测试的可靠性。
Pediatr Radiol. 2015 Mar;45(3):317-28. doi: 10.1007/s00247-014-2944-x. Epub 2015 Mar 1.
2
What you need to know about statistics Part I: validity of diagnostic and screening tests.你需要了解的统计学知识 第一部分:诊断性试验和筛查试验的效度
Pediatr Radiol. 2015 Feb;45(2):146-52. doi: 10.1007/s00247-014-2882-7. Epub 2015 Jan 31.
3
Agreement Analysis: What He Said, She Said Versus You Said.一致性分析:他说、她说与你说的。
Anesth Analg. 2018 Jun;126(6):2123-2128. doi: 10.1213/ANE.0000000000002924.
4
Asymptotic distributions of kappa statistics and their differences with many raters, many rating categories and two conditions.kappa统计量的渐近分布及其在多评分者、多评分类别和两种条件下的差异。
Biom J. 2018 Jan;60(1):146-154. doi: 10.1002/bimj.201700016. Epub 2017 Nov 7.
5
[Design and analysis of a study on reliability and validity of diagnostic tests in urological clinical research].[泌尿外科临床研究中诊断试验可靠性和有效性研究的设计与分析]
Arch Esp Urol. 2003 Jul-Aug;56(6):645-56.
6
A spreadsheet for the calculation of comprehensive statistics for the assessment of diagnostic tests and inter-rater agreement.一个用于计算诊断试验评估和评分者间一致性综合统计数据的电子表格。
Comput Biol Med. 2000 May;30(3):127-34. doi: 10.1016/s0010-4825(00)00006-8.
7
Reliability of assessment tools in rehabilitation: an illustration of appropriate statistical analyses.康复评估工具的可靠性:适当统计分析示例
Clin Rehabil. 1998 Jun;12(3):187-99. doi: 10.1191/026921598672178340.
8
Comparing two diagnostic tests against the same "gold standard" in the same sample.在同一样本中,将两种诊断测试与同一“金标准”进行比较。
Biometrics. 1997 Mar;53(1):73-85.
9
When do latent class models overstate accuracy for diagnostic and other classifiers in the absence of a gold standard?在没有金标准的情况下,潜在类别模型何时会高估诊断及其他分类器的准确性?
Biometrics. 2012 Jun;68(2):559-66. doi: 10.1111/j.1541-0420.2011.01694.x. Epub 2011 Oct 21.
10
Evaluation of a wearable body monitoring device during treadmill walking and jogging in patients with fibromyalgia syndrome.评估可穿戴身体监测设备在纤维肌痛综合征患者进行跑步机行走和慢跑时的表现。
Arch Phys Med Rehabil. 2012 Jan;93(1):115-22. doi: 10.1016/j.apmr.2011.08.021.

引用本文的文献

1
Persistent microbial infections and idiopathic pulmonary fibrosis - an insight into pathogenesis.持续性微生物感染与特发性肺纤维化——发病机制的深入探讨
Front Cell Infect Microbiol. 2024 Dec 20;14:1479801. doi: 10.3389/fcimb.2024.1479801. eCollection 2024.
2
Comparable analysis of six immunoassays for carcinoembryonic antigen detection.六种癌胚抗原检测免疫分析方法的比较分析。
Heliyon. 2024 Jan 29;10(3):e25158. doi: 10.1016/j.heliyon.2024.e25158. eCollection 2024 Feb 15.
3
Location In Vivo of the Innervation Zone in the Human Medial Gastrocnemius Using Imposed Contractions: A Comparison of the Usefulness of the M-Wave and H-Reflex.

本文引用的文献

1
Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.加权kappa系数:用于衡量名义尺度上的一致性,并考虑了尺度不一致或部分得分的情况。
Psychol Bull. 1968 Oct;70(4):213-20. doi: 10.1037/h0026256.
2
Analysis of agreement between measurements of continuous variables: general principles and lessons from studies of imaging of carotid stenosis.
J Neurol. 2000 Nov;247(11):825-34. doi: 10.1007/s004150070068.
3
Measurement reliability and agreement in psychiatry.精神病学中的测量可靠性与一致性
利用强制收缩确定人体内侧腓肠肌神经支配区的体内位置:M波与H反射有用性的比较
J Funct Morphol Kinesiol. 2022 Nov 28;7(4):107. doi: 10.3390/jfmk7040107.
4
Validation of low anterior resection syndrome score in Brazil with Portuguese.巴西用葡萄牙语对低位前切除综合征评分进行的验证。
Ann Coloproctol. 2023 Oct;39(5):402-409. doi: 10.3393/ac.2022.00136.0019. Epub 2022 May 13.
5
Clinical and laboratory parameters associated with li-rads as diagnostic of liver nodule in patients with cirrhosis.与肝脏影像报告和数据系统(LI-RADS)相关的临床和实验室参数对肝硬化患者肝脏结节的诊断价值
Transl Gastroenterol Hepatol. 2021 Oct 25;6:55. doi: 10.21037/tgh.2020.01.05. eCollection 2021.
6
Ferumoxytol magnetic resonance imaging detects joint and pleural infiltration of bone sarcomas in pediatric and young adult patients.铁磁共振成像检测儿科和青年成骨肉瘤的关节和胸膜浸润。
Pediatr Radiol. 2021 Dec;51(13):2521-2529. doi: 10.1007/s00247-021-05156-y. Epub 2021 Aug 19.
7
The value of bi-exponential and non-Gaussian distribution diffusion-weighted imaging in the differentiation of recurrent soft tissue neoplasms and post-surgical changes.双指数和非高斯分布扩散加权成像在复发性软组织肿瘤与术后改变鉴别诊断中的价值
Ann Transl Med. 2020 Nov;8(21):1357. doi: 10.21037/atm-20-2025.
8
Utility of radiographic measurements to predict echocardiographic left heart enlargement in dogs with preclinical myxomatous mitral valve disease.影像学测量对预测临床前期黏液瘤样二尖瓣疾病犬超声心动图左心扩大的效用。
J Vet Intern Med. 2020 Sep;34(5):1728-1733. doi: 10.1111/jvim.15854. Epub 2020 Jul 20.
9
Performance of LI-RADS version 2018 CT treatment response algorithm in tumor response evaluation and survival prediction of patients with single hepatocellular carcinoma after radiofrequency ablation.2018版LI-RADS CT治疗反应算法在单发性肝细胞癌患者射频消融术后肿瘤反应评估及生存预测中的性能
Ann Transl Med. 2020 Mar;8(6):388. doi: 10.21037/atm.2020.03.120.
10
Paediatric vision screening by non-healthcare volunteers: evidence based practices.儿科视力筛查由非医护志愿者进行:循证实践。
BMC Med Educ. 2019 Feb 28;19(1):65. doi: 10.1186/s12909-019-1498-x.
Stat Methods Med Res. 1998 Sep;7(3):301-17. doi: 10.1177/096228029800700306.
4
Bias, prevalence and kappa.偏倚、患病率及kappa值
J Clin Epidemiol. 1993 May;46(5):423-9. doi: 10.1016/0895-4356(93)90018-v.
5
How to read clinical journals: II. To learn about a diagnostic test.如何阅读临床期刊:II. 了解一项诊断性检查。
Can Med Assoc J. 1981 Mar 15;124(6):703-10.
6
The intraclass correlation coefficient as a measure of reliability.组内相关系数作为可靠性的一种度量。
Psychol Rep. 1966 Aug;19(1):3-11. doi: 10.2466/pr0.1966.19.1.3.
7
Statistical methods for assessing agreement between two methods of clinical measurement.评估两种临床测量方法之间一致性的统计方法。
Lancet. 1986 Feb 8;1(8476):307-10.
8
A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement.关于组内相关系数在评估两种测量方法之间一致性时的应用说明。
Comput Biol Med. 1990;20(5):337-40. doi: 10.1016/0010-4825(90)90013-f.
9
The measurement of observer agreement for categorical data.分类数据观察者一致性的测量。
Biometrics. 1977 Mar;33(1):159-74.