• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于理解诊断试验的可靠性。

On understanding reliability for diagnostic tests.

作者信息

Bogduk Nikolai

机构信息

The University of Newcastle, PO Box 431, East Maitland, NSW, 2323, Australia.

出版信息

Interv Pain Med. 2022 Aug 15;1(Suppl 2):100124. doi: 10.1016/j.inpm.2022.100124. eCollection 2022.

DOI:10.1016/j.inpm.2022.100124
PMID:39239130
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11372993/
Abstract

For professional practice to be responsible, any diagnostic tests used must be reliable. Therefore, the reliability of any diagnostic test needs to have been measured. The classical statistic for quantifying reliability is Kappa. Although Kappa can be promptly determined using a programmed calculator, using an algorithm to derive Kappa provides greater insight into what it is actually measuring and why. Kappa scores can be graded, with verbal descriptor applied to different grades. However, those descriptors do not necessarily reflect the degree of skill required to achieve different grades of Kappa. High levels of skill attract high Kappa scores, but Kappa scores described as fair or moderate are not necessarily flattering because they can be achieved with questionable levels of skill. Various corrections can be applied to the calculation of Kappa scores in order to raise their value, and to improve the verbal descriptors of their grade, but these may not be legitimate or necessary. Low Kappa scores do not condemn tests but they serve to raise questions about their reliability.

摘要

为使专业实践负责,所使用的任何诊断测试都必须可靠。因此,任何诊断测试的可靠性都需要进行测量。用于量化可靠性的经典统计量是卡帕值。虽然使用编程计算器可以迅速确定卡帕值,但使用算法推导卡帕值能更深入地了解其实际测量的内容以及原因。卡帕分数可以分级,并为不同级别应用文字描述。然而,这些描述不一定反映达到不同卡帕级别所需的技能程度。高水平的技能会获得高卡帕分数,但被描述为“尚可”或“中等”的卡帕分数不一定令人满意,因为以可疑的技能水平也可能获得这些分数。可以对卡帕分数的计算应用各种校正,以提高其值,并改善对其级别的文字描述,但这些校正可能不合理或没有必要。低卡帕分数并不意味着测试不可用,但它们会引发对其可靠性的质疑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/37abad332d31/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/ebef69813a5b/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/6f662569b7a6/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/37abad332d31/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/ebef69813a5b/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/6f662569b7a6/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34cc/11372993/37abad332d31/fx1.jpg

相似文献

1
On understanding reliability for diagnostic tests.关于理解诊断试验的可靠性。
Interv Pain Med. 2022 Aug 15;1(Suppl 2):100124. doi: 10.1016/j.inpm.2022.100124. eCollection 2022.
2
Reliability of clinical findings in temporomandibular disorders.颞下颌关节紊乱病临床检查结果的可靠性
J Orofac Pain. 1995 Spring;9(2):181-91.
3
Interrater Reliability of Osteopathic Sacral Palpatory Diagnostic Tests Among Osteopathy Students.整骨疗法学生中整骨疗法骶骨触诊诊断测试的评分者间信度
J Am Osteopath Assoc. 2018 Oct 1;118(10):637-644. doi: 10.7556/jaoa.2018.132.
4
Radiographic displacement in pelvic ring disruption: reliability of 3 previously described measurement techniques.骨盆环破裂的放射学移位:3 种先前描述的测量技术的可靠性。
J Orthop Trauma. 2014 Mar;28(3):160-6. doi: 10.1097/BOT.0b013e31829efcc5.
5
Reliability of the Modified TICI Score among Endovascular Neurosurgeons.改良 TICI 评分在血管神经外科医生中的可靠性。
AJNR Am J Neuroradiol. 2020 Aug;41(8):1441-1446. doi: 10.3174/ajnr.A6696. Epub 2020 Jul 23.
6
Interrater agreement and reliability of clinical tests for assessment of patients with shoulder pain in primary care.临床检查评估初级保健中肩部疼痛患者的一致性和可靠性。
Physiother Theory Pract. 2021 Jan;37(1):177-196. doi: 10.1080/09593985.2019.1587801. Epub 2019 Mar 22.
7
Inter-rater reliability of diagnostic criteria for sacroiliac joint-, disc- and facet joint pain.骶髂关节、椎间盘及小关节疼痛诊断标准的评分者间信度
J Back Musculoskelet Rehabil. 2017;30(3):551-557. doi: 10.3233/BMR-150495.
8
Interrater reliability: the kappa statistic.组内一致性:kappa 统计量。
Biochem Med (Zagreb). 2012;22(3):276-82.
9
The reliability of the clinical tests and questions recommended in international guidelines for low back pain.国际腰痛指南中推荐的临床试验和问题的可靠性。
Spine (Phila Pa 1976). 2007 Apr 15;32(8):921-6. doi: 10.1097/01.brs.0000259864.21869.26.
10
Interobserver reliability is higher for assessments with 3D software-generated models than with conventional MRI images in the classification of trochlear dysplasia.在滑车发育不良的分类中,与传统 MRI 图像相比,使用 3D 软件生成模型进行评估具有更高的观察者间可靠性。
Knee Surg Sports Traumatol Arthrosc. 2022 May;30(5):1654-1660. doi: 10.1007/s00167-021-06697-3. Epub 2021 Aug 22.

引用本文的文献

1
The WHO BMI System Misclassifies Weight Status in Adults from the General Population in North Italy: A DXA-Based Assessment Study (18-98 Years).世界卫生组织的体重指数系统对意大利北部普通人群中成年人的体重状况分类错误:一项基于双能X线吸收法的评估研究(18 - 98岁)
Nutrients. 2025 Jun 29;17(13):2162. doi: 10.3390/nu17132162.

本文引用的文献

1
Mathematical Validation and Credibility of Diagnostic Blocks for Spinal Pain.脊柱疼痛诊断性阻滞的数学验证与可信度
Pain Med. 2016 Oct;17(10):1821-1828. doi: 10.1093/pm/pnw020. Epub 2016 Mar 19.
2
A philosophical foundation for diagnostic blocks, with criteria for their validation.诊断性阻滞的哲学基础及其验证标准。
Pain Med. 2014 Jun;15(6):998-1006. doi: 10.1111/pme.12436. Epub 2014 Apr 9.
3
The reliability of a quality appraisal tool for studies of diagnostic reliability (QAREL).诊断可靠性研究质量评价工具(QAREL)的可靠性。
BMC Med Res Methodol. 2013 Sep 9;13:111. doi: 10.1186/1471-2288-13-111.
4
Interrater reliability: the kappa statistic.组内一致性:kappa 统计量。
Biochem Med (Zagreb). 2012;22(3):276-82.
5
The development of a quality appraisal tool for studies of diagnostic reliability (QAREL).用于诊断可靠性研究的质量评价工具(QAREL)的开发。
J Clin Epidemiol. 2010 Aug;63(8):854-61. doi: 10.1016/j.jclinepi.2009.10.002. Epub 2010 Jan 13.
6
Computing inter-rater reliability and its variance in the presence of high agreement.在高度一致的情况下计算评分者间信度及其方差。
Br J Math Stat Psychol. 2008 May;61(Pt 1):29-48. doi: 10.1348/000711006X126600.
7
The kappa statistic in reliability studies: use, interpretation, and sample size requirements.可靠性研究中的kappa统计量:用途、解释及样本量要求。
Phys Ther. 2005 Mar;85(3):257-68.
8
Bias, prevalence and kappa.偏倚、患病率及kappa值
J Clin Epidemiol. 1993 May;46(5):423-9. doi: 10.1016/0895-4356(93)90018-v.
9
High agreement but low kappa: I. The problems of two paradoxes.高一致性但低卡帕值:I. 两个悖论的问题。
J Clin Epidemiol. 1990;43(6):543-9. doi: 10.1016/0895-4356(90)90158-l.
10
High agreement but low kappa: II. Resolving the paradoxes.高一致性但低卡帕值:II. 解决悖论
J Clin Epidemiol. 1990;43(6):551-8. doi: 10.1016/0895-4356(90)90159-m.