• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

如何分析新检测方法的诊断性能?图文并茂解释说明。

How to Analyze the Diagnostic Performance of a New Test? Explained with Illustrations.

作者信息

Dhamnetiya Deepak, Jha Ravi Prakash, Shalini Shalini, Bhattacharyya Krittika

机构信息

Department of Community Medicine, Dr Baba Saheb Ambedkar Medical College and Hospital, Rohini, Delhi, India.

Lady Hardinge Medical College, Delhi, India.

出版信息

J Lab Physicians. 2021 Sep 8;14(1):90-98. doi: 10.1055/s-0041-1734019. eCollection 2022 Mar.

DOI:10.1055/s-0041-1734019
PMID:36186253
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9519267/
Abstract

Diagnostic tests are pivotal in modern medicine due to their applications in statistical decision-making regarding confirming or ruling out the presence of a disease in patients. In this regard, sensitivity and specificity are two most important and widely utilized components that measure the inherent validity of a diagnostic test for dichotomous outcomes against a gold standard test. Other diagnostic indices like positive predictive value, negative predictive value, positive likelihood ratio, negative likelihood ratio, accuracy of a diagnostic test, and the effect of prevalence on various diagnostic indices have also been discussed. We have tried to present the performance of a classification model at all classification thresholds by reviewing the receiver operating characteristic (ROC) curve and the depiction of the tradeoff between sensitivity and (1-specificity) across a series of cutoff points when the diagnostic test is on a continuous scale. The area under the ROC (AUROC) and comparison of AUROCs of different tests have also been discussed. Reliability of a test is defined in terms of the repeatability of the test such that the test gives consistent results when repeated more than once on the same individual or material, under the same conditions. In this article, we have presented the calculation of kappa coefficient, which is the simplest way of finding the agreement between two observers by calculating the overall percentage of agreement. When the prevalence of disease in the population is low, prospective study becomes increasingly difficult to handle through the conventional design. Hence, we chose to describe three more designs along with the conventional one and presented the sensitivity and specificity calculations for those designs. We tried to offer some guidance in choosing the best possible design among these four designs, depending on a number of factors. The ultimate aim of this article is to provide the basic conceptual framework and interpretation of various diagnostic test indices, ROC analysis, comparison of diagnostic accuracy of different tests, and the reliability of a test so that the clinicians can use it effectively. Several R packages, as mentioned in this article, can prove handy during quantitative synthesis of clinical data related to diagnostic tests.

摘要

诊断测试在现代医学中至关重要,因为它们在关于确认或排除患者疾病存在的统计决策中发挥着作用。在这方面,敏感性和特异性是衡量针对金标准测试的二分结果诊断测试固有有效性的两个最重要且广泛使用的组成部分。还讨论了其他诊断指标,如阳性预测值、阴性预测值、阳性似然比、阴性似然比、诊断测试的准确性以及患病率对各种诊断指标的影响。我们试图通过回顾接收者操作特征(ROC)曲线以及当诊断测试为连续尺度时在一系列截断点上敏感性与(1 - 特异性)之间权衡的描述,来展示分类模型在所有分类阈值下的性能。还讨论了ROC曲线下面积(AUROC)以及不同测试的AUROC比较。测试的可靠性是根据测试的可重复性来定义的,即当在相同条件下对同一个体或材料重复进行多次测试时,测试能给出一致的结果。在本文中,我们介绍了kappa系数的计算方法,这是通过计算总体一致百分比来找出两个观察者之间一致性的最简单方法。当人群中疾病患病率较低时,通过传统设计进行前瞻性研究变得越来越困难。因此,我们选择除了传统设计之外再描述三种设计,并给出这些设计的敏感性和特异性计算方法。我们试图根据一些因素,为在这四种设计中选择最佳可能设计提供一些指导。本文的最终目的是提供各种诊断测试指标、ROC分析、不同测试诊断准确性比较以及测试可靠性的基本概念框架和解释,以便临床医生能够有效地使用它。本文中提到的几个R包在与诊断测试相关的临床数据定量综合过程中可能会很有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/95f076627f5b/10-1055-s-0041-1734019-i2110521-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/ed7d0a4cd14a/10-1055-s-0041-1734019-i2110521-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/89b66388afa4/10-1055-s-0041-1734019-i2110521-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/95f076627f5b/10-1055-s-0041-1734019-i2110521-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/ed7d0a4cd14a/10-1055-s-0041-1734019-i2110521-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/89b66388afa4/10-1055-s-0041-1734019-i2110521-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8d/9519267/95f076627f5b/10-1055-s-0041-1734019-i2110521-3.jpg

相似文献

1
How to Analyze the Diagnostic Performance of a New Test? Explained with Illustrations.如何分析新检测方法的诊断性能?图文并茂解释说明。
J Lab Physicians. 2021 Sep 8;14(1):90-98. doi: 10.1055/s-0041-1734019. eCollection 2022 Mar.
2
Receiver operating characteristic (ROC) curve for medical researchers.医学研究者的受试者工作特征 (ROC) 曲线。
Indian Pediatr. 2011 Apr;48(4):277-87. doi: 10.1007/s13312-011-0055-4.
3
Biostatistics Series Module 7: The Statistics of Diagnostic Tests.生物统计学系列模块7:诊断测试的统计学
Indian J Dermatol. 2017 Jan-Feb;62(1):18-24. doi: 10.4103/0019-5154.198047.
4
Use of likelihood ratios for comparisons of binary diagnostic tests: underlying ROC curves.使用似然比进行二项诊断测试的比较:ROC 曲线的基础。
Med Phys. 2010 Nov;37(11):5821-30. doi: 10.1118/1.3503849.
5
Diagnostic test accuracy of nutritional tools used to identify undernutrition in patients with colorectal cancer: a systematic review.用于识别结直肠癌患者营养不良的营养评估工具的诊断测试准确性:一项系统综述
JBI Database System Rev Implement Rep. 2015 May 15;13(4):141-87. doi: 10.11124/jbisrir-2015-1673.
6
Reliability and diagnostic accuracy of 5 physical examination tests and combination of tests for subacromial impingement.5种体格检查测试及测试组合对肩峰下撞击症的可靠性和诊断准确性
Arch Phys Med Rehabil. 2009 Nov;90(11):1898-903. doi: 10.1016/j.apmr.2009.05.015.
7
Congenital Uterine Malformation by Experts (CUME): diagnostic criteria for T-shaped uterus.先天性子宫畸形专家共识(CUME):T 型子宫的诊断标准。
Ultrasound Obstet Gynecol. 2020 Jun;55(6):815-829. doi: 10.1002/uog.20845. Epub 2020 May 15.
8
Diagnostic accuracy of the A-test and cutoff points for assessing outcomes and planning acute and post-acute rehabilitation of patients surgically treated for hip fractures and osteoarthritis.A测试在评估接受髋关节骨折和骨关节炎手术治疗患者的预后及规划急性和亚急性康复方面的诊断准确性及截断点。
Vojnosanit Pregl. 2016 Dec;73(12):1139-48. doi: 10.2298/VSP150819056V.
9
Diagnostic Testing and Decision-Making: Beauty Is Not Just in the Eye of the Beholder.诊断检测与决策:美不只是在观察者眼中。
Anesth Analg. 2018 Oct;127(4):1085-1091. doi: 10.1213/ANE.0000000000003698.
10
Methodology of diagnostic tests in hepatology.诊断检测在肝脏病学中的应用方法。
Ann Hepatol. 2009 Jul-Sep;8(3):177-83.

引用本文的文献

1
Applying a Modified Version of the Prediction of Alcohol Withdrawal Severity Scale in a Canadian Community Withdrawal Management Setting.在加拿大社区戒酒管理环境中应用酒精戒断严重程度量表预测的修订版。
Drug Alcohol Rev. 2025 Jul;44(5):1365-1373. doi: 10.1111/dar.14075. Epub 2025 May 6.
2
Use of activated partial thrombin time and prothrombin time for quality assessment of fresh frozen plasma.使用活化部分凝血活酶时间和凝血酶原时间评估新鲜冰冻血浆的质量。
Int J Hematol. 2025 Apr 12. doi: 10.1007/s12185-025-03984-4.
3
Identification of Borderline Personality Disorder in Adolescents: Psychometric Properties and Diagnostic Efficiency of a Juvenile Version of the Impulsivity and Emotion Dysregulation Scale (IES-27-J).

本文引用的文献

1
Diagnostic test accuracy: application and practice using R software.诊断测试准确性:使用 R 软件的应用与实践。
Epidemiol Health. 2019;41:e2019007. doi: 10.4178/epih.e2019007. Epub 2019 Mar 28.
2
Cochrane diagnostic test accuracy reviews.考科蓝诊断试验准确性综述。
Syst Rev. 2013 Oct 7;2:82. doi: 10.1186/2046-4053-2-82.
3
Estimating the agreement and diagnostic accuracy of two diagnostic tests when one test is conducted on only a subsample of specimens.当仅对部分样本进行一项检测时,估计两项检测的一致性和诊断准确性。
青少年边缘型人格障碍的识别:冲动与情绪失调量表青少年版(IES-27-J)的心理测量特性及诊断效能
J Clin Psychol. 2025 Jul;81(7):567-576. doi: 10.1002/jclp.23792. Epub 2025 Mar 25.
4
The interaction of post-activation potentiation and fatigue on skeletal muscle twitch torque and displacement.激活后增强与疲劳对骨骼肌抽搐扭矩和位移的相互作用。
Front Physiol. 2025 Jan 30;15:1527523. doi: 10.3389/fphys.2024.1527523. eCollection 2024.
5
The Three-Class Annotation Method Improves the AI Detection of Early-Stage Osteosarcoma on Plain Radiographs: A Novel Approach for Rare Cancer Diagnosis.三级标注方法提高了平面X线片上早期骨肉瘤的人工智能检测:一种罕见癌症诊断的新方法。
Cancers (Basel). 2024 Dec 25;17(1):29. doi: 10.3390/cancers17010029.
6
Targeted syndromic next-generation sequencing panel for simultaneous detection of pathogens associated with bovine reproductive failure.用于同时检测与牛繁殖障碍相关病原体的靶向综合征下一代测序 panel
J Clin Microbiol. 2025 Jan 31;63(1):e0143324. doi: 10.1128/jcm.01433-24. Epub 2024 Dec 10.
7
The potential of circulating microRNAs as novel diagnostic biomarkers of COVID-19: a systematic review and meta-analysis.循环 microRNAs 作为 COVID-19 新型诊断生物标志物的潜力:系统评价和荟萃分析。
BMC Infect Dis. 2024 Sep 19;24(1):1011. doi: 10.1186/s12879-024-09915-8.
8
Diagnostic performance of an albuminuria point-of-care test in screening for chronic kidney disease among young people living with HIV in Uganda: a cross-sectional study.在乌干达,一项针对 HIV 感染者青少年人群的横断面研究显示,即时白蛋白尿检测在慢性肾脏病筛查中的诊断性能。
BMJ Open. 2024 Aug 17;14(8):e083221. doi: 10.1136/bmjopen-2023-083221.
9
Protein Biomarkers in Lung Cancer Screening: Technical Considerations and Feasibility Assessment.肺癌筛查中的蛋白质生物标志物:技术考虑因素和可行性评估。
Arch Bronconeumol. 2024 Oct;60 Suppl 2:S67-S76. doi: 10.1016/j.arbres.2024.07.007. Epub 2024 Jul 17.
10
Performance of existing diagnostic criteria for palindromic rheumatism.现有反射性风湿症诊断标准的性能。
Clin Rheumatol. 2024 Jul;43(7):2337-2342. doi: 10.1007/s10067-024-07010-6. Epub 2024 May 22.
Stat Med. 2012 Feb 28;31(5):436-48. doi: 10.1002/sim.4422. Epub 2011 Dec 4.
4
Receiver operating characteristic (ROC) curve for medical researchers.医学研究者的受试者工作特征 (ROC) 曲线。
Indian Pediatr. 2011 Apr;48(4):277-87. doi: 10.1007/s13312-011-0055-4.
5
Confidence intervals for predictive values with an emphasis to case-control studies.重点针对病例对照研究的预测值置信区间。
Stat Med. 2007 May 10;26(10):2170-83. doi: 10.1002/sim.2677.
6
The kappa statistic in reliability studies: use, interpretation, and sample size requirements.可靠性研究中的kappa统计量:用途、解释及样本量要求。
Phys Ther. 2005 Mar;85(3):257-68.
7
Index for rating diagnostic tests.诊断试验评级指数。
Cancer. 1950 Jan;3(1):32-5. doi: 10.1002/1097-0142(1950)3:1<32::aid-cncr2820030106>3.0.co;2-3.
8
The diagnostic odds ratio: a single indicator of test performance.诊断比值比:测试性能的单一指标。
J Clin Epidemiol. 2003 Nov;56(11):1129-35. doi: 10.1016/s0895-4356(03)00177-x.
9
Prospective studies of diagnostic test accuracy when disease prevalence is low.疾病患病率较低时诊断试验准确性的前瞻性研究。
Biostatistics. 2002 Dec;3(4):477-92. doi: 10.1093/biostatistics/3.4.477.
10
Systematic reviews in health care: Systematic reviews of evaluations of diagnostic and screening tests.医疗保健中的系统评价:诊断和筛查试验评估的系统评价。
BMJ. 2001 Jul 21;323(7305):157-62. doi: 10.1136/bmj.323.7305.157.