• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统评价用于检验测量连续变量的医学仪器可靠性的统计方法。

A systematic review of statistical methods used to test for reliability of medical instruments measuring continuous variables.

机构信息

Julius Centre University of Malaya, Department of Social & Preventive Medicine, Faculty of Medicine, University of Malaya, 50603, Kuala Lumpur, Malaysia.

出版信息

Iran J Basic Med Sci. 2013 Jun;16(6):803-7.

PMID:23997908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3758037/
Abstract

OBJECTIVE(S): Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice.

MATERIALS AND METHODS

In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria.

RESULTS

The Intra-class Correlation Coefficient (ICC) is the most popular method with 25 (60%) studies having used this method followed by the comparing means (8 or 19%). Out of 25 studies using the ICC, only 7 (28%) reported the confidence intervals and types of ICC used. Most studies (71%) also tested the agreement of instruments.

CONCLUSION

This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.

摘要

目的

可靠性测量精度或测试结果可再现的程度。这是首次系统地综述,旨在确定用于测量连续变量测量设备可靠性的统计方法。本研究还旨在强调可靠性分析中使用的不适当的统计方法及其在医学实践中的影响。

材料和方法

2010 年,在 2007 年至 2009 年期间搜索了五个电子数据库以查找可靠性研究。最初确定了 5795 个标题。只有 282 个标题可能与研究相关,最终有 42 个符合纳入标准。

结果

组内相关系数(ICC)是最受欢迎的方法,有 25 项(60%)研究使用了这种方法,其次是比较均值(8 项或 19%)。在使用 ICC 的 25 项研究中,只有 7 项(28%)报告了置信区间和使用的 ICC 类型。大多数研究(71%)还测试了仪器的一致性。

结论

本研究发现,组内相关系数是评估测量连续结果的医学仪器可靠性最常用的方法。在一些研究中,也存在不适当的统计方法应用和解释。医学研究人员了解这一问题并能够正确地进行可靠性研究分析非常重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af94/3758037/c20367abd40b/ijbms-16-803-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af94/3758037/c20367abd40b/ijbms-16-803-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af94/3758037/c20367abd40b/ijbms-16-803-g001.jpg

相似文献

1
A systematic review of statistical methods used to test for reliability of medical instruments measuring continuous variables.系统评价用于检验测量连续变量的医学仪器可靠性的统计方法。
Iran J Basic Med Sci. 2013 Jun;16(6):803-7.
2
Statistical methods used to test for agreement of medical instruments measuring continuous variables in method comparison studies: a systematic review.用于检验医学仪器在方法比较研究中测量连续变量的一致性的统计方法:系统综述。
PLoS One. 2012;7(5):e37908. doi: 10.1371/journal.pone.0037908. Epub 2012 May 25.
3
Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.评估老年人患者报告结局测量工具的重测信度:系统评价。
Int J Nurs Stud. 2018 Mar;79:58-69. doi: 10.1016/j.ijnurstu.2017.11.003. Epub 2017 Nov 8.
4
The measurement of collaboration within healthcare settings: a systematic review of measurement properties of instruments.医疗机构内协作的测量:对测量工具属性的系统评价
JBI Database System Rev Implement Rep. 2016 Apr;14(4):138-97. doi: 10.11124/JBISRIR-2016-2159.
5
Characteristics that affect score reliability in the Berg Balance Scale: a meta-analytic reliability generalization study.影响 Berg 平衡量表评分可靠性的特征:一项荟萃分析可靠性综合研究。
Eur J Phys Rehabil Med. 2019 Oct;55(5):570-584. doi: 10.23736/S1973-9087.19.05363-2. Epub 2019 Apr 4.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
[Translation and validation of a French version of the Young Mania Rating Scale (YMRS)].《青年躁狂评定量表(YMRS)法语版的翻译与验证》
Encephale. 2003 Nov-Dec;29(6):499-505.
8
Is laser speckle contrast analysis (LASCA) the new kid on the block in systemic sclerosis? A systematic literature review and pilot study to evaluate reliability of LASCA to measure peripheral blood perfusion in scleroderma patients.激光散斑对比分析(LASCA)是否是系统性硬化症中的新贵?一项系统文献回顾和初步研究,旨在评估 LASCA 测量硬皮病患者外周血液灌注的可靠性。
Autoimmun Rev. 2018 Aug;17(8):775-780. doi: 10.1016/j.autrev.2018.01.023. Epub 2018 Jun 6.
9
Assessing reproducibility of data obtained with instruments based on continuous measurements.评估基于连续测量的仪器所获得数据的可重复性。
Exp Aging Res. 2000 Oct-Dec;26(4):353-65. doi: 10.1080/036107300750015741.
10
Measurement properties of walking outcome measures for neurogenic claudication: a systematic review and meta analysis.神经源性间歇性跛行步行结局测量的测量特性:系统评价和荟萃分析。
Spine J. 2019 Aug;19(8):1378-1396. doi: 10.1016/j.spinee.2019.04.004. Epub 2019 Apr 12.

引用本文的文献

1
Preliminary inter-rater reliability of "The observed Off-task Behavior among School-Children" (The OBS-Children).《学龄儿童观察到的任务外行为》(The OBS-Children)的评分者间初步信度。
Br J Occup Ther. 2023 Apr;86(4):302-311. doi: 10.1177/03080226221137770. Epub 2023 Feb 25.
2
Correlation Between the Initial Severity of Oral Clefts at Birth in Patients With Complete Unilateral Cleft Lip and Palate and Craniofacial Morphology, Dental Arch Relationship, and Nasolabial Aesthetics During Pre-Adolescence.完全性单侧唇腭裂患者出生时口腔裂隙的初始严重程度与青春期前颅面形态、牙弓关系及鼻唇美学之间的相关性
Orthod Craniofac Res. 2025 Jun;28(3):564-576. doi: 10.1111/ocr.12909. Epub 2025 Feb 24.
3

本文引用的文献

1
Statistical methods used to test for agreement of medical instruments measuring continuous variables in method comparison studies: a systematic review.用于检验医学仪器在方法比较研究中测量连续变量的一致性的统计方法:系统综述。
PLoS One. 2012;7(5):e37908. doi: 10.1371/journal.pone.0037908. Epub 2012 May 25.
2
Repeatability of physiotherapy chest wall vibrations applied to spontaneously breathing adults.
Physiotherapy. 2009 Mar;95(1):36-42. doi: 10.1016/j.physio.2008.08.004. Epub 2008 Oct 1.
3
The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration.评估卫生保健干预措施的研究的系统评价和Meta分析报告的PRISMA声明:解释与详述。
Psychometric evaluation of the Chinese version of the Nursing Student Contributions to Clinical Settings scale and analysis of factors influencing nurses' perceptions of nursing students' contributions: a cross-sectional study.
中文版护理专业学生对临床环境贡献量表的心理测量学评价及影响护士对护理专业学生贡献认知的因素分析:一项横断面研究
BMC Nurs. 2024 Oct 8;23(1):720. doi: 10.1186/s12912-024-02398-7.
4
Orofacial Myofunctional Aspects of Nursing Infants and Preschoolers.护理婴幼儿的口面部肌功能问题
Int Arch Otorhinolaryngol. 2023 Oct 23;27(4):e680-e686. doi: 10.1055/s-0042-1759576. eCollection 2023 Oct.
5
Aptamer-Based Magnetic Nanoprobe for Quantitative Measurement of Chloramphenicol in Milk through Portable Reader.基于适配体的磁性纳米探针,用于通过便携式读数器对牛奶中的氯霉素进行定量检测。
J Med Signals Sens. 2023 May 29;13(2):136-143. doi: 10.4103/jmss.jmss_177_21. eCollection 2023 Apr-Jun.
6
Tissue-specific assessment of oxidative status: Wild boar as a case study.氧化状态的组织特异性评估:以野猪为例的研究
Front Vet Sci. 2023 Mar 6;10:1089922. doi: 10.3389/fvets.2023.1089922. eCollection 2023.
7
An MRI-based Study to Investigate If the Patella is Truly Centred between the Femoral Condyles in the Coronal Plane.一项基于磁共振成像的研究,旨在调查髌骨在冠状面是否真的位于股骨髁之间的中心位置。
Strategies Trauma Limb Reconstr. 2022 May-Aug;17(2):63-67. doi: 10.5005/jp-journals-10080-1561.
8
MMBRG Protocol - Infants and Preschoolers: Myofunctional Orofacial Clinic Examination.MMBRG 协议 - 婴幼儿:肌功能口面诊所检查。
Codas. 2022 Apr 22;34(5):e20200325. doi: 10.1590/2317-1782/20212020325. eCollection 2022.
9
Evaluation of fully automated cephalometric measurements obtained from web-based artificial intelligence driven platform.基于网络的人工智能驱动平台获取的全自动头影测量评估。
BMC Oral Health. 2022 Apr 19;22(1):132. doi: 10.1186/s12903-022-02170-w.
10
Dosimetric Performance of Poly(vinyl alcohol)/Silver Nanoparticles Hybrid Nanomaterials for Colorimetric Sensing of Gamma Radiation.用于伽马辐射比色传感的聚乙烯醇/银纳米颗粒杂化纳米材料的剂量学性能
Nanomaterials (Basel). 2022 Mar 26;12(7):1088. doi: 10.3390/nano12071088.
PLoS Med. 2009 Jul 21;6(7):e1000100. doi: 10.1371/journal.pmed.1000100.
4
Impact of shock requiring norepinephrine on the accuracy and reliability of subcutaneous continuous glucose monitoring.需要去甲肾上腺素的休克对皮下连续血糖监测准确性和可靠性的影响。
Intensive Care Med. 2009 Aug;35(8):1383-9. doi: 10.1007/s00134-009-1471-y. Epub 2009 Apr 7.
5
Intraclass correlations: uses in assessing rater reliability.组内相关系数:在评估评分者可靠性中的应用。
Psychol Bull. 1979 Mar;86(2):420-8. doi: 10.1037//0033-2909.86.2.420.
6
Validation of the Artsana CS 410 automated blood pressure monitor in adults according to the International Protocol of the European Society of Hypertension.根据欧洲高血压学会国际协议对成人使用的阿萨纳CS 410自动血压计进行验证。
Blood Press Monit. 2008 Jun;13(3):177-82. doi: 10.1097/MBP.0b013e3282f697cc.
7
A method to measure cervical spine motion over extended periods of time.一种长时间测量颈椎活动的方法。
Spine (Phila Pa 1976). 2007 Sep 1;32(19):2092-8. doi: 10.1097/BRS.0b013e318145a93a.
8
Inter-observer validation study of quantitative CT-osteodensitometry in total knee arthroplasty.
Arch Orthop Trauma Surg. 2007 Oct;127(8):729-31. doi: 10.1007/s00402-007-0351-6. Epub 2007 Jul 11.
9
Test-retest reliability of knee kinesthesia in healthy adults.健康成年人膝关节运动觉的重测信度
BMC Musculoskelet Disord. 2007 Jul 3;8:57. doi: 10.1186/1471-2474-8-57.
10
Validation of the Artsana CSI 610 automated blood pressure monitor in adults according to the International Protocol of the European Society of Hypertension.根据欧洲高血压学会国际协议对成人使用的阿萨纳CSI 610自动血压监测仪进行验证。
Blood Press Monit. 2007 Jun;12(3):179-84. doi: 10.1097/MBP.0b013e3280b08394.