Suppr超能文献

龋病患病状况调查中检查者可靠性的测量、分析和解释:一些方法学思考。

Measurement, analysis and interpretation of examiner reliability in caries experience surveys: some methodological thoughts.

机构信息

School of Dentistry, Oral Pathology and Maxillofacial Surgery, Catholic University Leuven, Kapucijnenvoer 7 blok a bus 7001, 3000, Leuven, Belgium.

出版信息

Clin Oral Investig. 2012 Feb;16(1):117-27. doi: 10.1007/s00784-010-0475-x. Epub 2010 Oct 13.

Abstract

Data obtained from calibration exercises are used to assess the level of agreement between examiners (and the benchmark examiner) and/or between repeated examinations by the same examiner in epidemiological surveys or large-scale clinical studies. Agreement can be measured using different techniques: kappa statistic, percentage agreement, dice coefficient, sensitivity and specificity. Each of these methods shows specific characteristics and has its own shortcomings. The aim of this contribution is to critically review techniques for the measurement and analysis of examiner agreement and to illustrate this using data from a recent survey in young children, the Smile for Life project. The above-mentioned agreement measures are influenced (in differing ways and extents) by the unit of analysis (subject, tooth, surface level) and the disease level in the validation sample. These effects are more pronounced for percentage agreement and kappa than for sensitivity and specificity. It is, therefore, important to include information on unit of analysis and disease level (in validation sample) when reporting agreement measures. Also, confidence intervals need to be included since they indicate the reliability of the estimate. When dependency among observations is present [as is the case in caries experience data sets with typical hierarchical structure (surface-tooth-subject)], this will influence the width of the confidence interval and should therefore not be ignored. In this situation, the use of multilevel modelling is necessary. This review clearly shows that there is a need for the development of guidelines for the measurement, interpretation and reporting of examiner reliability in caries experience surveys.

摘要

从校准练习中获得的数据可用于评估检查者之间(以及基准检查者)的一致性,或者用于评估同一位检查者在流行病学调查或大规模临床研究中对重复检查的一致性。一致性可以使用不同的技术进行衡量:kappa 统计量、百分比一致性、骰子系数、灵敏度和特异性。这些方法中的每一种都具有特定的特征,并且都有其自身的缺点。本研究的目的是批判性地回顾评估检查者一致性的技术,并使用 Smile for Life 项目中最近对幼儿进行的调查数据来说明这些技术。上述一致性衡量指标会受到分析单位(个体、牙齿、表面)和验证样本中疾病水平的影响(以不同的方式和程度)。对于百分比一致性和 kappa 来说,这种影响比灵敏度和特异性更为明显。因此,在报告一致性衡量指标时,重要的是要包含分析单位和疾病水平(在验证样本中)的信息。还需要包含置信区间,因为它们可以指示估计的可靠性。当观察值之间存在依赖性时(在具有典型层次结构的龋齿经验数据集(表面-牙齿-个体)中就是如此),这将影响置信区间的宽度,因此不应忽略。在这种情况下,需要使用多层模型。本综述清楚地表明,需要制定有关龋齿经验调查中检查者可靠性的测量、解释和报告的指南。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验