Discipline of Medical Radiation Sciences, Faculty of Health Sciences, University of Sydney, PO Box 170, Lidcombe, NSW 1825, Australia.
Eur J Radiol. 2010 Jun;74(3):e122-31. doi: 10.1016/j.ejrad.2009.04.069. Epub 2009 May 30.
Consistency in evaluation of mammography images in research and clinical practice is dependent on a standardised clinical image quality evaluation system. Currently two such systems are available-one developed by the American College of Radiology (ACR) and the other by the European Commission (EU guidelines). The purpose of this study was to review mammography clinical evaluation methods in research studies and their adherence to these systems.
A total of 23 research articles were reviewed from the period 2000-2006, 11 of these studies used digital images. The focus of the review was the criteria and rating scales used.
Only 5 studies used either the ACR (3) or EU guidelines (2). The remainder included aspects of these systems together with a range of other criteria and rating scales. Variation was found in the categories of criteria used, number of criteria, the descriptors of the criteria and the instructions used to evaluate the criteria. Instructions were frequently not specific and open to individual interpretation. Although breast density is an important criterion of image quality and contributes to perception of breast lesions, inclusion of this criterion was not universal, and even when used the area of breast density to be evaluated was not identified, thus enhancing inter-observer variability. Scales that were absolute or relative were used for evaluation, all of which incorporated inconsistent numbers of steps.
Low adherence to ACR and EU Guidelines has resulted in considerable variation in the evaluation methods used in research studies. The implications of this variability are considerable both for evaluation of image quality in research outcomes and clinical practice.
在研究和临床实践中,评估乳腺 X 光图像的一致性依赖于标准化的临床图像质量评估系统。目前有两种这样的系统——一种由美国放射学院(ACR)开发,另一种由欧盟委员会(EU 指南)开发。本研究的目的是回顾研究中乳腺 X 光摄影临床评估方法及其对这些系统的遵循情况。
对 2000 年至 2006 年期间的 23 篇研究论文进行了综述,其中 11 篇研究使用了数字图像。审查的重点是使用的标准和评分量表。
只有 5 项研究使用了 ACR(3 项)或 EU 指南(2 项)。其余的研究包括这些系统的各个方面,以及一系列其他的标准和评分量表。所使用的标准类别、标准数量、标准描述符以及评估标准的说明存在差异。说明往往不具体,容易引起个人的解读。尽管乳腺密度是图像质量的一个重要标准,并且会影响对乳腺病变的感知,但并非所有研究都包含这一标准,即使使用了这一标准,也没有明确需要评估的乳腺密度区域,从而增加了观察者之间的差异。评估时使用的是绝对或相对的量表,所有这些量表都包含了不一致的步骤数。
对 ACR 和 EU 指南的低遵从性导致了研究中使用的评估方法存在很大的差异。这种变异性对研究结果和临床实践中的图像质量评估都有很大的影响。