Hertzman P A, Clauw D J, Duffy J, Medsger T A, Feinstein A R
Los Alamos Medical Center, Suite 130, West Road, Los Alamos, NM 87544, USA.
Arch Intern Med. 2001 Oct 22;161(19):2301-6. doi: 10.1001/archinte.161.19.2301.
Constructing diagnostic criteria, a common problem in clinical medicine, is particularly difficult for diseases that lack a pathognomonic "gold standard." To develop an improved strategy for constructing such criteria, we used the eosinophilia-myalgia syndrome as an example. The goal, for research classifications, was to construct validated clinically sensible criteria and to develop improved methods that can be used for other disorders.
Using a "pattern-based" approach with data from several separate sources, a committee of investigators first prepared and informally tested criteria for the diagnosis of eosinophilia-myalgia syndrome. A gold standard challenge set of reports of cases and noncases was independently generated and separately validated by an external panel of clinical experts. The criteria were then tested using the gold standard set, and interobserver variability and diagnostic accuracy were determined.
Interobserver variability showed the following mean proportionate agreements: 98.7% for the presence of specific criteria elements, 99% to 100% for diagnosis, and 97% to 98% for diagnostic pattern. kappa Values were correspondingly high. Diagnostic accuracy showed sensitivity at 88%, specificity at 97%, and overall accuracy at 92%.
The proposed criteria are accurate and reproducible, and can be used in future clinical investigations of the eosinophilia-myalgia syndrome. The new strategy and methods developed for this challenge can be valuable for solving analogous problems in constructing criteria for other clinical disorders.
构建诊断标准是临床医学中的常见问题,对于缺乏特征性“金标准”的疾病而言尤其困难。为了开发一种改进的构建此类标准的策略,我们以嗜酸性粒细胞增多性肌痛综合征为例。对于研究分类而言,目标是构建经过验证的、临床上合理的标准,并开发可用于其他疾病的改进方法。
采用一种“基于模式”的方法,利用来自多个独立来源的数据,一个研究委员会首先制定并对嗜酸性粒细胞增多性肌痛综合征的诊断标准进行了非正式测试。一个由病例和非病例报告组成的金标准挑战集由外部临床专家小组独立生成并分别进行验证。然后使用该金标准集对标准进行测试,并确定观察者间的变异性和诊断准确性。
观察者间变异性显示出以下平均比例一致性:特定标准要素存在情况为98.7%,诊断为99%至100%,诊断模式为97%至98%。kappa值相应较高。诊断准确性显示敏感性为88%,特异性为97%,总体准确性为92%。
所提出的标准准确且可重复,可用于未来嗜酸性粒细胞增多性肌痛综合征的临床研究。为应对这一挑战而开发的新策略和方法对于解决构建其他临床疾病标准中的类似问题可能具有重要价值。