Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.
Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee.
Ann Am Thorac Soc. 2023 Oct;20(10):1491-1498. doi: 10.1513/AnnalsATS.202302-182OC.
Studies of bronchoscopy have reported diagnostic yield (DY) using different calculation methods, which has hindered comparisons across studies. To quantify the effect of the variability of four methods on DY estimates of bronchoscopy. We performed a simulation-based analysis of patients undergoing bronchoscopy using variations around base case assumptions for cancer prevalence (60%), distribution of nonmalignant findings, and degree of follow-up information at a fixed sensitivity of bronchoscopy for malignancy (80%). We calculated DY, the rate of true positives and true negatives (TNs), using four methods. Method 1 considered malignant and specific benign findings at index bronchoscopy as true positives and TNs, respectively. Method 2 included nonspecific benign findings as TNs. Method 3 considered nonspecific benign findings cases as TNs only if follow-up confirmed benign disease. Method 4 counted all cases with a nonmalignant diagnosis as TNs if follow-up confirmed benign disease. A scenario analysis and probabilistic sensitivity analysis were conducted to demonstrate the effect of parameter estimates on DY. A change in DY of >10% was considered clinically meaningful. Across all pairwise comparisons of the four methods, a DY difference of >10% was observed in 76.7% of cases (45,992 of 60,000 comparisons). Method 4 resulted in DY estimates that were >10% higher than estimates made with other methods in >90% of scenarios. Variation in cancer prevalence had a large effect on DY. Across a wide range of clinical scenarios, the categorization of nonmalignant findings at index bronchoscopy and cancer prevalence had the largest impact on DY. The large variability in DY estimates across the four methods limits the interpretation of bronchoscopy studies and warrants standardization.
对支气管镜检查的研究报告了使用不同计算方法的诊断产量 (DY),这阻碍了研究之间的比较。为了量化四种方法的可变性对支气管镜检查 DY 估计的影响。我们使用癌症患病率(60%)、非恶性发现分布和在固定支气管镜检查恶性肿瘤敏感性(80%)下随访信息程度的基本案例假设的变化,对接受支气管镜检查的患者进行了基于模拟的分析。我们使用四种方法计算了 DY、真阳性和真阴性 (TN) 的比率。方法 1 将指数支气管镜检查中的恶性和特定良性发现视为真阳性和 TNs。方法 2 将非特异性良性发现包括为 TNs。方法 3 仅在随访证实良性疾病的情况下,将非特异性良性发现病例视为 TNs。方法 4 如果随访证实为良性疾病,则将所有诊断为非恶性的病例视为 TNs。进行了方案分析和概率敏感性分析,以证明参数估计对 DY 的影响。DY 变化超过 10%被认为具有临床意义。在四种方法的所有两两比较中,76.7%的情况下观察到 DY 差异超过 10%(60000 次比较中的 45992 次)。在>90%的情况下,方法 4 导致的 DY 估计值比其他方法高>10%。癌症患病率的变化对 DY 有很大影响。在广泛的临床情况下,指数支气管镜检查中非恶性发现和癌症患病率的分类对 DY 有最大影响。四种方法的 DY 估计值的巨大差异限制了对支气管镜检查研究的解释,并需要标准化。