Roth Dominik, Pace Nathan L, Lee Anna, Hovhannisyan Karen, Warenits Alexandra-Maria, Arrich Jasmin, Herkner Harald
Department of Emergency Medicine, Medical University of Vienna, Währinger Gürtel 18-20, Vienna, Austria, A-1090.
Cochrane Database Syst Rev. 2018 May 15;5(5):CD008874. doi: 10.1002/14651858.CD008874.pub2.
The unanticipated difficult airway is a potentially life-threatening event during anaesthesia or acute conditions. An unsuccessfully managed upper airway is associated with serious morbidity and mortality. Several bedside screening tests are used in clinical practice to identify those at high risk of difficult airway. Their accuracy and benefit however, remains unclear.
The objective of this review was to characterize and compare the diagnostic accuracy of the Mallampati classification and other commonly used airway examination tests for assessing the physical status of the airway in adult patients with no apparent anatomical airway abnormalities. We performed this individually for each of the four descriptors of the difficult airway: difficult face mask ventilation, difficult laryngoscopy, difficult tracheal intubation, and failed intubation.
We searched major electronic databases including CENTRAL, MEDLINE, Embase, ISI Web of Science, CINAHL, as well as regional, subject specific, and dissertation and theses databases from inception to 16 December 2016, without language restrictions. In addition, we searched the Science Citation Index and checked the references of all the relevant studies. We also handsearched selected journals, conference proceedings, and relevant guidelines. We updated this search in March 2018, but we have not yet incorporated these results.
We considered full-text diagnostic test accuracy studies of any individual index test, or a combination of tests, against a reference standard. Participants were adults without obvious airway abnormalities, who were having laryngoscopy performed with a standard laryngoscope and the trachea intubated with a standard tracheal tube. Index tests included the Mallampati test, modified Mallampati test, Wilson risk score, thyromental distance, sternomental distance, mouth opening test, upper lip bite test, or any combination of these. The target condition was difficult airway, with one of the following reference standards: difficult face mask ventilation, difficult laryngoscopy, difficult tracheal intubation, and failed intubation.
We performed screening and selection of the studies, data extraction and assessment of methodological quality (using QUADAS-2) independently and in duplicate. We designed a Microsoft Access database for data collection and used Review Manager 5 and R for data analysis. For each index test and each reference standard, we assessed sensitivity and specificity. We produced forest plots and summary receiver operating characteristic (ROC) plots to summarize the data. Where possible, we performed meta-analyses to calculate pooled estimates and compare test accuracy indirectly using bivariate models. We investigated heterogeneity and performed sensitivity analyses.
We included 133 (127 cohort type and 6 case-control) studies involving 844,206 participants. We evaluated a total of seven different prespecified index tests in the 133 studies, as well as 69 non-prespecified, and 32 combinations. For the prespecified index tests, we found six studies for the Mallampati test, 105 for the modified Mallampati test, six for the Wilson risk score, 52 for thyromental distance, 18 for sternomental distance, 34 for the mouth opening test, and 30 for the upper lip bite test. Difficult face mask ventilation was the reference standard in seven studies, difficult laryngoscopy in 92 studies, difficult tracheal intubation in 50 studies, and failed intubation in two studies. Across all studies, we judged the risk of bias to be variable for the different domains; we mostly observed low risk of bias for patient selection, flow and timing, and unclear risk of bias for reference standard and index test. Applicability concerns were generally low for all domains. For difficult laryngoscopy, the summary sensitivity ranged from 0.22 (95% confidence interval (CI) 0.13 to 0.33; mouth opening test) to 0.67 (95% CI 0.45 to 0.83; upper lip bite test) and the summary specificity ranged from 0.80 (95% CI 0.74 to 0.85; modified Mallampati test) to 0.95 (95% CI 0.88 to 0.98; Wilson risk score). The upper lip bite test for diagnosing difficult laryngoscopy provided the highest sensitivity compared to the other tests (P < 0.001). For difficult tracheal intubation, summary sensitivity ranged from 0.24 (95% CI 0.12 to 0.43; thyromental distance) to 0.51 (95% CI 0.40 to 0.61; modified Mallampati test) and the summary specificity ranged from 0.87 (95% CI 0.82 to 0.91; modified Mallampati test) to 0.93 (0.87 to 0.96; mouth opening test). The modified Mallampati test had the highest sensitivity for diagnosing difficult tracheal intubation compared to the other tests (P < 0.001). For difficult face mask ventilation, we could only estimate summary sensitivity (0.17, 95% CI 0.06 to 0.39) and specificity (0.90, 95% CI 0.81 to 0.95) for the modified Mallampati test.
AUTHORS' CONCLUSIONS: Bedside airway examination tests, for assessing the physical status of the airway in adults with no apparent anatomical airway abnormalities, are designed as screening tests. Screening tests are expected to have high sensitivities. We found that all investigated index tests had relatively low sensitivities with high variability. In contrast, specificities were consistently and markedly higher than sensitivities across all tests. The standard bedside airway examination tests should be interpreted with caution, as they do not appear to be good screening tests. Among the tests we examined, the upper lip bite test showed the most favourable diagnostic test accuracy properties. Given the paucity of available data, future research is needed to develop tests with high sensitivities to make them useful, and to consider their use for screening difficult face mask ventilation and failed intubation. The 27 studies in 'Studies awaiting classification' may alter the conclusions of the review, once we have assessed them.
意外困难气道是麻醉或急性病症期间潜在的危及生命的事件。上气道管理失败与严重的发病率和死亡率相关。临床实践中使用了几种床旁筛查试验来识别气道困难的高危人群。然而,它们的准确性和益处仍不明确。
本综述的目的是描述和比较Mallampati分级以及其他常用气道检查试验在评估无明显气道解剖异常的成年患者气道状况时的诊断准确性。我们针对困难气道的四个描述指标分别进行了此项研究:面罩通气困难、喉镜检查困难、气管插管困难和插管失败。
我们检索了主要电子数据库,包括Cochrane系统评价数据库、MEDLINE、Embase、科学引文索引(ISI Web of Science)、护理学与健康领域数据库(CINAHL),以及地区性、特定学科、学位论文数据库,检索时间从建库至2016年12月16日,无语言限制。此外,我们检索了科学引文索引,并查阅了所有相关研究的参考文献。我们还手工检索了选定的期刊、会议论文集和相关指南。我们在2018年3月更新了此检索,但尚未纳入这些结果。
我们纳入了针对任何单项指标试验或试验组合与参考标准进行比较的全文诊断试验准确性研究。参与者为无明显气道异常的成年人,使用标准喉镜进行喉镜检查,并使用标准气管导管进行气管插管。指标试验包括Mallampati试验、改良Mallampati试验、Wilson风险评分、甲颏距离、胸骨颏距离、张口试验、上唇咬合试验,或这些试验的任何组合。目标状况为困难气道,参考标准如下之一:面罩通气困难、喉镜检查困难、气管插管困难和插管失败。
我们独立且重复地进行研究的筛选和选择、数据提取以及方法学质量评估(使用QUADAS-2)。我们设计了一个Microsoft Access数据库用于数据收集,并使用Review Manager 5和R进行数据分析。对于每个指标试验和每个参考标准,我们评估了敏感性和特异性。我们绘制森林图和汇总受试者工作特征(ROC)图来汇总数据。在可能的情况下,我们进行Meta分析以计算合并估计值,并使用双变量模型间接比较试验准确性。我们调查了异质性并进行了敏感性分析。
我们纳入了133项研究(127项队列研究类型和6项病例对照研究),涉及844,206名参与者。在这133项研究中,我们总共评估了7种不同的预先指定的指标试验,以及69种非预先指定的试验和32种试验组合。对于预先指定的指标试验,我们发现有6项研究涉及Mallampati试验,105项涉及改良Mallampati试验,6项涉及Wilson风险评分,52项涉及甲颏距离,18项涉及胸骨颏距离,34项涉及张口试验,30项涉及上唇咬合试验。7项研究将面罩通气困难作为参考标准,92项研究将喉镜检查困难作为参考标准,50项研究将气管插管困难作为参考标准,2项研究将插管失败作为参考标准。在所有研究中,我们判断不同领域的偏倚风险各不相同;我们大多观察到患者选择、流程和时间方面的偏倚风险较低,而参考标准和指标试验的偏倚风险不明确。所有领域的适用性问题普遍较低。对于喉镜检查困难,汇总敏感性范围为0.22(95%置信区间(CI)0.13至0.33;张口试验)至0.67(95%CI 0.45至0.83;上唇咬合试验),汇总特异性范围为0.80(95%CI 0.74至0.85;改良Mallampati试验)至0.95(95%CI 0.88至0.98;Wilson风险评分)。与其他试验相比,上唇咬合试验诊断喉镜检查困难的敏感性最高(P<0.001)。对于气管插管困难,汇总敏感性范围为0.24(95%CI 0.12至0.43;甲颏距离)至0.51(95%CI 0.40至0.61;改良Mallampati试验),汇总特异性范围为0.87(95%CI 0.82至0.91;改良Mallampati试验)至0.93(95%CI 0.87至0.96;张口试验)。与其他试验相比,改良Mallampati试验诊断气管插管困难的敏感性最高(P<0.001)。对于面罩通气困难,我们仅能估计改良Mallampati试验的汇总敏感性(0.17,95%CI 0.06至