Test Evaluation Research Group, Institute of Applied Health Research, University of Birmingham, Birmingham , UK.
NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham, UK.
Cochrane Database Syst Rev. 2021 Mar 24;3(3):CD013705. doi: 10.1002/14651858.CD013705.pub2.
Accurate rapid diagnostic tests for SARS-CoV-2 infection could contribute to clinical and public health strategies to manage the COVID-19 pandemic. Point-of-care antigen and molecular tests to detect current infection could increase access to testing and early confirmation of cases, and expediate clinical and public health management decisions that may reduce transmission.
To assess the diagnostic accuracy of point-of-care antigen and molecular-based tests for diagnosis of SARS-CoV-2 infection. We consider accuracy separately in symptomatic and asymptomatic population groups.
Electronic searches of the Cochrane COVID-19 Study Register and the COVID-19 Living Evidence Database from the University of Bern (which includes daily updates from PubMed and Embase and preprints from medRxiv and bioRxiv) were undertaken on 30 Sept 2020. We checked repositories of COVID-19 publications and included independent evaluations from national reference laboratories, the Foundation for Innovative New Diagnostics and the Diagnostics Global Health website to 16 Nov 2020. We did not apply language restrictions.
We included studies of people with either suspected SARS-CoV-2 infection, known SARS-CoV-2 infection or known absence of infection, or those who were being screened for infection. We included test accuracy studies of any design that evaluated commercially produced, rapid antigen or molecular tests suitable for a point-of-care setting (minimal equipment, sample preparation, and biosafety requirements, with results within two hours of sample collection). We included all reference standards that define the presence or absence of SARS-CoV-2 (including reverse transcription polymerase chain reaction (RT-PCR) tests and established diagnostic criteria).
Studies were screened independently in duplicate with disagreements resolved by discussion with a third author. Study characteristics were extracted by one author and checked by a second; extraction of study results and assessments of risk of bias and applicability (made using the QUADAS-2 tool) were undertaken independently in duplicate. We present sensitivity and specificity with 95% confidence intervals (CIs) for each test and pooled data using the bivariate model separately for antigen and molecular-based tests. We tabulated results by test manufacturer and compliance with manufacturer instructions for use and according to symptom status.
Seventy-eight study cohorts were included (described in 64 study reports, including 20 pre-prints), reporting results for 24,087 samples (7,415 with confirmed SARS-CoV-2). Studies were mainly from Europe (n = 39) or North America (n = 20), and evaluated 16 antigen and five molecular assays. We considered risk of bias to be high in 29 (50%) studies because of participant selection; in 66 (85%) because of weaknesses in the reference standard for absence of infection; and in 29 (45%) for participant flow and timing. Studies of antigen tests were of a higher methodological quality compared to studies of molecular tests, particularly regarding the risk of bias for participant selection and the index test. Characteristics of participants in 35 (45%) studies differed from those in whom the test was intended to be used and the delivery of the index test in 39 (50%) studies differed from the way in which the test was intended to be used. Nearly all studies (97%) defined the presence or absence of SARS-CoV-2 based on a single RT-PCR result, and none included participants meeting case definitions for probable COVID-19. Antigen tests Forty-eight studies reported 58 evaluations of antigen tests. Estimates of sensitivity varied considerably between studies. There were differences between symptomatic (72.0%, 95% CI 63.7% to 79.0%; 37 evaluations; 15530 samples, 4410 cases) and asymptomatic participants (58.1%, 95% CI 40.2% to 74.1%; 12 evaluations; 1581 samples, 295 cases). Average sensitivity was higher in the first week after symptom onset (78.3%, 95% CI 71.1% to 84.1%; 26 evaluations; 5769 samples, 2320 cases) than in the second week of symptoms (51.0%, 95% CI 40.8% to 61.0%; 22 evaluations; 935 samples, 692 cases). Sensitivity was high in those with cycle threshold (Ct) values on PCR ≤25 (94.5%, 95% CI 91.0% to 96.7%; 36 evaluations; 2613 cases) compared to those with Ct values >25 (40.7%, 95% CI 31.8% to 50.3%; 36 evaluations; 2632 cases). Sensitivity varied between brands. Using data from instructions for use (IFU) compliant evaluations in symptomatic participants, summary sensitivities ranged from 34.1% (95% CI 29.7% to 38.8%; Coris Bioconcept) to 88.1% (95% CI 84.2% to 91.1%; SD Biosensor STANDARD Q). Average specificities were high in symptomatic and asymptomatic participants, and for most brands (overall summary specificity 99.6%, 95% CI 99.0% to 99.8%). At 5% prevalence using data for the most sensitive assays in symptomatic people (SD Biosensor STANDARD Q and Abbott Panbio), positive predictive values (PPVs) of 84% to 90% mean that between 1 in 10 and 1 in 6 positive results will be a false positive, and between 1 in 4 and 1 in 8 cases will be missed. At 0.5% prevalence applying the same tests in asymptomatic people would result in PPVs of 11% to 28% meaning that between 7 in 10 and 9 in 10 positive results will be false positives, and between 1 in 2 and 1 in 3 cases will be missed. No studies assessed the accuracy of repeated lateral flow testing or self-testing. Rapid molecular assays Thirty studies reported 33 evaluations of five different rapid molecular tests. Sensitivities varied according to test brand. Most of the data relate to the ID NOW and Xpert Xpress assays. Using data from evaluations following the manufacturer's instructions for use, the average sensitivity of ID NOW was 73.0% (95% CI 66.8% to 78.4%) and average specificity 99.7% (95% CI 98.7% to 99.9%; 4 evaluations; 812 samples, 222 cases). For Xpert Xpress, the average sensitivity was 100% (95% CI 88.1% to 100%) and average specificity 97.2% (95% CI 89.4% to 99.3%; 2 evaluations; 100 samples, 29 cases). Insufficient data were available to investigate the effect of symptom status or time after symptom onset.
AUTHORS' CONCLUSIONS: Antigen tests vary in sensitivity. In people with signs and symptoms of COVID-19, sensitivities are highest in the first week of illness when viral loads are higher. The assays shown to meet appropriate criteria, such as WHO's priority target product profiles for COVID-19 diagnostics ('acceptable' sensitivity ≥ 80% and specificity ≥ 97%), can be considered as a replacement for laboratory-based RT-PCR when immediate decisions about patient care must be made, or where RT-PCR cannot be delivered in a timely manner. Positive predictive values suggest that confirmatory testing of those with positive results may be considered in low prevalence settings. Due to the variable sensitivity of antigen tests, people who test negative may still be infected. Evidence for testing in asymptomatic cohorts was limited. Test accuracy studies cannot adequately assess the ability of antigen tests to differentiate those who are infectious and require isolation from those who pose no risk, as there is no reference standard for infectiousness. A small number of molecular tests showed high accuracy and may be suitable alternatives to RT-PCR. However, further evaluations of the tests in settings as they are intended to be used are required to fully establish performance in practice. Several important studies in asymptomatic individuals have been reported since the close of our search and will be incorporated at the next update of this review. Comparative studies of antigen tests in their intended use settings and according to test operator (including self-testing) are required.
准确快速的 SARS-CoV-2 感染诊断测试有助于临床和公共卫生策略管理 COVID-19 大流行。即时检测抗原和分子检测可提高检测的可及性,及早确认病例,并加速临床和公共卫生管理决策,从而减少传播。
评估用于 SARS-CoV-2 感染诊断的即时检测抗原和分子检测的诊断准确性。我们分别考虑在有症状和无症状人群中的准确性。
我们于 2020 年 9 月 30 日在 Cochrane COVID-19 研究注册库和伯尔尼 COVID-19 生活证据数据库(该数据库包含来自 PubMed 和 Embase 的每日更新以及 medRxiv 和 bioRxiv 的预印本)中进行了电子检索。我们检查了 COVID-19 出版物的存储库,并纳入了来自国家参考实验室、创新型新诊断基金会和诊断全球健康网站的独立评估,截至 2020 年 11 月 16 日。我们没有应用语言限制。
我们纳入了疑似 SARS-CoV-2 感染、已知 SARS-CoV-2 感染或已知无感染或正在接受感染筛查的人群的研究。我们纳入了任何设计的测试准确性研究,评估了适合即时检测的商业化生产的快速抗原或分子检测(设备要求最小、样本准备和生物安全要求低,结果可在样本采集后两小时内获得)。我们纳入了所有定义 SARS-CoV-2 存在或不存在的参考标准(包括逆转录聚合酶链反应(RT-PCR)检测和既定诊断标准)。
研究由两位作者独立筛选,分歧通过与第三位作者讨论解决。由一位作者提取研究特征,由另一位作者检查;使用双变量模型独立提取研究结果,并使用 QUADAS-2 工具评估偏倚风险和适用性(适用于每个测试)。我们以敏感性和特异性(95%置信区间(CI))的形式报告了每种检测方法的结果,并根据制造商和制造商使用说明的合规性以及症状状态进行了汇总数据的分组。
共纳入 78 项研究队列(包括 64 项研究报告,其中 20 项为预印本),共报告了 24087 份样本(7415 份有确诊的 SARS-CoV-2)。这些研究主要来自欧洲(n=39)或北美(n=20),评估了 16 种抗原检测和 5 种分子检测。由于参与者选择(29 项研究,50%)、用于确定无感染的参考标准的弱点(66 项研究,85%)以及参与者流动和时间安排(29 项研究,45%)等原因,我们认为这些研究的偏倚风险很高。与分子检测相比,抗原检测的研究方法质量更高,尤其是在参与者选择和索引检测方面的偏倚风险方面。35 项研究(45%)的参与者特征与预期使用该检测的人群不同,39 项研究(50%)的索引检测实施方式与预期使用方式不同。几乎所有的研究(97%)都基于单一的 RT-PCR 结果来定义 SARS-CoV-2 的存在或不存在,没有一项研究包括符合可能的 COVID-19 病例定义的参与者。
48 项研究报告了 58 项抗原检测评估。研究之间的敏感性估计差异很大。在有症状(敏感性 72.0%,95%CI 63.7%至 79.0%;37 项评估;15530 份样本,4410 例)和无症状参与者(敏感性 58.1%,95%CI 40.2%至 74.1%;12 项评估;1581 份样本,295 例)中存在差异。在症状出现的第一周内,敏感性较高(敏感性 78.3%,95%CI 71.1%至 84.1%;26 项评估;5769 份样本,2320 例),而在第二周内敏感性较低(敏感性 51.0%,95%CI 40.8%至 61.0%;22 项评估;935 份样本,692 例)。那些 Ct 值(PCR)值≤25 的患者的敏感性较高(敏感性 94.5%,95%CI 91.0%至 96.7%;36 项评估;2613 例),而那些 Ct 值>25 的患者敏感性较低(敏感性 40.7%,95%CI 31.8%至 50.3%;36 项评估;2632 例)。品牌之间的敏感性存在差异。在对符合使用说明(IFU)的有症状参与者进行的评估中,汇总敏感性范围从 34.1%(95%CI 29.7%至 38.8%;Coris Bioconcept)到 88.1%(95%CI 84.2%至 91.1%;SD Biosensor STANDARD Q)。在有症状和无症状参与者中,特异性均较高,且大多数品牌的特异性均较高(总体汇总特异性 99.6%,95%CI 99.0%至 99.8%)。使用最敏感的检测方法(SD Biosensor STANDARD Q 和 Abbott Panbio)在有症状人群中进行 5%的患病率预测时,阳性预测值(PPV)为 84%至 90%,这意味着每 10 至 18 个阳性结果中就有 1 个是假阳性,每 4 至 8 个病例中就有 1 个被漏诊。在无症状人群中进行 0.5%的患病率预测时,PPV 为 11%至 28%,这意味着每 10 至 30 个阳性结果中就有 7 个是假阳性,每 2 至 3 个病例中就有 1 个被漏诊。没有研究评估重复的侧向流动检测或自我检测的准确性。
30 项研究报告了 33 项对五种不同快速分子检测的评估。敏感性因检测品牌而异。大多数数据与 ID NOW 和 Xpert Xpress 检测有关。根据制造商使用说明进行评估,ID NOW 的平均敏感性为 73.0%(95%CI 66.8%至 78.4%),平均特异性为 99.7%(95%CI 98.7%至 99.9%;4 项评估;812 份样本,222 例)。对于 Xpert Xpress,平均敏感性为 100%(95%CI 88.1%至 100%),平均特异性为 97.2%(95%CI 89.4%至 99.3%;2 项评估;100 份样本,29 例)。由于数据有限,无法调查症状状态或症状出现后时间对敏感性的影响。
抗原检测的敏感性不同。在有 COVID-19 症状和体征的人群中,当病毒载量较高时,第一周的敏感性最高。符合世界卫生组织优先目标产品概况(SARS-CoV-2 诊断)的检测方法(“可接受”的敏感性≥80%,特异性≥97%),当必须立即做出患者护理决策,或无法及时进行实验室基于 RT-PCR 检测时,可以替代基于实验室的 RT-PCR,当 RT-PCR 不能及时进行时。阳性预测值表明,对于检测结果呈阳性的人,可能需要考虑确认性检测。由于抗原检测的敏感性有限,检测结果呈阴性的人仍可能感染。关于无症状人群的证据有限。由于没有适当的参考标准来区分有传染性和需要隔离的感染者与没有传染性的人,因此,检测准确性研究不能充分评估抗原检测区分那些具有传染性并需要隔离的人与不会传播的人的能力。少数几种分子检测显示出较高的准确性,可能是 RT-PCR 的替代品。然而,需要对这些检测在其预期用途的设置中进行进一步评估,以充分确定其实践中的性能。自我们的搜索截止以来,已经有一些关于无症状个体的重要研究报告,这些研究将在下一次更新时纳入。需要在其预期使用环境中并根据检测操作人员(包括自我检测)进行抗原检测的比较研究。