Population Health Sciences, University of Bristol, Bristol, UK.
Radcliffe Department of Medicine, University of Oxford , Oxford, UK.
Cochrane Database Syst Rev. 2022 Jun 16;6(6):CD012558. doi: 10.1002/14651858.CD012558.pub2.
In primary care, general practitioners (GPs) unavoidably reach a clinical judgement about a patient as part of their encounter with patients, and so clinical judgement can be an important part of the diagnostic evaluation. Typically clinical decision making about what to do next for a patient incorporates clinical judgement about the diagnosis with severity of symptoms and patient factors, such as their ideas and expectations for treatment. When evaluating patients for dementia, many GPs report using their own judgement to evaluate cognition, using information that is immediately available at the point of care, to decide whether someone has or does not have dementia, rather than more formal tests.
To determine the diagnostic accuracy of GPs' clinical judgement for diagnosing cognitive impairment and dementia in symptomatic people presenting to primary care. To investigate the heterogeneity of test accuracy in the included studies.
We searched MEDLINE (Ovid SP), Embase (Ovid SP), PsycINFO (Ovid SP), Web of Science Core Collection (ISI Web of Science), and LILACs (BIREME) on 16 September 2021.
We selected cross-sectional and cohort studies from primary care where clinical judgement was determined by a GP either prospectively (after consulting with a patient who has presented to a specific encounter with the doctor) or retrospectively (based on knowledge of the patient and review of the medical notes, but not relating to a specific encounter with the patient). The target conditions were dementia and cognitive impairment (mild cognitive impairment and dementia) and we included studies with any appropriate reference standard such as the Diagnostic and Statistical Manual of Mental Disorders (DSM), International Classification of Diseases (ICD), aetiological definitions, or expert clinical diagnosis.
Two review authors screened titles and abstracts for relevant articles and extracted data separately with differences resolved by consensus discussion. We used QUADAS-2 to evaluate the risk of bias and concerns about applicability in each study using anchoring statements. We performed meta-analysis using the bivariate method.
We identified 18,202 potentially relevant articles, of which 12,427 remained after de-duplication. We assessed 57 full-text articles and extracted data on 11 studies (17 papers), of which 10 studies had quantitative data. We included eight studies in the meta-analysis for the target condition dementia and four studies for the target condition cognitive impairment. Most studies were at low risk of bias as assessed with the QUADAS-2 tool, except for the flow and timing domain where four studies were at high risk of bias, and the reference standard domain where two studies were at high risk of bias. Most studies had low concern about applicability to the review question in all QUADAS-2 domains. Average age ranged from 73 years to 83 years (weighted average 77 years). The percentage of female participants in studies ranged from 47% to 100%. The percentage of people with a final diagnosis of dementia was between 2% and 56% across studies (a weighted average of 21%). For the target condition dementia, in individual studies sensitivity ranged from 34% to 91% and specificity ranged from 58% to 99%. In the meta-analysis for dementia as the target condition, in eight studies in which a total of 826 of 2790 participants had dementia, the summary diagnostic accuracy of clinical judgement of general practitioners was sensitivity 58% (95% confidence interval (CI) 43% to 72%), specificity 89% (95% CI 79% to 95%), positive likelihood ratio 5.3 (95% CI 2.4 to 8.2), and negative likelihood ratio 0.47 (95% CI 0.33 to 0.61). For the target condition cognitive impairment, in individual studies sensitivity ranged from 58% to 97% and specificity ranged from 40% to 88%. The summary diagnostic accuracy of clinical judgement of general practitioners in four studies in which a total of 594 of 1497 participants had cognitive impairment was sensitivity 84% (95% CI 60% to 95%), specificity 73% (95% CI 50% to 88%), positive likelihood ratio 3.1 (95% CI 1.4 to 4.7), and negative likelihood ratio 0.23 (95% CI 0.06 to 0.40). It was impossible to draw firm conclusions in the analysis of heterogeneity because there were small numbers of studies. For specificity we found the data were compatible with studies that used ICD-10, or applied retrospective judgement, had higher reported specificity compared to studies with DSM definitions or using prospective judgement. In contrast for sensitivity, we found studies that used a prospective index test may have had higher sensitivity than studies that used a retrospective index test.
AUTHORS' CONCLUSIONS: Clinical judgement of GPs is more specific than sensitive for the diagnosis of dementia. It would be necessary to use additional tests to confirm the diagnosis for either target condition, or to confirm the absence of the target conditions, but clinical judgement may inform the choice of further testing. Many people who a GP judges as having dementia will have the condition. People with false negative diagnoses are likely to have less severe disease and some could be identified by using more formal testing in people who GPs judge as not having dementia. Some false positives may require similar practical support to those with dementia, but some - such as some people with depression - may suffer delayed intervention for an alternative treatable pathology.
在初级保健中,全科医生(GP)在与患者的接触中不可避免地对患者做出临床判断,因此临床判断可以成为诊断评估的重要组成部分。通常,关于下一步治疗患者的临床决策包括对诊断的严重程度和患者因素(如他们对治疗的想法和期望)的临床判断。在评估痴呆症患者时,许多全科医生报告说,他们使用自己的判断来评估认知,使用在护理点立即获得的信息,来决定某人是否患有痴呆症,而不是使用更正式的测试。
确定全科医生对有症状的患者进行认知障碍和痴呆症的临床判断的诊断准确性,这些患者就诊于初级保健。调查纳入研究中测试准确性的异质性。
我们于 2021 年 9 月 16 日在 MEDLINE(Ovid SP)、Embase(Ovid SP)、PsycINFO(Ovid SP)、Web of Science 核心合集(ISI Web of Science)和 LILACs(BIREME)上进行了检索。
我们选择了来自初级保健的前瞻性(在与有特定就诊经历的患者咨询后)或回顾性(基于对患者的了解和对病历的审查,但与患者的特定就诊经历无关)的横断面和队列研究,其中临床判断由全科医生确定。目标病症为痴呆症和认知障碍(轻度认知障碍和痴呆症),我们纳入了任何适当的参考标准,如《精神障碍诊断与统计手册》(DSM)、《国际疾病分类》(ICD)、病因学定义或专家临床诊断的研究。
两名综述作者筛选标题和摘要,以确定相关文章,并分别独立提取数据,通过共识讨论解决分歧。我们使用 QUADAS-2 来评估每项研究的偏倚风险和对适用问题的关注程度,使用锚定语句。我们使用双变量方法进行荟萃分析。
我们确定了 18202 篇潜在相关文章,其中 12427 篇在去重后仍然存在。我们评估了 57 篇全文文章,并提取了 11 项研究(17 篇论文)的数据,其中 10 项研究具有定量数据。我们将 10 项研究纳入了痴呆症的荟萃分析,4 项研究纳入了认知障碍的荟萃分析。大多数研究的 QUADAS-2 工具评估的偏倚风险较低,除了流程和时间域的四项研究和参考标准域的两项研究存在较高的偏倚风险。大多数研究在 QUADAS-2 的所有领域都对审查问题的适用性有较低的关注。平均年龄范围从 73 岁到 83 岁(加权平均 77 岁)。研究中女性参与者的比例从 47%到 100%不等。研究中痴呆症的最终诊断比例在 2%到 56%之间(加权平均值为 21%)。对于痴呆症的目标病症,在个别研究中,敏感性范围从 34%到 91%,特异性范围从 58%到 99%。在作为目标病症的痴呆症的荟萃分析中,在 8 项共纳入 2790 名参与者中有痴呆症的研究中,全科医生临床判断的汇总诊断准确性为敏感性 58%(95%置信区间[CI]为 43%至 72%),特异性 89%(95%CI 为 79%至 95%),阳性似然比 5.3(95%CI 为 2.4 至 8.2),阴性似然比 0.47(95%CI 为 0.33 至 0.61)。对于认知障碍的目标病症,在敏感性范围从 58%到 97%,特异性范围从 40%到 88%的个别研究中。在纳入 1497 名参与者中有认知障碍的 4 项研究中,全科医生临床判断的汇总诊断准确性为敏感性 84%(95%CI 为 60%至 95%),特异性 73%(95%CI 为 50%至 88%),阳性似然比 3.1(95%CI 为 1.4 至 4.7),阴性似然比 0.23(95%CI 为 0.06 至 0.40)。由于研究数量较少,无法对异质性进行明确的结论分析。在特异性方面,我们发现数据与使用 ICD-10 的研究或应用回顾性判断相兼容,与使用 DSM 定义或采用前瞻性判断的研究相比,报告的特异性更高。相反,在敏感性方面,我们发现使用前瞻性索引测试的研究可能比使用回顾性索引测试的研究具有更高的敏感性。
全科医生的临床判断对痴呆症的诊断更特异而非敏感。对于任何一种目标病症,或者为了确认不存在目标病症,都需要使用额外的测试来确认诊断,但临床判断可能会影响对进一步测试的选择。许多被全科医生判断为患有痴呆症的人实际上确实患有这种疾病。一些假阴性诊断的患者可能疾病程度较轻,一些人可以通过对那些被全科医生判断为没有痴呆症的患者使用更正式的测试来识别。一些假阳性可能需要与患有痴呆症的患者类似的实际支持,但也有一些,如一些患有抑郁症的患者,可能会因另一种可治疗的疾病而延迟干预。