Madanay Farrah, O'Donohue Laura S, Zikmund-Fisher Brian J
Center for Bioethics and Social Sciences in Medicine, University of Michigan-Ann Arbor, Ann Arbor, MI, United States.
Department of Radiology, University of Michigan Medicine, University of Michigan-Ann Arbor, Ann Arbor, MI, United States.
J Med Internet Res. 2025 May 22;27:e68823. doi: 10.2196/68823.
As the US Food and Drug Administration (FDA)-approved use of artificial intelligence (AI) for medical imaging rises, radiologists are increasingly integrating AI into their clinical practices. In lung cancer screening, diagnostic AI offers a second set of eyes with the potential to detect cancer earlier than human radiologists. Despite AI's promise, a potential problem with its integration is the erosion of patient confidence in clinician expertise when there is a discrepancy between the radiologist's and the AI's interpretation of the imaging findings.
We examined how discrepancies between AI-derived recommendations and radiologists' recommendations affect patients' agreement with radiologists' recommendations and satisfaction with their radiologists. We also analyzed how patients' medical maximizing-minimizing preferences moderate these relationships.
We conducted a randomized, between-subjects experiment with 1606 US adult participants. Assuming the role of patients, participants imagined undergoing a low-dose computerized tomography scan for lung cancer screening and receiving results and recommendations from (1) a radiologist only, (2) AI and a radiologist in agreement, (3) a radiologist who recommended more testing than AI (ie, radiologist overcalled AI), or (4) a radiologist who recommended less testing than AI (ie, radiologist undercalled AI). Participants rated the radiologist on three criteria: agreement with the radiologist's recommendation, how likely they would be to recommend the radiologist to family and friends, and how good of a provider they perceived the radiologist to be. We measured medical maximizing-minimizing preferences and categorized participants as maximizers (ie, those who seek aggressive intervention), minimizers (ie, those who prefer no or passive intervention), and neutrals (ie, those in the middle).
Participants' agreement with the radiologist's recommendation was significantly lower when the radiologist undercalled AI (mean 4.01, SE 0.07, P<.001) than in the other 3 conditions, with no significant differences among them (radiologist overcalled AI [mean 4.63, SE 0.06], agreed with AI [mean 4.55, SE 0.07], or had no AI [mean 4.57, SE 0.06]). Similarly, participants were least likely to recommend (P<.001) and positively rate (P<.001) the radiologist who undercalled AI, with no significant differences among the other conditions. Maximizers agreed with the radiologist who overcalled AI (β=0.82, SE 0.14; P<.001) and disagreed with the radiologist who undercalled AI (β=-0.47, SE 0.14; P=.001). However, whereas minimizers disagreed with the radiologist who overcalled AI (β=-0.43, SE 0.18, P=.02), they did not significantly agree with the radiologist who undercalled AI (β=0.14, SE 0.17, P=.41).
Radiologists who recommend less testing than AI may face decreased patient confidence in their expertise, but they may not face this same penalty for giving more aggressive recommendations than AI. Patients' reactions may depend in part on whether their general preferences to maximize or minimize align with the radiologists' recommendations. Future research should test communication strategies for radiologists' disclosure of AI discrepancies to patients.
随着美国食品药品监督管理局(FDA)批准人工智能(AI)在医学成像中的应用不断增加,放射科医生越来越多地将AI融入其临床实践。在肺癌筛查中,诊断性AI提供了另一双眼睛,有可能比人类放射科医生更早地检测出癌症。尽管AI前景广阔,但在将其整合过程中存在一个潜在问题,即当放射科医生与AI对影像结果的解读存在差异时,患者对临床医生专业知识的信心可能会受到削弱。
我们研究了AI得出的建议与放射科医生的建议之间的差异如何影响患者对放射科医生建议的认同以及对放射科医生的满意度。我们还分析了患者的医疗最大化 - 最小化偏好如何调节这些关系。
我们对1606名美国成年参与者进行了一项随机的组间实验。参与者扮演患者的角色,想象自己接受低剂量计算机断层扫描进行肺癌筛查,并收到来自以下人员的结果和建议:(1)仅一名放射科医生;(2)AI与放射科医生意见一致;(3)一名建议进行比AI更多检查的放射科医生(即放射科医生高估了AI);或(4)一名建议进行比AI更少检查的放射科医生(即放射科医生低估了AI)。参与者根据三个标准对放射科医生进行评分:对放射科医生建议的认同度、向家人和朋友推荐该放射科医生的可能性,以及他们认为该放射科医生作为医疗服务提供者的水平。我们测量了医疗最大化 - 最小化偏好,并将参与者分为最大化者(即寻求积极干预的人)、最小化者(即倾向于不进行或进行被动干预的人)和中立者(即处于中间状态的人)。
当放射科医生低估AI时,参与者对放射科医生建议的认同度(均值4.01,标准误0.07,P <.001)显著低于其他三种情况,其他三种情况之间无显著差异(放射科医生高估AI [均值4.63,标准误0.06]、与AI意见一致 [均值4.55,标准误0.07] 或没有AI参与 [均值4.57,标准误0.06])。同样,参与者最不可能推荐(P <.001)并给予积极评价(P <.001)低估AI的放射科医生,其他情况之间无显著差异。最大化者认同高估AI的放射科医生(β = 0.82,标准误0.14;P <.001),不认同低估AI的放射科医生(β = -0.47,标准误0.14;P =.001)。然而,最小化者不认同高估AI的放射科医生(β = -0.43,标准误0.18,P =.02),但他们对低估AI的放射科医生的认同度没有显著差异(β = 0.14,标准误0.17,P =.41)。
建议进行比AI更少检查的放射科医生可能会面临患者对其专业知识信心下降的情况,但对于给出比AI更积极建议的放射科医生,可能不会面临同样的惩罚。患者的反应可能部分取决于他们最大化或最小化的总体偏好是否与放射科医生的建议一致。未来的研究应该测试放射科医生向患者披露AI差异的沟通策略。