Zhang Xianfa, Wang Jing
Department of Orthopedics, Wenshang People's Hospital, Wenshang, Ji'ning, Shandong, China.
Eur Spine J. 2025 May 17. doi: 10.1007/s00586-025-08932-8.
We have conducted an in-depth analysis of the comparative study on AI and spinal surgeons presented by Mehmet et al. While we recognize the significant contributions of this work, we also identify certain limitations, including a restricted sample size, extended data collection intervals, and inadequate validation of AI outputs. To address these issues, we propose increasing the sample diversity, reducing the duration between data collection points, implementing double-blind evaluations of AI responses, fostering interdisciplinary collaboration, and establishing a long-term monitoring framework. These measures aim to enhance the scientific rigor, generalizability, and reliability of AI applications in clinical settings.
我们对Mehmet等人提出的关于人工智能与脊柱外科医生的比较研究进行了深入分析。虽然我们认可这项工作的重大贡献,但我们也发现了某些局限性,包括样本量有限、数据收集间隔时间长以及人工智能输出的验证不足。为了解决这些问题,我们建议增加样本多样性、缩短数据收集点之间的持续时间、对人工智能的响应进行双盲评估、促进跨学科合作以及建立长期监测框架。这些措施旨在提高人工智能在临床环境中应用的科学严谨性、普遍性和可靠性。