Lasko Thomas A, Strobl Eric V, Stead William W
Vanderbilt University Medical Center, Nashville, TN, USA.
NPJ Digit Med. 2024 Mar 1;7(1):53. doi: 10.1038/s41746-024-01037-4.
The rising popularity of artificial intelligence in healthcare is highlighting the problem that a computational model achieving super-human clinical performance at its training sites may perform substantially worse at new sites. In this perspective, we argue that we should typically expect this failure to transport, and we present common sources for it, divided into those under the control of the experimenter and those inherent to the clinical data-generating process. Of the inherent sources we look a little deeper into site-specific clinical practices that can affect the data distribution, and propose a potential solution intended to isolate the imprint of those practices on the data from the patterns of disease cause and effect that are the usual target of probabilistic clinical models.
人工智能在医疗保健领域日益普及,凸显了一个问题:一个在其训练地点实现超人临床性能的计算模型,在新地点的表现可能会大幅下降。从这个角度来看,我们认为通常应该预期这种迁移失败的情况,并指出其常见原因,分为实验者可控的原因和临床数据生成过程中固有的原因。对于固有的原因,我们更深入地研究了可能影响数据分布的特定地点临床实践,并提出了一种潜在的解决方案,旨在将这些实践对数据的影响与概率临床模型通常关注的疾病因果模式区分开来。