Shi Jingpu, Norgeot Beau
Anthem, Inc., Point of Care AI, Palo Alto, CA, United States.
Front Med (Lausanne). 2022 Jul 7;9:864882. doi: 10.3389/fmed.2022.864882. eCollection 2022.
Causal inference is a broad field that seeks to build and apply models that learn the effect of interventions on outcomes using many data types. While the field has existed for decades, its potential to impact healthcare outcomes has increased dramatically recently due to both advancements in machine learning and the unprecedented amounts of observational data resulting from electronic capture of patient claims data by medical insurance companies and widespread adoption of electronic health records (EHR) worldwide. However, there are many different schools of learning causality coming from different fields of statistics, some of them strongly conflicting. While the recent advances in machine learning greatly enhanced causal inference from a modeling perspective, it further exacerbated the fractured state in this field. This fractured state has limited research at the intersection of causal inference, modern machine learning, and EHRs that could potentially transform healthcare. In this paper we unify the classical causal inference approaches with new machine learning developments into a straightforward framework based on whether the researcher is most interested in finding the best intervention for an individual, a group of similar people, or an entire population. Through this lens, we then provide a timely review of the applications of causal inference in healthcare from the literature. As expected, we found that applications of causal inference in medicine were mostly limited to just a few technique types and lag behind other domains. In light of this gap, we offer a helpful schematic to guide data scientists and healthcare stakeholders in selecting appropriate causal methods and reviewing the findings generated by them.
因果推断是一个广泛的领域,旨在构建和应用模型,利用多种数据类型来了解干预措施对结果的影响。尽管该领域已经存在了数十年,但由于机器学习的进步以及医疗保险机构对患者理赔数据的电子采集和全球范围内电子健康记录(EHR)的广泛采用所产生的前所未有的大量观测数据,其对医疗保健结果产生影响的潜力最近已大幅增加。然而,来自不同统计领域的有许多不同的因果关系学派,其中一些存在强烈冲突。虽然机器学习的最新进展从建模角度极大地增强了因果推断,但它进一步加剧了该领域的分裂状态。这种分裂状态限制了因果推断、现代机器学习和电子健康记录交叉领域的研究,而这些研究有可能改变医疗保健。在本文中,我们将经典的因果推断方法与新的机器学习发展统一到一个简单的框架中,该框架基于研究人员最感兴趣的是为个体、一组相似的人还是整个人口找到最佳干预措施。通过这个视角,我们随后对文献中因果推断在医疗保健中的应用进行了及时的综述。不出所料,我们发现因果推断在医学中的应用大多仅限于少数几种技术类型,并且落后于其他领域。鉴于这一差距,我们提供了一个有用的示意图,以指导数据科学家和医疗保健利益相关者选择合适的因果方法并审查它们所产生的结果。