Eli Lilly and Company, Sydney, NSW, Australia.
Eli Lilly and Company, Indianapolis, IN, USA.
BMC Med Inform Decis Mak. 2021 Feb 15;21(1):54. doi: 10.1186/s12911-021-01403-2.
Machine learning is a broad term encompassing a number of methods that allow the investigator to learn from the data. These methods may permit large real-world databases to be more rapidly translated to applications to inform patient-provider decision making.
This systematic literature review was conducted to identify published observational research of employed machine learning to inform decision making at the patient-provider level. The search strategy was implemented and studies meeting eligibility criteria were evaluated by two independent reviewers. Relevant data related to study design, statistical methods and strengths and limitations were identified; study quality was assessed using a modified version of the Luo checklist.
A total of 34 publications from January 2014 to September 2020 were identified and evaluated for this review. There were diverse methods, statistical packages and approaches used across identified studies. The most common methods included decision tree and random forest approaches. Most studies applied internal validation but only two conducted external validation. Most studies utilized one algorithm, and only eight studies applied multiple machine learning algorithms to the data. Seven items on the Luo checklist failed to be met by more than 50% of published studies.
A wide variety of approaches, algorithms, statistical software, and validation strategies were employed in the application of machine learning methods to inform patient-provider decision making. There is a need to ensure that multiple machine learning approaches are used, the model selection strategy is clearly defined, and both internal and external validation are necessary to be sure that decisions for patient care are being made with the highest quality evidence. Future work should routinely employ ensemble methods incorporating multiple machine learning algorithms.
机器学习是一个涵盖许多方法的术语,这些方法允许研究人员从数据中学习。这些方法可以使大型真实世界数据库更快地转化为应用程序,为患者与医生的决策提供信息。
本系统文献回顾旨在确定已发表的观察性研究,这些研究使用机器学习来为患者与医生层面的决策提供信息。实施了搜索策略,并由两名独立评审员评估符合入选标准的研究。确定了与研究设计、统计方法以及优缺点相关的相关数据;使用洛氏清单的修改版本评估了研究质量。
从 2014 年 1 月至 2020 年 9 月共确定了 34 篇出版物,并对其进行了评估。在已确定的研究中,使用了各种方法、统计包和方法。最常见的方法包括决策树和随机森林方法。大多数研究都进行了内部验证,但只有两项研究进行了外部验证。大多数研究仅使用一种算法,只有八项研究将多种机器学习算法应用于数据。洛氏清单的七个项目未能达到 50%以上发表研究的要求。
在将机器学习方法应用于为患者与医生决策提供信息的过程中,采用了广泛的方法、算法、统计软件和验证策略。有必要确保使用多种机器学习方法,明确定义模型选择策略,并进行内部和外部验证,以确保患者护理决策是基于最高质量的证据做出的。未来的工作应定期采用包含多种机器学习算法的集成方法。