Department of Radiation Oncology, University of California, San Francisco, CA 94143;
Department of Statistics, Stanford University, Stanford, CA 94305.
Proc Natl Acad Sci U S A. 2020 Mar 3;117(9):4571-4577. doi: 10.1073/pnas.1906831117. Epub 2020 Feb 18.
Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications.
机器学习在各学科中都被证明是非常有价值的。然而,它的成功往往受到可用数据的质量和数量的限制,而其采用则受到给定模型所获得的信任程度的限制。人们通常通过经验比较人机性能来决定某项任务是应该由计算机还是专家来执行。实际上,最优的学习策略可能涉及到结合人类和机器的互补优势。在这里,我们提出了专家增强机器学习(EAML),这是一种自动化的方法,可以指导专家知识的提取及其整合到机器学习模型中。我们使用了一个大型的重症监护患者数据集,得出了 126 条预测医院死亡率的决策规则。我们使用一个在线平台,让 15 名临床医生评估每条规则定义的子群体与总样本相比的相对风险。我们将临床医生评估的风险与经验风险进行了比较,发现尽管临床医生在大多数情况下与数据一致,但也有一些明显的例外,他们高估或低估了真实风险。通过研究分歧最大的规则,我们发现了训练数据中的问题,包括一个错误编码的变量和一个隐藏的混杂因素。根据临床医生评估的风险与经验风险之间的分歧程度过滤规则,我们提高了对样本外数据的性能,并且能够用更少的数据进行训练。EAML 提供了一个自动创建特定于问题的先验的平台,有助于在关键应用中构建稳健和可靠的机器学习模型。