Department of Neurosurgery, NYU Langone Health, New York, NY, USA.
Center for Data Science, New York University, New York, NY, USA.
Nature. 2023 Jul;619(7969):357-362. doi: 10.1038/s41586-023-06160-y. Epub 2023 Jun 7.
Physicians make critical time-constrained decisions every day. Clinical predictive models can help physicians and administrators make decisions by forecasting clinical and operational events. Existing structured data-based clinical predictive models have limited use in everyday practice owing to complexity in data processing, as well as model development and deployment. Here we show that unstructured clinical notes from the electronic health record can enable the training of clinical language models, which can be used as all-purpose clinical predictive engines with low-resistance development and deployment. Our approach leverages recent advances in natural language processing to train a large language model for medical language (NYUTron) and subsequently fine-tune it across a wide range of clinical and operational predictive tasks. We evaluated our approach within our health system for five such tasks: 30-day all-cause readmission prediction, in-hospital mortality prediction, comorbidity index prediction, length of stay prediction, and insurance denial prediction. We show that NYUTron has an area under the curve (AUC) of 78.7-94.9%, with an improvement of 5.36-14.7% in the AUC compared with traditional models. We additionally demonstrate the benefits of pretraining with clinical text, the potential for increasing generalizability to different sites through fine-tuning and the full deployment of our system in a prospective, single-arm trial. These results show the potential for using clinical language models in medicine to read alongside physicians and provide guidance at the point of care.
医生每天都要做出关键的限时决策。临床预测模型可以帮助医生和管理人员通过预测临床和运营事件来做出决策。由于数据处理、模型开发和部署的复杂性,现有的基于结构化数据的临床预测模型在日常实践中的应用有限。在这里,我们展示了来自电子健康记录的非结构化临床记录可以用于训练临床语言模型,这些模型可以作为通用的临床预测引擎,具有低阻力的开发和部署。我们的方法利用了自然语言处理的最新进展,为医疗语言训练了一个大型语言模型(NYUTron),并在广泛的临床和运营预测任务中对其进行了微调。我们在我们的医疗系统中评估了我们的方法,用于五个这样的任务:30 天全因再入院预测、住院死亡率预测、合并症指数预测、住院时间预测和保险拒绝预测。我们发现 NYUTron 的曲线下面积(AUC)为 78.7-94.9%,与传统模型相比,AUC 提高了 5.36-14.7%。我们还展示了临床文本预训练的好处、通过微调提高到不同地点的通用性的潜力,以及我们的系统在前瞻性、单臂试验中的全面部署。这些结果表明,在医学中使用临床语言模型可以与医生一起阅读并在护理点提供指导。