Suppr超能文献

电子病历作为预测库欣病术后即刻缓解的输入:词嵌入的应用

Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing's Disease: Application of Word Embedding.

作者信息

Zhang Wentai, Li Dongfang, Feng Ming, Hu Baotian, Fan Yanghua, Chen Qingcai, Wang Renzhi

机构信息

Department of Neurosurgery, Chinese Academy of Medical Sciences and Peking Union Medical College, Peking Union Medical College Hospital, Beijing, China.

School of Computer Science, and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China.

出版信息

Front Oncol. 2021 Oct 13;11:754882. doi: 10.3389/fonc.2021.754882. eCollection 2021.

Abstract

BACKGROUND

No existing machine learning (ML)-based models use free text from electronic medical records (EMR) as input to predict immediate remission (IR) of Cushing's disease (CD) after transsphenoidal surgery.

PURPOSE

The aim of the present study is to develop an ML-based model that uses EMR that include both structured features and free text as input to preoperatively predict IR after transsphenoidal surgery.

METHODS

A total of 419 patients with CD from Peking Union Medical College Hospital were enrolled between January 2014 and August 2020. The EMR of the patients were embedded and transformed into low-dimensional dense vectors that can be included in four ML-based models together with structured features. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models.

RESULTS

The overall remission rate of the 419 patients was 75.7%. From the results of logistic multivariate analysis, operation ( < 0.001), invasion of cavernous sinus from MRI ( = 0.046), and ACTH ( = 0.024) were strongly correlated with IR. The AUC values for the four ML-based models ranged from 0.686 to 0.793. The highest AUC value (0.793) was for logistic regression when 11 structured features and "individual conclusions of the case by doctor" were included.

CONCLUSION

An ML-based model was developed using both structured and unstructured features (after being processed using a word embedding method) as input to preoperatively predict postoperative IR.

摘要

背景

现有的基于机器学习(ML)的模型均未使用电子病历(EMR)中的自由文本作为输入来预测经蝶窦手术后库欣病(CD)的即刻缓解(IR)情况。

目的

本研究旨在开发一种基于ML的模型,该模型使用包含结构化特征和自由文本的EMR作为输入,以术前预测经蝶窦手术后的IR情况。

方法

2014年1月至2020年8月期间,共纳入了419例来自北京协和医院的CD患者。将患者的EMR进行嵌入并转换为低维密集向量,这些向量可与结构化特征一起纳入四个基于ML的模型中。采用受试者工作特征曲线下面积(AUC)来评估模型的性能。

结果

419例患者的总体缓解率为75.7%。根据多因素逻辑回归分析结果,手术(<0.001)、MRI显示海绵窦侵犯(=0.046)和促肾上腺皮质激素(ACTH,=0.024)与IR密切相关。四个基于ML的模型的AUC值范围为0.686至0.793。当纳入11个结构化特征和“医生对病例的个人结论”时,逻辑回归模型的AUC值最高(0.793)。

结论

开发了一种基于ML的模型,该模型使用结构化和非结构化特征(经词嵌入方法处理后)作为输入,以术前预测术后IR情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e159/8548651/626ee1c58f05/fonc-11-754882-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验