• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发和评估新型眼科领域特定的神经词汇向量以预测视觉预后。

Development and evaluation of novel ophthalmology domain-specific neural word embeddings to predict visual prognosis.

机构信息

Byers Eye Institute, Department of Ophthalmology, Stanford University, 2370 Watson Court, Palo Alto, CA, 94303, United States.

Center for Biomedical Informatics Research, School of Medicine, Stanford University, 1265 Welch Road, Stanford, CA, 94305, United States.

出版信息

Int J Med Inform. 2021 Jun;150:104464. doi: 10.1016/j.ijmedinf.2021.104464. Epub 2021 Apr 16.

DOI:10.1016/j.ijmedinf.2021.104464
PMID:33892445
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8183292/
Abstract

OBJECTIVE

To develop and evaluate novel word embeddings (WEs) specific to ophthalmology, using text corpora from published literature and electronic health records (EHR).

MATERIALS AND METHODS

We trained ophthalmology-specific WEs using 121,740 PubMed abstracts and 89,282 EHR notes using word2vec continuous bag-of-words architecture. PubMed and EHR WEs were compared to general domain GloVe WEs and general biomedical domain BioWordVec embeddings using a novel ophthalmology-domain-specific 200-question analogy test and prediction of prognosis in 5547 low vision patients using EHR notes as inputs to a deep learning model.

RESULTS

We found that many words representing important ophthalmic concepts in the EHR were missing from the general domain GloVe vocabulary, but covered in the ophthalmology abstract corpus. On ophthalmology analogy testing, PubMed WEs scored 95.0 %, outperforming EHR (86.0 %) and GloVe (91.0 %) but less than BioWordVec (99.5 %). On predicting low vision prognosis, PubMed and EHR WEs resulted in similar AUROC (0.830; 0.826), outperforming GloVe (0.778) and BioWordVec (0.784).

CONCLUSION

We found that using ophthalmology domain-specific WEs improved performance in ophthalmology-related clinical prediction compared to general WEs. Deep learning models using clinical notes as inputs can predict the prognosis of visually impaired patients. This work provides a framework to improve predictive models using domain-specific WEs.

摘要

目的

使用来自已发表文献和电子健康记录 (EHR) 的文本语料库,开发和评估特定于眼科的新型词嵌入 (WE)。

材料与方法

我们使用 word2vec 连续词袋架构,通过 121740 篇 PubMed 摘要和 89282 篇 EHR 笔记训练眼科特定的 WE。通过一项新颖的眼科特定领域的 200 个问题类比测试和使用 EHR 笔记作为输入的深度学习模型对 5547 名低视力患者预后的预测,将 PubMed 和 EHR WE 与一般领域 GloVe WE 和一般生物医学领域 BioWordVec 嵌入进行比较。

结果

我们发现,EHR 中代表重要眼科概念的许多词在一般领域 GloVe 词汇中缺失,但在眼科摘要语料库中有所涵盖。在眼科类比测试中,PubMed WE 得分为 95.0%,优于 EHR(86.0%)和 GloVe(91.0%),但低于 BioWordVec(99.5%)。在预测低视力预后方面,PubMed 和 EHR WE 的 AUROC 相似(0.830;0.826),优于 GloVe(0.778)和 BioWordVec(0.784)。

结论

我们发现,与一般 WE 相比,使用眼科领域特定的 WE 可提高与眼科相关的临床预测性能。使用临床笔记作为输入的深度学习模型可以预测视力受损患者的预后。这项工作为使用领域特定的 WE 改进预测模型提供了框架。

相似文献

1
Development and evaluation of novel ophthalmology domain-specific neural word embeddings to predict visual prognosis.开发和评估新型眼科领域特定的神经词汇向量以预测视觉预后。
Int J Med Inform. 2021 Jun;150:104464. doi: 10.1016/j.ijmedinf.2021.104464. Epub 2021 Apr 16.
2
Looking for low vision: Predicting visual prognosis by fusing structured and free-text data from electronic health records.寻找低视力:通过融合电子健康记录中的结构化和自由文本数据来预测视觉预后。
Int J Med Inform. 2022 Mar;159:104678. doi: 10.1016/j.ijmedinf.2021.104678. Epub 2021 Dec 30.
3
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing.使用电子健康记录和自然语言处理的深度学习方法预测青光眼进展
Ophthalmol Sci. 2022 Feb 12;2(2):100127. doi: 10.1016/j.xops.2022.100127. eCollection 2022 Jun.
5
Projection Word Embedding Model With Hybrid Sampling Training for Classifying ICD-10-CM Codes: Longitudinal Observational Study.用于对ICD-10-CM编码进行分类的混合采样训练投影词嵌入模型:纵向观察研究
JMIR Med Inform. 2019 Jul 23;7(3):e14499. doi: 10.2196/14499.
6
Fall Risk Prediction in Older Adults Using Free-Text Nursing Notes and Medications in Electronic Health Records.利用电子健康记录中的自由文本护理记录和药物预测老年人跌倒风险。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341127.
7
Word embeddings trained on published case reports are lightweight, effective for clinical tasks, and free of protected health information.基于已发表病例报告训练的词嵌入模型轻巧、适用于临床任务且不包含受保护的健康信息。
J Biomed Inform. 2022 Jan;125:103971. doi: 10.1016/j.jbi.2021.103971. Epub 2021 Dec 14.
8
A study of deep learning methods for de-identification of clinical notes in cross-institute settings.深度学习方法在跨机构环境下对临床记录进行去识别的研究。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):232. doi: 10.1186/s12911-019-0935-4.
9
Evaluating semantic relations in neural word embeddings with biomedical and general domain knowledge bases.利用生物医学和一般领域知识库评估神经词汇嵌入中的语义关系。
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):65. doi: 10.1186/s12911-018-0630-x.
10
EHR-HGCN: An Enhanced Hybrid Approach for Text Classification Using Heterogeneous Graph Convolutional Networks in Electronic Health Records.EHR-HGCN:一种在电子健康记录中使用异构图卷积网络的增强型混合文本分类方法。
IEEE J Biomed Health Inform. 2024 Mar;28(3):1668-1679. doi: 10.1109/JBHI.2023.3346210. Epub 2024 Mar 6.

引用本文的文献

1
Identifying Transportation Needs in Ophthalmology Clinic Notes Using Natural Language Processing: Retrospective, Cross-Sectional Study.使用自然语言处理识别眼科临床记录中的交通需求:回顾性横断面研究。
JMIR Med Inform. 2025 Sep 5;13:e69216. doi: 10.2196/69216.
2
Visual acuity prediction on real-life patient data using a machine learning based multistage system.基于机器学习的多阶段系统对现实患者数据的视力预测。
Sci Rep. 2024 Mar 6;14(1):5532. doi: 10.1038/s41598-024-54482-2.
3
Use of artificial intelligence in forecasting glaucoma progression.人工智能在预测青光眼进展中的应用。
Taiwan J Ophthalmol. 2023 May 23;13(2):168-183. doi: 10.4103/tjo.TJO-D-23-00022. eCollection 2023 Apr-Jun.
4
Predicting near-term glaucoma progression: An artificial intelligence approach using clinical free-text notes and data from electronic health records.预测近期青光眼进展:一种使用临床自由文本记录和电子健康记录数据的人工智能方法。
Front Med (Lausanne). 2023 Apr 13;10:1157016. doi: 10.3389/fmed.2023.1157016. eCollection 2023.
5
Impact of word embedding models on text analytics in deep learning environment: a review.词嵌入模型对深度学习环境下文本分析的影响:综述
Artif Intell Rev. 2023 Feb 22:1-81. doi: 10.1007/s10462-023-10419-1.
6
Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing.使用电子健康记录和自然语言处理的深度学习方法预测青光眼进展
Ophthalmol Sci. 2022 Feb 12;2(2):100127. doi: 10.1016/j.xops.2022.100127. eCollection 2022 Jun.
7
Applications of natural language processing in ophthalmology: present and future.自然语言处理在眼科中的应用:现状与未来。
Front Med (Lausanne). 2022 Aug 8;9:906554. doi: 10.3389/fmed.2022.906554. eCollection 2022.
8
Predicting Glaucoma Progression Requiring Surgery Using Clinical Free-Text Notes and Transfer Learning With Transformers.使用临床自由文本记录和基于转换器的迁移学习预测需要手术的青光眼进展。
Transl Vis Sci Technol. 2022 Mar 2;11(3):37. doi: 10.1167/tvst.11.3.37.
9
Looking for low vision: Predicting visual prognosis by fusing structured and free-text data from electronic health records.寻找低视力:通过融合电子健康记录中的结构化和自由文本数据来预测视觉预后。
Int J Med Inform. 2022 Mar;159:104678. doi: 10.1016/j.ijmedinf.2021.104678. Epub 2021 Dec 30.

本文引用的文献

1
Estimating Rates of Progression and Predicting Future Visual Fields in Glaucoma Using a Deep Variational Autoencoder.使用深度变分自动编码器估计青光眼的进展速度并预测未来的视野。
Sci Rep. 2019 Dec 2;9(1):18113. doi: 10.1038/s41598-019-54653-6.
2
Automated extraction of ophthalmic surgery outcomes from the electronic health record.从电子健康记录中自动提取眼科手术结果。
Int J Med Inform. 2020 Jan;133:104007. doi: 10.1016/j.ijmedinf.2019.104007. Epub 2019 Oct 17.
3
Natural Language Processing Approaches to Detect the Timeline of Metastatic Recurrence of Breast Cancer.用于检测乳腺癌转移复发时间线的自然语言处理方法
JCO Clin Cancer Inform. 2019 Oct;3:1-12. doi: 10.1200/CCI.19.00034.
4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
5
Machine Learning-Based Predictive Modeling of Surgical Intervention in Glaucoma Using Systemic Data From Electronic Health Records.基于机器学习的电子健康记录系统数据预测青光眼手术干预
Am J Ophthalmol. 2019 Dec;208:30-40. doi: 10.1016/j.ajo.2019.07.005. Epub 2019 Jul 16.
6
BioWordVec, improving biomedical word embeddings with subword information and MeSH.BioWordVec,利用子词信息和 MeSH 改进生物医学词向量。
Sci Data. 2019 May 10;6(1):52. doi: 10.1038/s41597-019-0055-0.
7
Forecasting future Humphrey Visual Fields using deep learning.利用深度学习预测未来 Humphrey 视野。
PLoS One. 2019 Apr 5;14(4):e0214875. doi: 10.1371/journal.pone.0214875. eCollection 2019.
8
Optimizing Corpus Creation for Training Word Embedding in Low Resource Domains: A Case Study in Autism Spectrum Disorder (ASD).优化低资源领域中训练词嵌入的语料库创建:以自闭症谱系障碍(ASD)为例
AMIA Annu Symp Proc. 2018 Dec 5;2018:508-517. eCollection 2018.
9
An Artificial Intelligence Approach to Detect Visual Field Progression in Glaucoma Based on Spatial Pattern Analysis.基于空间模式分析的人工智能在青光眼视野进展检测中的应用。
Invest Ophthalmol Vis Sci. 2019 Jan 2;60(1):365-375. doi: 10.1167/iovs.18-25568.
10
Using Kalman Filtering to Forecast Disease Trajectory for Patients With Normal Tension Glaucoma.使用卡尔曼滤波预测正常眼压型青光眼患者的疾病轨迹。
Am J Ophthalmol. 2019 Mar;199:111-119. doi: 10.1016/j.ajo.2018.10.012. Epub 2018 Oct 16.