• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习和代谢组学数据从健康对照中对干眼疾病患者进行分类。

Classifying Dry Eye Disease Patients from Healthy Controls Using Machine Learning and Metabolomics Data.

作者信息

Amouei Sheshkal Sajad, Gundersen Morten, Alexander Riegler Michael, Aass Utheim Øygunn, Gunnar Gundersen Kjell, Rootwelt Helge, Prestø Elgstøen Katja Benedikte, Lewi Hammer Hugo

机构信息

Department of Computer Science, Oslo Metropolitan University, 0166 Oslo, Norway.

Department of Holistic Systems, SimulaMet, 0167 Oslo, Norway.

出版信息

Diagnostics (Basel). 2024 Nov 29;14(23):2696. doi: 10.3390/diagnostics14232696.

DOI:10.3390/diagnostics14232696
PMID:39682603
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11640104/
Abstract

Dry eye disease is a common disorder of the ocular surface, leading patients to seek eye care. Clinical signs and symptoms are currently used to diagnose dry eye disease. Metabolomics, a method for analyzing biological systems, has been found helpful in identifying distinct metabolites in patients and in detecting metabolic profiles that may indicate dry eye disease at early stages. In this study, we explored the use of machine learning and metabolomics data to identify cataract patients who suffer from dry eye disease, a topic that, to our knowledge, has not been previously explored. As there is no one-size-fits-all machine learning model for metabolomics data, choosing the most suitable model can significantly affect the quality of predictions and subsequent metabolomics analyses. To address this challenge, we conducted a comparative analysis of eight machine learning models on two metabolomics data sets from cataract patients with and without dry eye disease. The models were evaluated and optimized using nested k-fold cross-validation. To assess the performance of these models, we selected a set of suitable evaluation metrics tailored to the data set's challenges. The logistic regression model overall performed the best, achieving the highest area under the curve score of 0.8378, balanced accuracy of 0.735, Matthew's correlation coefficient of 0.5147, an F1-score of 0.8513, and a specificity of 0.5667. Additionally, following the logistic regression, the XGBoost and Random Forest models also demonstrated good performance. The results show that the logistic regression model with L2 regularization can outperform more complex models on an imbalanced data set with a small sample size and a high number of features, while also avoiding overfitting and delivering consistent performance across cross-validation folds. Additionally, the results demonstrate that it is possible to identify dry eye in cataract patients from tear film metabolomics data using machine learning models.

摘要

干眼症是一种常见的眼表疾病,会导致患者寻求眼部护理。目前临床症状和体征用于诊断干眼症。代谢组学作为一种分析生物系统的方法,已被证明有助于识别患者体内独特的代谢物,并检测可能在早期阶段指示干眼症的代谢谱。在本研究中,我们探索了使用机器学习和代谢组学数据来识别患有干眼症的白内障患者,据我们所知,这一主题此前尚未被探讨过。由于对于代谢组学数据不存在通用的机器学习模型,选择最合适的模型会显著影响预测质量和后续的代谢组学分析。为应对这一挑战,我们对来自患有和未患有干眼症的白内障患者的两个代谢组学数据集上的八个机器学习模型进行了比较分析。使用嵌套k折交叉验证对模型进行评估和优化。为评估这些模型的性能,我们选择了一组适合该数据集挑战的评估指标。逻辑回归模型总体表现最佳,曲线下面积得分最高,为0.8378,平衡准确率为0.735,马修斯相关系数为0.5147,F1分数为0.8513,特异性为0.5667。此外,在逻辑回归之后,XGBoost和随机森林模型也表现出良好的性能。结果表明,具有L2正则化的逻辑回归模型在样本量小、特征数量多的不平衡数据集上可以优于更复杂的模型,同时还能避免过拟合,并在交叉验证折中提供一致的性能。此外,结果表明使用机器学习模型从泪膜代谢组学数据中识别白内障患者的干眼症是可行的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/bff65cfe3ce8/diagnostics-14-02696-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/d539f47f6a93/diagnostics-14-02696-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/260654a42e6a/diagnostics-14-02696-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/bff65cfe3ce8/diagnostics-14-02696-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/d539f47f6a93/diagnostics-14-02696-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/260654a42e6a/diagnostics-14-02696-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/949b/11640104/bff65cfe3ce8/diagnostics-14-02696-g003.jpg

相似文献

1
Classifying Dry Eye Disease Patients from Healthy Controls Using Machine Learning and Metabolomics Data.利用机器学习和代谢组学数据从健康对照中对干眼疾病患者进行分类。
Diagnostics (Basel). 2024 Nov 29;14(23):2696. doi: 10.3390/diagnostics14232696.
2
A machine learning approach for identifying anatomical biomarkers of early mild cognitive impairment.一种用于识别早期轻度认知障碍解剖生物标志物的机器学习方法。
PeerJ. 2024 Dec 13;12:e18490. doi: 10.7717/peerj.18490. eCollection 2024.
3
Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.使用机器学习算法开发并验证针对20岁及以上抑郁症患者冠心病风险的预测模型。
Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
5
Addressing Imbalanced Classification Problems in Drug Discovery and Development Using Random Forest, Support Vector Machine, AutoGluon-Tabular, and H2O AutoML.使用随机森林、支持向量机、AutoGluon-Tabular和H2O自动机器学习解决药物发现与开发中的不平衡分类问题。
J Chem Inf Model. 2025 Apr 28;65(8):3976-3989. doi: 10.1021/acs.jcim.5c00023. Epub 2025 Apr 15.
6
A Novel Approach to Identifying Hibernating Myocardium Using Radiomics-Based Machine Learning.一种基于放射组学的机器学习识别冬眠心肌的新方法。
Cureus. 2024 Sep 16;16(9):e69532. doi: 10.7759/cureus.69532. eCollection 2024 Sep.
7
Machine learning-based prediction of tear osmolarity for contact lens practice.基于机器学习的隐形眼镜佩戴中泪液渗透压预测
Ophthalmic Physiol Opt. 2024 Jun;44(4):727-736. doi: 10.1111/opo.13302. Epub 2024 Mar 25.
8
Explore the factors related to the death of offspring under age five and appraise the hazard of child mortality using machine learning techniques in Bangladesh.在孟加拉国,利用机器学习技术探究与五岁以下儿童死亡相关的因素,并评估儿童死亡风险。
BMC Public Health. 2025 Jan 29;25(1):360. doi: 10.1186/s12889-025-21460-w.
9
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.
10
Tear Metabolomics in Dry Eye Disease: A Review.干眼疾病中的泪液代谢组学:综述。
Int J Mol Sci. 2019 Aug 1;20(15):3755. doi: 10.3390/ijms20153755.

本文引用的文献

1
Method Development for Omics Analyses using Schirmer Strips.基于 Schirmer 条的组学分析方法开发。
Curr Eye Res. 2024 Jul;49(7):708-716. doi: 10.1080/02713683.2024.2335271. Epub 2024 Apr 3.
2
A Preservative-Free Approach - Effects on Dry Eye Signs and Symptoms After Cataract Surgery.一种无防腐剂方法——对白内障手术后干眼体征和症状的影响。
Clin Ophthalmol. 2024 Feb 26;18:591-604. doi: 10.2147/OPTH.S446804. eCollection 2024.
3
Pretreating and normalizing metabolomics data for statistical analysis.预处理和标准化代谢组学数据以进行统计分析。
Genes Dis. 2023 Jul 7;11(3):100979. doi: 10.1016/j.gendis.2023.04.018. eCollection 2024 May.
4
The Significance of Dry Eye Signs on Preoperative Keratometry Measurements in Patients Scheduled for Cataract Surgery.干眼体征对白内障手术患者术前角膜曲率测量的意义
Clin Ophthalmol. 2024 Jan 16;18:151-161. doi: 10.2147/OPTH.S448168. eCollection 2024.
5
Discrimination of missing data types in metabolomics data based on particle swarm optimization algorithm and XGBoost model.基于粒子群优化算法和 XGBoost 模型的代谢组学数据缺失类型判别。
Sci Rep. 2024 Jan 2;14(1):152. doi: 10.1038/s41598-023-50646-8.
6
An Explainable Artificial Intelligence Model Proposed for the Prediction of Myalgic Encephalomyelitis/Chronic Fatigue Syndrome and the Identification of Distinctive Metabolites.一种用于预测肌痛性脑脊髓炎/慢性疲劳综合征及识别特征性代谢物的可解释人工智能模型
Diagnostics (Basel). 2023 Nov 21;13(23):3495. doi: 10.3390/diagnostics13233495.
7
Potential of Negative-Ion-Mode Proteomics: An MS1-Only Approach.负离子模式蛋白质组学的潜力:一种仅 MS1 的方法。
J Proteome Res. 2023 Aug 4;22(8):2734-2742. doi: 10.1021/acs.jproteome.3c00307. Epub 2023 Jul 3.
8
Prevalence of Dry Eye Disease Among Individuals Scheduled for Cataract Surgery in a Norwegian Cataract Clinic.挪威一家白内障诊所中计划进行白内障手术的患者干眼疾病患病率。
Clin Ophthalmol. 2023 Apr 27;17:1233-1243. doi: 10.2147/OPTH.S407805. eCollection 2023.
9
The Significance of Inter-Eye Osmolarity Difference in Dry Eye Diagnostics.双眼渗透压差异在干眼诊断中的意义
Clin Ophthalmol. 2023 Mar 11;17:829-835. doi: 10.2147/OPTH.S402556. eCollection 2023.
10
Applications of machine learning in metabolomics: Disease modeling and classification.机器学习在代谢组学中的应用:疾病建模与分类。
Front Genet. 2022 Nov 24;13:1017340. doi: 10.3389/fgene.2022.1017340. eCollection 2022.