• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测大动脉粥样硬化的生物标志物发现的机器学习方法。

Machine learning approaches for biomarker discovery to predict large-artery atherosclerosis.

机构信息

Artificial Intelligence Center, China Medical University Hospital, Taichung, Taiwan.

Department of Neurology, China Medical University Hospital, Taichung, Taiwan.

出版信息

Sci Rep. 2023 Sep 13;13(1):15139. doi: 10.1038/s41598-023-42338-0.

DOI:10.1038/s41598-023-42338-0
PMID:37704672
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10499778/
Abstract

Large-artery atherosclerosis (LAA) is a leading cause of cerebrovascular disease. However, LAA diagnosis is costly and needs professional identification. Many metabolites have been identified as biomarkers of specific traits. However, there are inconsistent findings regarding suitable biomarkers for the prediction of LAA. In this study, we propose a new method integrates multiple machine learning algorithms and feature selection method to handle multidimensional data. Among the six machine learning models, logistic regression (LR) model exhibited the best prediction performance. The value of area under the receiver operating characteristic curve (AUC) was 0.92 when 62 features were incorporated in the external validation set for the LR model. In this model, LAA could be well predicted by clinical risk factors including body mass index, smoking, and medications for controlling diabetes, hypertension, and hyperlipidemia as well as metabolites involved in aminoacyl-tRNA biosynthesis and lipid metabolism. In addition, we found that 27 features were present among the five adopted models that could provide good results. If these 27 features were used in the LR model, an AUC value of 0.93 could be achieved. Our study has demonstrated the effectiveness of combining machine learning algorithms with recursive feature elimination and cross-validation methods for biomarker identification. Moreover, we have shown that using shared features can yield more reliable correlations than either model, which can be valuable for future identification of LAA.

摘要

大动脉粥样硬化(LAA)是脑血管病的主要原因。然而,LAA 的诊断成本高,需要专业识别。许多代谢物已被确定为特定特征的生物标志物。然而,对于预测 LAA 的合适生物标志物,存在不一致的发现。在这项研究中,我们提出了一种新的方法,该方法结合了多种机器学习算法和特征选择方法来处理多维数据。在这六种机器学习模型中,逻辑回归(LR)模型表现出最佳的预测性能。当将 62 个特征纳入外部验证集中时,LR 模型的接收者操作特征曲线(ROC)下面积(AUC)值为 0.92。在该模型中,LAA 可以通过包括体重指数、吸烟和控制糖尿病、高血压和高脂血症的药物在内的临床危险因素以及涉及氨基酸-tRNA 生物合成和脂质代谢的代谢物来很好地预测。此外,我们发现 27 个特征存在于 5 种采用的模型中,可以提供良好的结果。如果将这 27 个特征用于 LR 模型,则可以达到 AUC 值 0.93。我们的研究表明,结合机器学习算法和递归特征消除以及交叉验证方法进行生物标志物识别是有效的。此外,我们还表明,使用共享特征可以产生比任何一种模型更可靠的相关性,这对于未来 LAA 的识别可能具有重要价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/2f2edffa367b/41598_2023_42338_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/cc1bbbd4cd1d/41598_2023_42338_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/318d5214ff15/41598_2023_42338_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/7e7d3d61bf66/41598_2023_42338_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/974e1881e1c0/41598_2023_42338_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/aca56d8dedf3/41598_2023_42338_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/f6e63122b361/41598_2023_42338_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/2f2edffa367b/41598_2023_42338_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/cc1bbbd4cd1d/41598_2023_42338_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/318d5214ff15/41598_2023_42338_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/7e7d3d61bf66/41598_2023_42338_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/974e1881e1c0/41598_2023_42338_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/aca56d8dedf3/41598_2023_42338_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/f6e63122b361/41598_2023_42338_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b9f/10499778/2f2edffa367b/41598_2023_42338_Fig7_HTML.jpg

相似文献

1
Machine learning approaches for biomarker discovery to predict large-artery atherosclerosis.用于预测大动脉粥样硬化的生物标志物发现的机器学习方法。
Sci Rep. 2023 Sep 13;13(1):15139. doi: 10.1038/s41598-023-42338-0.
2
Machine learning-based prediction of postpartum hemorrhage after vaginal delivery: combining bleeding high risk factors and uterine contraction curve.基于机器学习的阴道分娩后产后出血预测:结合出血高危因素和子宫收缩曲线。
Arch Gynecol Obstet. 2022 Oct;306(4):1015-1025. doi: 10.1007/s00404-021-06377-0. Epub 2022 Feb 16.
3
Prediction of atherosclerosis using machine learning based on operations research.基于运筹学的机器学习预测动脉粥样硬化。
Math Biosci Eng. 2022 Mar 14;19(5):4892-4910. doi: 10.3934/mbe.2022229.
4
Diagnostic Performance of 2D and 3D T2WI-Based Radiomics Features With Machine Learning Algorithms to Distinguish Solid Solitary Pulmonary Lesion.基于二维和三维T2加权成像的影像组学特征联合机器学习算法鉴别实性孤立性肺结节的诊断效能
Front Oncol. 2021 Nov 18;11:683587. doi: 10.3389/fonc.2021.683587. eCollection 2021.
5
The prediction of in-hospital mortality in chronic kidney disease patients with coronary artery disease using machine learning models.应用机器学习模型预测伴有冠状动脉疾病的慢性肾脏病患者的院内死亡率。
Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x.
6
Next-Generation Radiogenomics Sequencing for Prediction of EGFR and KRAS Mutation Status in NSCLC Patients Using Multimodal Imaging and Machine Learning Algorithms.使用多模态成像和机器学习算法的下一代放射基因组学测序预测非小细胞肺癌患者的EGFR和KRAS突变状态
Mol Imaging Biol. 2020 Aug;22(4):1132-1148. doi: 10.1007/s11307-020-01487-8.
7
Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study.使用多预后指标领域评分、临床数据和机器学习提高老年住院患者 12 个月死亡率风险预测:前瞻性队列研究。
J Med Internet Res. 2021 Jun 21;23(6):e26139. doi: 10.2196/26139.
8
Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA?
Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.
9
Application of machine learning model to predict osteoporosis based on abdominal computed tomography images of the psoas muscle: a retrospective study.基于腰部竖脊肌 CT 图像的机器学习模型预测骨质疏松症的应用:一项回顾性研究。
BMC Geriatr. 2022 Oct 13;22(1):796. doi: 10.1186/s12877-022-03502-9.
10
Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms.基于多种机器学习算法的孕妇孕前暴露于农药与胎儿生长受限的预测模型。
Front Public Health. 2022 Aug 8;10:940182. doi: 10.3389/fpubh.2022.940182. eCollection 2022.

引用本文的文献

1
Using Machine Learning to Detect Factors That Affect Homocysteine in Healthy Elderly Taiwanese Men.利用机器学习检测影响台湾健康老年男性同型半胱氨酸的因素。
Biomedicines. 2025 Jul 24;13(8):1816. doi: 10.3390/biomedicines13081816.
2
Machine Learning Approach and Bioinformatics Analysis Discovered Key Genomic Signatures for Hepatitis B Virus-Associated Hepatocyte Remodeling and Hepatocellular Carcinoma.机器学习方法与生物信息学分析发现了乙型肝炎病毒相关肝细胞重塑和肝细胞癌的关键基因组特征。
Cancer Inform. 2025 Apr 16;24:11769351251333847. doi: 10.1177/11769351251333847. eCollection 2025.
3
Chemical Sensors and Biosensors for Point-of-Care Testing of Pets: Opportunities for Individualized Diagnostics of Companion Animals.

本文引用的文献

1
Development and validation of a carotid atherosclerosis risk prediction model based on a Chinese population.基于中国人群的颈动脉粥样硬化风险预测模型的开发与验证
Front Cardiovasc Med. 2022 Aug 2;9:946063. doi: 10.3389/fcvm.2022.946063. eCollection 2022.
2
Causes of ischemic stroke in young adults versus non-young adults: A multicenter hospital-based observational study.中青年与非中青年缺血性脑卒中病因分析:一项多中心医院观察性研究。
PLoS One. 2022 Jul 13;17(7):e0268481. doi: 10.1371/journal.pone.0268481. eCollection 2022.
3
Prediction of atherosclerosis using machine learning based on operations research.
用于宠物即时检测的化学传感器和生物传感器:伴侣动物个性化诊断的机遇
ACS Sens. 2025 May 23;10(5):3222-3238. doi: 10.1021/acssensors.4c03664. Epub 2025 Apr 22.
4
Identification of lipid metabolism related immune markers in atherosclerosis through machine learning and experimental analysis.通过机器学习和实验分析鉴定动脉粥样硬化中与脂质代谢相关的免疫标志物
Front Immunol. 2025 Feb 25;16:1549150. doi: 10.3389/fimmu.2025.1549150. eCollection 2025.
5
Deciphering the epigenetic role of long non-coding RNAs in mood disorders: Focus on human brain studies.解读长链非编码RNA在情绪障碍中的表观遗传作用:聚焦于人类大脑研究。
Clin Transl Med. 2025 Mar;15(3):e70135. doi: 10.1002/ctm2.70135.
6
Integrated Proteomics and Machine Learning Approach Reveals PYCR1 as a Novel Biomarker to Predict Prognosis of Sinonasal Squamous Cell Carcinoma.整合蛋白质组学与机器学习方法揭示PYCR1作为预测鼻窦鳞状细胞癌预后的新型生物标志物。
Int J Mol Sci. 2024 Dec 10;25(24):13234. doi: 10.3390/ijms252413234.
7
Artificial intelligence and bioinformatics: a journey from traditional techniques to smart approaches.人工智能与生物信息学:从传统技术到智能方法的历程。
Gastroenterol Hepatol Bed Bench. 2024;17(3):241-252. doi: 10.22037/ghfbb.v17i3.2977.
8
Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study.通过可解释的机器学习算法对急性缺血性脑卒中进行预测病因分类:一项多中心前瞻性队列研究。
BMC Med Res Methodol. 2024 Sep 10;24(1):199. doi: 10.1186/s12874-024-02331-1.
9
Exploration of effective biomarkers for venous thrombosis embolism in Behçet's disease based on comprehensive bioinformatics analysis.基于综合生物信息学分析探讨白塞病静脉血栓栓塞的有效生物标志物。
Sci Rep. 2024 Jul 10;14(1):15884. doi: 10.1038/s41598-024-66973-3.
10
Development of a Real-time Force-based Algorithm for Infusion Failure Detection.基于实时力的输液失败检测算法的开发。
J Diabetes Sci Technol. 2024 Nov;18(6):1313-1323. doi: 10.1177/19322968241247530. Epub 2024 Apr 23.
基于运筹学的机器学习预测动脉粥样硬化。
Math Biosci Eng. 2022 Mar 14;19(5):4892-4910. doi: 10.3934/mbe.2022229.
4
Machine learning on small size samples: A synthetic knowledge synthesis.基于小样本的机器学习:综合知识合成。
Sci Prog. 2022 Jan-Mar;105(1):368504211029777. doi: 10.1177/00368504211029777.
5
Performance Analysis of Machine Learning and Deep Learning Architectures on Early Stroke Detection Using Carotid Artery Ultrasound Images.基于颈动脉超声图像的机器学习和深度学习架构在早期中风检测中的性能分析
Front Aging Neurosci. 2022 Jan 27;13:828214. doi: 10.3389/fnagi.2021.828214. eCollection 2021.
6
Machine Learning-Based Model for Predicting Incidence and Severity of Acute Ischemic Stroke in Anterior Circulation Large Vessel Occlusion.基于机器学习的前循环大血管闭塞性急性缺血性卒中发病率和严重程度预测模型
Front Neurol. 2021 Dec 2;12:749599. doi: 10.3389/fneur.2021.749599. eCollection 2021.
7
Predicting Risk of Stroke From Lab Tests Using Machine Learning Algorithms: Development and Evaluation of Prediction Models.使用机器学习算法通过实验室检测预测中风风险:预测模型的开发与评估
JMIR Form Res. 2021 Dec 2;5(12):e23440. doi: 10.2196/23440.
8
Prediction of large vessel occlusion for ischaemic stroke by using the machine learning model random forests.利用机器学习模型随机森林预测缺血性脑卒中的大血管闭塞。
Stroke Vasc Neurol. 2022 Apr;7(2):94-100. doi: 10.1136/svn-2021-001096. Epub 2021 Oct 26.
9
Cardiovascular Disease Prediction by Machine Learning Algorithms Based on Cytokines in Kazakhs of China.基于细胞因子的机器学习算法对中国哈萨克族心血管疾病的预测
Clin Epidemiol. 2021 Jun 9;13:417-428. doi: 10.2147/CLEP.S313343. eCollection 2021.
10
Machine Learning Identifies Metabolic Signatures that Predict the Risk of Recurrent Angina in Remitted Patients after Percutaneous Coronary Intervention: A Multicenter Prospective Cohort Study.机器学习识别出代谢特征,可预测经皮冠状动脉介入治疗后缓解患者复发性心绞痛的风险:一项多中心前瞻性队列研究。
Adv Sci (Weinh). 2021 Mar 8;8(10):2003893. doi: 10.1002/advs.202003893. eCollection 2021 May.