• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的他汀类药物疗效和安全性预测模型。

Machine learning-based prediction model for the efficacy and safety of statins.

作者信息

Xiong Yu, Liu Xiaoyang, Wang Qing, Zhao Li, Kong Xudong, Da Chunhe, Meng Zuohuan, Qu Leilei, Xia Qinfang, Liu Lihong, Li Pengmei

机构信息

Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.

Department of Pharmacy, China-Japan Friendship Hospital, Beijing, China.

出版信息

Front Pharmacol. 2024 Jul 29;15:1334929. doi: 10.3389/fphar.2024.1334929. eCollection 2024.

DOI:10.3389/fphar.2024.1334929
PMID:39135800
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11317424/
Abstract

OBJECTIVE

The appropriate use of statins plays a vital role in reducing the risk of atherosclerotic cardiovascular disease (ASCVD). However, due to changes in diet and lifestyle, there has been a significant increase in the number of individuals with high cholesterol levels. Therefore, it is crucial to ensure the rational use of statins. Adverse reactions associated with statins, including liver enzyme abnormalities and statin-associated muscle symptoms (SAMS), have impacted their widespread utilization. In this study, we aimed to develop a predictive model for statin efficacy and safety based on real-world clinical data using machine learning techniques.

METHODS

We employed various data preprocessing techniques, such as improved random forest imputation and Borderline SMOTE oversampling, to handle the dataset. Boruta method was utilized for feature selection, and the dataset was divided into training and testing sets in a 7:3 ratio. Five algorithms, including logistic regression, naive Bayes, decision tree, random forest, and gradient boosting decision tree, were used to construct the predictive models. Ten-fold cross-validation and bootstrapping sampling were performed for internal and external validation. Additionally, SHAP (SHapley Additive exPlanations) was employed for feature interpretability. Ultimately, an accessible web-based platform for predicting statin efficacy and safety was established based on the optimal predictive model.

RESULTS

The random forest algorithm exhibited the best performance among the five algorithms. The predictive models for LDL-C target attainment (AUC = 0.883, Accuracy = 0.868, Precision = 0.858, Recall = 0.863, F1 = 0.860, AUPRC = 0.906, MCC = 0.761), liver enzyme abnormalities (AUC = 0.964, Accuracy = 0.964, Precision = 0.967, Recall = 0.963, F1 = 0.965, AUPRC = 0.978, MCC = 0.938), and muscle pain/Creatine kinase (CK) abnormalities (AUC = 0.981, Accuracy = 0.980, Precision = 0.987, Recall = 0.975, F1 = 0.981, AUPRC = 0.987, MCC = 0.965) demonstrated favorable performance. The most important features of LDL-C target attainment prediction model was cerebral infarction, TG, PLT and HDL. The most important features of liver enzyme abnormalities model was CRP, CK and number of oral medications. Similarly, AST, ALT, PLT and number of oral medications were found to be important features for muscle pain/CK abnormalities. Based on the best-performing predictive model, a user-friendly web application was designed and implemented.

CONCLUSION

This study presented a machine learning-based predictive model for statin efficacy and safety. The platform developed can assist in guiding statin therapy decisions and optimizing treatment strategies. Further research and application of the model are warranted to improve the utilization of statin therapy.

摘要

目的

合理使用他汀类药物在降低动脉粥样硬化性心血管疾病(ASCVD)风险方面起着至关重要的作用。然而,由于饮食和生活方式的改变,高胆固醇水平个体的数量显著增加。因此,确保他汀类药物的合理使用至关重要。与他汀类药物相关的不良反应,包括肝酶异常和他汀类药物相关肌肉症状(SAMS),影响了它们的广泛应用。在本研究中,我们旨在利用机器学习技术基于真实世界临床数据开发一种他汀类药物疗效和安全性的预测模型。

方法

我们采用了各种数据预处理技术,如改进的随机森林插补和边界合成少数类过采样(Borderline SMOTE)来处理数据集。使用Boruta方法进行特征选择,并将数据集按7:3的比例分为训练集和测试集。使用逻辑回归、朴素贝叶斯、决策树、随机森林和梯度提升决策树这五种算法构建预测模型。进行十折交叉验证和自助抽样以进行内部和外部验证。此外,使用SHAP(SHapley值相加解释)进行特征解释。最终,基于最优预测模型建立了一个可访问的基于网络的他汀类药物疗效和安全性预测平台。

结果

随机森林算法在五种算法中表现最佳。低密度脂蛋白胆固醇(LDL-C)达标预测模型(AUC = 0.883,准确率 = 0.868,精确率 = 0.858,召回率 = 0.863,F1 = 0.860,AUPRC = 0.906,MCC = 0.761)、肝酶异常预测模型(AUC = 0.964,准确率 = 0.964,精确率 = 0.967,召回率 = 0.963,F1 = 0.965,AUPRC = 0.978,MCC = 0.938)和肌肉疼痛/肌酸激酶(CK)异常预测模型(AUC = 0.981,准确率 = 0.980,精确率 = 0.987,召回率 = 0.975,F1 = 0.981,AUPRC = 0.987,MCC = 0.965)表现良好。LDL-C达标预测模型的最重要特征是脑梗死、甘油三酯(TG)、血小板(PLT)和高密度脂蛋白(HDL)。肝酶异常模型的最重要特征是C反应蛋白(CRP)、CK和口服药物数量。同样,天门冬氨酸氨基转移酶(AST)、丙氨酸氨基转移酶(ALT)、PLT和口服药物数量被发现是肌肉疼痛/CK异常的重要特征。基于性能最佳的预测模型,设计并实现了一个用户友好的网络应用程序。

结论

本研究提出了一种基于机器学习的他汀类药物疗效和安全性预测模型。所开发的平台可协助指导他汀类药物治疗决策并优化治疗策略。该模型的进一步研究和应用对于提高他汀类药物治疗的利用率是必要的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/a0ec3cc54c21/fphar-15-1334929-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/edaec1ca9b16/fphar-15-1334929-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/137a97b562c0/fphar-15-1334929-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/a75b9acf4666/fphar-15-1334929-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/e72babda9950/fphar-15-1334929-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/a0ec3cc54c21/fphar-15-1334929-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/edaec1ca9b16/fphar-15-1334929-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/137a97b562c0/fphar-15-1334929-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/a75b9acf4666/fphar-15-1334929-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/e72babda9950/fphar-15-1334929-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c6b/11317424/a0ec3cc54c21/fphar-15-1334929-g005.jpg

相似文献

1
Machine learning-based prediction model for the efficacy and safety of statins.基于机器学习的他汀类药物疗效和安全性预测模型。
Front Pharmacol. 2024 Jul 29;15:1334929. doi: 10.3389/fphar.2024.1334929. eCollection 2024.
2
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
3
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
4
Explainable Machine Learning Techniques To Predict Amiodarone-Induced Thyroid Dysfunction Risk: Multicenter, Retrospective Study With External Validation.可解释机器学习技术预测胺碘酮诱导甲状腺功能障碍风险:多中心回顾性研究及外部验证。
J Med Internet Res. 2023 Feb 7;25:e43734. doi: 10.2196/43734.
5
Interpretable machine learning model for early prediction of 28-day mortality in ICU patients with sepsis-induced coagulopathy: development and validation.用于脓毒症诱导性凝血病 ICU 患者 28 天死亡率早期预测的可解释机器学习模型:开发与验证。
Eur J Med Res. 2024 Jan 3;29(1):14. doi: 10.1186/s40001-023-01593-7.
6
Predicting cerebral edema in patients with spontaneous intracerebral hemorrhage using machine learning.使用机器学习预测自发性脑出血患者的脑水肿
Front Neurol. 2024 Oct 3;15:1419608. doi: 10.3389/fneur.2024.1419608. eCollection 2024.
7
Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules.将机器学习病理诊断算法纳入甲状腺超声成像数据可提高甲状腺恶性结节的诊断风险。
Front Oncol. 2022 Dec 8;12:968784. doi: 10.3389/fonc.2022.968784. eCollection 2022.
8
Prediction model of atrial fibrillation recurrence after Cox-Maze IV procedure in patients with chronic valvular disease and atrial fibrillation based on machine learning algorithm.基于机器学习算法的慢性瓣膜病合并心房颤动患者 Cox-Maze IV 术后心房颤动复发预测模型。
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2023 Jul 28;48(7):995-1007. doi: 10.11817/j.issn.1672-7347.2023.230018.
9
Application of information theoretic feature selection and machine learning methods for the development of genetic risk prediction models.信息论特征选择和机器学习方法在遗传风险预测模型开发中的应用。
Sci Rep. 2021 Dec 2;11(1):23335. doi: 10.1038/s41598-021-00854-x.
10
Development and validation of a machine learning-based predictive model for assessing the 90-day prognostic outcome of patients with spontaneous intracerebral hemorrhage.基于机器学习的预测模型评估自发性脑出血患者 90 天预后结局的开发与验证。
J Transl Med. 2024 Mar 4;22(1):236. doi: 10.1186/s12967-024-04896-3.

引用本文的文献

1
Harnessing Statistical and Machine Learning Approaches to Analyze Oxidized LDL in Clinical Research.利用统计和机器学习方法在临床研究中分析氧化型低密度脂蛋白。
Cell Biochem Biophys. 2025 Aug 30. doi: 10.1007/s12013-025-01837-9.

本文引用的文献

1
Development and application of pharmacological statin-associated muscle symptoms phenotyping algorithms using structured and unstructured electronic health records data.使用结构化和非结构化电子健康记录数据开发和应用药理学他汀类药物相关肌肉症状表型分析算法。
JAMIA Open. 2023 Oct 24;6(4):ooad087. doi: 10.1093/jamiaopen/ooad087. eCollection 2023 Dec.
2
Development and validation of predictive model based on deep learning method for classification of dyslipidemia in Chinese medicine.基于深度学习方法的中医血脂异常分类预测模型的构建与验证
Health Inf Sci Syst. 2023 Apr 6;11(1):21. doi: 10.1007/s13755-023-00215-0. eCollection 2023 Dec.
3
Machine-learning-based analysis of the sensitivity and specificity on lipid-lowering effect of one-month-administered statins.
基于机器学习的他汀类药物一个月降脂效果的敏感性和特异性分析。
Medicine (Baltimore). 2023 Mar 3;102(9):e33139. doi: 10.1097/MD.0000000000033139.
4
A review on longitudinal data analysis with random forest.随机森林的纵向数据分析综述。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad002.
5
Long-term prognostic value of LDL-C, HDL-C, lp(a) and TG levels on cardiovascular disease incidence, by body weight status, dietary habits and lipid-lowering treatment: the ATTICA epidemiological cohort study (2002-2012).按体重状况、饮食习惯和降脂治疗划分的 LDL-C、HDL-C、lp(a) 和 TG 水平对心血管疾病发病的长期预后价值:ATTICA 流行病学队列研究(2002-2012 年)。
Lipids Health Dis. 2022 Dec 19;21(1):141. doi: 10.1186/s12944-022-01747-2.
6
A machine learning-based risk warning platform for potentially inappropriate prescriptions for elderly patients with cardiovascular disease.一个基于机器学习的针对老年心血管疾病患者潜在不适当处方的风险预警平台。
Front Pharmacol. 2022 Aug 11;13:804566. doi: 10.3389/fphar.2022.804566. eCollection 2022.
7
Statin Usage in Primary Prevention-Comparing the USPSTF Recommendations With the AHA/ACC/Multisociety Guidelines.一级预防中他汀类药物的使用——比较美国预防服务工作组(USPSTF)建议与美国心脏协会(AHA)/美国心脏病学会(ACC)/多学会指南
JAMA Cardiol. 2022 Oct 1;7(10):997-999. doi: 10.1001/jamacardio.2022.2851.
8
Association Between High-Density Lipoprotein Cholesterol Levels and Adverse Cardiovascular Outcomes in High-risk Populations.高密度脂蛋白胆固醇水平与高危人群不良心血管结局的关系。
JAMA Cardiol. 2022 Jul 1;7(7):672-680. doi: 10.1001/jamacardio.2022.0912.
9
Temporal Trends in the Remaining Lifetime Risk of Cardiovascular Disease Among Middle-Aged Adults Across 6 Decades: The Framingham Study.六十年间中年人群心血管疾病剩余寿命风险的时间趋势:弗雷明汉研究。
Circulation. 2022 Apr 26;145(17):1324-1338. doi: 10.1161/CIRCULATIONAHA.121.057889. Epub 2022 Apr 18.
10
Machine learning in medical applications: A review of state-of-the-art methods.机器学习在医学应用中的应用:最新方法综述。
Comput Biol Med. 2022 Jun;145:105458. doi: 10.1016/j.compbiomed.2022.105458. Epub 2022 Mar 28.