Suppr超能文献

潜在功能性单核苷酸多态性在预测阿托伐他汀诱导性肌痛的机器学习模型中的稳健性能

Robust Performance of Potentially Functional SNPs in Machine Learning Models for the Prediction of Atorvastatin-Induced Myalgia.

作者信息

Ooi Brandon N S, Ying Ariel F, Koh Yong Zher, Jin Yu, Yee Sherman W L, Lee Justin H S, Chong Samuel S, Tan Jack W C, Liu Jianjun, Lee Caroline G, Drum Chester L

机构信息

Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Dundee, Singapore.

Duke-NUS Graduate School, Singapore, Singapore.

出版信息

Front Pharmacol. 2021 Apr 22;12:605764. doi: 10.3389/fphar.2021.605764. eCollection 2021.

Abstract

Statins can cause muscle symptoms resulting in poor adherence to therapy and increased cardiovascular risk. We hypothesize that combinations of potentially functional SNPs (pfSNPs), rather than individual SNPs, better predict myalgia in patients on atorvastatin. This study assesses the value of potentially functional single nucleotide polymorphisms (pfSNPs) and employs six machine learning algorithms to identify the combination of SNPs that best predict myalgia. Whole genome sequencing of 183 Chinese, Malay and Indian patients from Singapore was conducted to identify genetic variants associated with atorvastatin induced myalgia. To adjust for confounding factors, demographic and clinical characteristics were also examined for their association with myalgia. The top factor, sex, was then used as a covariate in the whole genome association analyses. Variants that were highly associated with myalgia from this and previous studies were extracted, assessed for potential functionality (pfSNPs) and incorporated into six machine learning models. Predictive performance of a combination of different models and inputs were compared using the average cross validation area under ROC curve (AUC). The minimum combination of SNPs to achieve maximum sensitivity and specificity as determined by AUC, that predict atorvastatin-induced myalgia in most, if not all the six machine learning models was determined. Through whole genome association analyses using sex as a covariate, a larger proportion of pfSNPs compared to non-pf SNPs were found to be highly associated with myalgia. Although none of the individual SNPs achieved genome wide significance in univariate analyses, machine learning models identified a combination of 15 SNPs that predict myalgia with good predictive performance (AUC >0.9). SNPs within genes identified in this study significantly outperformed SNPs within genes previously reported to be associated with myalgia. pfSNPs were found to be more robust in predicting myalgia, outperforming non-pf SNPs in the majority of machine learning models tested. Combinations of pfSNPs that were consistently identified by different machine learning models to have high predictive performance have good potential to be clinically useful for predicting atorvastatin-induced myalgia once validated against an independent cohort of patients.

摘要

他汀类药物可引起肌肉症状,导致治疗依从性差并增加心血管疾病风险。我们推测,潜在功能性单核苷酸多态性(pfSNP)的组合而非单个SNP,能更好地预测阿托伐他汀治疗患者的肌痛。本研究评估了潜在功能性单核苷酸多态性(pfSNP)的价值,并采用六种机器学习算法来识别最能预测肌痛的SNP组合。对来自新加坡的183名中国、马来和印度患者进行了全基因组测序,以确定与阿托伐他汀诱导的肌痛相关的基因变异。为了调整混杂因素,还检查了人口统计学和临床特征与肌痛的关联。然后,将首要因素性别用作全基因组关联分析的协变量。从本研究及先前研究中提取与肌痛高度相关的变异,评估其潜在功能(pfSNP),并纳入六个机器学习模型。使用ROC曲线下的平均交叉验证面积(AUC)比较不同模型和输入组合的预测性能。确定了在六个机器学习模型中(即使不是全部)大多数能预测阿托伐他汀诱导肌痛的、达到最大敏感性和特异性所需的SNP的最小组合。通过以性别为协变量的全基因组关联分析,发现与非pfSNP相比,更大比例的pfSNP与肌痛高度相关。虽然在单变量分析中没有单个SNP达到全基因组显著性,但机器学习模型识别出一组15个SNP的组合,其预测肌痛的性能良好(AUC>0.9)。本研究中鉴定出的基因内SNP显著优于先前报道的与肌痛相关基因内的SNP。发现pfSNP在预测肌痛方面更具稳健性,在大多数测试的机器学习模型中优于非pfSNP。一旦在独立患者队列中得到验证,不同机器学习模型一致鉴定出的具有高预测性能的pfSNP组合在临床上很有可能用于预测阿托伐他汀诱导的肌痛。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53af/8100589/a89012bfe978/fphar-12-605764-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验