• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估惩罚和机器学习方法在韩国基因组与流行病学研究(KoGES)中对哮喘病的预测作用。

Evaluation of penalized and machine learning methods for asthma disease prediction in the Korean Genome and Epidemiology Study (KoGES).

机构信息

Department of Applied Artificial Intelligence, College of Computing, Hanyang University, 55 Hanyang-daehak-ro, Sangnok-gu, Ansan, 15588, South Korea.

Department of Mathematical Data Science, College of Science and Convergence Technology, Hanyang University, 55 Hanyang-daehak-ro, Sangnok-gu, Ansan, 15588, South Korea.

出版信息

BMC Bioinformatics. 2024 Feb 2;25(1):56. doi: 10.1186/s12859-024-05677-x.

DOI:10.1186/s12859-024-05677-x
PMID:38308205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10837879/
Abstract

BACKGROUND

Genome-wide association studies have successfully identified genetic variants associated with human disease. Various statistical approaches based on penalized and machine learning methods have recently been proposed for disease prediction. In this study, we evaluated the performance of several such methods for predicting asthma using the Korean Chip (KORV1.1) from the Korean Genome and Epidemiology Study (KoGES).

RESULTS

First, single-nucleotide polymorphisms were selected via single-variant tests using logistic regression with the adjustment of several epidemiological factors. Next, we evaluated the following methods for disease prediction: ridge, least absolute shrinkage and selection operator, elastic net, smoothly clipped absolute deviation, support vector machine, random forest, boosting, bagging, naïve Bayes, and k-nearest neighbor. Finally, we compared their predictive performance based on the area under the curve of the receiver operating characteristic curves, precision, recall, F1-score, Cohen's Kappa, balanced accuracy, error rate, Matthews correlation coefficient, and area under the precision-recall curve. Additionally, three oversampling algorithms are used to deal with imbalance problems.

CONCLUSIONS

Our results show that penalized methods exhibit better predictive performance for asthma than that achieved via machine learning methods. On the other hand, in the oversampling study, randomforest and boosting methods overall showed better prediction performance than penalized methods.

摘要

背景

全基因组关联研究已成功鉴定出与人类疾病相关的遗传变异。基于惩罚和机器学习方法的各种统计方法最近已被提出用于疾病预测。在这项研究中,我们使用来自韩国基因组和流行病学研究(KoGES)的韩国芯片(KORV1.1)评估了几种此类方法预测哮喘的性能。

结果

首先,通过使用逻辑回归进行单变量测试,在调整了几个流行病学因素后,选择了单核苷酸多态性。接下来,我们评估了以下疾病预测方法:岭回归、最小绝对收缩和选择算子、弹性网络、平滑裁剪绝对偏差、支持向量机、随机森林、提升、袋装、朴素贝叶斯和 K-最近邻。最后,我们根据接收者操作特征曲线的曲线下面积、精度、召回率、F1 分数、科恩氏 Kappa、平衡准确性、错误率、马修斯相关系数和精度召回曲线下面积来比较它们的预测性能。此外,还使用了三种过采样算法来解决不平衡问题。

结论

我们的结果表明,惩罚方法在预测哮喘方面表现出比机器学习方法更好的预测性能。另一方面,在过采样研究中,随机森林和提升方法总体上表现出比惩罚方法更好的预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/671cf4e9ab23/12859_2024_5677_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/f6e5c5bbd10f/12859_2024_5677_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/2e23e7701db4/12859_2024_5677_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/3de42de8d890/12859_2024_5677_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/671cf4e9ab23/12859_2024_5677_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/f6e5c5bbd10f/12859_2024_5677_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/2e23e7701db4/12859_2024_5677_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/3de42de8d890/12859_2024_5677_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/10837879/671cf4e9ab23/12859_2024_5677_Fig4_HTML.jpg

相似文献

1
Evaluation of penalized and machine learning methods for asthma disease prediction in the Korean Genome and Epidemiology Study (KoGES).评估惩罚和机器学习方法在韩国基因组与流行病学研究(KoGES)中对哮喘病的预测作用。
BMC Bioinformatics. 2024 Feb 2;25(1):56. doi: 10.1186/s12859-024-05677-x.
2
Machine-learning model predicting quality of life using multifaceted lifestyles in middle-aged South Korean adults: a cross-sectional study.利用中年韩国成年人多方面生活方式预测生活质量的机器学习模型:一项横断面研究。
BMC Public Health. 2024 Jan 11;24(1):159. doi: 10.1186/s12889-023-17457-y.
3
Prediction of metabolic and pre-metabolic syndromes using machine learning models with anthropometric, lifestyle, and biochemical factors from a middle-aged population in Korea.使用来自韩国中年人群的人体测量学、生活方式和生化因素的机器学习模型预测代谢和前代谢综合征。
BMC Public Health. 2022 Apr 6;22(1):664. doi: 10.1186/s12889-022-13131-x.
4
Explainable Machine Learning Techniques To Predict Amiodarone-Induced Thyroid Dysfunction Risk: Multicenter, Retrospective Study With External Validation.可解释机器学习技术预测胺碘酮诱导甲状腺功能障碍风险:多中心回顾性研究及外部验证。
J Med Internet Res. 2023 Feb 7;25:e43734. doi: 10.2196/43734.
5
Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.比较多种机器学习算法和指标进行雌激素受体结合预测。
Mol Pharm. 2018 Oct 1;15(10):4361-4370. doi: 10.1021/acs.molpharmaceut.8b00546. Epub 2018 Aug 28.
6
Machine Learning-Based Identification of Diagnostic Biomarkers for Korean Male Sarcopenia Through Integrative DNA Methylation and Methylation Risk Score: From the Korean Genomic Epidemiology Study (KoGES).基于机器学习的韩国男性肌肉减少症诊断生物标志物的鉴定:来自韩国基因组流行病学研究(KoGES)的整合 DNA 甲基化和甲基化风险评分。
J Korean Med Sci. 2024 Jul 8;39(26):e200. doi: 10.3346/jkms.2024.39.e200.
7
Prediction of metabolic syndrome using machine learning approaches based on genetic and nutritional factors: a 14-year prospective-based cohort study.基于遗传和营养因素的机器学习方法预测代谢综合征:一项基于 14 年前瞻性队列研究。
BMC Med Genomics. 2024 Sep 4;17(1):224. doi: 10.1186/s12920-024-01998-1.
8
Machine learning-based prediction of hospital prolonged length of stay admission at emergency department: a Gradient Boosting algorithm analysis.基于机器学习的急诊科住院时间延长预测:梯度提升算法分析
Front Artif Intell. 2023 Jul 28;6:1179226. doi: 10.3389/frai.2023.1179226. eCollection 2023.
9
Multimodality radiomics prediction of radiotherapy-induced the early proctitis and cystitis in rectal cancer patients: a machine learning study.多模态放射组学预测直肠癌患者放疗诱导的早期直肠炎和膀胱炎:一项机器学习研究。
Biomed Phys Eng Express. 2023 Dec 20;10(1). doi: 10.1088/2057-1976/ad0f3e.
10
Application of supervised machine learning algorithms for classification and prediction of type-2 diabetes disease status in Afar regional state, Northeastern Ethiopia 2021.2021 年,埃塞俄比亚东北部阿法尔地区使用监督机器学习算法对 2 型糖尿病疾病状况进行分类和预测。
Sci Rep. 2023 May 13;13(1):7779. doi: 10.1038/s41598-023-34906-1.

本文引用的文献

1
Deep learning-based identification of genetic variants: application to Alzheimer's disease classification.基于深度学习的基因突变识别:在阿尔茨海默病分类中的应用。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac022.
2
Large-scale plasma proteomics can reveal distinct endotypes in chronic obstructive pulmonary disease and severe asthma.大规模血浆蛋白质组学可揭示慢性阻塞性肺疾病和重度哮喘的不同内型。
Clin Transl Allergy. 2021 Dec;11(10):e12091. doi: 10.1002/clt2.12091.
3
Bronchopulmonary dysplasia and wnt pathway-associated single nucleotide polymorphisms.
支气管肺发育不良与 Wnt 通路相关的单核苷酸多态性。
Pediatr Res. 2022 Sep;92(3):888-898. doi: 10.1038/s41390-021-01851-6. Epub 2021 Dec 1.
4
A genome-wide association study of quantitative computed tomographic emphysema in Korean populations.一项针对韩国人群定量 CT 肺气肿的全基因组关联研究。
Sci Rep. 2021 Aug 17;11(1):16692. doi: 10.1038/s41598-021-95887-7.
5
expression in asthmatic epithelium depends on intronic methylation and ZNF263 binding.在哮喘上皮细胞中的表达取决于内含子甲基化和ZNF263结合。
ERJ Open Res. 2021 Jun 7;7(2). doi: 10.1183/23120541.00802-2020. eCollection 2021 Apr.
6
Genetics and Epigenetics in Asthma.哮喘的遗传学和表观遗传学。
Int J Mol Sci. 2021 Feb 27;22(5):2412. doi: 10.3390/ijms22052412.
7
CADD-Splice-improving genome-wide variant effect prediction using deep learning-derived splice scores.使用深度学习衍生的剪接分数提高 CADD-Splice 全基因组变异效应预测。
Genome Med. 2021 Feb 22;13(1):31. doi: 10.1186/s13073-021-00835-9.
8
An Overview of the Application of Systems Biology in an Understanding of Chronic Rhinosinusitis (CRS) Development.系统生物学在理解慢性鼻-鼻窦炎(CRS)发展中的应用概述
J Pers Med. 2020 Nov 26;10(4):245. doi: 10.3390/jpm10040245.
9
Investigation of circulating lncRNAs as potential biomarkers in chronic respiratory diseases.循环长链非编码RNA作为慢性呼吸道疾病潜在生物标志物的研究
J Transl Med. 2020 Nov 10;18(1):422. doi: 10.1186/s12967-020-02581-9.
10
Acute 4,4'-Methylene Diphenyl Diisocyanate Exposure-Mediated Downregulation of miR-206-3p and miR-381-3p Activates Inducible Nitric Oxide Synthase Transcription by Targeting Calcineurin/NFAT Signaling in Macrophages.急性 4,4'-亚甲基二苯基二异氰酸酯暴露通过靶向巨噬细胞中的钙调神经磷酸酶/NFAT 信号下调 miR-206-3p 和 miR-381-3p 激活诱导型一氧化氮合酶转录。
Toxicol Sci. 2020 Jan 1;173(1):100-113. doi: 10.1093/toxsci/kfz215.