监督式机器学习方法在初级保健中检测家族性高胆固醇血症的性能及临床效用

Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care.

作者信息

Akyea Ralph K, Qureshi Nadeem, Kai Joe, Weng Stephen F

机构信息

Primary Care Stratified Medicine, Division of Primary Care, University of Nottingham, Nottingham, UK.

出版信息

NPJ Digit Med. 2020 Oct 30;3:142. doi: 10.1038/s41746-020-00349-5. eCollection 2020.

DOI:10.1038/s41746-020-00349-5

PMID:33145438

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7603302/

Abstract

Familial hypercholesterolaemia (FH) is a common inherited disorder, causing lifelong elevated low-density lipoprotein cholesterol (LDL-C). Most individuals with FH remain undiagnosed, precluding opportunities to prevent premature heart disease and death. Some machine-learning approaches improve detection of FH in electronic health records, though clinical impact is under-explored. We assessed performance of an array of machine-learning approaches for enhancing detection of FH, and their clinical utility, within a large primary care population. A retrospective cohort study was done using routine primary care clinical records of 4,027,775 individuals from the United Kingdom with total cholesterol measured from 1 January 1999 to 25 June 2019. Predictive accuracy of five common machine-learning algorithms (logistic regression, random forest, gradient boosting machines, neural networks and ensemble learning) were assessed for detecting FH. Predictive accuracy was assessed by area under the receiver operating curves (AUC) and expected vs observed calibration slope; with clinical utility assessed by expected case-review workload and likelihood ratios. There were 7928 incident diagnoses of FH. In addition to known clinical features of FH (raised total cholesterol or LDL-C and family history of premature coronary heart disease), machine-learning (ML) algorithms identified features such as raised triglycerides which reduced the likelihood of FH. Apart from logistic regression (AUC, 0.81), all four other ML approaches had similarly high predictive accuracy (AUC > 0.89). Calibration slope ranged from 0.997 for gradient boosting machines to 1.857 for logistic regression. Among those screened, high probability cases requiring clinical review varied from 0.73% using ensemble learning to 10.16% using deep learning, but with positive predictive values of 15.5% and 2.8% respectively. Ensemble learning exhibited a dominant positive likelihood ratio (45.5) compared to all other ML models (7.0-14.4). Machine-learning models show similar high accuracy in detecting FH, offering opportunities to increase diagnosis. However, the clinical case-finding workload required for yield of cases will differ substantially between models.

摘要

家族性高胆固醇血症（FH）是一种常见的遗传性疾病，会导致低密度脂蛋白胆固醇（LDL-C）终生升高。大多数FH患者仍未被诊断出来，从而失去了预防过早心脏病和死亡的机会。一些机器学习方法可改善在电子健康记录中对FH的检测，不过其临床影响尚未得到充分探索。我们在一大群初级保健人群中评估了一系列用于增强FH检测的机器学习方法的性能及其临床效用。利用来自英国的4,027,775名个体的常规初级保健临床记录进行了一项回顾性队列研究，这些个体在1999年1月1日至2019年6月25日期间测量了总胆固醇。评估了五种常见机器学习算法（逻辑回归、随机森林、梯度提升机、神经网络和集成学习）检测FH的预测准确性。通过受试者工作特征曲线下面积（AUC）和预期与观察到的校准斜率评估预测准确性；通过预期病例审查工作量和似然比评估临床效用。有7928例FH的新发诊断。除了FH的已知临床特征（总胆固醇或LDL-C升高以及早发冠心病家族史）外，机器学习（ML）算法还识别出甘油三酯升高等特征，这些特征降低了FH的可能性。除逻辑回归（AUC，0.81）外，其他四种ML方法均具有相似的高预测准确性（AUC>0.89）。校准斜率范围从梯度提升机的0.997到逻辑回归的1.857。在接受筛查的人群中，需要临床审查的高概率病例从使用集成学习的0.73%到使用深度学习的10.16%不等，但阳性预测值分别为15.5%和2.8%。与所有其他ML模型（7.0 - 14.4）相比，集成学习表现出显著的阳性似然比（45.5）。机器学习模型在检测FH方面显示出相似的高准确性，为增加诊断提供了机会。然而，不同模型间为发现病例所需的临床病例查找工作量将有很大差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97f/7603302/9c4cd2b29cd6/41746_2020_349_Fig2_HTML.jpg

相似文献

Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care.

NPJ Digit Med. 2020 Oct 30;3:142. doi: 10.1038/s41746-020-00349-5. eCollection 2020.

Improving identification of familial hypercholesterolaemia in primary care: derivation and validation of the familial hypercholesterolaemia case ascertainment tool (FAMCAT).

Atherosclerosis. 2015 Feb;238(2):336-43. doi: 10.1016/j.atherosclerosis.2014.12.034. Epub 2014 Dec 20.

Low-density lipoprotein apheresis: an evidence-based analysis.

Ont Health Technol Assess Ser. 2007;7(5):1-101. Epub 2006 Nov 1.

Familial Hypercholesterolemia Identification by Machine Learning Using Lipid Profile Data Performs as Well as Clinical Diagnostic Criteria.

Circ Genom Precis Med. 2022 Oct;15(5):e003324. doi: 10.1161/CIRCGEN.121.003324. Epub 2022 Sep 26.

Selection of individuals for genetic testing for familial hypercholesterolaemia: development and external validation of a prediction model for the presence of a mutation causing familial hypercholesterolaemia.

Eur Heart J. 2017 Feb 21;38(8):565-573. doi: 10.1093/eurheartj/ehw135.

Elucigene FH20 and LIPOchip for the diagnosis of familial hypercholesterolaemia: a systematic review and economic evaluation.

Health Technol Assess. 2012;16(17):1-266. doi: 10.3310/hta16170.

Evaluating a clinical tool (FAMCAT) for identifying familial hypercholesterolaemia in primary care: a retrospective cohort study.

BJGP Open. 2020 Dec 15;4(5). doi: 10.3399/bjgpopen20X101114. Print 2020 Dec.

Comparing the performance of the novel FAMCAT algorithms and established case-finding criteria for familial hypercholesterolaemia in primary care.

Open Heart. 2021 Oct;8(2). doi: 10.1136/openhrt-2021-001752.

Screening for hypercholesterolaemia versus case finding for familial hypercholesterolaemia: a systematic review and cost-effectiveness analysis.

Health Technol Assess. 2000;4(29):1-123.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

引用本文的文献

Magnitude and Impact of Hallucinations in Tabular Synthetic Health Data on Prognostic Machine Learning Models: Validation Study.

J Med Internet Res. 2025 Aug 18;27:e77893. doi: 10.2196/77893.

Integrating New Technologies in Lipidology: A Comprehensive Review.

J Clin Med. 2025 Jul 14;14(14):4984. doi: 10.3390/jcm14144984.

Effect of the exposure to brominated flame retardants on hyperuricemia using interpretable machine learning algorithms based on the SHAP methodology.

PLoS One. 2025 Jun 26;20(6):e0325896. doi: 10.1371/journal.pone.0325896. eCollection 2025.

Opportunities, challenges, and requirements for Artificial Intelligence (AI) implementation in Primary Health Care (PHC): a systematic review.

BMC Prim Care. 2025 Jun 9;26(1):196. doi: 10.1186/s12875-025-02785-2.

The interpretable machine learning model for depression associated with heavy metals via EMR mining method.

Sci Rep. 2025 Mar 28;15(1):10811. doi: 10.1038/s41598-025-95938-3.

Machine learning approaches to identify the link between heavy metal exposure and ischemic stroke using the US NHANES data from 2003 to 2018.

Front Public Health. 2024 Sep 16;12:1388257. doi: 10.3389/fpubh.2024.1388257. eCollection 2024.

Assessment of EMR ML Mining Methods for Measuring Association between Metal Mixture and Mortality for Hypertension.

High Blood Press Cardiovasc Prev. 2024 Sep;31(5):473-483. doi: 10.1007/s40292-024-00666-w. Epub 2024 Aug 12.

Predicting dyslipidemia incidence: unleashing machine learning algorithms on Lifestyle Promotion Project data.

BMC Public Health. 2024 Jul 3;24(1):1777. doi: 10.1186/s12889-024-19261-8.

Improving the Detection of Potential Cases of Familial Hypercholesterolemia: Could Machine Learning Be Part of the Solution?

J Am Heart Assoc. 2024 Jun 18;13(12):e034434. doi: 10.1161/JAHA.123.034434. Epub 2024 Jun 15.

The relationship between heavy metals and metabolic syndrome using machine learning.

Front Public Health. 2024 Apr 15;12:1378041. doi: 10.3389/fpubh.2024.1378041. eCollection 2024.

本文引用的文献

Precision screening for familial hypercholesterolaemia: a machine learning study applied to electronic health encounter data.

Lancet Digit Health. 2019 Dec;1(8):e393-e402. doi: 10.1016/S2589-7500(19)30150-5. Epub 2019 Oct 21.

Evaluating a clinical tool (FAMCAT) for identifying familial hypercholesterolaemia in primary care: a retrospective cohort study.

BJGP Open. 2020 Dec 15;4(5). doi: 10.3399/bjgpopen20X101114. Print 2020 Dec.

Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness.

BMJ. 2020 Mar 20;368:l6927. doi: 10.1136/bmj.l6927.

Detection of familial hypercholesterolaemia: external validation of the FAMCAT clinical case-finding algorithm to identify patients in primary care.

Lancet Public Health. 2019 May;4(5):e256-e264. doi: 10.1016/S2468-2667(19)30061-1.

Screening for familial hypercholesterolaemia in primary care: Time for general practice to play its part.

Atherosclerosis. 2018 Oct;277:399-406. doi: 10.1016/j.atherosclerosis.2018.08.019.

Improving identification and management of familial hypercholesterolaemia in primary care: Pre- and post-intervention study.

Atherosclerosis. 2018 Jul;274:54-60. doi: 10.1016/j.atherosclerosis.2018.04.037. Epub 2018 Apr 30.

Performing studies using the UK Clinical Practice Research Datalink: to link or not to link?

Eur J Epidemiol. 2018 Jun;33(6):601-605. doi: 10.1007/s10654-018-0389-5. Epub 2018 Apr 4.

Deep Learning and Its Applications in Biomedicine.

Genomics Proteomics Bioinformatics. 2018 Feb;16(1):17-32. doi: 10.1016/j.gpb.2017.07.003. Epub 2018 Mar 6.

Estimating the prevalence of heterozygous familial hypercholesterolaemia: a systematic review and meta-analysis.

BMJ Open. 2017 Sep 1;7(9):e016461. doi: 10.1136/bmjopen-2017-016461.

Can machine-learning improve cardiovascular risk prediction using routine clinical data?

PLoS One. 2017 Apr 4;12(4):e0174944. doi: 10.1371/journal.pone.0174944. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

监督式机器学习方法在初级保健中检测家族性高胆固醇血症的性能及临床效用

Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care.

作者信息

Akyea Ralph K, Qureshi Nadeem, Kai Joe, Weng Stephen F

机构信息

Primary Care Stratified Medicine, Division of Primary Care, University of Nottingham, Nottingham, UK.

出版信息

NPJ Digit Med. 2020 Oct 30;3:142. doi: 10.1038/s41746-020-00349-5. eCollection 2020.

DOI:10.1038/s41746-020-00349-5

PMID:33145438

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7603302/

Abstract

摘要

监督式机器学习方法在初级保健中检测家族性高胆固醇血症的性能及临床效用

Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

监督式机器学习方法在初级保健中检测家族性高胆固醇血症的性能及临床效用

Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献