支持向量机建模在常见疾病预测中的应用：以糖尿病和糖尿病前期为例。

Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes.

机构信息

National Office of Public Health Genomics, Coordinating Center for Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA, USA.

出版信息

BMC Med Inform Decis Mak. 2010 Mar 22;10:16. doi: 10.1186/1472-6947-10-16.

DOI:10.1186/1472-6947-10-16

PMID:20307319

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2850872/

Abstract

BACKGROUND

We present a potentially useful alternative approach based on support vector machine (SVM) techniques to classify persons with and without common diseases. We illustrate the method to detect persons with diabetes and pre-diabetes in a cross-sectional representative sample of the U.S. population.

METHODS

We used data from the 1999-2004 National Health and Nutrition Examination Survey (NHANES) to develop and validate SVM models for two classification schemes: Classification Scheme I (diagnosed or undiagnosed diabetes vs. pre-diabetes or no diabetes) and Classification Scheme II (undiagnosed diabetes or pre-diabetes vs. no diabetes). The SVM models were used to select sets of variables that would yield the best classification of individuals into these diabetes categories.

RESULTS

For Classification Scheme I, the set of diabetes-related variables with the best classification performance included family history, age, race and ethnicity, weight, height, waist circumference, body mass index (BMI), and hypertension. For Classification Scheme II, two additional variables--sex and physical activity--were included. The discriminative abilities of the SVM models for Classification Schemes I and II, according to the area under the receiver operating characteristic (ROC) curve, were 83.5% and 73.2%, respectively. The web-based tool-Diabetes Classifier was developed to demonstrate a user-friendly application that allows for individual or group assessment with a configurable, user-defined threshold.

CONCLUSIONS

Support vector machine modeling is a promising classification approach for detecting persons with common diseases such as diabetes and pre-diabetes in the population. This approach should be further explored in other complex diseases using common variables.

摘要

背景

我们提出了一种基于支持向量机（SVM）技术的潜在有用的替代方法，用于对患有和不患有常见疾病的人进行分类。我们以美国人口的横断面代表性样本为例，说明该方法用于检测糖尿病和糖尿病前期患者。

方法

我们使用了 1999-2004 年全国健康和营养检查调查（NHANES）的数据，为两种分类方案开发和验证了 SVM 模型：分类方案 I（已诊断或未诊断的糖尿病与糖尿病前期或无糖尿病）和分类方案 II（未诊断的糖尿病或糖尿病前期与无糖尿病）。SVM 模型用于选择可将个体最佳分类为这些糖尿病类别的变量集。

结果

对于分类方案 I，具有最佳分类性能的一组糖尿病相关变量包括家族史、年龄、种族和民族、体重、身高、腰围、体重指数（BMI）和高血压。对于分类方案 II，还包括两个额外的变量——性别和体力活动。根据接收者操作特征（ROC）曲线下的面积，SVM 模型对分类方案 I 和 II 的判别能力分别为 83.5%和 73.2%。开发了基于网络的工具-Diabetes Classifier，以展示一个用户友好的应用程序，允许个人或群体评估，具有可配置的、用户定义的阈值。

结论

支持向量机建模是一种很有前途的分类方法，可用于在人群中检测常见疾病，如糖尿病和糖尿病前期。应使用常见变量在其他复杂疾病中进一步探索这种方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2db/2850872/e99431b8fc14/1472-6947-10-16-1.jpg

相似文献

Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes.

BMC Med Inform Decis Mak. 2010 Mar 22;10:16. doi: 10.1186/1472-6947-10-16.

Diabetes Risk Calculator: a simple tool for detecting undiagnosed diabetes and pre-diabetes.

Diabetes Care. 2008 May;31(5):1040-5. doi: 10.2337/dc07-1150. Epub 2007 Dec 10.

A data-driven approach to predicting diabetes and cardiovascular disease with machine learning.

BMC Med Inform Decis Mak. 2019 Nov 6;19(1):211. doi: 10.1186/s12911-019-0918-5.

Development of a clinical guideline to predict undiagnosed diabetes in dental patients.

J Am Dent Assoc. 2011 Jan;142(1):28-37. doi: 10.14219/jada.archive.2011.0025.

Screening for pre-diabetes using support vector machine model.

Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:2472-5. doi: 10.1109/EMBC.2014.6944123.

Development of a screening tool using electronic health records for undiagnosed Type 2 diabetes mellitus and impaired fasting glucose detection in the Slovenian population.

Diabet Med. 2018 May;35(5):640-649. doi: 10.1111/dme.13605. Epub 2018 Mar 15.

Feasibility of Raman spectroscopy as a potential in vivo tool to screen for pre-diabetes and diabetes.

J Biophotonics. 2022 Sep;15(9):e202200055. doi: 10.1002/jbio.202200055. Epub 2022 Jun 21.

The construction of support vector machine classifier using the firefly algorithm.

Comput Intell Neurosci. 2015;2015:212719. doi: 10.1155/2015/212719. Epub 2015 Feb 23.

Harnessing machine learning models for non-invasive pre-diabetes screening in children and adolescents.

Comput Methods Programs Biomed. 2022 Nov;226:107180. doi: 10.1016/j.cmpb.2022.107180. Epub 2022 Oct 8.

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

引用本文的文献

Transfer learning prediction of type 2 diabetes with unpaired clinical and genetic data.

Sci Rep. 2025 Jul 29;15(1):27695. doi: 10.1038/s41598-025-05532-w.

Alterations in Tear Proteomes of Adults with Pre-Diabetes and Type 2 Diabetes Mellitus but Without Diabetic Retinopathy.

Proteomes. 2025 Jul 1;13(3):29. doi: 10.3390/proteomes13030029.

Improving T2D machine learning-based prediction accuracy with SNPs and younger age.

Comput Struct Biotechnol J. 2025 Jun 23;27:2772-2781. doi: 10.1016/j.csbj.2025.06.038. eCollection 2025.

Sex Estimation Based on Tooth Measurements on Panoramic Radiographs with Classical and Machine-Learning Classifiers.

Front Dent. 2025 Apr 12;22:14. doi: 10.18502/fid.v22i14.18470. eCollection 2025.

Comprehensive Bibliometric Analysis of Prediction Models for HCC: Current Trends and Future Prospects.

J Gastrointest Cancer. 2025 Jun 19;56(1):139. doi: 10.1007/s12029-025-01249-1.

Analysis of the most influential factors affecting outcomes of lung transplant recipients: a multivariate prediction model based on UNOS Data.

BMJ Open. 2025 May 16;15(5):e089796. doi: 10.1136/bmjopen-2024-089796.

Ensemble Learning-Based Alzheimer's Disease Classification Using Electroencephalogram Signals and Clock Drawing Test Images.

Sensors (Basel). 2025 May 2;25(9):2881. doi: 10.3390/s25092881.

Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.

Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.

Machine learning and artificial intelligence in type 2 diabetes prediction: a comprehensive 33-year bibliometric and literature analysis.

Front Digit Health. 2025 Mar 27;7:1557467. doi: 10.3389/fdgth.2025.1557467. eCollection 2025.

Hepatitis C Virus Saint Petersburg Variant Detection With Machine Learning Methods.

J Med Virol. 2025 Feb;97(2):e70169. doi: 10.1002/jmv.70169.

本文引用的文献

A network view of disease and compound screening.

Nat Rev Drug Discov. 2009 Apr;8(4):286-95. doi: 10.1038/nrd2826.

Support Vectors Machine-based identification of heart valve diseases using heart sounds.

Comput Methods Programs Biomed. 2009 Jul;95(1):47-61. doi: 10.1016/j.cmpb.2009.01.003. Epub 2009 Mar 6.

Improving the performance of physiologic hot flash measures with support vector machines.

Psychophysiology. 2009 Mar;46(2):285-92. doi: 10.1111/j.1469-8986.2008.00770.x. Epub 2009 Jan 26.

Tools for predicting the risk of type 2 diabetes in daily practice.

Horm Metab Res. 2009 Feb;41(2):86-97. doi: 10.1055/s-0028-1087203. Epub 2008 Nov 19.

Standards of medical care in diabetes--2008.

Diabetes Care. 2008 Jan;31 Suppl 1:S12-54. doi: 10.2337/dc08-S012.

Diabetes Risk Calculator: a simple tool for detecting undiagnosed diabetes and pre-diabetes.

Diabetes Care. 2008 May;31(5):1040-5. doi: 10.2337/dc07-1150. Epub 2007 Dec 10.

How effective are lifestyle changes in the prevention of type 2 diabetes mellitus?

Nutr Rev. 2007 Mar;65(3):101-10. doi: 10.1111/j.1753-4887.2007.tb00287.x.

De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures.

Bioinformatics. 2007 Jun 1;23(11):1321-30. doi: 10.1093/bioinformatics/btm026. Epub 2007 Jan 31.

Global Guideline for Type 2 Diabetes: recommendations for standard, comprehensive, and minimal care.

Diabet Med. 2006 Jun;23(6):579-93. doi: 10.1111/j.1464-5491.2006.01918.x.

Type 2 diabetes mellitus in midlife estimated from the Cambridge Risk Score and body mass index.

Arch Intern Med. 2006 Mar 27;166(6):682-8. doi: 10.1001/archinte.166.6.682.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

支持向量机建模在常见疾病预测中的应用：以糖尿病和糖尿病前期为例。

Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献