• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用可视化和特征选择方法,将不规则和不平衡数据应用于预测糖尿病肾病。

Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods.

作者信息

Cho Baek Hwan, Yu Hwanjo, Kim Kwang-Won, Kim Tae Hyun, Kim In Young, Kim Sun I

机构信息

Department of Biomedical Engineering, Hanyang University, Seoul, Republic of Korea.

出版信息

Artif Intell Med. 2008 Jan;42(1):37-53. doi: 10.1016/j.artmed.2007.09.005. Epub 2007 Nov 7.

DOI:10.1016/j.artmed.2007.09.005
PMID:17997291
Abstract

OBJECTIVE

Diabetic nephropathy is damage to the kidney caused by diabetes mellitus. It is a common complication and a leading cause of death in people with diabetes. However, the decline in kidney function varies considerably between patients and the determinants of diabetic nephropathy have not been clearly identified. Therefore, it is very difficult to predict the onset of diabetic nephropathy accurately with simple statistical approaches such as t-test or chi(2)-test. To accurately predict the onset of diabetic nephropathy, we applied various machine learning techniques to irregular and unbalanced diabetes dataset, such as support vector machine (SVM) classification and feature selection methods. Visualization of the risk factors was another important objective to give physicians intuitive information on each patient's clinical pattern.

METHODS AND MATERIALS

We collected medical data from 292 patients with diabetes and performed preprocessing to extract 184 features from the irregular data. To predict the onset of diabetic nephropathy, we compared several classification methods such as logistic regression, SVM, and SVM with a cost sensitive learning method. We also applied several feature selection methods to remove redundant features and improve the classification performance. For risk factor analysis with SVM classifiers, we have developed a new visualization system which uses a nomogram approach.

RESULTS

Linear SVM classifiers combined with wrapper or embedded feature selection methods showed the best results. Among the 184 features, the classifiers selected the same 39 features and gave 0.969 of the area under the curve by receiver operating characteristics analysis. The visualization tool was able to present the effect of each feature on the decision via graphical output.

CONCLUSIONS

Our proposed method can predict the onset of diabetic nephropathy about 2-3 months before the actual diagnosis with high prediction performance from an irregular and unbalanced dataset, which statistical methods such as t-test and logistic regression could not achieve. Additionally, the visualization system provides physicians with intuitive information for risk factor analysis. Therefore, physicians can benefit from the automatic early warning of each patient and visualize risk factors, which facilitate planning of effective and proper treatment strategies.

摘要

目的

糖尿病肾病是由糖尿病引起的肾脏损害。它是糖尿病患者常见的并发症和主要死因。然而,患者之间的肾功能下降差异很大,糖尿病肾病的决定因素尚未明确确定。因此,使用t检验或卡方检验等简单统计方法很难准确预测糖尿病肾病的发病情况。为了准确预测糖尿病肾病的发病,我们将各种机器学习技术应用于不规则且不平衡的糖尿病数据集,如支持向量机(SVM)分类和特征选择方法。危险因素的可视化是另一个重要目标,以便为医生提供有关每个患者临床模式的直观信息。

方法和材料

我们收集了292例糖尿病患者的医疗数据,并进行预处理以从不规则数据中提取184个特征。为了预测糖尿病肾病的发病,我们比较了几种分类方法,如逻辑回归、支持向量机以及采用成本敏感学习方法的支持向量机。我们还应用了几种特征选择方法来去除冗余特征并提高分类性能。对于使用支持向量机分类器进行危险因素分析,我们开发了一种使用列线图方法的新可视化系统。

结果

结合包装器或嵌入式特征选择方法应用的线性支持向量机分类器显示出最佳结果。在184个特征中,分类器选择了相同的39个特征,通过接受者操作特征分析得出曲线下面积为0.969。可视化工具能够通过图形输出呈现每个特征对决策的影响。

结论

我们提出的方法能够在实际诊断前约2至3个月预测糖尿病肾病的发病,具有较高的预测性能,这是t检验和逻辑回归等统计方法无法实现的,因为数据集不规则且不平衡。此外,可视化系统为医生提供了用于危险因素分析的直观信息。因此,医生可以从对每个患者的自动早期预警中受益,并可视化危险因素,这有助于制定有效且适当的治疗策略。

相似文献

1
Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods.运用可视化和特征选择方法,将不规则和不平衡数据应用于预测糖尿病肾病。
Artif Intell Med. 2008 Jan;42(1):37-53. doi: 10.1016/j.artmed.2007.09.005. Epub 2007 Nov 7.
2
A decision support system to facilitate management of patients with acute gastrointestinal bleeding.一个有助于急性胃肠道出血患者管理的决策支持系统。
Artif Intell Med. 2008 Mar;42(3):247-59. doi: 10.1016/j.artmed.2007.10.003. Epub 2007 Dec 11.
3
A novel feature selection approach for biomedical data classification.一种用于生物医学数据分类的新特征选择方法。
J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.
4
Comparison of machine learning and traditional classifiers in glaucoma diagnosis.机器学习与传统分类器在青光眼诊断中的比较
IEEE Trans Biomed Eng. 2002 Sep;49(9):963-74. doi: 10.1109/TBME.2002.802012.
5
Feature selection and classification model construction on type 2 diabetic patients' data.基于2型糖尿病患者数据的特征选择与分类模型构建
Artif Intell Med. 2007 Nov;41(3):251-62. doi: 10.1016/j.artmed.2007.07.002. Epub 2007 Aug 17.
6
GenSo-FDSS: a neural-fuzzy decision support system for pediatric ALL cancer subtype identification using gene expression data.GenSo-FDSS:一种基于基因表达数据的用于小儿急性淋巴细胞白血病癌症亚型识别的神经模糊决策支持系统。
Artif Intell Med. 2005 Jan;33(1):61-88. doi: 10.1016/j.artmed.2004.03.009.
7
Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels.使用列线图和局部径向基函数核进行风险因素分析的非线性支持向量机可视化
IEEE Trans Inf Technol Biomed. 2008 Mar;12(2):247-56. doi: 10.1109/TITB.2007.902300.
8
Predicting variations of perceptual performance across individuals from neural activity using pattern classifiers.使用模式分类器从神经活动预测个体间感知性能的变化。
Neuroimage. 2010 Jul 15;51(4):1425-37. doi: 10.1016/j.neuroimage.2010.03.030. Epub 2010 Mar 17.
9
Analysis of EEG signals by combining eigenvector methods and multiclass support vector machines.结合特征向量方法和多类支持向量机对脑电图信号进行分析。
Comput Biol Med. 2008 Jan;38(1):14-22. doi: 10.1016/j.compbiomed.2007.06.002. Epub 2007 Jul 24.
10
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.

引用本文的文献

1
Development and validation of a transformer model-based early warning score for real-time prediction of adverse outcomes in the emergency department.基于变压器模型的急诊科不良结局实时预测预警评分的开发与验证
Sci Rep. 2025 Jul 2;15(1):23021. doi: 10.1038/s41598-025-07511-7.
2
Application of Machine Learning in Chronic Kidney Disease: Current Status and Future Prospects.机器学习在慢性肾脏病中的应用:现状与未来展望
Biomedicines. 2024 Mar 3;12(3):568. doi: 10.3390/biomedicines12030568.
3
Machine learning prediction models for diabetic kidney disease: systematic review and meta-analysis.
机器学习预测糖尿病肾病模型:系统评价和荟萃分析。
Endocrine. 2024 Jun;84(3):890-902. doi: 10.1007/s12020-023-03637-8. Epub 2023 Dec 23.
4
The automatic detection of diabetic kidney disease from retinal vascular parameters combined with clinical variables using artificial intelligence in type-2 diabetes patients.基于人工智能的 2 型糖尿病患者视网膜血管参数与临床变量联合自动检测糖尿病肾病。
BMC Med Inform Decis Mak. 2023 Oct 30;23(1):241. doi: 10.1186/s12911-023-02343-9.
5
Application of Machine Learning Algorithm in Predicting Axillary Lymph Node Metastasis from Breast Cancer on Preoperative Chest CT.机器学习算法在术前胸部CT预测乳腺癌腋窝淋巴结转移中的应用
Diagnostics (Basel). 2023 Sep 14;13(18):2953. doi: 10.3390/diagnostics13182953.
6
Detection of Cancer Recurrence Using Systemic Inflammatory Markers and Machine Learning after Concurrent Chemoradiotherapy for Head and Neck Cancers.头颈部癌症同步放化疗后使用全身炎症标志物和机器学习检测癌症复发
Cancers (Basel). 2023 Jul 8;15(14):3540. doi: 10.3390/cancers15143540.
7
Predicting diabetic nephropathy in type 2 diabetic patients using machine learning algorithms.使用机器学习算法预测2型糖尿病患者的糖尿病肾病
J Diabetes Metab Disord. 2022 Jul 26;21(2):1433-1441. doi: 10.1007/s40200-022-01076-2. eCollection 2022 Dec.
8
Effectiveness of Artificial Intelligence Models for Cardiovascular Disease Prediction: Network Meta-Analysis.人工智能模型在心血管疾病预测中的有效性:网络荟萃分析。
Comput Intell Neurosci. 2022 Feb 24;2022:5849995. doi: 10.1155/2022/5849995. eCollection 2022.
9
A Novel Extra Tree Ensemble Optimized DL Framework (ETEODL) for Early Detection of Diabetes.一种用于糖尿病早期检测的新型 Extra Tree 集成优化深度学习框架(ETEODL)。
Front Public Health. 2022 Feb 15;9:797877. doi: 10.3389/fpubh.2021.797877. eCollection 2021.
10
Machine Learning for Screening Microvascular Complications in Type 2 Diabetic Patients Using Demographic, Clinical, and Laboratory Profiles.利用人口统计学、临床和实验室资料,通过机器学习筛查2型糖尿病患者的微血管并发症
J Clin Med. 2022 Feb 9;11(4):903. doi: 10.3390/jcm11040903.