• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于事件自相似性的不平衡超声心动图数据自动特征选择

Automatic Feature Selection for Imbalanced Echocardiogram Data Using Event-Based Self-Similarity.

作者信息

Huang Huang-Nan, Chen Hong-Min, Lin Wei-Wen, Wiryasaputra Rita, Chen Yung-Cheng, Wang Yu-Huei, Yang Chao-Tung

机构信息

Department of Smart Computing and Applied Mathematics, Tunghai University, Taichung 407224, Taiwan.

Cardiovascular Center, Taichung Veterans General Hospital, Taichung 407219, Taiwan.

出版信息

Diagnostics (Basel). 2025 Apr 11;15(8):976. doi: 10.3390/diagnostics15080976.

DOI:10.3390/diagnostics15080976
PMID:40310367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12025907/
Abstract

Using echocardiogram data for cardiovascular disease (CVD) can lead to difficulties due to imbalanced datasets, leading to biased predictions. Machine learning models can enhance prognosis accuracy, but their effectiveness is influenced by optimal feature selection and robust classification techniques. This study introduces an event-based self-similarity approach to enhance automatic feature selection approach for imbalanced echocardiogram data. Critical features correlated with disease progression were identified by leveraging self-similarity patterns. This study used an echocardiogram dataset, visual presentations of high-frequency sound wave signals, and data of patients with heart disease who are treated using three treatment methods: catheter ablation, ventricular defibrillator, and drug control-over the course of three years. The dataset was classified into nine categories and Recursive Feature Elimination (RFE) was applied to identify the most relevant features, reducing model complexity while maintaining diagnostic accuracy. Machine learning classification models, including XGBoost and CATBoost, were trained and evaluated. Both models achieved comparable accuracy values, 84.3% and 88.4%, respectively, under different normalization techniques. To further optimize performance, the models were combined into a voting ensemble, improving feature selection and predictive accuracy. Four essential features-age, aorta (AO), left ventricular (LV), and left atrium (LA)-were identified as critical for prognosis and were found in Random Forest (RF)-voting ensemble classifier. The results underscore the importance of feature selection techniques in handling imbalanced datasets, improving classification robustness, and reducing bias in automated prognosis systems. Our findings highlight the potential of machine learning-driven echocardiogram analysis to enhance patient care by providing accurate, data-driven assessments.

摘要

由于数据集不平衡,使用超声心动图数据诊断心血管疾病(CVD)可能会遇到困难,从而导致预测偏差。机器学习模型可以提高预后准确性,但其有效性受最佳特征选择和强大分类技术的影响。本研究引入了一种基于事件的自相似性方法,以增强对不平衡超声心动图数据的自动特征选择方法。通过利用自相似性模式识别与疾病进展相关的关键特征。本研究使用了一个超声心动图数据集、高频声波信号的可视化呈现,以及采用三种治疗方法(导管消融、植入式心脏复律除颤器和药物控制)治疗三年的心脏病患者数据。该数据集被分为九类,并应用递归特征消除(RFE)来识别最相关的特征,在保持诊断准确性的同时降低模型复杂性。对包括XGBoost和CATBoost在内的机器学习分类模型进行了训练和评估。在不同的归一化技术下,这两种模型分别达到了84.3%和88.4%的可比准确率。为了进一步优化性能,将这些模型组合成一个投票集成模型,提高了特征选择和预测准确性。四个重要特征——年龄、主动脉(AO)、左心室(LV)和左心房(LA)——被确定为预后的关键因素,并在随机森林(RF)投票集成分类器中被发现。结果强调了特征选择技术在处理不平衡数据集、提高分类稳健性以及减少自动预后系统偏差方面的重要性。我们的研究结果突出了机器学习驱动的超声心动图分析通过提供准确的数据驱动评估来改善患者护理的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/ec1b2d04f478/diagnostics-15-00976-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/e8b02824d87c/diagnostics-15-00976-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/062bf72557e4/diagnostics-15-00976-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/f9c4b3a3ffef/diagnostics-15-00976-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/174ca1f26f64/diagnostics-15-00976-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/d2ed7bf3538f/diagnostics-15-00976-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/db0c6661b8bc/diagnostics-15-00976-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/4c8f739d5111/diagnostics-15-00976-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/a77233af7e05/diagnostics-15-00976-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/ec1b2d04f478/diagnostics-15-00976-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/e8b02824d87c/diagnostics-15-00976-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/062bf72557e4/diagnostics-15-00976-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/f9c4b3a3ffef/diagnostics-15-00976-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/174ca1f26f64/diagnostics-15-00976-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/d2ed7bf3538f/diagnostics-15-00976-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/db0c6661b8bc/diagnostics-15-00976-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/4c8f739d5111/diagnostics-15-00976-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/a77233af7e05/diagnostics-15-00976-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf82/12025907/ec1b2d04f478/diagnostics-15-00976-g008.jpg

相似文献

1
Automatic Feature Selection for Imbalanced Echocardiogram Data Using Event-Based Self-Similarity.基于事件自相似性的不平衡超声心动图数据自动特征选择
Diagnostics (Basel). 2025 Apr 11;15(8):976. doi: 10.3390/diagnostics15080976.
2
Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization.基于硬集合投票优化的冠状动脉疾病诊断。
Medicina (Kaunas). 2022 Nov 28;58(12):1745. doi: 10.3390/medicina58121745.
3
Employing feature engineering strategies to improve the performance of machine learning algorithms on echocardiogram dataset.采用特征工程策略提高机器学习算法在超声心动图数据集上的性能。
Digit Health. 2023 Oct 29;9:20552076231207589. doi: 10.1177/20552076231207589. eCollection 2023 Jan-Dec.
4
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
5
Ensemble Feature Learning of Genomic Data Using Support Vector Machine.使用支持向量机的基因组数据集成特征学习
PLoS One. 2016 Jun 15;11(6):e0157330. doi: 10.1371/journal.pone.0157330. eCollection 2016.
6
Development of an efficient novel method for coronary artery disease prediction using machine learning and deep learning techniques.利用机器学习和深度学习技术开发一种用于冠心病预测的高效新方法。
Technol Health Care. 2024;32(6):4545-4569. doi: 10.3233/THC-240740.
7
An efficient computer vision-based approach for acute lymphoblastic leukemia prediction.一种基于计算机视觉的急性淋巴细胞白血病高效预测方法。
Front Comput Neurosci. 2022 Nov 24;16:1083649. doi: 10.3389/fncom.2022.1083649. eCollection 2022.
8
TabNet and TabTransformer: Novel Deep Learning Models for Chemical Toxicity Prediction in Comparison With Machine Learning.TabNet和TabTransformer:与机器学习相比用于化学毒性预测的新型深度学习模型。
J Appl Toxicol. 2025 May 1. doi: 10.1002/jat.4803.
9
Effective hybrid feature selection using different bootstrap enhances cancers classification performance.使用不同的自助法进行有效的混合特征选择可提高癌症分类性能。
BioData Min. 2022 Sep 30;15(1):24. doi: 10.1186/s13040-022-00304-y.
10
Prediction of Early Diagnosis in Ovarian Cancer Patients Using Machine Learning Approaches with Boruta and Advanced Feature Selection.使用带有Boruta和高级特征选择的机器学习方法预测卵巢癌患者的早期诊断
Life (Basel). 2025 Apr 3;15(4):594. doi: 10.3390/life15040594.

本文引用的文献

1
XAI-Augmented Voting Ensemble Models for Heart Disease Prediction: A SHAP and LIME-Based Approach.用于心脏病预测的XAI增强投票集成模型:一种基于SHAP和LIME的方法。
Bioengineering (Basel). 2024 Oct 12;11(10):1016. doi: 10.3390/bioengineering11101016.
2
Meta-analysis of deep learning approaches for automated coronary artery calcium scoring: Performance and clinical utility AI in CAC scoring: A meta-analysis: AI in CAC scoring: A meta-analysis.深度学习方法在自动冠状动脉钙评分中的应用的荟萃分析:性能和临床效用 AI 在 CAC 评分中的应用:荟萃分析:AI 在 CAC 评分中的应用:荟萃分析。
Comput Biol Med. 2024 Dec;183:109295. doi: 10.1016/j.compbiomed.2024.109295. Epub 2024 Oct 23.
3
Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases.
基于机器学习的心血管疾病检测预测模型
Diagnostics (Basel). 2024 Jan 8;14(2):144. doi: 10.3390/diagnostics14020144.
4
Machine learning using the extreme gradient boosting (XGBoost) algorithm predicts 5-day delta of SOFA score at ICU admission in COVID-19 patients.使用极端梯度提升(XGBoost)算法的机器学习可预测新冠肺炎患者入住重症监护病房(ICU)时序贯器官衰竭评估(SOFA)评分的5天变化值。
J Intensive Med. 2021 Oct 22;1(2):110-116. doi: 10.1016/j.jointm.2021.09.002. eCollection 2021 Oct.
5
Imbalanced ECG signal-based heart disease classification using ensemble machine learning technique.基于不平衡心电图信号的心脏病分类:使用集成机器学习技术
Front Big Data. 2022 Oct 10;5:1021518. doi: 10.3389/fdata.2022.1021518. eCollection 2022.
6
Interrater reliability estimators tested against true interrater reliabilities.评估者间信度估计值与真实评估者间信度进行比较。
BMC Med Res Methodol. 2022 Aug 29;22(1):232. doi: 10.1186/s12874-022-01707-5.
7
Prediction of acute kidney injury in patients with femoral neck fracture utilizing machine learning.利用机器学习预测股骨颈骨折患者的急性肾损伤
Front Surg. 2022 Jul 26;9:928750. doi: 10.3389/fsurg.2022.928750. eCollection 2022.
8
Predicting the Prognosis of Patients in the Coronary Care Unit: A Novel Multi-Category Machine Learning Model Using XGBoost.预测冠心病监护病房患者的预后:一种使用XGBoost的新型多类别机器学习模型。
Front Cardiovasc Med. 2022 May 12;9:764629. doi: 10.3389/fcvm.2022.764629. eCollection 2022.
9
Feature Selection in High Dimensional Biomedical Data Based on BF-SFLA.基于布谷鸟搜索-正弦余弦算法的高维生物医学数据特征选择
Front Neurosci. 2022 Apr 18;16:854685. doi: 10.3389/fnins.2022.854685. eCollection 2022.
10
A Joint Model of Random Forest and Artificial Neural Network for the Diagnosis of Endometriosis.一种用于子宫内膜异位症诊断的随机森林与人工神经网络联合模型
Front Genet. 2022 Mar 8;13:848116. doi: 10.3389/fgene.2022.848116. eCollection 2022.