• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积神经网络的光谱成像心音分类中数据增强方法的分析。

On the analysis of data augmentation methods for spectral imaged based heart sound classification using convolutional neural networks.

机构信息

Weill Cornell Medicine, New York, NY, 10021, USA.

出版信息

BMC Med Inform Decis Mak. 2022 Aug 29;22(1):226. doi: 10.1186/s12911-022-01942-2.

DOI:10.1186/s12911-022-01942-2
PMID:36038901
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9421122/
Abstract

BACKGROUND

The application of machine learning to cardiac auscultation has the potential to improve the accuracy and efficiency of both routine and point-of-care screenings. The use of convolutional neural networks (CNN) on heart sound spectrograms in particular has defined state-of-the-art performance. However, the relative paucity of patient data remains a significant barrier to creating models that can adapt to a wide range of potential variability. To that end, we examined a CNN model's performance on automated heart sound classification, before and after various forms of data augmentation, and aimed to identify the most optimal augmentation methods for cardiac spectrogram analysis.

RESULTS

We built a standard CNN model to classify cardiac sound recordings as either normal or abnormal. The baseline control model achieved a PR AUC of 0.763 ± 0.047. Among the single data augmentation techniques explored, horizontal flipping of the spectrogram image improved the model performance the most, with a PR AUC of 0.819 ± 0.044. Principal component analysis color augmentation (PCA) and perturbations of saturation-value (SV) of the hue-saturation-value (HSV) color scale achieved a PR AUC of 0.779 ± 045 and 0.784 ± 0.037, respectively. Time and frequency masking resulted in a PR AUC of 0.772 ± 0.050. Pitch shifting, time stretching and compressing, noise injection, vertical flipping, and applying random color filters negatively impacted model performance. Concatenating the best performing data augmentation technique (horizontal flip) with PCA and SV perturbations improved model performance.

CONCLUSION

Data augmentation can improve classification accuracy by expanding and diversifying the dataset, which protects against overfitting to random variance. However, data augmentation is necessarily domain specific. For example, methods like noise injection have found success in other areas of automated sound classification, but in the context of cardiac sound analysis, noise injection can mimic the presence of murmurs and worsen model performance. Thus, care should be taken to ensure clinically appropriate forms of data augmentation to avoid negatively impacting model performance.

摘要

背景

机器学习在心脏听诊中的应用有可能提高常规和即时筛查的准确性和效率。特别是使用卷积神经网络 (CNN) 对心音声谱图的应用,已经达到了最先进的性能水平。然而,患者数据的相对缺乏仍然是创建能够适应广泛潜在变异性的模型的一个重大障碍。为此,我们检查了一个 CNN 模型在自动心音分类中的性能,在进行各种形式的数据增强前后,并旨在确定最适合心脏声谱分析的数据增强方法。

结果

我们构建了一个标准的 CNN 模型来对心脏声音记录进行正常或异常分类。基线控制模型的 PR AUC 为 0.763±0.047。在探索的单一数据增强技术中,声谱图图像的水平翻转最能提高模型性能,PR AUC 为 0.819±0.044。主成分分析颜色增强 (PCA) 和色调-饱和度-值 (HSV) 颜色通道饱和度-值 (SV) 的扰动分别达到了 0.779±0.045 和 0.784±0.037 的 PR AUC。时间和频率掩蔽的 PR AUC 为 0.772±0.050。音高移动、时间拉伸和压缩、噪声注入、垂直翻转和应用随机颜色滤波器对模型性能产生了负面影响。将性能最佳的数据增强技术(水平翻转)与 PCA 和 SV 扰动相结合,提高了模型性能。

结论

数据增强可以通过扩展和多样化数据集来提高分类准确性,从而防止过度拟合随机方差。然而,数据增强是特定于领域的。例如,噪声注入等方法在自动化声音分类的其他领域取得了成功,但在心脏声音分析的背景下,噪声注入可以模拟杂音的存在,并降低模型性能。因此,应该小心谨慎地进行数据增强,以确保采用临床适当的数据增强形式,避免对模型性能产生负面影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/6df7e02a038d/12911_2022_1942_Fig11a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/742d90766b95/12911_2022_1942_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/50c6533a0557/12911_2022_1942_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/642671bbbabc/12911_2022_1942_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/0f2c54c55a76/12911_2022_1942_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/51b3de5b7931/12911_2022_1942_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/6e4b61c8dc89/12911_2022_1942_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/70d997eeb573/12911_2022_1942_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/e5d9769b83c8/12911_2022_1942_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/b12e86eeebe4/12911_2022_1942_Fig9a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/ffc976de13ea/12911_2022_1942_Fig10a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/6df7e02a038d/12911_2022_1942_Fig11a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/742d90766b95/12911_2022_1942_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/50c6533a0557/12911_2022_1942_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/642671bbbabc/12911_2022_1942_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/0f2c54c55a76/12911_2022_1942_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/51b3de5b7931/12911_2022_1942_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/6e4b61c8dc89/12911_2022_1942_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/70d997eeb573/12911_2022_1942_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/e5d9769b83c8/12911_2022_1942_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/b12e86eeebe4/12911_2022_1942_Fig9a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/ffc976de13ea/12911_2022_1942_Fig10a_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc65/9422104/6df7e02a038d/12911_2022_1942_Fig11a_HTML.jpg

相似文献

1
On the analysis of data augmentation methods for spectral imaged based heart sound classification using convolutional neural networks.基于卷积神经网络的光谱成像心音分类中数据增强方法的分析。
BMC Med Inform Decis Mak. 2022 Aug 29;22(1):226. doi: 10.1186/s12911-022-01942-2.
2
Cardiac anomaly detection considering an additive noise and convolutional distortion model of heart sound recordings.考虑心音记录的附加噪声和卷积失真模型的心脏异常检测。
Artif Intell Med. 2022 Nov;133:102417. doi: 10.1016/j.artmed.2022.102417. Epub 2022 Oct 7.
3
Feature-Based Fusion Using CNN for Lung and Heart Sound Classification.基于特征融合的 CNN 用于心肺音分类。
Sensors (Basel). 2022 Feb 16;22(4):1521. doi: 10.3390/s22041521.
4
Design of ear-contactless stethoscope and improvement in the performance of deep learning based on CNN to classify the heart sound.非接触式听诊器的设计及基于 CNN 的深度学习性能改进以用于心音分类。
Med Biol Eng Comput. 2023 Sep;61(9):2417-2439. doi: 10.1007/s11517-023-02827-w. Epub 2023 Apr 27.
5
Heart sound classification based on improved MFCC features and convolutional recurrent neural networks.基于改进 MFCC 特征和卷积循环神经网络的心音分类。
Neural Netw. 2020 Oct;130:22-32. doi: 10.1016/j.neunet.2020.06.015. Epub 2020 Jun 23.
6
Improving skin cancer detection by Raman spectroscopy using convolutional neural networks and data augmentation.利用卷积神经网络和数据增强技术通过拉曼光谱改善皮肤癌检测
Front Oncol. 2024 Jun 19;14:1320220. doi: 10.3389/fonc.2024.1320220. eCollection 2024.
7
Recurrent vs Non-Recurrent Convolutional Neural Networks for Heart Sound Classification.递归与非递归卷积神经网络在心脏音分类中的应用。
Stud Health Technol Inform. 2023 Jun 29;305:436-439. doi: 10.3233/SHTI230525.
8
A CNN Sound Classification Mechanism Using Data Augmentation.一种使用数据增强的卷积神经网络声音分类机制。
Sensors (Basel). 2023 Aug 5;23(15):6972. doi: 10.3390/s23156972.
9
Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks.基于卷积神经网络快照集成的肺音分类
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:760-763. doi: 10.1109/EMBC44109.2020.9176076.
10
Fundamental Heart Sound Classification using the Continuous Wavelet Transform and Convolutional Neural Networks.基于连续小波变换和卷积神经网络的基本心音分类
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:409-412. doi: 10.1109/EMBC.2018.8512284.

引用本文的文献

1
Deep learning-based evaluation of the severity of mitral regurgitation in canine myxomatous mitral valve disease patients using digital stethoscope recordings.基于深度学习,利用数字听诊器记录评估犬黏液瘤性二尖瓣疾病患者二尖瓣反流的严重程度
BMC Vet Res. 2025 May 8;21(1):326. doi: 10.1186/s12917-025-04802-z.
2
Real-Time PPG-Based Biometric Identification: Advancing Security with 2D Gram Matrices and Deep Learning Models.基于实时PPG的生物特征识别:利用二维Gram矩阵和深度学习模型提升安全性
Sensors (Basel). 2024 Dec 25;25(1):40. doi: 10.3390/s25010040.
3
Phonocardiogram (PCG) Murmur Detection Based on the Mean Teacher Method.

本文引用的文献

1
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.
2
Heartbeat Sound Signal Classification Using Deep Learning.基于深度学习的心跳声信号分类。
Sensors (Basel). 2019 Nov 5;19(21):4819. doi: 10.3390/s19214819.
3
Classifying Heart Sounds Using Images of Motifs, MFCC and Temporal Features.基于图案图像、MFCC 和时频特征的心脏音分类。
基于均值教师法的心音图(PCG)杂音检测。
Sensors (Basel). 2024 Oct 15;24(20):6646. doi: 10.3390/s24206646.
4
Deep Learning in Heart Sound Analysis: From Techniques to Clinical Applications.心音分析中的深度学习:从技术到临床应用
Health Data Sci. 2024 Oct 9;4:0182. doi: 10.34133/hds.0182. eCollection 2024.
5
[Research on bark-frequency spectral coefficients heart sound classification algorithm based on multiple window time-frequency reassignment].基于多窗口时频重分配的树皮频率谱系数心音分类算法研究
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Feb 25;41(1):51-59. doi: 10.7507/1001-5515.202212037.
J Med Syst. 2019 May 6;43(6):168. doi: 10.1007/s10916-019-1286-5.
4
A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection.基于 CNN 的自动心音分类的时频特征研究用于病理检测。
Comput Biol Med. 2018 Sep 1;100:132-143. doi: 10.1016/j.compbiomed.2018.06.026. Epub 2018 Jun 25.
5
Recognition of normal-abnormal phonocardiographic signals using deep convolutional neural networks and mel-frequency spectral coefficients.使用深度卷积神经网络和梅尔频率频谱系数识别正常和异常心音信号。
Physiol Meas. 2017 Jul 31;38(8):1671-1684. doi: 10.1088/1361-6579/aa7841.
6
Combining sparse coding and time-domain features for heart sound classification.基于稀疏编码和时域特征的心音分类。
Physiol Meas. 2017 Jul 31;38(8):1701-1713. doi: 10.1088/1361-6579/aa7623.
7
Automated Diagnosis of Heart Sounds Using Rule-Based Classification Tree.基于规则的分类树的心音自动诊断
J Med Syst. 2017 Apr;41(4):60. doi: 10.1007/s10916-017-0704-9. Epub 2017 Mar 1.
8
An open access database for the evaluation of heart sound algorithms.一个用于评估心音算法的开放获取数据库。
Physiol Meas. 2016 Dec;37(12):2181-2213. doi: 10.1088/0967-3334/37/12/2181. Epub 2016 Nov 21.
9
Automated Identification of Innocent Still's Murmur in Children.儿童无害性斯蒂尔杂音的自动识别
IEEE Trans Biomed Eng. 2017 Jun;64(6):1326-1334. doi: 10.1109/TBME.2016.2603787. Epub 2016 Aug 26.
10
Additive white Gaussian noise level estimation in SVD domain for images.在 SVD 域中对图像进行加性高斯白噪声水平估计。
IEEE Trans Image Process. 2013 Mar;22(3):872-83. doi: 10.1109/TIP.2012.2219544. Epub 2012 Sep 18.