Yu Yansuo, Zhang Mingwu, Xie Zhennian, Liu Qiang
Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing, China.
Xiyuan Hospital, Chinese Academy of Traditional Chinese Medicine, Beijing, China.
PLoS One. 2024 Dec 31;19(12):e0311503. doi: 10.1371/journal.pone.0311503. eCollection 2024.
Bowel sounds, a reflection of the gastrointestinal tract's peristalsis, are essential for diagnosing and monitoring gastrointestinal conditions. However, the absence of an effective, non-invasive method for assessing digestion through auscultation has resulted in a reliance on time-consuming and laborious manual analysis by clinicians. This study introduces an innovative deep learning-based method designed to automate and enhance the recognition of bowel sounds. Our approach integrates the Branchformer architecture, which leverages the power of self-attention and convolutional gating for robust feature extraction, with a self-supervised pre-training strategy. Specifically, the Branchformer model employs parallel processing of self-attention and convolutional gated Multi-layer Perceptron branches to capture both global and local dependencies in audio signals, thereby enabling effective characterization of complex bowel sound patterns. Furthermore, a self-supervised pre-training strategy is employed, leveraging a large corpus of unlabeled audio data to learn general sound wave representations, followed by fine-tuning on a limited set of bowel sound data to optimize the model's recognition performance for specific tasks. Experimental results on public bowel sound datasets demonstrate the superior recognition performance of the proposed method compared to existing baseline models, particularly under data-limited conditions, thereby confirming the effectiveness of the self-supervised pre-training strategy. This work provides an efficient and automated solution for clinical bowel sound monitoring, facilitating early diagnosis and treatment of gastrointestinal disorders.
肠鸣音是胃肠道蠕动的反映,对于诊断和监测胃肠道疾病至关重要。然而,由于缺乏一种通过听诊评估消化的有效、非侵入性方法,临床医生不得不依赖耗时费力的手动分析。本研究引入了一种基于深度学习的创新方法,旨在实现肠鸣音识别的自动化并提高其准确性。我们的方法将Branchformer架构与自监督预训练策略相结合,该架构利用自注意力和卷积门控的力量进行强大的特征提取。具体而言,Branchformer模型采用自注意力和卷积门控多层感知器分支的并行处理,以捕捉音频信号中的全局和局部依赖性,从而能够有效地表征复杂的肠鸣音模式。此外,采用了自监督预训练策略,利用大量未标记的音频数据来学习一般的声波表示,然后在有限的肠鸣音数据集中进行微调,以优化模型在特定任务中的识别性能。在公共肠鸣音数据集上的实验结果表明,与现有的基线模型相比,该方法具有卓越的识别性能,特别是在数据有限的条件下,从而证实了自监督预训练策略的有效性。这项工作为临床肠鸣音监测提供了一种高效、自动化的解决方案,有助于胃肠道疾病的早期诊断和治疗。