• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Transformer 的卷积神经网络:挖掘多声源 COVID-19 诊断的时间上下文信息。

Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2335-2338. doi: 10.1109/EMBC46164.2021.9629552.

DOI:10.1109/EMBC46164.2021.9629552
PMID:34891751
Abstract

Due to the COronaVIrus Disease 2019 (COVID-19) pandemic, early screening of COVID-19 is essential to prevent its transmission. Detecting COVID-19 with computer audition techniques has in recent studies shown the potential to achieve a fast, cheap, and ecologically friendly diagnosis. Respiratory sounds and speech may contain rich and complementary information about COVID-19 clinical conditions. Therefore, we propose training three deep neural networks on three types of sounds (breathing/counting/vowel) and assembling these models to improve the performance. More specifically, we employ Convolutional Neural Networks (CNNs) to extract spatial representations from log Mel spectrograms and a multi-head attention mechanism in the transformer to mine temporal context information from the CNNs' outputs. The experimental results demonstrate that the transformer-based CNNs can effectively detect COVID-19 on the DiCOVA Track-2 database (AUC: 70.0%) and outperform simple CNNs and hybrid CNN-RNNs.

摘要

由于 2019 年冠状病毒病(COVID-19)大流行,早期筛查 COVID-19 对于防止其传播至关重要。最近的研究表明,利用计算机听觉技术检测 COVID-19 具有快速、廉价和环保的诊断潜力。呼吸声和语音可能包含有关 COVID-19 临床状况的丰富且互补的信息。因此,我们提出在三种类型的声音(呼吸/计数/元音)上训练三个深度神经网络,并将这些模型组装起来以提高性能。更具体地说,我们使用卷积神经网络(CNNs)从对数梅尔频谱图中提取空间表示,并在变压器中使用多头注意力机制从 CNN 的输出中挖掘时间上下文信息。实验结果表明,基于变压器的 CNN 可以有效地在 DiCOVA Track-2 数据库上检测 COVID-19(AUC:70.0%),并且优于简单的 CNN 和混合 CNN-RNN。

相似文献

1
Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis.基于 Transformer 的卷积神经网络:挖掘多声源 COVID-19 诊断的时间上下文信息。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2335-2338. doi: 10.1109/EMBC46164.2021.9629552.
2
A hybrid model based on neural networks for biomedical relation extraction.基于神经网络的生物医学关系抽取混合模型。
J Biomed Inform. 2018 May;81:83-92. doi: 10.1016/j.jbi.2018.03.011. Epub 2018 Mar 27.
3
Deep Learning Algorithm for COVID-19 Classification Using Chest X-Ray Images.基于胸部 X 光图像的 COVID-19 分类深度学习算法。
Comput Math Methods Med. 2021 Nov 9;2021:9269173. doi: 10.1155/2021/9269173. eCollection 2021.
4
Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer.基于卷积神经网络和多头卷积变换的语音情感识别。
Sensors (Basel). 2023 Jul 7;23(13):6212. doi: 10.3390/s23136212.
5
Triplet Loss-Based Models for COVID-19 Detection from Vocal Sounds.基于三重损失的声频 COVID-19 检测模型。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:998-1001. doi: 10.1109/EMBC48229.2022.9871125.
6
Fruit-CoV: An efficient vision-based framework for speedy detection and diagnosis of SARS-CoV-2 infections through recorded cough sounds.水果冠状病毒:一种通过记录咳嗽声音快速检测和诊断新冠病毒感染的高效视觉框架。
Expert Syst Appl. 2023 Mar 1;213:119212. doi: 10.1016/j.eswa.2022.119212. Epub 2022 Nov 7.
7
Capturing Time Dynamics From Speech Using Neural Networks for Surgical Mask Detection.使用神经网络从语音中捕捉时间动态以进行手术口罩检测。
IEEE J Biomed Health Inform. 2022 Aug;26(8):4291-4302. doi: 10.1109/JBHI.2022.3173128. Epub 2022 Aug 11.
8
CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds.CovNet:一种用于从众包咳嗽声音中自动检测新冠肺炎的迁移学习框架。
Front Digit Health. 2022 Jan 3;3:799067. doi: 10.3389/fdgth.2021.799067. eCollection 2021.
9
Vision Transformer-based recognition of diabetic retinopathy grade.基于 Vision Transformer 的糖尿病视网膜病变分级识别。
Med Phys. 2021 Dec;48(12):7850-7863. doi: 10.1002/mp.15312. Epub 2021 Nov 16.
10
Towards robust diagnosis of COVID-19 using vision self-attention transformer.利用视觉自注意力转换器实现 COVID-19 的稳健诊断。
Sci Rep. 2022 May 26;12(1):8922. doi: 10.1038/s41598-022-13039-x.

引用本文的文献

1
Exploring machine learning for audio-based respiratory condition screening: A concise review of databases, methods, and open issues.探索基于音频的呼吸状况筛查的机器学习:数据库、方法和开放问题的简明回顾。
Exp Biol Med (Maywood). 2022 Nov;247(22):2053-2061. doi: 10.1177/15353702221115428. Epub 2022 Aug 16.