• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RAFF-Net:一种基于残差注意力网络和多尺度特征融合的改进型舌部分割算法。

RAFF-Net: An improved tongue segmentation algorithm based on residual attention network and multiscale feature fusion.

作者信息

Song Haibei, Huang Zonghai, Feng Li, Zhong Yanmei, Wen Chuanbiao, Guo Jinhong

机构信息

School of Intelligent Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, China.

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Digit Health. 2022 Nov 3;8:20552076221136362. doi: 10.1177/20552076221136362. eCollection 2022 Jan-Dec.

DOI:10.1177/20552076221136362
PMID:36339902
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9634193/
Abstract

OBJECTIVE

Due to the complexity of face images, tongue segmentation is susceptible to interference from uneven tongue texture, lips and face, resulting in traditional methods failing to segment the tongue accurately. To address this problem, RAFF-Net, an automatic tongue region segmentation network based on residual attention network and multiscale feature fusion, was proposed. It aims to improve tongue segmentation accuracy and achieve end-to-end automated segmentation.

METHODS

Based on the UNet backbone network, different numbers of ResBlocks combined with the Squeeze-and-Excitation (SE) block was used as an encoder to extract image layered features. The decoder structure of UNet was simplified and the number of parameters of the network model was reduced. Meanwhile, the multiscale feature fusion module was designed to optimize the network parameters by combining a custom loss function instead of the common cross-entropy loss function to further improve the detection accuracy.

RESULTS

The RAFF-Net network structure achieved Mean Intersection over Union (MIoU) and F1-score of 97.85% and 97.73%, respectively, which improved 0.56% and 0.46%, respectively, compared with the original UNet; ablation experiments demonstrated that the improved algorithm could contribute to the enhancement of tongue segmentation effect.

CONCLUSION

This study combined the residual attention network with multiscale feature fusion to effectively improve the segmentation accuracy of the tongue region, and optimized the input and output of the UNet network using different numbers of ResBlocks, SE block, multiscale feature fusion and weighted loss function, increased the stability of the network and improved the overall effect of the network.

摘要

目的

由于面部图像的复杂性,舌部分割容易受到不均匀舌纹理、嘴唇和面部的干扰,导致传统方法无法准确分割舌头。为了解决这个问题,提出了基于残差注意力网络和多尺度特征融合的自动舌区分割网络RAFF-Net。其目的是提高舌部分割精度并实现端到端自动分割。

方法

基于UNet骨干网络,使用不同数量的与挤压激励(SE)块相结合的残差块作为编码器来提取图像分层特征。简化了UNet的解码器结构,减少了网络模型的参数数量。同时,设计了多尺度特征融合模块,通过结合自定义损失函数而非常见的交叉熵损失函数来优化网络参数,以进一步提高检测精度。

结果

RAFF-Net网络结构的平均交并比(MIoU)和F1分数分别达到97.85%和97.73%,与原始UNet相比分别提高了0.56%和0.46%;消融实验表明改进算法有助于增强舌部分割效果。

结论

本研究将残差注意力网络与多尺度特征融合相结合,有效提高了舌区的分割精度,并使用不同数量的残差块、SE块、多尺度特征融合和加权损失函数对UNet网络的输入和输出进行优化,增加了网络的稳定性,提高了网络的整体效果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/c410c5ff492a/10.1177_20552076221136362-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/7ac132932a56/10.1177_20552076221136362-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/93418be75488/10.1177_20552076221136362-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/7107346e2c4c/10.1177_20552076221136362-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/1c272988115b/10.1177_20552076221136362-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/b45bf33e379e/10.1177_20552076221136362-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/83a3131b578e/10.1177_20552076221136362-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/55c36fa63ef3/10.1177_20552076221136362-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/169274ffe7a9/10.1177_20552076221136362-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/1c1f781a10ce/10.1177_20552076221136362-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/5810ef871ed4/10.1177_20552076221136362-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/b8657648054e/10.1177_20552076221136362-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/8de89b203adc/10.1177_20552076221136362-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/c410c5ff492a/10.1177_20552076221136362-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/7ac132932a56/10.1177_20552076221136362-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/93418be75488/10.1177_20552076221136362-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/7107346e2c4c/10.1177_20552076221136362-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/1c272988115b/10.1177_20552076221136362-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/b45bf33e379e/10.1177_20552076221136362-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/83a3131b578e/10.1177_20552076221136362-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/55c36fa63ef3/10.1177_20552076221136362-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/169274ffe7a9/10.1177_20552076221136362-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/1c1f781a10ce/10.1177_20552076221136362-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/5810ef871ed4/10.1177_20552076221136362-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/b8657648054e/10.1177_20552076221136362-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/8de89b203adc/10.1177_20552076221136362-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d79/9634193/c410c5ff492a/10.1177_20552076221136362-fig13.jpg

相似文献

1
RAFF-Net: An improved tongue segmentation algorithm based on residual attention network and multiscale feature fusion.RAFF-Net:一种基于残差注意力网络和多尺度特征融合的改进型舌部分割算法。
Digit Health. 2022 Nov 3;8:20552076221136362. doi: 10.1177/20552076221136362. eCollection 2022 Jan-Dec.
2
RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3.RTC_TongueNet:一种基于DeepLabV3的改进型舌图像分割模型。
Digit Health. 2024 Mar 28;10:20552076241242773. doi: 10.1177/20552076241242773. eCollection 2024 Jan-Dec.
3
Multiple myeloma segmentation net (MMNet): an encoder-decoder-based deep multiscale feature fusion model for multiple myeloma segmentation in magnetic resonance imaging.多发性骨髓瘤分割网络(MMNet):一种基于编码器-解码器的深度多尺度特征融合模型,用于磁共振成像中的多发性骨髓瘤分割。
Quant Imaging Med Surg. 2024 Oct 1;14(10):7176-7199. doi: 10.21037/qims-24-683. Epub 2024 Sep 24.
4
FMD-UNet: fine-grained feature squeeze and multiscale cascade dilated semantic aggregation dual-decoder UNet for COVID-19 lung infection segmentation from CT images.FMD-UNet:用于从 CT 图像中 COVID-19 肺部感染分割的细粒度特征挤压和多尺度级联扩张语义聚合双解码器 UNet。
Biomed Phys Eng Express. 2024 Aug 27;10(5). doi: 10.1088/2057-1976/ad6f12.
5
RAD-UNet: Research on an improved lung nodule semantic segmentation algorithm based on deep learning.RAD-UNet:基于深度学习的改进型肺结节语义分割算法研究
Front Oncol. 2023 Mar 23;13:1084096. doi: 10.3389/fonc.2023.1084096. eCollection 2023.
6
SegR-Net: A deep learning framework with multi-scale feature fusion for robust retinal vessel segmentation.SegR-Net:一种具有多尺度特征融合的深度学习框架,用于稳健的视网膜血管分割。
Comput Biol Med. 2023 Sep;163:107132. doi: 10.1016/j.compbiomed.2023.107132. Epub 2023 Jun 10.
7
SK-Unet++: An improved Unet++ network with adaptive receptive fields for automatic segmentation of ultrasound thyroid nodule images.SK-Unet++:一种具有自适应感受野的改进型Unet++网络,用于超声甲状腺结节图像的自动分割。
Med Phys. 2024 Mar;51(3):1798-1811. doi: 10.1002/mp.16672. Epub 2023 Aug 22.
8
[Fully Automatic Glioma Segmentation Algorithm of Magnetic Resonance Imaging Based on 3D-UNet With More Global Contextual Feature Extraction: An Improvement on Insufficient Extraction of Global Features].基于具有更多全局上下文特征提取的3D-UNet的磁共振成像全自动胶质瘤分割算法:对全局特征提取不足的改进
Sichuan Da Xue Xue Bao Yi Xue Ban. 2024 Mar 20;55(2):447-454. doi: 10.12182/20240360208.
9
DFA-UNet: dual-stream feature-fusion attention U-Net for lymph node segmentation in lung cancer diagnosis.DFA-UNet:用于肺癌诊断中淋巴结分割的双流特征融合注意力U-Net
Front Neurosci. 2024 Jul 15;18:1448294. doi: 10.3389/fnins.2024.1448294. eCollection 2024.
10
Medical lesion segmentation by combining multimodal images with modality weighted UNet.基于模态加权 UNet 融合多模态图像的医学病灶分割。
Med Phys. 2022 Jun;49(6):3692-3704. doi: 10.1002/mp.15610. Epub 2022 Apr 7.

引用本文的文献

1
TonguExpert: A Deep Learning-Based Algorithm Platform for Fine-Grained Extraction and Classification of Tongue Phenotypes.舌象专家:基于深度学习的舌象细粒度提取与分类算法平台
Phenomics. 2025 Mar 26;5(2):109-122. doi: 10.1007/s43657-024-00210-9. eCollection 2025 Apr.
2
Tongue shape classification based on IF-RCNet.基于IF-RCNet的舌形分类
Sci Rep. 2025 Mar 1;15(1):7301. doi: 10.1038/s41598-025-91823-1.
3
TUMamba: A novel tongue segment methods based on Mamba and U-Net.TUMamba:一种基于曼巴和U-Net的新型舌段分割方法。

本文引用的文献

1
Weakly Supervised Deep Learning for Tooth-Marked Tongue Recognition.用于齿痕舌识别的弱监督深度学习
Front Physiol. 2022 Apr 12;13:847267. doi: 10.3389/fphys.2022.847267. eCollection 2022.
2
LSM-SEC: Tongue Segmentation by the Level Set Model with Symmetry and Edge Constraints.LSM-SEC:基于水平集模型的对称性和边缘约束的舌分割。
Comput Intell Neurosci. 2021 Jul 29;2021:6370526. doi: 10.1155/2021/6370526. eCollection 2021.
3
Sharp U-Net: Depthwise convolutional network for biomedical image segmentation.Sharp U-Net:用于生物医学图像分割的深度卷积网络。
Digit Health. 2024 Oct 14;10:20552076241289007. doi: 10.1177/20552076241289007. eCollection 2024 Jan-Dec.
4
S5Utis: Structured State-Space Sequence SegNeXt UNet-like Tongue Image Segmentation in Traditional Chinese Medicine.S5Utis:基于结构状态空间序列的 SegNeXt UNet 样中医舌象分割。
Sensors (Basel). 2024 Jun 21;24(13):4046. doi: 10.3390/s24134046.
5
RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3.RTC_TongueNet:一种基于DeepLabV3的改进型舌图像分割模型。
Digit Health. 2024 Mar 28;10:20552076241242773. doi: 10.1177/20552076241242773. eCollection 2024 Jan-Dec.
Comput Biol Med. 2021 Sep;136:104699. doi: 10.1016/j.compbiomed.2021.104699. Epub 2021 Jul 29.
4
An enhanced loss function simplifies the deep learning model for characterizing the 3D organoid models.增强的损失函数简化了用于描述 3D 类器官模型的深度学习模型。
Bioinformatics. 2021 Sep 29;37(18):3084-3085. doi: 10.1093/bioinformatics/btab120.
5
Anam-Net: Anamorphic Depth Embedding-Based Lightweight CNN for Segmentation of Anomalies in COVID-19 Chest CT Images.Anam-Net:基于变形深度嵌入的轻量级 COVID-19 胸部 CT 图像异常分割 CNN
IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):932-946. doi: 10.1109/TNNLS.2021.3054746. Epub 2021 Mar 1.
6
TISNet-Enhanced Fully Convolutional Network with Encoder-Decoder Structure for Tongue Image Segmentation in Traditional Chinese Medicine.基于编解码器结构的 TISNet 增强全卷积网络在中医舌象分割中的应用。
Comput Math Methods Med. 2020 Aug 7;2020:6029258. doi: 10.1155/2020/6029258. eCollection 2020.
7
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network.MRA-Net:基于多模态关系注意力网络的视觉问答任务改进。
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):318-329. doi: 10.1109/TPAMI.2020.3004830. Epub 2021 Dec 7.
8
D-UNet: A Dimension-Fusion U Shape Network for Chronic Stroke Lesion Segmentation.D-UNet:一种用于慢性中风病灶分割的维度融合U型网络。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):940-950. doi: 10.1109/TCBB.2019.2939522. Epub 2021 Jun 3.
9
Fully Dense UNet for 2-D Sparse Photoacoustic Tomography Artifact Removal.二维稀疏光声断层成像伪影去除的全密集 UNet。
IEEE J Biomed Health Inform. 2020 Feb;24(2):568-576. doi: 10.1109/JBHI.2019.2912935. Epub 2019 Apr 23.