• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种具有可解释适应性的改进型手语翻译模型,用于处理长手语句子。

An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences.

作者信息

Zheng Jiangbin, Zhao Zheng, Chen Min, Chen Jing, Wu Chong, Chen Yidong, Shi Xiaodong, Tong Yiqi

机构信息

Department of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, China.

China Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, China.

出版信息

Comput Intell Neurosci. 2020 Oct 23;2020:8816125. doi: 10.1155/2020/8816125. eCollection 2020.

DOI:10.1155/2020/8816125
PMID:33163072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7604584/
Abstract

(SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initial stage. In fact, current systems perform poorly in processing long sign sentences, which often involve long-distance dependencies and require large resource consumption. To tackle this problem, we propose two explainable adaptations to the traditional neural SLT models using optimized tokenization-related modules. First, we introduce a (FSDC) algorithm for detecting and reducing the redundant similar frames, which effectively shortens the long sign sentences without losing information. Then, we replace the traditional encoder in a (NMT) module with an improved architecture, which incorporates a (T-Conv) unit and a (DH-BiGRU) unit sequentially. The improved component takes the temporal tokenization information into consideration to extract deeper information with reasonable resource consumption. Our experiments on the dataset show that the proposed model outperforms the state-of-the-art baseline up to about 1.5+ BLEU-4 score gains.

摘要

手语翻译(SLT)是弥合聋人和听力正常人之间沟通鸿沟的一项重要应用。近年来,基于神经翻译框架的手语翻译研究受到了广泛关注。尽管取得了进展,但当前的手语翻译研究仍处于初始阶段。事实上,当前系统在处理长手语句子时表现不佳,长手语句子往往涉及长距离依赖关系且需要大量资源消耗。为解决这一问题,我们提出了两种使用优化的分词相关模块对传统神经手语翻译模型进行的可解释性改编。首先,我们引入一种用于检测和减少冗余相似帧的快速相似帧检测与压缩(FSDC)算法,该算法能有效缩短长手语句子且不丢失信息。然后,我们在神经机器翻译(NMT)模块中用一种改进架构替换传统编码器,该改进架构依次包含一个时间卷积(T-Conv)单元和一个双向门控循环单元(DH-BiGRU)。改进后的组件考虑了时间分词信息,以合理的资源消耗提取更深入的信息。我们在[具体数据集名称]数据集上的实验表明,所提出的模型比最先进的基线模型性能提升高达约1.5个以上的BLEU-4分数增益。

相似文献

1
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences.一种具有可解释适应性的改进型手语翻译模型,用于处理长手语句子。
Comput Intell Neurosci. 2020 Oct 23;2020:8816125. doi: 10.1155/2020/8816125. eCollection 2020.
2
Ambient intelligence framework for real-time speech-to-sign translation.用于实时语音到手语翻译的环境智能框架。
Assist Technol. 2018;30(3):119-132. doi: 10.1080/10400435.2016.1268218. Epub 2017 Feb 2.
3
Biomechanical and perceptual constraints on the bandwidth requirements of sign language.手语带宽要求的生物力学和感知限制
IEEE Trans Neural Syst Rehabil Eng. 2004 Mar;12(1):65-72. doi: 10.1109/TNSRE.2003.821371.
4
Skeleton-based Chinese sign language recognition and generation for bidirectional communication between deaf and hearing people.基于骨架的中文手语识别与生成,实现聋听人群的双向交流。
Neural Netw. 2020 May;125:41-55. doi: 10.1016/j.neunet.2020.01.030. Epub 2020 Feb 6.
5
Text generation from Taiwanese Sign Language using a PST-based language model for augmentative communication.使用基于音素子序列转录(PST)的语言模型从台湾手语生成文本以辅助交流。
IEEE Trans Neural Syst Rehabil Eng. 2004 Dec;12(4):441-54. doi: 10.1109/TNSRE.2003.819930.
6
An Attention-Enhanced Multi-Scale and Dual Sign Language Recognition Network Based on a Graph Convolution Network.基于图卷积网络的注意力增强多尺度双通道手语识别网络。
Sensors (Basel). 2021 Feb 5;21(4):1120. doi: 10.3390/s21041120.
7
Synthetic Corpus Generation for Deep Learning-Based Translation of Spanish Sign Language.用于基于深度学习的西班牙语手语翻译的合成语料库生成
Sensors (Basel). 2024 Feb 24;24(5):1472. doi: 10.3390/s24051472.
8
Continuous Sign Language Recognition through a Context-Aware Generative Adversarial Network.基于上下文感知生成对抗网络的连续手语识别。
Sensors (Basel). 2021 Apr 1;21(7):2437. doi: 10.3390/s21072437.
9
Robust region of interest coding for improved sign language telecommunication.
IEEE Trans Inf Technol Biomed. 2002 Dec;6(4):310-6. doi: 10.1109/titb.2002.806094.
10
A Pyramid Semi-Autoregressive Transformer with Rich Semantics for Sign Language Production.用于手语生成的具有丰富语义的 Pyramid Semi-Autoregressive Transformer。
Sensors (Basel). 2022 Dec 8;22(24):9606. doi: 10.3390/s22249606.

引用本文的文献

1
Artificial Intelligence Technologies for Sign Language.手语的人工智能技术。
Sensors (Basel). 2021 Aug 30;21(17):5843. doi: 10.3390/s21175843.

本文引用的文献

1
Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos.基于多流 CNN-LSTM-HMM 的弱监督学习发现手语视频中的序列并行性。
IEEE Trans Pattern Anal Mach Intell. 2020 Sep;42(9):2306-2320. doi: 10.1109/TPAMI.2019.2911077. Epub 2019 Apr 15.
2
Hierarchical LSTMs with Adaptive Attention for Visual Captioning.基于自适应注意力机制的分层长短时记忆网络的视觉描述生成
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1112-1131. doi: 10.1109/TPAMI.2019.2894139. Epub 2019 Jan 21.
3
3D convolutional neural networks for human action recognition.
三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.
4
Image quality assessment: from error visibility to structural similarity.图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.
5
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.