• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过跨模态学习与同步优化提升英语口译水平

Enhancing english oral translation through cross-modal learning and synchronous optimization.

作者信息

Wang Yan

机构信息

Anhui Wenda University of Information Engineering, Hefei, Anhui, China.

出版信息

PLoS One. 2025 Aug 18;20(8):e0329381. doi: 10.1371/journal.pone.0329381. eCollection 2025.

DOI:10.1371/journal.pone.0329381
PMID:40824960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12360554/
Abstract

Oral translation in English serves as a critical conduit for international communication and cultural exchange. However, the prevalent variations in pronunciation and the rapid pace of spoken language currently impede the efficacy of synchronous translation methods. To improve the quality and efficiency of synchronous oral translation, this paper explores the integration of cross-modal semantic understanding and synchronous enhancement specifically for English oral translation. This exploration commences with the implementation of a cross-modal translation scenario. Subsequently, the text sequence derived from this process is amalgamated with the original speech features via Bidirectional Encoder Representations from Transformers (BERT). The cross-information between modalities is explored, and linear transformation optimization is performed on the self-attention mechanism in Transformer to achieve context-awareness and understanding of oral-transcribed text. In conclusion, the integration of dynamic time warping (DTW) enhances real-time synchronization between speech and text, thereby improving translation fluency. Experimental results reveal that, when compared to the existing bilingual attention neural machine translation (NMT) model and the context-aware NMT model, the model proposed in this study yields an average bilingual evaluation understudy (BLEU) score that is 9.3% and 26.9% higher, respectively. Furthermore, its synchronization speed surpasses that of the other two models by 17.9% and 16.8%, respectively. These findings suggest that the fusion model, which incorporates context-awareness and an attention mechanism in cross-modal translation, can significantly elevate the quality and efficiency of English oral translation, offering a novel approach to the synchronous translation of spoken English.

摘要

英语口译是国际交流和文化交流的重要渠道。然而,目前普遍存在的发音差异和口语的快速节奏阻碍了同步翻译方法的效果。为了提高同步口译的质量和效率,本文专门针对英语口译探索跨模态语义理解与同步增强的整合。这种探索从实施跨模态翻译场景开始。随后,通过来自变换器的双向编码器表示(BERT)将此过程中得到的文本序列与原始语音特征合并。探索模态间的交叉信息,并对Transformer中的自注意力机制进行线性变换优化,以实现对口译转录文本的上下文感知和理解。总之,动态时间规整(DTW)的整合增强了语音与文本之间的实时同步,从而提高了翻译流畅性。实验结果表明,与现有的双语注意力神经机器翻译(NMT)模型和上下文感知NMT模型相比,本研究提出的模型的平均双语评估替补(BLEU)得分分别高出9.3%和26.9%。此外,其同步速度分别比其他两个模型快17.9%和16.8%。这些发现表明,在跨模态翻译中融入上下文感知和注意力机制的融合模型可以显著提高英语口译的质量和效率,为英语口语的同步翻译提供了一种新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/e6a3dd538c1d/pone.0329381.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/eacb794ef04c/pone.0329381.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/1226645cdc23/pone.0329381.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/7dcc7860e15e/pone.0329381.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/73278505a8d7/pone.0329381.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/9843dc6cc136/pone.0329381.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/995300d008d7/pone.0329381.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/a7c846b8884c/pone.0329381.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/9ad8ab6f9ab8/pone.0329381.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/be5685347e3d/pone.0329381.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/9ea69be6ca3a/pone.0329381.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/66d6155fcc5e/pone.0329381.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/e6a3dd538c1d/pone.0329381.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/eacb794ef04c/pone.0329381.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/1226645cdc23/pone.0329381.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/7dcc7860e15e/pone.0329381.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/73278505a8d7/pone.0329381.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/9843dc6cc136/pone.0329381.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/995300d008d7/pone.0329381.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/a7c846b8884c/pone.0329381.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/9ad8ab6f9ab8/pone.0329381.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/be5685347e3d/pone.0329381.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/9ea69be6ca3a/pone.0329381.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/66d6155fcc5e/pone.0329381.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5f1/12360554/e6a3dd538c1d/pone.0329381.g012.jpg

相似文献

1
Enhancing english oral translation through cross-modal learning and synchronous optimization.通过跨模态学习与同步优化提升英语口译水平
PLoS One. 2025 Aug 18;20(8):e0329381. doi: 10.1371/journal.pone.0329381. eCollection 2025.
2
Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.
3
Short-Term Memory Impairment短期记忆障碍
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Text intelligent correction in English translation: A study on integrating models with dependency attention mechanism.英文翻译中的文本智能校正:一项关于集成具有依存注意力机制模型的研究。
PLoS One. 2025 Jun 24;20(6):e0319690. doi: 10.1371/journal.pone.0319690. eCollection 2025.
6
The intelligent evaluation model of the English humanistic landscape in agricultural industrial parks by the SPEAKING model: From the perspective of fish-vegetable symbiosis in new agriculture.基于SPEAKING模型的农业产业园区英文人文景观智能评价模型:以新型农业中的鱼菜共生为视角
PLoS One. 2025 Jul 16;20(7):e0325332. doi: 10.1371/journal.pone.0325332. eCollection 2025.
7
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
8
Age-related differences in the interplay of fluency and complexity in Chinese-speaking seniors' oral narratives.老年人汉语口语叙事流畅性与复杂性的交互作用随年龄的变化。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1672-1690. doi: 10.1111/1460-6984.13023. Epub 2024 Feb 26.
9
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
10
Analysis of the fusion of multimodal sentiment perception and physiological signals in Chinese-English cross-cultural communication: Transformer approach incorporating self-attention enhancement.汉英跨文化交际中多模态情感感知与生理信号融合分析:融入自注意力增强的Transformer方法
PeerJ Comput Sci. 2025 May 23;11:e2890. doi: 10.7717/peerj-cs.2890. eCollection 2025.

本文引用的文献

1
Attention Mechanisms and Their Applications to Complex Systems.注意力机制及其在复杂系统中的应用。
Entropy (Basel). 2021 Feb 26;23(3):283. doi: 10.3390/e23030283.
2
Attention in Natural Language Processing.自然语言处理中的注意力机制。
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4291-4308. doi: 10.1109/TNNLS.2020.3019893. Epub 2021 Oct 5.