• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

神经自动后编辑路线图:一种实证方法。

A roadmap to neural automatic post-editing: an empirical approach.

作者信息

Shterionov Dimitar, Carmo Félix do, Moorkens Joss, Hossari Murhaf, Wagner Joachim, Paquin Eric, Schmidtke Dag, Groves Declan, Way Andy

机构信息

Department of Cognitive Science and Artificial Intelligence, Tilburg University, Tilburg, The Netherlands.

ADAPT Centre, School of Computing, Dublin City University, Dublin, Ireland.

出版信息

Mach Transl. 2020;34(2):67-96. doi: 10.1007/s10590-020-09249-7. Epub 2020 Sep 3.

DOI:10.1007/s10590-020-09249-7
PMID:33012986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7501121/
Abstract

In a translation workflow, machine translation (MT) is almost always followed by a human post-editing step, where the raw MT output is corrected to meet required quality standards. To reduce the number of errors human translators need to correct, automatic post-editing (APE) methods have been developed and deployed in such workflows. With the advances in deep learning, neural APE (NPE) systems have outranked more traditional, statistical, ones. However, the plethora of options, variables and settings, as well as the relation between NPE performance and train/test data makes it difficult to select the most suitable approach for a given use case. In this article, we systematically analyse these different parameters with respect to NPE performance. We build an NPE "roadmap" to trace the different decision points and train a set of systems selecting different options through the roadmap. We also propose a novel approach for APE with data augmentation. We then analyse the performance of 15 of these systems and identify the best ones. In fact, the best systems are the ones that follow the newly-proposed method. The work presented in this article follows from a collaborative project between Microsoft and the ADAPT centre. The data provided by Microsoft originates from phrase-based statistical MT (PBSMT) systems employed in production. All tested NPE systems significantly increase the translation quality, proving the effectiveness of neural post-editing in the context of a commercial translation workflow that leverages PBSMT.

摘要

在翻译工作流程中,机器翻译(MT)之后几乎总会有一个人工后编辑步骤,即将原始的机器翻译输出进行校正,以达到所需的质量标准。为了减少人工翻译需要校正的错误数量,自动后编辑(APE)方法已被开发并应用于此类工作流程中。随着深度学习的发展,神经自动后编辑(NPE)系统已经超越了更为传统的统计型系统。然而,大量的选项、变量和设置,以及NPE性能与训练/测试数据之间的关系,使得为特定用例选择最合适的方法变得困难。在本文中,我们针对NPE性能系统地分析了这些不同参数。我们构建了一个NPE“路线图”来追踪不同的决策点,并通过该路线图训练一组选择不同选项的系统。我们还提出了一种带有数据增强的新型APE方法。然后,我们分析了其中15个系统的性能,并找出了最佳系统。事实上,最佳系统是那些采用新提出方法的系统。本文所介绍的工作源自微软与ADAPT中心的一个合作项目。微软提供的数据源自生产中使用的基于短语的统计机器翻译(PBSMT)系统。所有经过测试的NPE系统都显著提高了翻译质量,证明了神经后编辑在利用PBSMT的商业翻译工作流程中的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60be/7501121/4372a1c9f232/10590_2020_9249_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60be/7501121/653b67026272/10590_2020_9249_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60be/7501121/4372a1c9f232/10590_2020_9249_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60be/7501121/653b67026272/10590_2020_9249_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60be/7501121/4372a1c9f232/10590_2020_9249_Fig2_HTML.jpg

相似文献

1
A roadmap to neural automatic post-editing: an empirical approach.神经自动后编辑路线图:一种实证方法。
Mach Transl. 2020;34(2):67-96. doi: 10.1007/s10590-020-09249-7. Epub 2020 Sep 3.
2
Identifying the Machine Translation Error Types with the Greatest Impact on Post-editing Effort.识别对译后编辑工作影响最大的机器翻译错误类型。
Front Psychol. 2017 Aug 2;8:1282. doi: 10.3389/fpsyg.2017.01282. eCollection 2017.
3
A review of the state-of-the-art in automatic post-editing.自动后编辑技术的最新进展综述。
Mach Transl. 2021;35(2):101-143. doi: 10.1007/s10590-020-09252-y. Epub 2020 Dec 24.
4
Development of machine translation technology for assisting health communication: A systematic review.机器翻译技术在辅助健康传播中的发展:系统综述。
J Biomed Inform. 2018 Sep;85:56-67. doi: 10.1016/j.jbi.2018.07.018. Epub 2018 Jul 19.
5
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
6
Statistical-based system combination approach to gain advantages over different machine translation systems.基于统计的系统组合方法,以获得优于不同机器翻译系统的优势。
Heliyon. 2019 Sep 30;5(9):e02504. doi: 10.1016/j.heliyon.2019.e02504. eCollection 2019 Sep.
7
The use of automation in the rendition of certain articles of the Saudi Commercial Law into English: a post-editing-based comparison of five machine translation systems.将《沙特商业法》某些条款翻译成英文时自动化工具的应用:基于译后编辑的五种机器翻译系统比较
Front Artif Intell. 2024 Jan 12;6:1282020. doi: 10.3389/frai.2023.1282020. eCollection 2023.
8
Adaptation of machine translation for multilingual information retrieval in the medical domain.医学领域中用于多语言信息检索的机器翻译适配
Artif Intell Med. 2014 Jul;61(3):165-85. doi: 10.1016/j.artmed.2014.01.004. Epub 2014 Feb 5.
9
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
10
The role of automated evaluation techniques in online professional translator training.自动化评估技术在在线专业翻译人员培训中的作用。
PeerJ Comput Sci. 2021 Oct 4;7:e706. doi: 10.7717/peerj-cs.706. eCollection 2021.

引用本文的文献

1
A review of the state-of-the-art in automatic post-editing.自动后编辑技术的最新进展综述。
Mach Transl. 2021;35(2):101-143. doi: 10.1007/s10590-020-09252-y. Epub 2020 Dec 24.

本文引用的文献

1
A review of the state-of-the-art in automatic post-editing.自动后编辑技术的最新进展综述。
Mach Transl. 2021;35(2):101-143. doi: 10.1007/s10590-020-09252-y. Epub 2020 Dec 24.