探索ChatGPT在多个领域增强机器翻译后编辑方面的潜力：挑战与机遇。

Exploring ChatGPT's potential for augmenting post-editing in machine translation across multiple domains: challenges and opportunities.

作者信息

Algaraady Jeehaan, Mahyoob Mohammad

机构信息

Department of Languages and Translation, Taiz University, Taiz, Yemen.

Department of Languages and Translation, Taibah University, Madina, Saudi Arabia.

出版信息

Front Artif Intell. 2025 May 1;8:1526293. doi: 10.3389/frai.2025.1526293. eCollection 2025.

DOI:10.3389/frai.2025.1526293

PMID:40376281

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12078335/

Abstract

INTRODUCTION

Post-editing plays a crucial role in enhancing the quality of machine-generated translation (MGT) by correcting errors and ensuring cohesion and coherence. With advancements in artificial intelligence, Large Language Models (LLMs) like ChatGPT-4o offer promising capabilities for post-editing tasks. This study investigates the effectiveness of ChatGPT-4o as a natural language processing tool in post-editing Arabic translations across various domains, aiming to evaluate its performance in improving productivity, accuracy, consistency, and overall translation quality.

METHODS

The study involved a comparative analysis of Arabic translations generated by Google Translate. These texts, drawn from multiple domains, were post-edited by two professional human translators and ChatGPT-4o. Subsequently, three additional professional human post-editors evaluated both sets of post-edited outputs. To statistically assess the differences in quality between humans and ChatGPT-4o post-edits, a paired -test was employed, focusing on metrics such as fluency, accuracy, coherence, and efficiency.

RESULTS

The findings indicated that human post-editors outperformed ChatGPT-4o in most quality metrics. However, ChatGPT-4o demonstrated superior efficiency, yielding a positive -statistic of 8.00 and a -value of 0.015, indicating a statistically significant difference. Regarding fluency, no significant difference was observed between the two methods (-statistic = -3.5, -value = 0.074), suggesting comparable performance in ensuring the natural flow of text.

DISCUSSION

ChatGPT-4o showed competitive performance in English-to-Arabic post-editing, particularly in producing fluent, coherent, and stylistically consistent text. Its conversational design enables efficient and consistent editing across various domains. Nonetheless, the model faced challenges in handling grammatical and syntactic nuances, domain-specific idioms, and complex terminology, especially in medical and sports contexts. Overall, the study highlights the potential of ChatGPT-4o as a supportive tool in translation post-editing workflows, complementing human translators by enhancing productivity and maintaining acceptable quality standards.

摘要

引言

译后编辑在通过纠正错误以及确保衔接与连贯来提高机器生成翻译（MGT）的质量方面发挥着关键作用。随着人工智能的发展，像ChatGPT-4o这样的大语言模型在译后编辑任务中展现出了颇具前景的能力。本研究调查了ChatGPT-4o作为一种自然语言处理工具在跨领域译后编辑阿拉伯语翻译中的有效性，旨在评估其在提高生产力、准确性、一致性和整体翻译质量方面的表现。

方法

该研究涉及对谷歌翻译生成的阿拉伯语翻译进行对比分析。这些来自多个领域的文本由两名专业人工翻译人员和ChatGPT-4o进行译后编辑。随后，另外三名专业人工译后编辑人员对两组译后编辑输出进行了评估。为了从统计学角度评估人工和ChatGPT-4o译后编辑在质量上的差异，采用了配对检验，重点关注流畅性、准确性、连贯性和效率等指标。

结果

研究结果表明，在大多数质量指标上，人工译后编辑人员的表现优于ChatGPT-4o。然而，ChatGPT-4o展现出了更高的效率，其t统计量为8.00，p值为0.015，表明存在统计学上的显著差异。在流畅性方面，两种方法之间未观察到显著差异（t统计量 = -3.5，p值 = 0.074），这表明在确保文本自然流畅方面表现相当。

讨论

ChatGPT-4o在英阿译后编辑中表现出了具有竞争力的性能，尤其是在生成流畅、连贯且文体一致的文本方面。其对话式设计能够在各个领域实现高效且一致的编辑。尽管如此，该模型在处理语法和句法细微差别、特定领域习语以及复杂术语方面面临挑战，尤其是在医学和体育领域。总体而言，该研究凸显了ChatGPT-4o作为翻译译后编辑工作流程中的一种辅助工具的潜力，通过提高生产力和维持可接受的质量标准来补充人工翻译人员的工作。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

探索ChatGPT在多个领域增强机器翻译后编辑方面的潜力：挑战与机遇。

Exploring ChatGPT's potential for augmenting post-editing in machine translation across multiple domains: challenges and opportunities.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

探索ChatGPT在多个领域增强机器翻译后编辑方面的潜力：挑战与机遇。

Exploring ChatGPT's potential for augmenting post-editing in machine translation across multiple domains: challenges and opportunities.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献