• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

沙特学习者翻译语料库:英语-阿拉伯语学习者翻译语料库的设计与编制。

Saudi Learner Translation Corpus: The design and compilation of an English-Arabic learner translation corpus.

机构信息

Department of Applied Linguistics, College of Languages, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

College of Computer and Information Sciences, Al-Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia.

出版信息

PLoS One. 2024 Oct 23;19(10):e0303729. doi: 10.1371/journal.pone.0303729. eCollection 2024.

DOI:10.1371/journal.pone.0303729
PMID:39441857
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11498738/
Abstract

This article introduces the Saudi Learner Translation Corpus (SauLTC), an innovative multi-version English-Arabic parallel corpus featuring part-of-speech tagging. We describe the corpus parameters and compilation process and explain how textual processing and sentence alignment are conducted. The participants include 366 student translators, 48 instructors, and 23 alignment verifiers. The corpus provides access to two target versions of every ST to allow the detection of the changes in the translation and revision processes from the initial to the final draft. The translations were collected over three years, yielding 5,160,386 tokens. The metadata of 23 sentence alignment verifiers were added to the analysis as a unique variable to investigate individual differences in the manual verification process. This unidirectional corpus can be used to identify student translators' strategies and errors in translation and analyze the efficacy of instructors' feedback. Furthermore, it is accessible via an application and a website. It provides translation teachers and researchers with a database that can help develop corpus-based and corpus-driven teaching materials.

摘要

本文介绍了沙特学习者翻译语料库(SauLTC),这是一个创新性的多语言英语-阿拉伯语平行语料库,具有词性标注功能。我们描述了语料库的参数和编译过程,并解释了如何进行文本处理和句子对齐。参与者包括 366 名学生翻译、48 名教师和 23 名对齐验证员。该语料库为每个源语文本(ST)提供了两个目标语版本,以便从初稿到终稿检测翻译和修订过程中的变化。翻译是在三年内收集的,共产生了 5160386 个标记。23 名句子对齐验证员的元数据被添加到分析中,作为一个独特的变量,以调查手动验证过程中的个体差异。这个单向语料库可用于识别学生翻译在翻译过程中的策略和错误,并分析教师反馈的效果。此外,它还可以通过一个应用程序和一个网站访问。它为翻译教师和研究人员提供了一个数据库,有助于开发基于语料库和语料库驱动的教材。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/7f6f5d6dc475/pone.0303729.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/25d90dfc79a9/pone.0303729.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/1270a671b759/pone.0303729.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/6d693a2a33e1/pone.0303729.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/f5931cb66b1e/pone.0303729.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/849e2f084d5b/pone.0303729.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/c8f80d7a2bda/pone.0303729.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/badcbb041108/pone.0303729.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/f1f995f765db/pone.0303729.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/96fd0953a941/pone.0303729.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/6e6e1501b7a2/pone.0303729.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/7f6f5d6dc475/pone.0303729.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/25d90dfc79a9/pone.0303729.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/1270a671b759/pone.0303729.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/6d693a2a33e1/pone.0303729.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/f5931cb66b1e/pone.0303729.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/849e2f084d5b/pone.0303729.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/c8f80d7a2bda/pone.0303729.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/badcbb041108/pone.0303729.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/f1f995f765db/pone.0303729.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/96fd0953a941/pone.0303729.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/6e6e1501b7a2/pone.0303729.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2376/11498738/7f6f5d6dc475/pone.0303729.g011.jpg

相似文献

1
Saudi Learner Translation Corpus: The design and compilation of an English-Arabic learner translation corpus.沙特学习者翻译语料库:英语-阿拉伯语学习者翻译语料库的设计与编制。
PLoS One. 2024 Oct 23;19(10):e0303729. doi: 10.1371/journal.pone.0303729. eCollection 2024.
2
English Translation Ability for Cultivation Mode Based on Multimodal Auxiliary Scaffolding Teaching Method.基于多模态辅助支架教学法的英语翻译能力培养模式。
Comput Intell Neurosci. 2022 Jul 19;2022:3556703. doi: 10.1155/2022/3556703. eCollection 2022.
3
Arabic version of the Palliative Care Self-Efficacy Scale: Translation, adaptation, and validation.《舒缓治疗自我效能感量表》阿拉伯文版:翻译、改编与验证。
Palliat Support Care. 2023 Oct;21(5):863-870. doi: 10.1017/S1478951522001456.
4
ParaMed: a parallel corpus for English-Chinese translation in the biomedical domain.ParaMed:一个用于生物医学领域英汉翻译的平行语料库。
BMC Med Inform Decis Mak. 2021 Sep 6;21(1):258. doi: 10.1186/s12911-021-01621-8.
5
Translation, cultural adaptation and linguistic validation of the postgraduate hospital educational environment measure into Arabic.将研究生医院教育环境量表翻译成阿拉伯语,并进行翻译、文化适应和语言验证。
BMC Med Educ. 2024 Jun 5;24(1):625. doi: 10.1186/s12909-024-05611-y.
6
A lexicon-based diachronic comparison of emotions and sentiments in literary translation: A case study of five Chinese versions of David Copperfield.基于词典的文学翻译中情感历时性对比研究:以《大卫·科波菲尔》五个中文译本为例。
PLoS One. 2024 Feb 13;19(2):e0297101. doi: 10.1371/journal.pone.0297101. eCollection 2024.
7
Accuracy of Google Translate in translating the directions and counseling points for top-selling drugs from English to Arabic, Chinese, and Spanish.谷歌翻译将畅销药物的使用说明和咨询要点从英语翻译成阿拉伯语、中文和西班牙语的准确性。
Am J Health Syst Pharm. 2021 Nov 9;78(22):2053-2058. doi: 10.1093/ajhp/zxab224.
8
Machine Translation System Using Deep Learning for English to Urdu.基于深度学习的英语到乌尔都语机器翻译系统。
Comput Intell Neurosci. 2022 Jan 3;2022:7873012. doi: 10.1155/2022/7873012. eCollection 2022.
9
Translator's Style Through Lexical Bundles: A Corpus-Driven Analysis of Two English Translations of Hongloumeng.通过词汇束探究译者风格:《红楼梦》两个英译本的语料库驱动分析
Front Psychol. 2021 May 28;12:633422. doi: 10.3389/fpsyg.2021.633422. eCollection 2021.
10
Pseudotext Injection and Advance Filtering of Low-Resource Corpus for Neural Machine Translation.用于神经机器翻译的低资源语料库的伪文本注入与预过滤
Comput Intell Neurosci. 2021 Apr 11;2021:6682385. doi: 10.1155/2021/6682385. eCollection 2021.

引用本文的文献

1
Designing AI-powered translation education tools: a framework for parallel sentence generation using SauLTC and LLMs.设计人工智能驱动的翻译教育工具:一个使用SauLTC和大语言模型进行平行句生成的框架。
PeerJ Comput Sci. 2025 Mar 31;11:e2788. doi: 10.7717/peerj-cs.2788. eCollection 2025.

本文引用的文献

1
The translation teaching platform based on multilingual corpora of Xi Jinping: The Governance of China: Design, resources and applications.基于习近平多语种语料库的翻译教学平台:《习近平谈治国理政》:设计、资源与应用。
Acta Psychol (Amst). 2024 Feb;242:104110. doi: 10.1016/j.actpsy.2023.104110. Epub 2023 Dec 11.
2
Myth or reality? Some directions on translation universals in recent corpus based case studies.神话还是现实?基于语料库的近期案例研究中翻译普遍性的一些方向。
Front Psychol. 2022 Nov 16;13:902400. doi: 10.3389/fpsyg.2022.902400. eCollection 2022.