• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

设计人工智能驱动的翻译教育工具:一个使用SauLTC和大语言模型进行平行句生成的框架。

Designing AI-powered translation education tools: a framework for parallel sentence generation using SauLTC and LLMs.

作者信息

Aleedy Moneerh, Alshihri Fatma, Meshoul Souham, Al-Harthi Maha, Alramlawi Salwa, Aldaihani Badr, Shaiba Hadil, Atwell Eric

机构信息

Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

School of Computer Science, University of Leeds, Leeds, United Kingdom.

出版信息

PeerJ Comput Sci. 2025 Mar 31;11:e2788. doi: 10.7717/peerj-cs.2788. eCollection 2025.

DOI:10.7717/peerj-cs.2788
PMID:40567735
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190550/
Abstract

Translation education (TE) demands significant effort from educators due to its labor-intensive nature. Developing computational tools powered by artificial intelligence (AI) can alleviate this burden by automating repetitive tasks, allowing instructors to focus on higher-level pedagogical aspects of translation. This integration of AI has the potential to significantly enhance the efficiency and effectiveness of translation education. The development of effective AI-based tools for TE is hampered by a lack of high-quality, comprehensive datasets tailored to this specific need, especially for Arabic. While the Saudi Learner Translation (SauLTC), a unidirectional English-to-Arabic parallel , constitutes a valuable resource, its current format is inadequate for generating the parallel sentences required for a didactic translation . This article proposes leveraging large language models like the Generative Pre-trained Transformer (GPT) to transform SauLTC into a parallel sentence . Using cosine similarity and human evaluation, we assessed the quality of the generated parallel sentences, achieving promising results with an 85.2% similarity score using Language-agnostic BERT Sentence Embedding (LaBSE) in conjunction with GPT, outperforming other investigated embedding models. The results demonstrate the potential of AI to address critical dataset challenges in quest of effective data driven solutions to support translation education.

摘要

翻译教育(TE)因其劳动密集型的性质,对教育工作者提出了巨大的要求。开发由人工智能(AI)驱动的计算工具,可以通过自动化重复性任务来减轻这一负担,使教师能够专注于翻译教学的更高层次方面。人工智能的这种整合有潜力显著提高翻译教育的效率和效果。由于缺乏针对这一特定需求(尤其是阿拉伯语)量身定制的高质量、全面的数据集,用于翻译教育的有效人工智能工具的开发受到了阻碍。虽然沙特学习者翻译语料库(SauLTC),一个单向的英语到阿拉伯语平行语料库,是一种宝贵的资源,但其当前格式不足以生成教学翻译所需的平行句子。本文提出利用像生成式预训练变换器(GPT)这样的大语言模型,将SauLTC转换为平行句子。使用余弦相似度和人工评估,我们评估了生成的平行句子的质量,结合GPT使用与语言无关的BERT句子嵌入(LaBSE),获得了85.2%的相似度得分,取得了有前景的结果,优于其他研究的嵌入模型。结果表明,人工智能有潜力解决关键的数据集挑战,以寻求有效的数据驱动解决方案来支持翻译教育。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/c9351fac5465/peerj-cs-11-2788-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/126adc5c0c35/peerj-cs-11-2788-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/9876c54509ff/peerj-cs-11-2788-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/97edae1b5c3d/peerj-cs-11-2788-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/b04a3a9da7c6/peerj-cs-11-2788-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/c9351fac5465/peerj-cs-11-2788-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/126adc5c0c35/peerj-cs-11-2788-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/9876c54509ff/peerj-cs-11-2788-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/97edae1b5c3d/peerj-cs-11-2788-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/b04a3a9da7c6/peerj-cs-11-2788-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4627/12190550/c9351fac5465/peerj-cs-11-2788-g005.jpg

相似文献

1
Designing AI-powered translation education tools: a framework for parallel sentence generation using SauLTC and LLMs.设计人工智能驱动的翻译教育工具:一个使用SauLTC和大语言模型进行平行句生成的框架。
PeerJ Comput Sci. 2025 Mar 31;11:e2788. doi: 10.7717/peerj-cs.2788. eCollection 2025.
2
Detecting Redundant Health Survey Questions by Using Language-Agnostic Bidirectional Encoder Representations From Transformers Sentence Embedding: Algorithm Development Study.使用来自Transformer句子嵌入的语言无关双向编码器表示法检测冗余健康调查问题:算法开发研究
JMIR Med Inform. 2025 Jun 10;13:e71687. doi: 10.2196/71687.
3
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
4
AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.用于评估人工智能驱动的临床医生工具长期现实世界影响的AI for IMPACTS框架:系统评价与叙述性综合分析
J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485.
5
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
6
Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。
Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.
7
Tibyan corpus: balanced and comprehensive error coverage corpus using ChatGPT for Arabic grammatical error correction.提比安语料库:使用ChatGPT进行阿拉伯语语法错误纠正的平衡且全面的错误覆盖语料库。
PeerJ Comput Sci. 2025 Mar 31;11:e2724. doi: 10.7717/peerj-cs.2724. eCollection 2025.
8
Using Natural Language Processing to Explore Patient Perspectives on AI Avatars in Support Materials for Patients With Breast Cancer: Survey Study.使用自然语言处理技术探索乳腺癌患者在支持材料中对人工智能化身的看法:调查研究
J Med Internet Res. 2025 Jun 20;27:e70971. doi: 10.2196/70971.
9
Integrating multi-source data for skin burn classification using deep learning.利用深度学习整合多源数据进行皮肤烧伤分类
Comput Biol Med. 2025 Sep;195:110556. doi: 10.1016/j.compbiomed.2025.110556. Epub 2025 Jun 24.
10
Redefining Mentorship in Medical Education with Artificial Intelligence: A Delphi Study on the Feasibility and Implications.利用人工智能重新定义医学教育中的导师指导:关于可行性和影响的德尔菲研究
Teach Learn Med. 2025 Jun 18:1-11. doi: 10.1080/10401334.2025.2521001.

本文引用的文献

1
Saudi Learner Translation Corpus: The design and compilation of an English-Arabic learner translation corpus.沙特学习者翻译语料库:英语-阿拉伯语学习者翻译语料库的设计与编制。
PLoS One. 2024 Oct 23;19(10):e0303729. doi: 10.1371/journal.pone.0303729. eCollection 2024.
2
Adapting to technological change: An investigation of translator training and the translation market in the Arab world.适应技术变革:对阿拉伯世界翻译培训与翻译市场的调查
Heliyon. 2024 Mar 21;10(7):e28535. doi: 10.1016/j.heliyon.2024.e28535. eCollection 2024 Apr 15.
3
Artificial Intelligence Technologies in College English Translation Teaching.
人工智能技术在大学英语翻译教学中的应用。
J Psycholinguist Res. 2023 Oct;52(5):1525-1544. doi: 10.1007/s10936-023-09960-5. Epub 2023 Apr 26.
4
Overcoming Language Barriers in Academia: Machine Translation Tools and a Vision for a Multilingual Future.克服学术界的语言障碍:机器翻译工具与多语言未来愿景。
Bioscience. 2022 Aug 3;72(10):988-998. doi: 10.1093/biosci/biac062. eCollection 2022 Oct.
5
Artificial Intelligence-Based Translation Technology in Translation Teaching.人工智能翻译技术在翻译教学中的应用。
Comput Intell Neurosci. 2022 Jun 29;2022:6016752. doi: 10.1155/2022/6016752. eCollection 2022.
6
Extracting Parallel Sentences from Nonparallel Corpora Using Parallel Hierarchical Attention Network.利用平行分层注意网络从非平行语料库中提取平行句子。
Comput Intell Neurosci. 2020 Sep 1;2020:8823906. doi: 10.1155/2020/8823906. eCollection 2020.
7
A comparison of Cohen's Kappa and Gwet's AC1 when calculating inter-rater reliability coefficients: a study conducted with personality disorder samples.科恩氏 κ系数与格瓦特氏 AC1 系数在计算评定者间信度系数时的比较:一项对人格障碍样本进行的研究。
BMC Med Res Methodol. 2013 Apr 29;13:61. doi: 10.1186/1471-2288-13-61.
8
Computing inter-rater reliability and its variance in the presence of high agreement.在高度一致的情况下计算评分者间信度及其方差。
Br J Math Stat Psychol. 2008 May;61(Pt 1):29-48. doi: 10.1348/000711006X126600.