Suppr超能文献

通过提示和LoRA微调,使用BLOOMZ-3b研究印度语言的翻译。

Investigating translation for Indic languages with BLOOMZ-3b through prompting and LoRA fine-tuning.

作者信息

Nair Aarathi Rajagopalan, Gupta Deepa, Premjith B

机构信息

Department of Computer Science and Engineering, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Bengaluru, India.

Amrita School of Artificial Intelligence, Amrita Vishwa Vidyapeetham, Coimbatore, India.

出版信息

Sci Rep. 2024 Oct 15;14(1):24202. doi: 10.1038/s41598-024-74617-9.

Abstract

In the domain of natural language processing, the rise of Large Language Models and Generative AI represents a noteworthy transition, enabling machines to understand and generate text resembling that produced by humans. This research conducts a thorough examination of this transformative technology, with a focus on its influence on machine translation. The study explores the translation landscape between English and Indic languages, which include Hindi, Kannada, Malayalam, Tamil, and Telugu. To address this, the Large Language Model, BLOOMZ-3b, is utilized, which has been primarily developed for a text generation task. Multiple prompting engineering techniques for machine translation are prominently explored. The study further traverse fine-tuning the BLOOMZ-3b model using a Parameter Efficient Fine-Tuning technique called Low Rank Adaptation, aiming to reduce computational complexity. Hence, by combining innovative prompting approaches using BLOOMZ-3b model and fine-tuning the model, it contributes to continuous development of machine translation technologies beyond traditional borders of what can be done with respect to language processing. In this regard, not only does this research shed light on the intricacy of translation problems but it also sets a precedence for optimizing or adapting big language models to various languages which end up advancing Artificial Intelligence and Natural Language Processing at large.

摘要

在自然语言处理领域,大语言模型和生成式人工智能的兴起代表了一个值得关注的转变,使机器能够理解和生成类似于人类产生的文本。本研究对这项变革性技术进行了全面考察,重点关注其对机器翻译的影响。该研究探索了英语与印度语(包括印地语、卡纳达语、马拉雅拉姆语、泰米尔语和泰卢固语)之间的翻译情况。为解决此问题,使用了主要为文本生成任务而开发的大语言模型BLOOMZ - 3b。显著探索了多种用于机器翻译的提示工程技术。该研究进一步使用一种名为低秩自适应的参数高效微调技术对BLOOMZ - 3b模型进行微调,旨在降低计算复杂度。因此,通过结合使用BLOOMZ - 3b模型的创新提示方法并对模型进行微调,有助于机器翻译技术超越传统语言处理边界的持续发展。在这方面,这项研究不仅揭示了翻译问题的复杂性,还为优化或使大语言模型适应各种语言树立了先例,最终推动了人工智能和自然语言处理的整体发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17a7/11480041/d1aaee76385d/41598_2024_74617_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验