Nair Aarathi Rajagopalan, Gupta Deepa, Premjith B
Department of Computer Science and Engineering, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Bengaluru, India.
Amrita School of Artificial Intelligence, Amrita Vishwa Vidyapeetham, Coimbatore, India.
Sci Rep. 2024 Oct 15;14(1):24202. doi: 10.1038/s41598-024-74617-9.
In the domain of natural language processing, the rise of Large Language Models and Generative AI represents a noteworthy transition, enabling machines to understand and generate text resembling that produced by humans. This research conducts a thorough examination of this transformative technology, with a focus on its influence on machine translation. The study explores the translation landscape between English and Indic languages, which include Hindi, Kannada, Malayalam, Tamil, and Telugu. To address this, the Large Language Model, BLOOMZ-3b, is utilized, which has been primarily developed for a text generation task. Multiple prompting engineering techniques for machine translation are prominently explored. The study further traverse fine-tuning the BLOOMZ-3b model using a Parameter Efficient Fine-Tuning technique called Low Rank Adaptation, aiming to reduce computational complexity. Hence, by combining innovative prompting approaches using BLOOMZ-3b model and fine-tuning the model, it contributes to continuous development of machine translation technologies beyond traditional borders of what can be done with respect to language processing. In this regard, not only does this research shed light on the intricacy of translation problems but it also sets a precedence for optimizing or adapting big language models to various languages which end up advancing Artificial Intelligence and Natural Language Processing at large.
在自然语言处理领域,大语言模型和生成式人工智能的兴起代表了一个值得关注的转变,使机器能够理解和生成类似于人类产生的文本。本研究对这项变革性技术进行了全面考察,重点关注其对机器翻译的影响。该研究探索了英语与印度语(包括印地语、卡纳达语、马拉雅拉姆语、泰米尔语和泰卢固语)之间的翻译情况。为解决此问题,使用了主要为文本生成任务而开发的大语言模型BLOOMZ - 3b。显著探索了多种用于机器翻译的提示工程技术。该研究进一步使用一种名为低秩自适应的参数高效微调技术对BLOOMZ - 3b模型进行微调,旨在降低计算复杂度。因此,通过结合使用BLOOMZ - 3b模型的创新提示方法并对模型进行微调,有助于机器翻译技术超越传统语言处理边界的持续发展。在这方面,这项研究不仅揭示了翻译问题的复杂性,还为优化或使大语言模型适应各种语言树立了先例,最终推动了人工智能和自然语言处理的整体发展。