Suppr超能文献

多语言大语言模型调查

A survey of multilingual large language models.

作者信息

Qin Libo, Chen Qiguang, Zhou Yuhang, Chen Zhi, Li Yinghui, Liao Lizi, Li Min, Che Wanxiang, Yu Philip S

机构信息

School of Computer Science and Engineering, Central South University, Changsha 410083, China.

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, Harbin 150001, China.

出版信息

Patterns (N Y). 2025 Jan 10;6(1):101118. doi: 10.1016/j.patter.2024.101118.

Abstract

Multilingual large language models (MLLMs) leverage advanced large language models to process and respond to queries across multiple languages, achieving significant success in polyglot tasks. Despite these breakthroughs, a comprehensive survey summarizing existing approaches and recent developments remains absent. To this end, this paper presents a unified and thorough review of the field, highlighting recent progress and emerging trends in MLLM research. The contributions of this paper are as follows. (1) Extensive survey: to our knowledge, this is the pioneering thorough review of multilingual alignment in MLLMs. (2) Unified taxonomy: we provide a unified framework to summarize the current progress in MLLMs. (3) Emerging frontiers: key emerging frontiers are identified, alongside a discussion of associated challenges. (4) Abundant resources: we collect abundant open-source resources, including relevant papers, data corpora, and leaderboards. We hope our work can provide the community quick access and spur breakthrough research in MLLMs.

摘要

多语言大语言模型(MLLMs)利用先进的大语言模型来处理和回答多种语言的查询,在多语言任务中取得了显著成功。尽管有这些突破,但仍缺乏对现有方法和最新进展的全面综述。为此,本文对该领域进行了统一而全面的回顾,突出了多语言大语言模型研究的最新进展和新趋势。本文的贡献如下。(1)广泛的综述:据我们所知,这是对多语言大语言模型中多语言对齐的开创性全面综述。(2)统一的分类法:我们提供了一个统一的框架来总结多语言大语言模型的当前进展。(3)新兴前沿:确定了关键的新兴前沿,并讨论了相关挑战。(4)丰富的资源:我们收集了丰富的开源资源,包括相关论文、数据语料库和排行榜。我们希望我们的工作能为社区提供快速访问,并推动多语言大语言模型的突破性研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b4a/11783891/93fd4a7a734f/gr7.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验