• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于自动化与控制的微型语言模型:概述、潜在应用及未来研究方向

Tiny Language Models for Automation and Control: Overview, Potential Applications, and Future Research Directions.

作者信息

Lamaakal Ismail, Maleh Yassine, El Makkaoui Khalid, Ouahbi Ibrahim, Pławiak Paweł, Alfarraj Osama, Almousa May, Abd El-Latif Ahmed A

机构信息

Multidisciplinary Faculty of Nador, Mohammed Premier University, Oujda 60000, Morocco.

National School of Applied Sciences, Sultan Moulay Slimane University, Beni Mellal 23000, Morocco.

出版信息

Sensors (Basel). 2025 Feb 21;25(5):1318. doi: 10.3390/s25051318.

DOI:10.3390/s25051318
PMID:40096098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11902656/
Abstract

Large Language Models (LLMs), like GPT and BERT, have significantly advanced Natural Language Processing (NLP), enabling high performance on complex tasks. However, their size and computational needs make LLMs unsuitable for deployment on resource-constrained devices, where efficiency, speed, and low power consumption are critical. Tiny Language Models (TLMs), also known as BabyLMs, offer compact alternatives by using advanced compression and optimization techniques to function effectively on devices such as smartphones, Internet of Things (IoT) systems, and embedded platforms. This paper provides a comprehensive survey of TLM architectures and methodologies, including key techniques such as knowledge distillation, quantization, and pruning. Additionally, it explores potential and emerging applications of TLMs in automation and control, covering areas such as edge computing, IoT, industrial automation, and healthcare. The survey discusses challenges unique to TLMs, such as trade-offs between model size and accuracy, limited generalization, and ethical considerations in deployment. Future research directions are also proposed, focusing on hybrid compression techniques, application-specific adaptations, and context-aware TLMs optimized for hardware-specific constraints. This paper aims to serve as a foundational resource for advancing TLMs capabilities across diverse real-world applications.

摘要

像GPT和BERT这样的大语言模型(LLMs)极大地推动了自然语言处理(NLP)的发展,使其在复杂任务上具备高性能。然而,它们的规模和计算需求使得大语言模型不适用于在资源受限的设备上部署,而在这些设备上,效率、速度和低功耗至关重要。小语言模型(TLMs),也被称为微型语言模型,通过使用先进的压缩和优化技术,提供了紧凑的替代方案,以便在智能手机、物联网(IoT)系统和嵌入式平台等设备上有效运行。本文对小语言模型的架构和方法进行了全面综述,包括知识蒸馏、量化和剪枝等关键技术。此外,还探讨了小语言模型在自动化和控制领域的潜在及新兴应用,涵盖边缘计算、物联网、工业自动化和医疗保健等领域。该综述讨论了小语言模型所特有的挑战,例如模型规模与准确性之间的权衡、有限的泛化能力以及部署中的伦理考量。还提出了未来的研究方向,重点关注混合压缩技术、针对特定应用的适配以及针对硬件特定约束进行优化的上下文感知小语言模型。本文旨在成为推动小语言模型在各种实际应用中能力提升的基础资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/1724a624bca4/sensors-25-01318-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/300163db786d/sensors-25-01318-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/8ab7836138a2/sensors-25-01318-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/1724a624bca4/sensors-25-01318-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/300163db786d/sensors-25-01318-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/8ab7836138a2/sensors-25-01318-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e4b/11902656/1724a624bca4/sensors-25-01318-g003.jpg

相似文献

1
Tiny Language Models for Automation and Control: Overview, Potential Applications, and Future Research Directions.用于自动化与控制的微型语言模型:概述、潜在应用及未来研究方向
Sensors (Basel). 2025 Feb 21;25(5):1318. doi: 10.3390/s25051318.
2
Industrial applications of large language models.大语言模型的工业应用。
Sci Rep. 2025 Apr 21;15(1):13755. doi: 10.1038/s41598-025-98483-1.
3
A Survey on Industrial Internet of Things: A Cyber-Physical Systems Perspective.从信息物理系统视角对工业物联网的一项调查。
IEEE Access. 2018;6. doi: 10.1109/access.2018.2884906.
4
AI augmented edge and fog computing for Internet of Health Things (IoHT).用于健康物联网(IoHT)的人工智能增强边缘和雾计算。
PeerJ Comput Sci. 2025 Jan 30;11:e2431. doi: 10.7717/peerj-cs.2431. eCollection 2025.
5
Neuromorphic Sentiment Analysis Using Spiking Neural Networks.基于尖峰神经网络的神经形态情绪分析。
Sensors (Basel). 2023 Sep 6;23(18):7701. doi: 10.3390/s23187701.
6
Developing healthcare language model embedding spaces.开发医疗保健语言模型嵌入空间。
Artif Intell Med. 2024 Dec;158:103009. doi: 10.1016/j.artmed.2024.103009. Epub 2024 Oct 31.
7
Leveraging Large Language Models for Improved Understanding of Communications With Patients With Cancer in a Call Center Setting: Proof-of-Concept Study.在呼叫中心环境中利用大语言模型增进对癌症患者沟通的理解:概念验证研究
J Med Internet Res. 2024 Dec 11;26:e63892. doi: 10.2196/63892.
8
A Survey on IoT Application Architectures.物联网应用架构调查
Sensors (Basel). 2024 Aug 17;24(16):5320. doi: 10.3390/s24165320.
9
Integrating meta-heuristic with named data networking for secure edge computing in IoT enabled healthcare monitoring system.将元启发式与命名数据网络集成到物联网支持的医疗保健监测系统中的安全边缘计算中。
Sci Rep. 2024 Sep 15;14(1):21532. doi: 10.1038/s41598-024-71506-z.
10
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.

本文引用的文献

1
Analysis of the retraining strategies for multi-label text message classification in call/contact center systems.呼叫/联络中心系统中多标签短信分类的再训练策略分析
Sci Rep. 2024 May 2;14(1):10093. doi: 10.1038/s41598-024-60697-0.
2
Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习:一个包含综述与基准测试的统一框架
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.
3
A Survey for Machine Learning-Based Control of Continuum Robots.
基于机器学习的连续体机器人控制研究
Front Robot AI. 2021 Sep 24;8:730330. doi: 10.3389/frobt.2021.730330. eCollection 2021.
4
Attention Mechanisms and Their Applications to Complex Systems.注意力机制及其在复杂系统中的应用。
Entropy (Basel). 2021 Feb 26;23(3):283. doi: 10.3390/e23030283.
5
Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.在强化学习中用于神经网络函数逼近的 Sigmoid 加权线性单元。
Neural Netw. 2018 Nov;107:3-11. doi: 10.1016/j.neunet.2017.12.012. Epub 2018 Jan 11.
6
Medical devices and diagnostics for cardiovascular diseases in low-resource settings.资源匮乏地区的心血管疾病医疗设备与诊断方法
J Cardiovasc Transl Res. 2014 Nov;7(8):737-48. doi: 10.1007/s12265-014-9591-3. Epub 2014 Oct 8.