• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于分层强化学习的协作带宽管理自适应门限。

Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning.

机构信息

Information and Communication Engineering Department, Dongguk University, Seoul 04620, Korea.

出版信息

Sensors (Basel). 2021 Oct 25;21(21):7053. doi: 10.3390/s21217053.

DOI:10.3390/s21217053
PMID:34770360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8587839/
Abstract

With the increase in Internet of Things (IoT) devices and network communications, but with less bandwidth growth, the resulting constraints must be overcome. Due to the network complexity and uncertainty of emergency distribution parameters in smart environments, using predetermined rules seems illogical. Reinforcement learning (RL), as a powerful machine learning approach, can handle such smart environments without a trainer or supervisor. Recently, we worked on bandwidth management in a smart environment with several fog fragments using limited shared bandwidth, where IoT devices may experience uncertain emergencies in terms of the time and sequence needed for more bandwidth for further higher-level communication. We introduced fog fragment cooperation using an RL approach under a predefined fixed threshold constraint. In this study, we promote this approach by removing the fixed level of restriction of the threshold through hierarchical reinforcement learning (HRL) and completing the cooperation qualification. At the first learning hierarchy level of the proposed approach, the best threshold level is learned over time, and the final results are used by the second learning hierarchy level, where the fog node learns the best device for helping an emergency device by temporarily lending the bandwidth. Although equipping the method to the adaptive threshold and restricting fog fragment cooperation make the learning procedure more difficult, the HRL approach increases the method's efficiency in terms of time and performance.

摘要

随着物联网 (IoT) 设备和网络通信的增加,而带宽的增长却较少,因此必须克服由此产生的限制。由于智能环境中网络的复杂性和应急分配参数的不确定性,使用预定规则似乎不合逻辑。强化学习 (RL) 作为一种强大的机器学习方法,可以在没有培训师或监督者的情况下处理这种智能环境。最近,我们使用有限的共享带宽研究了具有多个雾碎片的智能环境中的带宽管理,其中 IoT 设备可能会遇到不确定的紧急情况,需要更多带宽以进行进一步的更高层次的通信。我们使用 RL 方法在预定义的固定阈值约束下引入了雾碎片合作。在这项研究中,我们通过层次强化学习 (HRL) 去除了阈值的固定限制水平并完成了合作资格,从而推动了这种方法。在提出的方法的第一个学习层次级别中,最佳阈值水平会随着时间的推移而学习,最终结果将由第二个学习层次级别使用,其中雾节点通过临时借用带宽来学习帮助应急设备的最佳设备。尽管将方法配备自适应阈值并限制雾碎片合作会使学习过程更加困难,但 HRL 方法在时间和性能方面提高了该方法的效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/6b33aeab5685/sensors-21-07053-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/764af223df29/sensors-21-07053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/c834a8694598/sensors-21-07053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/aead52cbbd88/sensors-21-07053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/0fbc1bb4c178/sensors-21-07053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/fb04555686f5/sensors-21-07053-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/6b33aeab5685/sensors-21-07053-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/764af223df29/sensors-21-07053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/c834a8694598/sensors-21-07053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/aead52cbbd88/sensors-21-07053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/0fbc1bb4c178/sensors-21-07053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/fb04555686f5/sensors-21-07053-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba4f/8587839/6b33aeab5685/sensors-21-07053-g006.jpg

相似文献

1
Toward an Adaptive Threshold on Cooperative Bandwidth Management Based on Hierarchical Reinforcement Learning.基于分层强化学习的协作带宽管理自适应门限。
Sensors (Basel). 2021 Oct 25;21(21):7053. doi: 10.3390/s21217053.
2
Fog Fragment Cooperation on Bandwidth Management Based on Reinforcement Learning.基于强化学习的雾碎片带宽管理协作
Sensors (Basel). 2020 Dec 4;20(23):6942. doi: 10.3390/s20236942.
3
Detection of Malicious Cloud Bandwidth Consumption in Cloud Computing Using Machine Learning Techniques.利用机器学习技术检测云计算中的恶意带宽消耗。
Comput Intell Neurosci. 2022 Sep 5;2022:4003403. doi: 10.1155/2022/4003403. eCollection 2022.
4
Integrating Deep Learning-Based IoT and Fog Computing with Software-Defined Networking for Detecting Weapons in Video Surveillance Systems.将基于深度学习的物联网和雾计算与软件定义网络集成,用于检测视频监控系统中的武器。
Sensors (Basel). 2022 Jul 6;22(14):5075. doi: 10.3390/s22145075.
5
Intelligent decision making for energy efficient fog nodes selection and smart switching in the IOT: a machine learning approach.物联网中用于节能雾节点选择和智能切换的智能决策:一种机器学习方法。
PeerJ Comput Sci. 2024 Mar 20;10:e1833. doi: 10.7717/peerj-cs.1833. eCollection 2024.
6
Real-Time Task Assignment Approach Leveraging Reinforcement Learning with Evolution Strategies for Long-Term Latency Minimization in Fog Computing.利用强化学习与进化策略的实时任务分配方法,实现雾计算中的长期延迟最小化。
Sensors (Basel). 2018 Aug 27;18(9):2830. doi: 10.3390/s18092830.
7
Device-to-Device (D2D) Multi-Criteria Learning Algorithm Using Secured Sensors.利用安全传感器的设备到设备(D2D)多准则学习算法。
Sensors (Basel). 2022 Mar 9;22(6):2115. doi: 10.3390/s22062115.
8
Processing Complex Events in Fog-Based Internet of Things Systems for Smart Agriculture.在基于雾的物联网系统中处理智能农业的复杂事件。
Sensors (Basel). 2021 Oct 30;21(21):7226. doi: 10.3390/s21217226.
9
An analytical model to minimize the latency in healthcare internet-of-things in fog computing environment.一种在雾计算环境中最小化医疗物联网中延迟的分析模型。
PLoS One. 2019 Nov 13;14(11):e0224934. doi: 10.1371/journal.pone.0224934. eCollection 2019.
10
A Study on the Design of Fog Computing Architecture Using Sensor Networks.基于传感器网络的雾计算架构设计研究。
Sensors (Basel). 2018 Oct 26;18(11):3633. doi: 10.3390/s18113633.

本文引用的文献

1
Fog Fragment Cooperation on Bandwidth Management Based on Reinforcement Learning.基于强化学习的雾碎片带宽管理协作
Sensors (Basel). 2020 Dec 4;20(23):6942. doi: 10.3390/s20236942.
2
Smart homes - current features and future perspectives.智能家居——当前特点与未来展望。
Maturitas. 2009 Oct 20;64(2):90-7. doi: 10.1016/j.maturitas.2009.07.014. Epub 2009 Sep 2.