• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的多副本中继节点选择策略。

Multi-Copy Relay Node Selection Strategy Based on Reinforcement Learning.

机构信息

College of Information Science and Technology, Nanjing Forestry University, Nanjing 210000, China.

出版信息

Sensors (Basel). 2023 Jul 4;23(13):6131. doi: 10.3390/s23136131.

DOI:10.3390/s23136131
PMID:37447980
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10346703/
Abstract

Delay tolerant networks (DTNs), are characterized by their difficulty in establishing end-to-end paths and and large message propagation delays. To control network overhead costs, reduce message delays, and improve delivery rates in DTNs, it is essential to not only delete messages that have reached their destination but also to more precisely determine appropriate relay nodes. Based on the above goals, this paper constructs a multi-copy relay node selection router algorithm based on Q-lambda reinforcement learning with reference to the idea of community division (QLCR). In community division, if a node has the highestdegree, it is considered the core node, and nodes with similar interests and structural properties are divided into a community. Node degree refers to the number of nodes associated with the node, indicating its importance in the network. Structural similarity determines the distance between nodes. The selection of relay nodes considers node degree, interests, and structural similarity. The Q-lambda reinforcement learning algorithm enables each node to learn from the entire network, setting corresponding reward values based on encountered nodes meeting the specified conditions. Through iterative processes, the node with the most cumulative reward value is chosen as the final relay node. Experimental results demonstrate that the proposed algorithm achieves a high delivery rate while maintaining low network overhead and delay.

摘要

延迟容忍网络(DTN)的特点是难以建立端到端路径和较大的消息传播延迟。为了控制网络开销成本、降低消息延迟并提高 DTN 中的投递率,不仅要删除已到达目的地的消息,还要更准确地确定合适的中继节点。基于上述目标,本文构建了一种基于 Q-λ强化学习的多副本中继节点选择路由器算法,该算法借鉴了社区划分的思想(QLCR)。在社区划分中,如果一个节点的度值最高,则认为它是核心节点,并且具有相似兴趣和结构属性的节点被划分到一个社区中。节点度是指与节点相关联的节点数量,表明其在网络中的重要性。结构相似度决定了节点之间的距离。中继节点的选择考虑了节点度、兴趣和结构相似度。Q-λ强化学习算法使每个节点都可以从整个网络中学习,根据遇到的满足指定条件的节点设置相应的奖励值。通过迭代过程,选择累积奖励值最高的节点作为最终的中继节点。实验结果表明,所提出的算法在保持低网络开销和延迟的同时,实现了高投递率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/0509637aabd8/sensors-23-06131-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/ccd63fd9a633/sensors-23-06131-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/9b8b5da444cd/sensors-23-06131-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/cbcd9efdcd0a/sensors-23-06131-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/54fd2565a66b/sensors-23-06131-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/35084441eee8/sensors-23-06131-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/0509637aabd8/sensors-23-06131-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/ccd63fd9a633/sensors-23-06131-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/9b8b5da444cd/sensors-23-06131-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/cbcd9efdcd0a/sensors-23-06131-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/54fd2565a66b/sensors-23-06131-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/35084441eee8/sensors-23-06131-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02af/10346703/0509637aabd8/sensors-23-06131-g006.jpg

相似文献

1
Multi-Copy Relay Node Selection Strategy Based on Reinforcement Learning.基于强化学习的多副本中继节点选择策略。
Sensors (Basel). 2023 Jul 4;23(13):6131. doi: 10.3390/s23136131.
2
A Time Synchronization Protocol for Barrage Relay Networks.弹幕中继网络的时间同步协议。
Sensors (Basel). 2023 Feb 22;23(5):2447. doi: 10.3390/s23052447.
3
A Comprehensive Evaluation Algorithm of Multi-Point Relay Based on Link-State Awareness for UANETs.一种基于链路状态感知的水声自组织网络多点中继综合评估算法
Sensors (Basel). 2024 Mar 6;24(5):1702. doi: 10.3390/s24051702.
4
Reinforcement Learning-Based Multihop Relaying: A Decentralized Q-Learning Approach.基于强化学习的多跳中继:一种去中心化Q学习方法。
Entropy (Basel). 2021 Oct 6;23(10):1310. doi: 10.3390/e23101310.
5
ARNS: Adaptive Relay-Node Selection Method for Message Broadcasting in the Internet of Vehicles.ARNS:车联网中消息广播的自适应中继节点选择方法。
Sensors (Basel). 2020 Feb 29;20(5):1338. doi: 10.3390/s20051338.
6
Range aware message drop policy for delay tolerant networks.延迟容忍网络的范围感知消息丢弃策略
PeerJ Comput Sci. 2024 Jul 12;10:e2099. doi: 10.7717/peerj-cs.2099. eCollection 2024.
7
A Software-Defined Directional Q-Learning Grid-Based Routing Platform and Its Two-Hop Trajectory-Based Routing Algorithm for Vehicular Ad Hoc Networks.一种用于车载自组织网络的软件定义定向Q学习网格路由平台及其基于两跳轨迹的路由算法。
Sensors (Basel). 2022 Oct 27;22(21):8222. doi: 10.3390/s22218222.
8
A New Method of Identifying Core Designers and Teams Based on the Importance and Similarity of Networks.基于网络重要性和相似性的核心设计师和团队识别新方法
Comput Intell Neurosci. 2021 Jul 20;2021:3717733. doi: 10.1155/2021/3717733. eCollection 2021.
9
FSF: Applying Machine Learning Techniques to Data Forwarding in Socially Selfish Opportunistic Networks.FSF:将机器学习技术应用于社会自私机会网络中的数据转发
Sensors (Basel). 2019 May 23;19(10):2374. doi: 10.3390/s19102374.
10
Next-Hop Relay Selection for Ad Hoc Network-Assisted Train-to-Train Communications in the CBTC System.基于 CBTC 系统的车地通信中自组织网络辅助的列车间中继选择。
Sensors (Basel). 2023 Jun 25;23(13):5883. doi: 10.3390/s23135883.

引用本文的文献

1
A Fuzzy-Based Relay Security Algorithm for Wireless Sensor Networks.一种用于无线传感器网络的基于模糊的中继安全算法。
Sensors (Basel). 2025 Jul 16;25(14):4422. doi: 10.3390/s25144422.
2
Range aware message drop policy for delay tolerant networks.延迟容忍网络的范围感知消息丢弃策略
PeerJ Comput Sci. 2024 Jul 12;10:e2099. doi: 10.7717/peerj-cs.2099. eCollection 2024.

本文引用的文献

1
Untargeted region of interest selection for gas chromatography - mass spectrometry data using a pseudo F-ratio moving window.使用伪 F 比移动窗口对气相色谱 - 质谱数据进行无目标感兴趣区域选择。
J Chromatogr A. 2022 Oct 25;1682:463499. doi: 10.1016/j.chroma.2022.463499. Epub 2022 Sep 13.
2
A Joint Energy Replenishment and Data Collection Strategy in Heterogeneous Wireless Rechargeable Sensor Networks.异构无线可充电传感器网络中的联合能量补充与数据收集策略
Sensors (Basel). 2021 Apr 22;21(9):2930. doi: 10.3390/s21092930.
3
Uncovering the overlapping community structure of complex networks in nature and society.
揭示自然与社会中复杂网络的重叠群落结构。
Nature. 2005 Jun 9;435(7043):814-8. doi: 10.1038/nature03607.
4
Fast algorithm for detecting community structure in networks.网络中社区结构检测的快速算法。
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066133. doi: 10.1103/PhysRevE.69.066133. Epub 2004 Jun 18.
5
Finding and evaluating community structure in networks.在网络中寻找并评估社区结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Feb;69(2 Pt 2):026113. doi: 10.1103/PhysRevE.69.026113. Epub 2004 Feb 26.
6
Defining and identifying communities in networks.定义和识别网络中的群落。
Proc Natl Acad Sci U S A. 2004 Mar 2;101(9):2658-63. doi: 10.1073/pnas.0400054101. Epub 2004 Feb 23.
7
Community structure in social and biological networks.社会和生物网络中的群落结构。
Proc Natl Acad Sci U S A. 2002 Jun 11;99(12):7821-6. doi: 10.1073/pnas.122653799.