Kabir Homayun, Tham Mau-Luen, Chang Yoong Choon, Chow Chee-Onn, Owada Yasunori
Department of Electrical and Electronic Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Sungai Long Campus, Kajang 43000, Malaysia.
Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, Lembah Pantai, Kuala Lumpur 50603, Malaysia.
Sensors (Basel). 2023 Jul 17;23(14):6448. doi: 10.3390/s23146448.
Natural disasters, including earthquakes, floods, landslides, tsunamis, wildfires, and hurricanes, have become more common in recent years due to rapid climate change. For Post-Disaster Management (PDM), authorities deploy various types of user equipment (UE) for the search and rescue operation, for example, search and rescue robots, drones, medical robots, smartphones, etc., via the Internet of Robotic Things (IoRT) supported by cellular 4G/LTE/5G and beyond or other wireless technologies. For uninterrupted communication services, movable and deployable resource units (MDRUs) have been utilized where the base stations are damaged due to the disaster. In addition, power optimization of the networks by satisfying the quality of service (QoS) of each UE is a crucial challenge because of the electricity crisis after the disaster. In order to optimize the energy efficiency, UE throughput, and serving cell (SC) throughput by considering the stationary as well as movable UE without knowing the environmental priori knowledge in MDRUs aided two-tier heterogeneous networks (HetsNets) of IoRT, the optimization problem has been formulated based on emitting power allocation and user association combinedly in this article. This optimization problem is nonconvex and NP-hard where parameterized (discrete: user association and continuous: power allocation) action space is deployed. The new model-free hybrid action space-based algorithm called multi-pass deep Q network (MP-DQN) is developed to optimize this complex problem. Simulations results demonstrate that the proposed MP-DQN outperforms the parameterized deep Q network (P-DQN) approach, which is well known for solving parameterized action space, DQN, as well as traditional algorithms in terms of reward, average energy efficiency, UE throughput, and SC throughput for motionless as well as moveable UE.
近年来,由于气候变化迅速,包括地震、洪水、山体滑坡、海啸、野火和飓风在内的自然灾害变得更加频繁。在灾后管理(PDM)中,当局通过由蜂窝4G/LTE/5G及更高版本或其他无线技术支持的机器人物联网(IoRT),部署各种类型的用户设备(UE)进行搜索和救援行动,例如搜索和救援机器人、无人机、医疗机器人、智能手机等。对于不间断通信服务,在基站因灾难受损的地方使用了可移动和可部署资源单元(MDRU)。此外,由于灾后的电力危机,在满足每个UE的服务质量(QoS)的情况下优化网络的功率是一项关键挑战。为了在不了解MDRU辅助的IoRT两层异构网络(HetsNets)中的环境先验知识的情况下,通过考虑固定和移动UE来优化能源效率、UE吞吐量和服务小区(SC)吞吐量,本文结合发射功率分配和用户关联制定了优化问题。该优化问题是非凸且NP难的,其中部署了参数化(离散:用户关联和连续:功率分配)动作空间。开发了一种名为多通道深度Q网络(MP-DQN)的基于新的无模型混合动作空间的算法来优化这个复杂问题。仿真结果表明,所提出的MP-DQN在奖励、平均能源效率、UE吞吐量和SC吞吐量方面优于参数化深度Q网络(P-DQN)方法,P-DQN方法以解决参数化动作空间而闻名,在解决DQN以及传统算法方面,对于静止和移动UE都有优势。