Rashid Muhammad, Baloch Naveed Khan, Shafique Muhammad Akmal, Hussain Fawad, Saleem Shahroon, Zikria Yousaf Bin, Yu Heejung
Department of Computer Engineering, University of Engineering & Technology, Taxila 47050, Pakistan.
Department of Electrical Engineering, University of Engineering & Technology, Taxila 47050, Pakistan.
Sensors (Basel). 2020 Sep 18;20(18):5355. doi: 10.3390/s20185355.
Network-on-chip (NoC) architectures have become a popular communication platform for heterogeneous computing systems owing to their scalability and high performance. Aggressive technology scaling makes these architectures prone to both permanent and transient faults. This study focuses on the tolerance of a NoC router to permanent faults. A permanent fault in a NoC router severely impacts the performance of the entire network. Thus, it is necessary to incorporate component-level protection techniques in a router. In the proposed scheme, the input port utilizes a bypass path, virtual channel (VC) queuing, and VC closing strategies. Moreover, the routing computation stage utilizes spatial redundancy and double routing strategies, and the VC allocation stage utilizes spatial redundancy. The switch allocation stage utilizes run-time arbiter selection. The crossbar stage utilizes a triple bypass bus. The proposed router is highly fault-tolerant compared with the existing state-of-the-art fault-tolerant routers. The reliability of the proposed router is 7.98 times higher than that of the unprotected baseline router in terms of the mean-time-to-failure metric. The silicon protection factor metric is used to calculate the protection ability of the proposed router. Consequently, it is confirmed that the proposed router has a greater protection ability than the conventional fault-tolerant routers.
片上网络(NoC)架构因其可扩展性和高性能,已成为异构计算系统中一种流行的通信平台。激进的技术缩放使得这些架构容易出现永久性和瞬态故障。本研究聚焦于NoC路由器对永久性故障的容错能力。NoC路由器中的永久性故障会严重影响整个网络的性能。因此,有必要在路由器中纳入组件级保护技术。在所提出的方案中,输入端口采用旁路路径、虚拟通道(VC)排队和VC关闭策略。此外,路由计算阶段采用空间冗余和双重路由策略,VC分配阶段采用空间冗余。交换分配阶段采用运行时仲裁器选择。交叉开关阶段采用三重旁路总线。与现有的最先进容错路由器相比,所提出的路由器具有高度的容错能力。就平均无故障时间指标而言,所提出路由器的可靠性比未受保护的基准路由器高7.98倍。硅保护因子指标用于计算所提出路由器的保护能力。因此,可以确认所提出的路由器比传统容错路由器具有更强的保护能力。