Liu Shugang, Peng Zhan, Yu Qiangguo, Duan Linan
School of Physics and Electronic Science, Hunan University of Science and Technology, Xiangtan, 411201, China.
Key Laboratory of Intelligent Sensors and Advanced Sensing Materials of Hunan Province, Hunan University of Science and Technology, Xiangtan, 411201, China.
Sci Rep. 2024 Aug 23;14(1):19636. doi: 10.1038/s41598-024-70619-9.
Effectively compressing transmitted images and reducing the distortion of reconstructed images are challenges in image semantic communication. This paper proposes a novel image semantic communication model that integrates a dynamic decision generation network and a generative adversarial network to address these challenges as efficiently as possible. At the transmitter, features are extracted and selected based on the channel's signal-to-noise ratio (SNR) using semantic encoding and a dynamic decision generation network. This semantic approach can effectively compress transmitted images, thereby reducing communication traffic. At the receiver, the generator/decoder collaborates with the discriminator network, enhancing image reconstruction quality through adversarial and perceptual losses. The experimental results on the CIFAR-10 dataset demonstrate that our scheme achieves a peak SNR of 26 dB, a structural similarity of 0.9, and a compression ratio (CR) of 81.5% in an AWGN channel with an SNR of 3 dB. Similarly, in the Rayleigh fading channel, the peak SNR is 23 dB, structural similarity is 0.8, and the CR is 80.5%. The learned perceptual image patch similarity in both channels is below 0.008. These experiments thoroughly demonstrate that the proposed semantic communication is a superior deep learning-based joint source-channel coding method, offering a high CR and low distortion of reconstructed images.
有效地压缩传输图像并减少重建图像的失真,是图像语义通信中的挑战。本文提出了一种新颖的图像语义通信模型,该模型集成了动态决策生成网络和生成对抗网络,以尽可能高效地应对这些挑战。在发射端,使用语义编码和动态决策生成网络,基于信道的信噪比(SNR)提取并选择特征。这种语义方法可以有效地压缩传输图像,从而减少通信流量。在接收端,生成器/解码器与判别器网络协作,通过对抗损失和感知损失提高图像重建质量。在CIFAR-10数据集上的实验结果表明,在信噪比为3 dB的加性高斯白噪声(AWGN)信道中,我们的方案实现了26 dB的峰值信噪比、0.9的结构相似性和81.5%的压缩率(CR)。同样,在瑞利衰落信道中,峰值信噪比为23 dB,结构相似性为0.8,压缩率为80.5%。在两个信道中学习到的感知图像块相似性均低于0.008。这些实验充分证明,所提出的语义通信是一种基于深度学习的优越的联合信源信道编码方法,具有高压缩率和低重建图像失真。