Bai Yanan, Liu Quanliang, Wu Wenyuan, Feng Yong
Chongqing Key Laboratory of Automated Reasoning and Cognition, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China.
University of Chinese Academy of Sciences, Beijing, China.
Front Comput Neurosci. 2021 Dec 23;15:799977. doi: 10.3389/fncom.2021.799977. eCollection 2021.
The emerging topic of privacy-preserving deep learning as a service has attracted increasing attention in recent years, which focuses on building an efficient and practical neural network prediction framework to secure client and model-holder data privately on the cloud. In such a task, the time cost of performing the secure linear layers is expensive, where matrix multiplication is the atomic operation. Most existing mix-based solutions heavily emphasized employing BGV-based homomorphic encryption schemes to secure the linear layer on the CPU platform. However, they suffer an efficiency and energy loss when dealing with a larger-scale dataset, due to the complicated encoded methods and intractable ciphertext operations. To address it, we propose cuSCNN, a secure and efficient framework to perform the privacy prediction task of a convolutional neural network (CNN), which can flexibly perform on the GPU platform. Its main idea is 2-fold: (1) To avoid the trivia and complicated homomorphic matrix computations brought by BGV-based solutions, it adopts GSW-based homomorphic matrix encryption to efficiently enable the linear layers of CNN, which is a naive method to secure matrix computation operations. (2) To improve the computation efficiency on GPU, a hybrid optimization approach based on CUDA (Compute Unified Device Architecture) has been proposed to improve the parallelism level and memory access speed when performing the matrix multiplication on GPU. Extensive experiments are conducted on industrial datasets and have shown the superior performance of the proposed cuSCNN framework in terms of runtime and power consumption compared to the other frameworks.
近年来,作为一种服务的隐私保护深度学习这一新兴主题受到了越来越多的关注,它专注于构建一个高效实用的神经网络预测框架,以便在云端私密地保护客户端和模型持有者的数据。在这样的任务中,执行安全线性层的时间成本很高,其中矩阵乘法是原子操作。大多数现有的基于混合的解决方案严重强调采用基于BGV的同态加密方案来保护CPU平台上的线性层。然而,由于编码方法复杂且密文操作棘手,它们在处理大规模数据集时会出现效率和能量损失。为了解决这个问题,我们提出了cuSCNN,这是一个用于执行卷积神经网络(CNN)隐私预测任务的安全高效框架,它可以在GPU平台上灵活运行。其主要思想有两个方面:(1)为了避免基于BGV的解决方案带来的琐碎和复杂的同态矩阵计算,它采用基于GSW的同态矩阵加密来有效地实现CNN的线性层,这是一种保护矩阵计算操作的简单方法。(2)为了提高GPU上的计算效率,提出了一种基于CUDA(计算统一设备架构)的混合优化方法,以提高在GPU上执行矩阵乘法时的并行度和内存访问速度。在工业数据集上进行了广泛的实验,结果表明,与其他框架相比,所提出的cuSCNN框架在运行时和功耗方面具有卓越的性能。