Pervez Zeeshan, Ahmad Mahmood, Khattak Asad Masood, Ramzan Naeem, Khan Wajahat Ali
School of Engineering and Computing, University of the West of Scotland, Paisley, PA1 2BE, United Kingdom.
Ubiquitous Computing Lab, Department of Computer Engineering, Kyung Hee University, Global Campus, 1 Seocheon-dong, Giheung-gu, Yongin-si, Gyeonggi-do 446-701, South Korea.
PLoS One. 2017 Jul 10;12(7):e0179720. doi: 10.1371/journal.pone.0179720. eCollection 2017.
Public cloud storage services are becoming prevalent and myriad data sharing, archiving and collaborative services have emerged which harness the pay-as-you-go business model of public cloud. To ensure privacy and confidentiality often encrypted data is outsourced to such services, which further complicates the process of accessing relevant data by using search queries. Search over encrypted data schemes solve this problem by exploiting cryptographic primitives and secure indexing to identify outsourced data that satisfy the search criteria. Almost all of these schemes rely on exact matching between the encrypted data and search criteria. A few schemes which extend the notion of exact matching to similarity based search, lack realism as those schemes rely on trusted third parties or due to increase storage and computational complexity. In this paper we propose Oblivious Similarity based Search ([Formula: see text]) for encrypted data. It enables authorized users to model their own encrypted search queries which are resilient to typographical errors. Unlike conventional methodologies, [Formula: see text] ranks the search results by using similarity measure offering a better search experience than exact matching. It utilizes encrypted bloom filter and probabilistic homomorphic encryption to enable authorized users to access relevant data without revealing results of search query evaluation process to the untrusted cloud service provider. Encrypted bloom filter based search enables [Formula: see text] to reduce search space to potentially relevant encrypted data avoiding unnecessary computation on public cloud. The efficacy of [Formula: see text] is evaluated on Google App Engine for various bloom filter lengths on different cloud configurations.
公共云存储服务正变得越来越普遍,并且涌现出了无数的数据共享、存档和协作服务,这些服务利用了公共云的即付即用商业模式。为了确保隐私和机密性,通常会将加密数据外包给此类服务,这使得使用搜索查询访问相关数据的过程变得更加复杂。加密数据搜索方案通过利用密码原语和安全索引来识别满足搜索标准的外包数据,从而解决了这个问题。几乎所有这些方案都依赖于加密数据和搜索标准之间的精确匹配。一些将精确匹配概念扩展到基于相似性搜索的方案缺乏现实性,因为这些方案依赖于可信第三方,或者由于存储和计算复杂度的增加。在本文中,我们提出了用于加密数据的基于不经意相似性的搜索([公式:见正文])。它使授权用户能够对自己的加密搜索查询进行建模,这些查询能够抵御排版错误。与传统方法不同,[公式:见正文]通过使用相似性度量对搜索结果进行排序,提供了比精确匹配更好的搜索体验。它利用加密布隆过滤器和概率同态加密,使授权用户能够访问相关数据,而无需向不可信的云服务提供商透露搜索查询评估过程的结果。基于加密布隆过滤器的搜索使[公式:见正文]能够将搜索空间缩小到潜在相关的加密数据,避免在公共云上进行不必要的计算。在谷歌应用引擎上,针对不同云配置下的各种布隆过滤器长度,对[公式:见正文]的有效性进行了评估。