Namazi Mina, Farahpoor Mohammadali, Ayday Erman, Pérez-González Fernando
Internet Interdisciplinary Institute (IN3), Open University of Catalonia, Barcelona 08018, Spain.
Computer and Data Engineering School, Case Western Reserve University, Cleveland, OH 44106, United States.
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btae754.
The affordability of genome sequencing and the widespread availability of genomic data have opened up new medical possibilities. Nevertheless, they also raise significant concerns regarding privacy due to the sensitive information they encompass. These privacy implications act as barriers to medical research and data availability. Researchers have proposed privacy-preserving techniques to address this, with cryptography-based methods showing the most promise. However, existing cryptography-based designs lack (i) interoperability, (ii) scalability, (iii) a high degree of privacy (i.e. compromise one to have the other), or (iv) multiparty analyses support (as most existing schemes process genomic information of each party individually). Overcoming these limitations is essential to unlocking the full potential of genomic data while ensuring privacy and data utility. Further research and development are needed to advance privacy-preserving techniques in genomics, focusing on achieving interoperability and scalability, preserving data utility, and enabling secure multiparty computation.
This study aims to overcome the limitations of current cryptography-based techniques by employing a multi-key homomorphic encryption scheme. By utilizing this scheme, we have developed a comprehensive protocol capable of conducting diverse genomic analyses. Our protocol facilitates interoperability among individual genome processing and enables multiparty tests, analyses of genomic databases, and operations involving multiple databases. Consequently, our approach represents an innovative advancement in secure genomic data processing, offering enhanced protection and privacy measures.
All associated code and documentation are available at https://github.com/farahpoor/smkhe.
基因组测序的可承受性以及基因组数据的广泛可得性开启了新的医学可能性。然而,由于它们所包含的敏感信息,也引发了对隐私的重大担忧。这些隐私问题成为医学研究和数据可得性的障碍。研究人员提出了隐私保护技术来解决这一问题,其中基于密码学的方法最具前景。然而,现有的基于密码学的设计缺乏(i)互操作性,(ii)可扩展性,(iii)高度隐私性(即要实现一个就会牺牲另一个),或者(iv)多方分析支持(因为大多数现有方案分别处理各方的基因组信息)。克服这些限制对于释放基因组数据的全部潜力同时确保隐私和数据实用性至关重要。需要进一步的研究和开发来推进基因组学中的隐私保护技术,重点是实现互操作性和可扩展性、保留数据实用性以及实现安全的多方计算。
本研究旨在通过采用多密钥同态加密方案克服当前基于密码学技术的局限性。通过利用该方案,我们开发了一个能够进行各种基因组分析的综合协议。我们的协议促进了个体基因组处理之间的互操作性,并实现了多方测试、基因组数据库分析以及涉及多个数据库的操作。因此,我们的方法代表了安全基因组数据处理方面的创新进展,提供了增强的保护和隐私措施。
所有相关代码和文档可在https://github.com/farahpoor/smkhe获取。