Gürsoy Gamze, Chielle Eduardo, Brannon Charlotte M, Maniatakos Michail, Gerstein Mark
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
Department of Electrical and Computer Engineering, New York University Abu Dhabi, Saadiyat Island, Abu Dhabi, UAE.
Cell Syst. 2022 Feb 16;13(2):173-182.e3. doi: 10.1016/j.cels.2021.10.003. Epub 2021 Nov 9.
Genotype imputation is the inference of unknown genotypes using known population structure observed in large genomic datasets; it can further our understanding of phenotype-genotype relationships and is useful for QTL mapping and GWASs. However, the compute-intensive nature of genotype imputation can overwhelm local servers for computation and storage. Hence, many researchers are moving toward using cloud services, raising privacy concerns. We address these concerns by developing an efficient, privacy-preserving algorithm called p-Impute. Our method uses homomorphic encryption, allowing calculations on ciphertext, thereby avoiding the decryption of private genotypes in the cloud. It is similar to k-nearest neighbor approaches, inferring missing genotypes in a genomic block based on the SNP genotypes of genetically related individuals in the same block. Our results demonstrate accuracy in agreement with the state-of-the-art plaintext solutions. Moreover, p-Impute is scalable to real-world applications as its memory and time requirements increase linearly with the increasing number of samples. p-Impute is freely available for download here: https://doi.org/10.5281/zenodo.5542001.
基因型填充是利用在大型基因组数据集中观察到的已知群体结构来推断未知基因型;它可以加深我们对表型-基因型关系的理解,并且对数量性状基因座定位和全基因组关联研究很有用。然而,基因型填充的计算密集型性质可能会压垮用于计算和存储的本地服务器。因此,许多研究人员正转向使用云服务,这引发了隐私担忧。我们通过开发一种名为p-Impute的高效、隐私保护算法来解决这些担忧。我们的方法使用同态加密,允许对密文进行计算,从而避免在云中解密私人基因型。它类似于k近邻方法,基于同一基因组块中遗传相关个体的单核苷酸多态性基因型来推断该基因组块中缺失的基因型。我们的结果表明,其准确性与最先进的明文解决方案相当。此外,p-Impute可扩展到实际应用,因为其内存和时间需求随样本数量的增加而线性增加。p-Impute可在此处免费下载:https://doi.org/10.5281/zenodo.5542001。