Quantitative and Computational Biology Group, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany.
School of Biological Sciences, Seoul National University, Seoul, South Korea.
Nat Biotechnol. 2024 Feb;42(2):243-246. doi: 10.1038/s41587-023-01773-0. Epub 2023 May 8.
As structure prediction methods are generating millions of publicly available protein structures, searching these databases is becoming a bottleneck. Foldseek aligns the structure of a query protein against a database by describing tertiary amino acid interactions within proteins as sequences over a structural alphabet. Foldseek decreases computation times by four to five orders of magnitude with 86%, 88% and 133% of the sensitivities of Dali, TM-align and CE, respectively.
随着结构预测方法生成数以百万计的公开可用蛋白质结构,对这些数据库的搜索成为了一个瓶颈。Foldseek 通过将蛋白质中的三级氨基酸相互作用描述为结构字母表上的序列,将查询蛋白质的结构与数据库进行对齐。与 Dali、TM-align 和 CE 相比,Foldseek 的计算时间分别减少了四到五个数量级,灵敏度分别为 86%、88%和 133%。