Research and Development, PIRCHE AG, Berlin, Germany.
Center for Translational Immunology, University Medical Center, Utrecht, Netherlands.
HLA. 2024 Jan;103(1):e15260. doi: 10.1111/tan.15260. Epub 2023 Oct 18.
Allorecognition of donor HLA is a major risk factor for long-term kidney graft survival. Although several molecular matching algorithms have been proposed that compare physiochemical and structural features of the donors' and recipients' HLA proteins in order to predict their compatibility, the exact underlying mechanisms are still not fully understood. We hypothesized that the ElliPro approach of single ellipsoid fitting and protrusion ranking lacks sensitivity for the characteristic shape of HLA molecules and developed a prediction pipeline named Snowball that is fitting smaller ellipsoids iteratively to substructures. Aggregated protrusion ranks of locally fitted ellipsoids were calculated for 712 publicly available HLA structures and 78 predicted structures using AlphaFold 2. Amino-acid sequence and protrusion ranks were used to train deep neural network predictors to infer protrusion ranks for all known HLA sequences. Snowball protrusion ranks appear to be more sensitive than ElliPro scores in fine parts of the HLA such as the helix structures forming the HLA binding groove in particular when the ellipsoids are fitted to substructures considering atoms within a 15 Å radius. A cloud-based web service was implemented based on amino-acid matching considering both protein- and position-specific surface area and protrusion ranks extending the previously presented Snowflake prediction pipeline.
同种异体识别供体 HLA 是长期肾脏移植物存活的主要风险因素。尽管已经提出了几种分子匹配算法,这些算法比较供体和受者 HLA 蛋白的物理化学和结构特征,以预测它们的相容性,但确切的潜在机制仍不完全清楚。我们假设 ElliPro 的单椭球拟合和突出物排序方法缺乏对 HLA 分子特征形状的敏感性,并开发了一种名为 Snowball 的预测管道,该管道通过迭代拟合较小的椭球体来拟合亚结构。使用 AlphaFold 2 计算了 712 个公开的 HLA 结构和 78 个预测结构的局部拟合椭球体的聚合突出物等级。使用氨基酸序列和突出物等级来训练深度神经网络预测器,以推断所有已知 HLA 序列的突出物等级。Snowball 的突出物等级在 HLA 的精细部分(如形成 HLA 结合槽的螺旋结构)似乎比 ElliPro 分数更敏感,特别是当考虑到 15Å 半径内的原子时,将椭球体拟合到亚结构时。基于氨基酸匹配的基于云的网络服务实现了,同时考虑了蛋白质和位置特定的表面积以及突出物等级,扩展了之前提出的 Snowflake 预测管道。