Department of Visual and Data-Centric Computing, Zuse Institute Berlin, Takustraße 7, 14195, Berlin, Germany.
Department of CryoEM Technology, Max Planck Institute of Biochemistry, Am Klopferspitz 18, 82152, Martinsried, Germany.
BMC Bioinformatics. 2022 Aug 30;23(1):360. doi: 10.1186/s12859-022-04901-w.
Despite recent advances in cellular cryo-electron tomography (CET), developing automated tools for macromolecule identification in submolecular resolution remains challenging due to the lack of annotated data and high structural complexities. To date, the extent of the deep learning methods constructed for this problem is limited to conventional Convolutional Neural Networks (CNNs). Identifying macromolecules of different types and sizes is a tedious and time-consuming task. In this paper, we employ a capsule-based architecture to automate the task of macromolecule identification, that we refer to as 3D-UCaps. In particular, the architecture is composed of three components: feature extractor, capsule encoder, and CNN decoder. The feature extractor converts voxel intensities of input sub-tomograms to activities of local features. The encoder is a 3D Capsule Network (CapsNet) that takes local features to generate a low-dimensional representation of the input. Then, a 3D CNN decoder reconstructs the sub-tomograms from the given representation by upsampling.
We performed binary and multi-class localization and identification tasks on synthetic and experimental data. We observed that the 3D-UNet and the 3D-UCaps had an [Formula: see text]score mostly above 60% and 70%, respectively, on the test data. In both network architectures, we observed degradation of at least 40% in the [Formula: see text]-score when identifying very small particles (PDB entry 3GL1) compared to a large particle (PDB entry 4D8Q). In the multi-class identification task of experimental data, 3D-UCaps had an [Formula: see text]-score of 91% on the test data in contrast to 64% of the 3D-UNet. The better [Formula: see text]-score of 3D-UCaps compared to 3D-UNet is obtained by a higher precision score. We speculate this to be due to the capsule network employed in the encoder. To study the effect of the CapsNet-based encoder architecture further, we performed an ablation study and perceived that the [Formula: see text]-score is boosted as network depth is increased which is in contrast to the previously reported results for the 3D-UNet. To present a reproducible work, source code, trained models, data as well as visualization results are made publicly available.
Quantitative and qualitative results show that 3D-UCaps successfully perform various downstream tasks including identification and localization of macromolecules and can at least compete with CNN architectures for this task. Given that the capsule layers extract both the existence probability and the orientation of the molecules, this architecture has the potential to lead to representations of the data that are better interpretable than those of 3D-UNet.
尽管细胞冷冻电子断层成像术(CET)在最近取得了进展,但由于缺乏注释数据和高结构复杂性,开发用于亚分子分辨率下的大分子识别的自动化工具仍然具有挑战性。迄今为止,为解决此问题而构建的深度学习方法的程度仅限于传统的卷积神经网络(CNN)。识别不同类型和大小的大分子是一项繁琐且耗时的任务。在本文中,我们采用基于胶囊的架构来自动执行大分子识别任务,我们将其称为 3D-UCaps。具体来说,该架构由三个组件组成:特征提取器、胶囊编码器和 CNN 解码器。特征提取器将输入子断层图像的体素强度转换为局部特征的活动。编码器是一个 3D 胶囊网络(CapsNet),它采用局部特征来生成输入的低维表示。然后,通过上采样,3D CNN 解码器从给定表示中重建子断层图像。
我们在合成数据和实验数据上执行了二进制和多类定位和识别任务。我们观察到,在测试数据上,3D-UNet 和 3D-UCaps 的[公式:见文本]分数大多高于 60%和 70%,分别。在这两种网络架构中,当识别非常小的颗粒(PDB 条目 3GL1)时,与识别大颗粒(PDB 条目 4D8Q)相比,[公式:见文本]分数至少降低了 40%。在实验数据的多类识别任务中,3D-UCaps 在测试数据上的[公式:见文本]分数为 91%,而 3D-UNet 的分数为 64%。与 3D-UNet 相比,3D-UCaps 更高的[公式:见文本]分数是通过更高的精度分数获得的。我们推测这是由于编码器中使用的胶囊网络所致。为了进一步研究基于胶囊网络的编码器架构的效果,我们进行了一项消融研究,并发现随着网络深度的增加,[公式:见文本]分数得到了提高,这与之前报道的 3D-UNet 结果相反。为了呈现可重现的工作,我们公开了源代码、训练模型、数据以及可视化结果。
定量和定性结果表明,3D-UCaps 成功地执行了各种下游任务,包括大分子的识别和定位,并且至少可以与 CNN 架构竞争此任务。鉴于胶囊层既提取分子的存在概率又提取分子的方向,因此该架构有可能导致比 3D-UNet 更好解释的数据表示。