Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom.
Acta Crystallogr D Struct Biol. 2024 Oct 1;80(Pt 10):744-764. doi: 10.1107/S2059798324009276.
A group of three deep-learning tools, referred to collectively as CHiMP (Crystal Hits in My Plate), were created for analysis of micrographs of protein crystallization experiments at the Diamond Light Source (DLS) synchrotron, UK. The first tool, a classification network, assigns images into categories relating to experimental outcomes. The other two tools are networks that perform both object detection and instance segmentation, resulting in masks of individual crystals in the first case and masks of crystallization droplets in addition to crystals in the second case, allowing the positions and sizes of these entities to be recorded. The creation of these tools used transfer learning, where weights from a pre-trained deep-learning network were used as a starting point and repurposed by further training on a relatively small set of data. Two of the tools are now integrated at the VMXi macromolecular crystallography beamline at DLS, where they have the potential to absolve the need for any user input, both for monitoring crystallization experiments and for triggering in situ data collections. The third is being integrated into the XChem fragment-based drug-discovery screening platform, also at DLS, to allow the automatic targeting of acoustic compound dispensing into crystallization droplets.
一组名为 CHiMP(Crystal Hits in My Plate)的三个深度学习工具被创建,用于分析英国钻石光源(DLS)同步加速器上的蛋白质结晶实验的显微照片。第一个工具是一个分类网络,将图像分配到与实验结果相关的类别中。另外两个工具是执行对象检测和实例分割的网络,在第一种情况下生成单个晶体的掩模,在第二种情况下生成除晶体外还有结晶液滴的掩模,从而可以记录这些实体的位置和大小。这些工具的创建使用了迁移学习,其中从预先训练的深度学习网络的权重被用作起点,并通过在相对较小的数据集上进一步训练来重新利用。其中两个工具现在已经集成到 DLS 的 VMXi 大分子晶体学光束线上,它们有可能免除任何用户输入的需要,无论是用于监测结晶实验还是触发原位数据收集。第三个工具正在集成到 DLS 的 XChem 基于片段的药物发现筛选平台中,以允许自动将声学化合物分配到结晶液滴中。