Insitut für Nanostruktur und Festkörperphysik, Universität Hamburg, Hamburg, Germany.
Acta Crystallogr D Struct Biol. 2024 Oct 1;80(Pt 10):722-732. doi: 10.1107/S2059798324008519.
During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.
在晶体学衍射实验的自动处理过程中,光束截止器的阴影通常未被考虑或仅部分屏蔽。因此,异常反射强度被积分,这是一个已知的问题。传统的统计诊断方法在识别这些异常值方面的效果有限,这里称之为未排除未屏蔽异常值(NEMO)。诊断工具 AUSPEX 允许对 NEMO 进行可视化检查,它们形成了一个典型的模式:在 AUSPEX 强度或振幅与分辨率的图谱的低分辨率端形成簇。为了实现 NEMO 的自动检测,我们结合数据统计和基于密度的聚类方法开发了一种新算法。该方法在不破坏现有数据还原管道的情况下,在合并数据集的 NEMO 检测方面表现出了有前景的性能。重新精修结果表明,排除已识别的 NEMO 可以有效提高后续结构确定步骤的质量。该方法为评估光束截止器的效果提供了一种有前途的自动化手段,并突出了现代模式识别技术在数据处理过程中自动排除异常值的潜力,为未来适应不断发展的实验策略提供了便利。