Suppr超能文献

稳健且自动的光束截止器阴影异常值剔除:在半监督学习策略下结合晶体学统计和现代聚类。

Robust and automatic beamstop shadow outlier rejection: combining crystallographic statistics with modern clustering under a semi-supervised learning strategy.

机构信息

Insitut für Nanostruktur und Festkörperphysik, Universität Hamburg, Hamburg, Germany.

出版信息

Acta Crystallogr D Struct Biol. 2024 Oct 1;80(Pt 10):722-732. doi: 10.1107/S2059798324008519.

Abstract

During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.

摘要

在晶体学衍射实验的自动处理过程中,光束截止器的阴影通常未被考虑或仅部分屏蔽。因此,异常反射强度被积分,这是一个已知的问题。传统的统计诊断方法在识别这些异常值方面的效果有限,这里称之为未排除未屏蔽异常值(NEMO)。诊断工具 AUSPEX 允许对 NEMO 进行可视化检查,它们形成了一个典型的模式:在 AUSPEX 强度或振幅与分辨率的图谱的低分辨率端形成簇。为了实现 NEMO 的自动检测,我们结合数据统计和基于密度的聚类方法开发了一种新算法。该方法在不破坏现有数据还原管道的情况下,在合并数据集的 NEMO 检测方面表现出了有前景的性能。重新精修结果表明,排除已识别的 NEMO 可以有效提高后续结构确定步骤的质量。该方法为评估光束截止器的效果提供了一种有前途的自动化手段,并突出了现代模式识别技术在数据处理过程中自动排除异常值的潜力,为未来适应不断发展的实验策略提供了便利。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca07/11448920/d76861e013c0/d-80-00722-fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验