Lan Ti-Yen, Wierman Jennifer L, Tate Mark W, Philipp Hugh T, Martin-Garcia Jose M, Zhu Lan, Kissick David, Fromme Petra, Fischetti Robert F, Liu Wei, Elser Veit, Gruner Sol M
Laboratory of Atomic and Solid State Physics, Cornell University, Ithaca, NY 14853, USA.
Cornell High Energy Synchrotron Source (CHESS), Cornell University, Ithaca, NY 14853, USA.
IUCrJ. 2018 Jul 20;5(Pt 5):548-558. doi: 10.1107/S205225251800903X. eCollection 2018 Sep 1.
In recent years, the success of serial femtosecond crystallography and the paucity of beamtime at X-ray free-electron lasers have motivated the development of serial microcrystallography experiments at storage-ring synchrotron sources. However, especially at storage-ring sources, if a crystal is too small it will have suffered significant radiation damage before diffracting a sufficient number of X-rays into Bragg peaks for peak-indexing software to determine the crystal orientation. As a consequence, the data frames of small crystals often cannot be indexed and are discarded. Introduced here is a method based on the expand-maximize-compress (EMC) algorithm to solve protein structures, specifically from data frames for which indexing methods fail because too few X-rays are diffracted into Bragg peaks. The method is demonstrated on a real serial microcrystallography data set whose signals are too weak to be indexed by conventional methods. In spite of the daunting background scatter from the sample-delivery medium, it was still possible to solve the protein structure at 2.1 Å resolution. The ability of the EMC algorithm to analyze weak data frames will help to reduce sample consumption. It will also allow serial microcrystallography to be performed with crystals that are otherwise too small to be feasibly analyzed at storage-ring sources.
近年来,串行飞秒晶体学的成功以及X射线自由电子激光器束流时间的匮乏,促使人们在储存环同步辐射源上开展串行微晶学实验。然而,特别是在储存环光源下,如果晶体太小,在将足够数量的X射线衍射到布拉格峰以让峰索引软件确定晶体取向之前,它就会遭受严重的辐射损伤。因此,小晶体的数据帧常常无法被索引而被丢弃。本文介绍了一种基于扩展 - 最大化 - 压缩(EMC)算法来解析蛋白质结构的方法,具体针对那些因衍射到布拉格峰的X射线太少而索引方法失效的数据帧。该方法在一个真实的串行微晶学数据集上得到了验证,该数据集的信号太弱,无法用传统方法进行索引。尽管来自样品输送介质的背景散射令人望而生畏,但仍有可能以2.1 Å的分辨率解析蛋白质结构。EMC算法分析弱数据帧的能力将有助于减少样品消耗。它还将使串行微晶学能够使用那些在储存环光源下太小而无法进行可行分析的晶体来进行。