Department of Physical Chemistry, Kharkiv National University, Ukraine.
Department of Mathematics, Aston University, Birmingham, UK.
Faraday Discuss. 2022 Nov 8;240(0):152-167. doi: 10.1039/d2fd00053a.
It is very difficult to reconstruct computationally a large biomolecular complex in its biological entirety from experimental data. The resulting atomistic model should not contain gaps structurally and it should yield stable dynamics. We, for the first time, reconstruct from the published incomplete cryo-EM density a complete MS2 virus at atomistic resolution, that is, the capsid with the genome, and validate the result by all-atom molecular dynamics with explicit water. The available experimental data includes a high resolution protein capsid and an inhomogeneously resolved genome map. For the genomic RNA, apart from 16 hairpins with atomistic resolution, the strands near the capsid's inner surface were resolved up to the nucleic backbone level, and the innermost density was completely unresolved. As a result, only 242 nucleotides (out of 3569) were positioned, while only a fragmented backbone was outlined for the rest of the genome, making a detailed model reconstruction necessary. For model reconstruction, in addition to the available atomistic structure information, we extensively used the predicted secondary structure of the genome (base pairing). The technique was based on semi-automatic building of relatively large strands of RNA with subsequent manual positioning over the traced backbone. The entire virus structure (capsid + genome) was validated by a molecular dynamics run in physiological solution with ions at standard conditions confirming the stability of the model.
从实验数据中通过计算重建一个大型生物分子复合物的生物整体是非常困难的。所得到的原子模型在结构上不应有空白,并且应该产生稳定的动力学。我们首次从已发表的不完全冷冻电镜密度中,以原子分辨率重建了完整的 MS2 病毒,即带有基因组的衣壳,并通过带有显式水的全原子分子动力学验证了结果。可用的实验数据包括高分辨率的蛋白质衣壳和不均匀分辨率的基因组图谱。对于基因组 RNA,除了具有原子分辨率的 16 个发夹外,靠近衣壳内表面的链被解析到核酸主链水平,而最里面的密度完全无法解析。结果,只有 242 个核苷酸(3569 个中的 242 个)被定位,而其余基因组的骨架仅被勾勒出片段,因此需要进行详细的模型重建。对于模型重建,除了可用的原子结构信息外,我们还广泛使用了基因组的预测二级结构(碱基配对)。该技术基于相对较大的 RNA 链的半自动构建,随后手动定位在追踪的骨架上。整个病毒结构(衣壳+基因组)通过在生理溶液中进行分子动力学模拟得到验证,标准条件下的离子证实了模型的稳定性。