蛋白质的碎量子力学计算及其应用。
Fragment quantum mechanical calculation of proteins and its applications.
机构信息
State Key Laboratory of Precision Spectroscopy, Institute of Theoretical and Computational Science, East China Normal University , Shanghai 200062, China.
出版信息
Acc Chem Res. 2014 Sep 16;47(9):2748-57. doi: 10.1021/ar500077t. Epub 2014 May 22.
Conspectus The desire to study molecular systems that are much larger than what the current state-of-the-art ab initio or density functional theory methods could handle has naturally led to the development of novel approximate methods, including semiempirical approaches, reduced-scaling methods, and fragmentation methods. The major computational limitation of ab initio methods is the scaling problem, because the cost of ab initio calculation scales nth power or worse with system size. In the past decade, the fragmentation approach based on chemical locality has opened a new door for developing linear-scaling quantum mechanical (QM) methods for large systems and for applications to large molecular systems such as biomolecules. The fragmentation approach is highly attractive from a computational standpoint. First, the ab initio calculation of individual fragments can be conducted almost independently, which makes it suitable for massively parallel computations. Second, the electron properties, such as density and energy, are typically combined in a linear fashion to reproduce those for the entire molecular system, which makes the overall computation scale linearly with the size of the system. In this Account, two fragmentation methods and their applications to macromolecules are described. They are the electrostatically embedded generalized molecular fractionation with conjugate caps (EE-GMFCC) method and the automated fragmentation quantum mechanics/molecular mechanics (AF-QM/MM) approach. The EE-GMFCC method is developed from the MFCC approach, which was initially used to obtain accurate protein-ligand QM interaction energies. The main idea of the MFCC approach is that a pair of conjugate caps (concaps) is inserted at the location where the subsystem is divided by cutting the chemical bond. In addition, the pair of concaps is fused to form molecular species such that the overcounted effect from added concaps can be properly removed. By introducing the electrostatic embedding field in each fragment calculation and two-body interaction energy correction on top of the MFCC approach, the EE-GMFCC method is capable of accurately reproducing the QM molecular properties (such as the dipole moment, electron density, and electrostatic potential), the total energy, and the electrostatic solvation energy from full system calculations for proteins. On the other hand, the AF-QM/MM method was used for the efficient QM calculation of protein nuclear magnetic resonance (NMR) parameters, including the chemical shift, chemical shift anisotropy tensor, and spin-spin coupling constant. In the AF-QM/MM approach, each amino acid and all the residues in its vicinity are automatically assigned as the QM region through a distance cutoff for each residue-centric QM/MM calculation. Local chemical properties of the central residue can be obtained from individual QM/MM calculations. The AF-QM/MM approach precisely reproduces the NMR chemical shifts of proteins in the gas phase from full system QM calculations. Furthermore, via the incorporation of implicit and explicit solvent models, the protein NMR chemical shifts calculated by the AF-QM/MM method are in excellent agreement with experimental values. The applications of the AF-QM/MM method may also be extended to more general biological systems such as DNA/RNA and protein-ligand complexes.
概述 由于当前最先进的从头算或密度泛函理论方法无法处理更大的分子体系,人们自然希望研究这些体系,这导致了新的近似方法的发展,包括半经验方法、降尺度方法和碎片方法。从头算方法的主要计算限制是尺度问题,因为从头算计算的成本随系统大小呈 n 次幂或更差的比例增长。在过去的十年中,基于化学局部性的碎片方法为开发用于大系统的线性标度量子力学 (QM) 方法以及应用于生物分子等大的分子体系打开了一扇新的大门。从计算的角度来看,碎片方法非常有吸引力。首先,单个碎片的从头算计算几乎可以独立进行,这使其适合大规模并行计算。其次,电子性质(如密度和能量)通常以线性方式组合,以再现整个分子体系的性质,这使得整体计算与体系的大小呈线性关系。在本综述中,描述了两种碎片方法及其在大分子中的应用。它们是静电嵌入广义分子分馏与共轭帽(EE-GMFCC)方法和自动碎片量子力学/分子力学(AF-QM/MM)方法。EE-GMFCC 方法是从 MFCC 方法发展而来的,该方法最初用于获得准确的蛋白质-配体 QM 相互作用能。MFCC 方法的主要思想是在通过切断化学键分割子系统的位置插入一对共轭帽(concaps)。此外,通过融合这对 concaps 形成分子物种,可以正确去除添加的 concaps 引起的重复计算效应。通过在每个片段计算中引入静电嵌入场和在 MFCC 方法的基础上进行二体相互作用能校正,EE-GMFCC 方法能够准确地从全系统计算中重现蛋白质的 QM 分子性质(如偶极矩、电子密度和静电势)、总能量和静电溶剂化能。另一方面,AF-QM/MM 方法用于高效计算蛋白质核磁共振(NMR)参数,包括化学位移、化学位移各向异性张量和自旋-自旋耦合常数。在 AF-QM/MM 方法中,通过为每个残基中心的 QM/MM 计算设置距离截止值,自动将每个氨基酸及其附近的所有残基分配为 QM 区域。通过单独的 QM/MM 计算可以获得中心残基的局部化学性质。AF-QM/MM 方法可以从全系统 QM 计算中准确重现气相中蛋白质的 NMR 化学位移。此外,通过包含隐式和显式溶剂模型,AF-QM/MM 方法计算的蛋白质 NMR 化学位移与实验值非常吻合。AF-QM/MM 方法的应用也可以扩展到更一般的生物系统,如 DNA/RNA 和蛋白质-配体复合物。