Department of Engineering Science and Mechanics, Penn State University, State College, Pennsylvania; Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania.
Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania.
Biophys J. 2024 Sep 3;123(17):2671-2681. doi: 10.1016/j.bpj.2023.10.011. Epub 2023 Oct 14.
Fast and accurate 3D RNA structure prediction remains a major challenge in structural biology, mostly due to the size and flexibility of RNA molecules, as well as the lack of diverse experimentally determined structures of RNA molecules. Unlike DNA structure, RNA structure is far less constrained by basepair hydrogen bonding, resulting in an explosion of potential stable states. Here, we propose a convolutional neural network that predicts all pairwise distances between residues in an RNA, using a recently described smooth parametrization of Euclidean distance matrices. We achieve high-accuracy predictions on RNAs up to 100 nt in length in fractions of a second, a factor of 10 faster than existing molecular dynamics-based methods. We also convert our coarse-grained machine learning output into an all-atom model using discrete molecular dynamics with constraints. Our proposed computational pipeline predicts all-atom RNA models solely from the nucleotide sequence. However, this method suffers from the same limitation as nucleic acid molecular dynamics: the scarcity of available RNA crystal structures for training.
快速准确的三维 RNA 结构预测仍然是结构生物学的主要挑战,主要原因是 RNA 分子的大小和灵活性,以及缺乏多样化的实验确定的 RNA 分子结构。与 DNA 结构不同,RNA 结构受碱基对氢键的限制要小得多,从而导致潜在稳定状态的爆炸式增长。在这里,我们提出了一种卷积神经网络,它使用最近描述的欧几里得距离矩阵的平滑参数化来预测 RNA 中所有残基之间的成对距离。我们在几分之一秒内就能实现高达 100 个核苷酸长度的 RNA 的高精度预测,比现有的基于分子动力学的方法快 10 倍。我们还使用带约束的离散分子动力学将我们的粗粒度机器学习输出转换为全原子模型。我们提出的计算流水线仅从核苷酸序列预测全原子 RNA 模型。然而,这种方法与核酸分子动力学有相同的局限性:可用于训练的 RNA 晶体结构稀缺。