登革热二级结构预测中的MPGAfold

MPGAfold in dengue secondary structure prediction.

作者信息

Kasprzak Wojciech K, Shapiro Bruce A

机构信息

Basic Science Program, Leidos Biomedical Research, Inc., Frederick, MD, USA.

出版信息

Methods Mol Biol. 2014;1138:199-224. doi: 10.1007/978-1-4939-0348-1_13.

DOI:10.1007/978-1-4939-0348-1_13

PMID:24696339

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6354254/

Abstract

This chapter presents the computational prediction of the secondary structures within the 5' and 3' untranslated regions of the dengue virus serotype 2 (DENV2), with the focus on the conformational prediction of the two dumbbell-like structures, 5' DB and 3' DB, found in the core region of the 3' untranslated region of DENV2. For secondary structure prediction purposes we used a 719 nt-long subgenomic RNA construct from DENV2, which we refer to as the minigenome. The construct combines the 5'-most 226 nt from the 5' UTR and a fragment of the capsid coding region with the last 42 nt from the non-structural protein NS5 coding region and the 451 nt of the 3' UTR. This minigenome has been shown to contain the elements needed for translation, as well as negative strand RNA synthesis. We present the Massively Parallel Genetic Algorithm MPGAfold, a non-deterministic algorithm, that was used to predict the secondary structures of the DENV2 719 nt long minigenome construct, as well as our computational workbench called StructureLab that was used to interactively explore the solution spaces produced by MPGAfold. The MPGAfold algorithm is first introduced at the conceptual level. Then specific parameters guiding its performance are discussed and illustrated with a representative selection of the results from the study. Plots of the solution spaces generated by MPGAfold illustrate the algorithm, while selected secondary structures focus on variable formation of the dumbbell structures and other identified structural motifs. They also serve as illustrations of some of the capabilities of the StructureLab workbench. Results of the computational structure determination calculations are discussed and compared to the experimental data.

摘要

本章介绍了登革病毒2型（DENV2）5'和3'非翻译区二级结构的计算预测，重点是在DENV2 3'非翻译区核心区域发现的两个哑铃状结构，即5' DB和3' DB的构象预测。为了进行二级结构预测，我们使用了来自DENV2的一个719 nt长的亚基因组RNA构建体，我们将其称为微型基因组。该构建体将5' UTR最前面的226 nt与衣壳编码区的一个片段，以及非结构蛋白NS5编码区的最后42 nt和3' UTR的451 nt结合在一起。已证明该微型基因组包含翻译所需的元件以及负链RNA合成所需的元件。我们介绍了大规模并行遗传算法MPGAfold，一种非确定性算法，它用于预测DENV2 719 nt长微型基因组构建体的二级结构，以及我们称为StructureLab的计算工作台，它用于交互式探索MPGAfold产生的解空间。首先在概念层面介绍MPGAfold算法。然后讨论指导其性能的特定参数，并用该研究结果的代表性选择进行说明。MPGAfold生成的解空间图说明了该算法，而选定的二级结构则侧重于哑铃结构和其他已识别结构基序的可变形成。它们还展示了StructureLab工作台的一些功能。讨论了计算结构确定计算的结果，并与实验数据进行了比较。