Biswas Abhishek, Ranjan Desh, Zubair Mohammad, Zeil Stephanie, Nasr Kamal Al, He Jing
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):578-586. doi: 10.1109/TCBB.2016.2543721. Epub 2016 Mar 17.
A key idea in de novo modeling of a medium-resolution density image obtained from cryo-electron microscopy is to compute the optimal mapping between the secondary structure traces observed in the density image and those predicted on the protein sequence. When secondary structures are not determined precisely, either from the image or from the amino acid sequence of the protein, the computational problem becomes more complex. We present an efficient method that addresses the secondary structure placement problem in presence of multiple secondary structure predictions and computes the optimal mapping. We tested the method using 12 simulated images from α-proteins and two Cryo-EM images of α-β proteins. We observed that the rank of the true topologies is consistently improved by using multiple secondary structure predictions instead of a single prediction. The results show that the algorithm is robust and works well even when errors/misses in the predicted secondary structures are present in the image or the sequence. The results also show that the algorithm is efficient and is able to handle proteins with as many as 33 helices.
从冷冻电子显微镜获得的中等分辨率密度图像的从头建模中的一个关键思想是计算在密度图像中观察到的二级结构迹线与在蛋白质序列上预测的二级结构迹线之间的最佳映射。当二级结构不能从图像或蛋白质的氨基酸序列中精确确定时,计算问题就会变得更加复杂。我们提出了一种有效的方法,该方法解决了存在多个二级结构预测时的二级结构放置问题,并计算出最佳映射。我们使用来自α-蛋白的12个模拟图像和α-β蛋白的两个冷冻电镜图像对该方法进行了测试。我们观察到,通过使用多个二级结构预测而不是单个预测,真实拓扑结构的排名持续得到改善。结果表明,该算法具有鲁棒性,即使图像或序列中存在预测二级结构中的错误/遗漏,该算法也能很好地工作。结果还表明,该算法效率高,能够处理多达33个螺旋的蛋白质。