Matveeva Olga V, Nazipova Nafisa N, Ogurtsov Aleksey Y, Shabalina Svetlana A
Department of Human Genetics, University of Utah Salt Lake City, UT, USA.
Front Genet. 2012 Aug 29;3:163. doi: 10.3389/fgene.2012.00163. eCollection 2012.
Small hairpin RNAs (shRNAs) became an important research tool in cell biology. Reliable design of these molecules is essential for the needs of large functional genomics projects. To optimize the design of efficient shRNAs, we performed comparative, thermodynamic, and correlation analyses of ~18,000 miR30-based shRNAs with known functional efficiencies, derived from the Sensor Assay project (Fellmann et al., 2011). We identified features of the shRNA guide strand that significantly correlate with the silencing efficiency and performed multiple regression analysis, using 4/5 of the data for training purposes and 1/5 for cross validation. A model that included the position-dependent nucleotide preferences was predictive in the cross-validation data subset (R = 0.39). However, a model, which in addition to the nucleotide preferences included thermodynamic shRNA features such as a thermodynamic duplex stability and position-dependent thermodynamic profile (dinucleotide free energy) was performing better (R = 0.53). Software "miR_Scan" was developed based upon the optimized models. Calculated mRNA target secondary structure stability showed correlation with shRNA silencing efficiency but failed to improve the model. Correlation analysis demonstrates that our algorithm for identification of efficient miR30-based shRNA molecules performs better than approaches that were developed for design of chemically synthesized siRNAs (R(max) = 0.36).
小发夹RNA(shRNA)成为细胞生物学中的一种重要研究工具。这些分子的可靠设计对于大型功能基因组学项目的需求至关重要。为了优化高效shRNA的设计,我们对约18,000个基于miR30的、具有已知功能效率的shRNA进行了比较、热力学和相关性分析,这些shRNA来自传感器检测项目(费尔曼等人,2011年)。我们确定了与沉默效率显著相关的shRNA引导链特征,并进行了多元回归分析,使用4/5的数据用于训练,1/5的数据用于交叉验证。一个包含位置依赖性核苷酸偏好的模型在交叉验证数据子集中具有预测性(R = 0.39)。然而,一个除了核苷酸偏好还包括热力学shRNA特征(如热力学双链稳定性和位置依赖性热力学概况(二核苷酸自由能))的模型表现更好(R = 0.53)。基于优化模型开发了软件“miR_Scan”。计算得出的mRNA靶标二级结构稳定性与shRNA沉默效率相关,但未能改进模型。相关性分析表明,我们用于鉴定基于miR30的高效shRNA分子的算法比为化学合成siRNA设计开发的方法表现更好(R(max) = 0.36)。