Department of Bioengineering, Rice University, Houston, Texas 77030, USA.
Systems, Synthetic, and Physical Biology, Rice University, Houston, Texas 77030, USA.
Nat Chem. 2018 Jan;10(1):91-98. doi: 10.1038/nchem.2877. Epub 2017 Nov 6.
Hybridization is a key molecular process in biology and biotechnology, but so far there is no predictive model for accurately determining hybridization rate constants based on sequence information. Here, we report a weighted neighbour voting (WNV) prediction algorithm, in which the hybridization rate constant of an unknown sequence is predicted based on similarity reactions with known rate constants. To construct this algorithm we first performed 210 fluorescence kinetics experiments to observe the hybridization kinetics of 100 different DNA target and probe pairs (36 nt sub-sequences of the CYCS and VEGF genes) at temperatures ranging from 28 to 55 °C. Automated feature selection and weighting optimization resulted in a final six-feature WNV model, which can predict hybridization rate constants of new sequences to within a factor of 3 with ∼91% accuracy, based on leave-one-out cross-validation. Accurate prediction of hybridization kinetics allows the design of efficient probe sequences for genomics research.
杂交是生物学和生物技术中的一个关键分子过程,但到目前为止,还没有基于序列信息准确预测杂交速率常数的预测模型。在这里,我们报告了一种加权邻位投票(WNV)预测算法,该算法基于与已知速率常数的相似反应来预测未知序列的杂交速率常数。为了构建该算法,我们首先进行了 210 次荧光动力学实验,以观察在 28 至 55°C 的温度范围内 100 对不同 DNA 靶标和探针(CYCS 和 VEGF 基因的 36 个核苷酸亚序列)的杂交动力学。自动特征选择和加权优化得到了最终的六个特征 WNV 模型,该模型可以在约 91%的准确率下,通过留一法交叉验证,将新序列的杂交速率常数预测到 3 倍以内。杂交动力学的准确预测可以为基因组学研究设计高效的探针序列。