Pine Angela C, Brooke Greg N, Marco Antonio
School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, UK.
NAR Genom Bioinform. 2023 Jan 10;5(1):lqac098. doi: 10.1093/nargab/lqac098. eCollection 2023 Mar.
DNAzymes are short pieces of DNA with catalytic activity, capable of cleaving RNA. DNAzymes have multiple applications as biosensors and in therapeutics. The high specificity and low toxicity of these molecules make them particularly suitable as therapeutics, and clinical trials have shown that they are effective in patients. However, the development of DNAzymes has been limited due to the lack of specific tools to identify efficient molecules, and users often resort to time-consuming/costly large-scale screens. Here, we propose a computational methodology to identify 10-23 DNAzymes that can be used to triage thousands of potential molecules, specific to a target RNA, to identify those that are predicted to be efficient. The method is based on a logistic regression and can be trained to incorporate additional DNAzyme efficiency data, improving its performance with time. We first trained the method with published data, and then we validated, and further refined it, by testing additional newly synthesized DNAzymes in the laboratory. We found that although binding free energy between the DNAzyme and its RNA target is the primary determinant of efficiency, other factors such as internal structure of the DNAzyme also have an important effect. A program implementing the proposed method is publicly available.
脱氧核酶是具有催化活性的短链DNA,能够切割RNA。脱氧核酶在生物传感器和治疗领域有多种应用。这些分子的高特异性和低毒性使其特别适合用于治疗,临床试验表明它们对患者有效。然而,由于缺乏识别高效分子的特定工具,脱氧核酶的开发受到了限制,用户通常不得不采用耗时/成本高昂的大规模筛选方法。在此,我们提出一种计算方法来识别10-23型脱氧核酶,可用于筛选针对目标RNA的数千种潜在分子,以识别那些预计高效的分子。该方法基于逻辑回归,并且可以进行训练以纳入更多脱氧核酶效率数据,随着时间的推移提高其性能。我们首先用已发表的数据训练该方法,然后通过在实验室测试额外新合成的脱氧核酶进行验证并进一步完善。我们发现,虽然脱氧核酶与其RNA靶标的结合自由能是效率的主要决定因素,但其他因素如脱氧核酶的内部结构也有重要影响。实现该方法的程序已公开可用。