Zaslavskiy Mikhail, Bertonati Claudia, Duchateau Philippe, Duclert Aymeric, Silva George H
Research and Development department, Cellectis, 8 rue de la Croix Jarry, Paris 75013, France.
BMC Bioinformatics. 2014 Jun 17;15:191. doi: 10.1186/1471-2105-15-191.
Meganucleases are important tools for genome engineering, providing an efficient way to generate DNA double-strand breaks at specific loci of interest. Numerous experimental efforts, ranging from in vivo selection to in silico modeling, have been made to re-engineer meganucleases to target relevant DNA sequences.
Here we present a novel in silico method for designing custom meganucleases that is based on the use of a machine learning approach. We compared it with existing in silico physical models and high-throughput experimental screening. The machine learning model was used to successfully predict active meganucleases for 53 new DNA targets.
This new method shows competitive performance compared with state-of-the-art in silico physical models, with up to a fourfold increase in terms of the design success rate. Compared to experimental high-throughput screening methods, it reduces the number of screening experiments needed by a factor of more than 100 without affecting final performance.
巨型核酸酶是基因组工程的重要工具,为在特定感兴趣位点产生DNA双链断裂提供了一种有效方法。从体内筛选到计算机模拟建模,人们进行了大量实验工作,以重新设计巨型核酸酶来靶向相关DNA序列。
在此,我们提出一种基于机器学习方法设计定制巨型核酸酶的新型计算机方法。我们将其与现有的计算机物理模型和高通量实验筛选进行了比较。该机器学习模型成功预测了53个新DNA靶点的活性巨型核酸酶。
与最先进的计算机物理模型相比,这种新方法表现出了有竞争力的性能,设计成功率提高了四倍。与实验性高通量筛选方法相比,它将所需的筛选实验数量减少了100倍以上,同时不影响最终性能。