利用机器学习方法高效设计巨型核酸酶

Efficient design of meganucleases using a machine learning approach.

作者信息

Zaslavskiy Mikhail, Bertonati Claudia, Duchateau Philippe, Duclert Aymeric, Silva George H

机构信息

Research and Development department, Cellectis, 8 rue de la Croix Jarry, Paris 75013, France.

出版信息

BMC Bioinformatics. 2014 Jun 17;15:191. doi: 10.1186/1471-2105-15-191.

DOI:10.1186/1471-2105-15-191

PMID:24934562

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4065607/

Abstract

BACKGROUND

Meganucleases are important tools for genome engineering, providing an efficient way to generate DNA double-strand breaks at specific loci of interest. Numerous experimental efforts, ranging from in vivo selection to in silico modeling, have been made to re-engineer meganucleases to target relevant DNA sequences.

RESULTS

Here we present a novel in silico method for designing custom meganucleases that is based on the use of a machine learning approach. We compared it with existing in silico physical models and high-throughput experimental screening. The machine learning model was used to successfully predict active meganucleases for 53 new DNA targets.

CONCLUSIONS

This new method shows competitive performance compared with state-of-the-art in silico physical models, with up to a fourfold increase in terms of the design success rate. Compared to experimental high-throughput screening methods, it reduces the number of screening experiments needed by a factor of more than 100 without affecting final performance.

摘要

背景

巨型核酸酶是基因组工程的重要工具，为在特定感兴趣位点产生DNA双链断裂提供了一种有效方法。从体内筛选到计算机模拟建模，人们进行了大量实验工作，以重新设计巨型核酸酶来靶向相关DNA序列。

结果

在此，我们提出一种基于机器学习方法设计定制巨型核酸酶的新型计算机方法。我们将其与现有的计算机物理模型和高通量实验筛选进行了比较。该机器学习模型成功预测了53个新DNA靶点的活性巨型核酸酶。

结论

与最先进的计算机物理模型相比，这种新方法表现出了有竞争力的性能，设计成功率提高了四倍。与实验性高通量筛选方法相比，它将所需的筛选实验数量减少了100倍以上，同时不影响最终性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a6f/4065607/61d0f50fa477/1471-2105-15-191-1.jpg

相似文献

Efficient design of meganucleases using a machine learning approach.

BMC Bioinformatics. 2014 Jun 17;15:191. doi: 10.1186/1471-2105-15-191.

Computer design of obligate heterodimer meganucleases allows efficient cutting of custom DNA sequences.

Nucleic Acids Res. 2008 Apr;36(7):2163-73. doi: 10.1093/nar/gkn059. Epub 2008 Feb 14.

Redesign of extensive protein-DNA interfaces of meganucleases using iterative cycles of in vitro compartmentalization.

Proc Natl Acad Sci U S A. 2014 Mar 18;111(11):4061-6. doi: 10.1073/pnas.1321030111. Epub 2014 Mar 3.

Using machine learning methods to predict experimental high-throughput screening data.

Comb Chem High Throughput Screen. 2010 Jun;13(5):430-41. doi: 10.2174/138620710791292958.

Engineering of customized meganucleases via in vitro compartmentalization and in cellulo optimization.

Methods Mol Biol. 2015;1239:105-32. doi: 10.1007/978-1-4939-1862-1_6.

Potency-directed similarity searching using support vector machines.

Chem Biol Drug Des. 2011 Jan;77(1):30-8. doi: 10.1111/j.1747-0285.2010.01059.x. Epub 2010 Nov 29.

Categorical Matrix Completion With Active Learning for High-Throughput Screening.

IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2261-2270. doi: 10.1109/TCBB.2020.2982142. Epub 2021 Dec 8.

Poly(A) motif prediction using spectral latent features from human DNA sequences.

Bioinformatics. 2013 Jul 1;29(13):i316-25. doi: 10.1093/bioinformatics/btt218.

Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.

BMC Pharmacol. 2012 Mar 31;12:1. doi: 10.1186/1471-2210-12-1.

Computational Analysis and In silico Predictive Modeling for Inhibitors of PhoP Regulon in S. typhi on High-Throughput Screening Bioassay Dataset.

Interdiscip Sci. 2016 Mar;8(1):95-101. doi: 10.1007/s12539-015-0273-x. Epub 2015 Aug 23.

引用本文的文献

CRISPR-Cas9 in basic and translational aspects of cancer therapy.

Bioimpacts. 2024;14(6):30087. doi: 10.34172/bi.2024.30087. Epub 2024 Mar 10.

Allogeneic CAR-T Therapy Technologies: Has the Promise Been Met?

Cells. 2024 Jan 12;13(2):146. doi: 10.3390/cells13020146.

'Off-the-shelf' allogeneic CAR T cells: development and challenges.

Nat Rev Drug Discov. 2020 Mar;19(3):185-199. doi: 10.1038/s41573-019-0051-2. Epub 2020 Jan 3.

Genome-Editing Technologies: Concept, Pros, and Cons of Various Genome-Editing Techniques and Bioethical Concerns for Clinical Application.

Mol Ther Nucleic Acids. 2019 Jun 7;16:326-334. doi: 10.1016/j.omtn.2019.02.027. Epub 2019 Apr 3.

Application of Genome Editing Techniques in Immunology.

Arch Immunol Ther Exp (Warsz). 2018 Aug;66(4):289-298. doi: 10.1007/s00005-018-0504-z. Epub 2018 Jan 17.

Synthetic biology in cell-based cancer immunotherapy.

Trends Biotechnol. 2015 Aug;33(8):449-61. doi: 10.1016/j.tibtech.2015.05.001. Epub 2015 Jun 16.

本文引用的文献

RNA-guided editing of bacterial genomes using CRISPR-Cas systems.

Nat Biotechnol. 2013 Mar;31(3):233-9. doi: 10.1038/nbt.2508. Epub 2013 Jan 29.

RNA-guided human genome engineering via Cas9.

Science. 2013 Feb 15;339(6121):823-6. doi: 10.1126/science.1232033. Epub 2013 Jan 3.

Engineering domain fusion chimeras from I-OnuI family LAGLIDADG homing endonucleases.

Nucleic Acids Res. 2012 Sep;40(16):7985-8000. doi: 10.1093/nar/gks502. Epub 2012 Jun 7.

Non-specific protein-DNA interactions control I-CreI target binding and cleavage.

Nucleic Acids Res. 2012 Aug;40(14):6936-45. doi: 10.1093/nar/gks320. Epub 2012 Apr 11.

Chromosomal context and epigenetic mechanisms control the efficacy of genome editing by rare-cutting designer endonucleases.

Nucleic Acids Res. 2012 Jul;40(13):6367-79. doi: 10.1093/nar/gks268. Epub 2012 Mar 29.

Context dependence between subdomains in the DNA binding interface of the I-CreI homing endonuclease.

Nucleic Acids Res. 2011 Aug;39(14):6124-36. doi: 10.1093/nar/gkr186. Epub 2011 Apr 10.

Comprehensive computational design of mCreI homing endonuclease cleavage specificity for genome engineering.

Nucleic Acids Res. 2011 May;39(10):4330-9. doi: 10.1093/nar/gkr022. Epub 2011 Feb 1.

Meganucleases and other tools for targeted genome engineering: perspectives and challenges for gene therapy.

Curr Gene Ther. 2011 Feb;11(1):11-27. doi: 10.2174/156652311794520111.

The I-CreI meganuclease and its engineered derivatives: applications from cell modification to gene therapy.

Protein Eng Des Sel. 2011 Jan;24(1-2):27-31. doi: 10.1093/protein/gzq083. Epub 2010 Nov 3.

Regularization Paths for Generalized Linear Models via Coordinate Descent.

J Stat Softw. 2010;33(1):1-22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用机器学习方法高效设计巨型核酸酶

Efficient design of meganucleases using a machine learning approach.

作者信息

Zaslavskiy Mikhail, Bertonati Claudia, Duchateau Philippe, Duclert Aymeric, Silva George H

机构信息

Research and Development department, Cellectis, 8 rue de la Croix Jarry, Paris 75013, France.