Wolf Benedict, Shehu Pegi, Brenker Luca, von Bachmann Anna-Lisa, Kroell Ann-Sophie, Southern Nicholas, Holderbach Stefan, Eigenmann Joshua, Aschenbrenner Sabine, Mathony Jan, Niopek Dominik
Institute of Pharmacy and Molecular Biotechnology, Faculty of Engineering Sciences, Heidelberg University, Heidelberg, Germany.
Department of Biology, Technical University of Darmstadt, Darmstadt, Germany.
Nat Methods. 2025 Aug;22(8):1698-1706. doi: 10.1038/s41592-025-02741-z. Epub 2025 Aug 4.
Domain insertion engineering is a powerful approach to juxtapose otherwise separate biological functions, resulting in proteins with new-to-nature activities. A prominent example are switchable protein variants, created by receptor domain insertion into effector proteins. Identifying suitable, allosteric sites for domain insertion, however, typically requires extensive screening and optimization. We present ProDomino, a machine learning pipeline to rationalize domain recombination, trained on a semisynthetic protein sequence dataset derived from naturally occurring intradomain insertion events. ProDomino robustly identifies domain insertion sites in proteins of biotechnological relevance, which we experimentally validated in Escherichia coli and human cells. Finally, we used light- and chemically regulated receptor domains as inserts and demonstrate the rapid, model-guided creation of potent, single-component opto- and chemogenetic protein switches. These include novel CRISPR-Cas9 and -Cas12a variants for inducible genome engineering in human cells. Our work enables one-shot domain insertion engineering and substantially accelerates the design of customized allosteric proteins.
结构域插入工程是一种强大的方法,可将原本分离的生物学功能并列在一起,从而产生具有新型天然活性的蛋白质。一个突出的例子是可切换蛋白变体,它是通过将受体结构域插入效应蛋白中而产生的。然而,识别适合结构域插入的变构位点通常需要广泛的筛选和优化。我们展示了ProDomino,这是一种用于合理化结构域重组的机器学习流程,它在源自天然结构域内插入事件的半合成蛋白质序列数据集上进行训练。ProDomino能够可靠地识别具有生物技术相关性的蛋白质中的结构域插入位点,我们在大肠杆菌和人类细胞中对其进行了实验验证。最后,我们使用光控和化学调控的受体结构域作为插入片段,并展示了在模型指导下快速创建有效的单组分光遗传学和化学遗传学蛋白开关。这些包括用于人类细胞中诱导性基因组工程的新型CRISPR-Cas9和-Cas12a变体。我们的工作实现了一次性结构域插入工程,并大大加速了定制变构蛋白的设计。