Humphreys Ian R, Zhang Jing, Baek Minkyung, Wang Yaxi, Krishnakumar Aditya, Pei Jimin, Anishchenko Ivan, Tower Catherine A, Jackson Blake A, Warrier Thulasi, Hung Deborah T, Peterson S Brook, Mougous Joseph D, Cong Qian, Baker David
Department of Biochemistry, University of Washington, Seattle, WA, USA.
Institute for Protein Design, University of Washington, Seattle, WA, USA.
Nat Microbiol. 2024 Oct;9(10):2642-2652. doi: 10.1038/s41564-024-01791-x. Epub 2024 Sep 18.
Identification of bacterial protein-protein interactions and predicting the structures of these complexes could aid in the understanding of pathogenicity mechanisms and developing treatments for infectious diseases. Here we developed RoseTTAFold2-Lite, a rapid deep learning model that leverages residue-residue coevolution and protein structure prediction to systematically identify and structurally characterize protein-protein interactions at the proteome-wide scale. Using this pipeline, we searched through 78 million pairs of proteins across 19 human bacterial pathogens and identified 1,923 confidently predicted complexes involving essential genes and 256 involving virulence factors. Many of these complexes were not previously known; we experimentally tested 12 such predictions, and half of them were validated. The predicted interactions span core metabolic and virulence pathways ranging from post-transcriptional modification to acid neutralization to outer-membrane machinery and should contribute to our understanding of the biology of these important pathogens and the design of drugs to combat them.
鉴定细菌蛋白质-蛋白质相互作用并预测这些复合物的结构,有助于理解致病机制并开发传染病治疗方法。在此,我们开发了RoseTTAFold2-Lite,这是一种快速深度学习模型,它利用残基-残基共进化和蛋白质结构预测,在全蛋白质组范围内系统地鉴定蛋白质-蛋白质相互作用并对其进行结构表征。使用该流程,我们在19种人类细菌病原体的7800万对蛋白质中进行搜索,确定了1923个涉及必需基因的可靠预测复合物和256个涉及毒力因子的复合物。其中许多复合物以前并不为人所知;我们对12个这样的预测进行了实验测试,其中一半得到了验证。预测的相互作用涵盖了从转录后修饰到酸中和再到外膜机制的核心代谢和毒力途径,应该有助于我们理解这些重要病原体的生物学特性以及设计对抗它们的药物。