Department of Computer and Information Science, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Bioengineering, Department of Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA 19104, USA.
Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Bioengineering, Department of Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA 19104, USA.
Cell Host Microbe. 2023 Aug 9;31(8):1260-1274.e6. doi: 10.1016/j.chom.2023.07.001. Epub 2023 Jul 28.
Molecular de-extinction could offer avenues for drug discovery by reintroducing bioactive molecules that are no longer encoded by extant organisms. To prospect for antimicrobial peptides encrypted within extinct and extant human proteins, we introduce the panCleave random forest model for proteome-wide cleavage site prediction. Our model outperformed multiple protease-specific cleavage site classifiers for three modern human caspases, despite its pan-protease design. Antimicrobial activity was observed in vitro for modern and archaic protein fragments identified with panCleave. Lead peptides showed resistance to proteolysis and exhibited variable membrane permeabilization. Additionally, representative modern and archaic protein fragments showed anti-infective efficacy against A. baumannii in both a skin abscess infection model and a preclinical murine thigh infection model. These results suggest that machine-learning-based encrypted peptide prospection can identify stable, nontoxic peptide antibiotics. Moreover, we establish molecular de-extinction through paleoproteome mining as a framework for antibacterial drug discovery.
分子灭绝后复活技术可以通过重新引入不再由现存生物编码的生物活性分子,为药物发现提供途径。为了寻找已灭绝和现存人类蛋白质中编码的抗菌肽,我们引入了 panCleave 随机森林模型,用于预测蛋白质组范围的切割位点。尽管我们的模型是一种泛蛋白酶设计,但它在三种现代人类半胱天冬酶的蛋白酶特异性切割位点分类器中表现出色。panCleave 鉴定的现代和古代蛋白质片段在体外表现出抗菌活性。先导肽显示出对蛋白水解的抗性,并表现出可变的膜通透性。此外,代表性的现代和古代蛋白质片段在皮肤脓肿感染模型和临床前鼠大腿感染模型中均显示出对鲍曼不动杆菌的抗感染功效。这些结果表明,基于机器学习的加密肽探测可以识别稳定、无毒的肽类抗生素。此外,我们通过古蛋白组挖掘建立了分子灭绝后复活技术,作为抗菌药物发现的框架。