Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States.
J Am Chem Soc. 2020 Mar 4;142(9):4114-4120. doi: 10.1021/jacs.9b13786. Epub 2020 Feb 21.
This report describes the first application of the novel NMR-based machine learning tool "Small Molecule Accurate Recognition Technology" (SMART 2.0) for mixture analysis and subsequent accelerated discovery and characterization of new natural products. The concept was applied to the extract of a filamentous marine cyanobacterium known to be a prolific producer of cytotoxic natural products. This environmental extract was roughly fractionated, and then prioritized and guided by cancer cell cytotoxicity, NMR-based SMART 2.0, and MS-based molecular networking. This led to the isolation and rapid identification of a new chimeric swinholide-like macrolide, symplocolide A, as well as the annotation of swinholide A, samholides A-I, and several new derivatives. The planar structure of symplocolide A was confirmed to be a structural hybrid between swinholide A and luminaolide B by 1D/2D NMR and LC-MS analysis. A second example applies SMART 2.0 to the characterization of structurally novel cyclic peptides, and compares this approach to the recently appearing "atomic sort" method. This study exemplifies the revolutionary potential of combined traditional and deep learning-assisted analytical approaches to overcome longstanding challenges in natural products drug discovery.
本报告描述了新型基于 NMR 的机器学习工具“小分子精确识别技术”(SMART 2.0)在混合物分析中的首次应用,以及随后对新天然产物的加速发现和鉴定。该概念应用于一种已知是细胞毒性天然产物丰富生产者的丝状海洋蓝细菌提取物。对该环境提取物进行了大致的分级,然后根据癌细胞毒性、基于 NMR 的 SMART 2.0 和基于 MS 的分子网络对其进行优先排序和指导。这导致了一种新的嵌合 swinholide 样大环内酯 symplocolide A 的分离和快速鉴定,以及 swinholide A、samholides A-I 和几种新衍生物的注释。通过 1D/2D NMR 和 LC-MS 分析,确定 symplocolide A 的平面结构是 swinholide A 和 luminaolide B 的结构杂合体。第二个例子将 SMART 2.0 应用于结构新颖的环状肽的表征,并将这种方法与最近出现的“原子排序”方法进行比较。本研究例证了结合传统和深度学习辅助分析方法的革命性潜力,以克服天然产物药物发现中长期存在的挑战。