Jia Chen, Li Xiaofang, Hu Song, Liu Guohong, Fang Jiansong, Zhou Xiaoxia, Yan Xiliang, Yan Bing
Institute of Environmental Research at Greater Bay Area, Key Laboratory for Water Quality and Conservation of the Pearl River Delta, Ministry of Education, Guangzhou University, Guangzhou 510006, China.
School of Health, Guangzhou Vocational and Technical University of Science and Technology, Guangzhou 510555, China.
Anal Chem. 2025 Jan 14;97(1):783-792. doi: 10.1021/acs.analchem.4c05311. Epub 2024 Dec 20.
Traditional Chinese medicine (TCM) has been a cornerstone of health care for centuries, valued for its preventive and therapeutic properties. However, recent decades have revealed significant toxicological concerns associated with TCMs due to their complex chemical compositions. Traditional QSAR (quantitative structure-activity relationships) models, which predict toxicity based on chemical structures, face challenges with the intricate nature of TCM compounds. In this study, we effectively resolved this issue by correlating the toxicity of TCMs with advanced analytical descriptors from electron ionization mass spectra (EI-MS) data. The optimal classification model achieved a balanced accuracy of over 0.74. Through interpretable machine learning models, we identified specific toxic components, such as 13-hexyloxacyclotridec-10-en-2-one and loliolide. We applied molecular dynamics (MD) simulations to explore the interactions of identified toxic components with crucial protein targets, using hepatic cytochrome P450 3A4 as an example. This novel approach not only enhances our understanding of the toxicological profiles of TCMs but also maximizes their therapeutic benefits while minimizing adverse effects. More importantly, our findings support the application of analytical descriptor-based machine learning in predicting the toxicity of unknown mixtures in the real environment.
几个世纪以来,传统中医一直是医疗保健的基石,因其预防和治疗功效而备受重视。然而,近几十年来,由于中药化学成分复杂,人们发现了与中药相关的重大毒理学问题。传统的定量构效关系(QSAR)模型基于化学结构预测毒性,但面对中药化合物的复杂性质时面临挑战。在本研究中,我们通过将中药的毒性与电子电离质谱(EI-MS)数据中的先进分析描述符相关联,有效解决了这个问题。最优分类模型的平衡准确率超过0.74。通过可解释的机器学习模型,我们确定了特定的有毒成分,如13-己基氧杂环十三碳-10-烯-2-酮和黑麦草内酯。我们以肝细胞色素P450 3A4为例,应用分子动力学(MD)模拟来探索已确定的有毒成分与关键蛋白质靶点之间的相互作用。这种新方法不仅增强了我们对中药毒理学特征的理解,还在将不良反应降至最低的同时,最大限度地发挥了它们的治疗益处。更重要的是,我们的研究结果支持基于分析描述符的机器学习在预测实际环境中未知混合物毒性方面的应用。