Han Lu, Shan Guangcun, Chu Bingfeng, Wang Hongyu, Wang Zhongjian, Gao Shengqiao, Zhou Wenxia
Beijing, 100850 China State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Institute of Pharmacology and Toxicology.
Beijing, 100083 China School of Instrumentation Science and Opto-Electronics Engineering and Beijing Advanced Innovation Center for Big Data-based Precision Medicine, Beihang University.
Cogn Neurodyn. 2023 Jun;17(3):803-811. doi: 10.1007/s11571-021-09727-5. Epub 2021 Nov 5.
The novel coronavirus disease, COVID-19, has rapidly spread worldwide. Developing methods to identify the therapeutic activity of drugs based on phenotypic data can improve the efficiency of drug development. Here, a state-of-the-art machine-learning method was used to identify drug mechanism of actions (MoAs) based on the cell image features of 1105 drugs in the LINCS database. As the multi-dimensional features of cell images are affected by non-experimental factors, the characteristics of similar drugs vary considerably, and it is difficult to effectively identify the MoA of drugs as there is substantial noise. By applying the supervised information theoretic metric-learning (ITML) algorithm, a linear transformation made drugs with the same MoA aggregate. By clustering drugs to communities and performing enrichment analysis, we found that transferred image features were more conducive to the recognition of drug MoAs. Image features analysis showed that different features play important roles in identifying different drug functions. Drugs that significantly affect cell survival or proliferation, such as cyclin-dependent kinase inhibitors, were more likely to be enriched in communities, whereas other drugs might be decentralized. Chloroquine and clomiphene, which block the entry of virus, were clustered into the same community, indicating that similar MoA could be reflected by the cell image. Overall, the findings of the present study laid the foundation for the discovery of MoAs of new drugs, based on image data. In addition, it provided a new method of drug repurposing for COVID-19.
The online version contains supplementary material available at 10.1007/s11571-021-09727-5.
新型冠状病毒病(COVID-19)已在全球迅速传播。开发基于表型数据识别药物治疗活性的方法可以提高药物开发效率。在此,使用一种先进的机器学习方法,基于LINCS数据库中1105种药物的细胞图像特征来识别药物作用机制(MoA)。由于细胞图像的多维特征受非实验因素影响,相似药物的特征差异很大,且存在大量噪声,难以有效识别药物的作用机制。通过应用监督信息理论度量学习(ITML)算法,线性变换使具有相同作用机制的药物聚集在一起。通过将药物聚类到群落并进行富集分析,我们发现转移后的图像特征更有利于识别药物的作用机制。图像特征分析表明,不同特征在识别不同药物功能中发挥重要作用。显著影响细胞存活或增殖的药物,如细胞周期蛋白依赖性激酶抑制剂,更有可能在群落中富集,而其他药物可能分散分布。阻断病毒进入的氯喹和克罗米芬被聚类到同一个群落,表明细胞图像可以反映相似的作用机制。总体而言,本研究结果为基于图像数据发现新药的作用机制奠定了基础。此外,它为COVID-19提供了一种新的药物重新利用方法。
在线版本包含可在10.1007/s11571-021-09727-5获取的补充材料。