Das Dibyajyoti, Chakrabarty Broto, Srinivasan Rajgopal, Roy Arijit
TCS Research (Life Sciences Division), Tata Consultancy Services Ltd, Hyderabad 500081, India.
J Chem Inf Model. 2023 Apr 10;63(7):1882-1893. doi: 10.1021/acs.jcim.2c01301. Epub 2023 Mar 27.
Drug-induced gene expression profiling provides a lot of useful information covering various aspects of drug discovery and development. Most importantly, this knowledge can be used to discover drugs' mechanisms of action. Recently, deep learning-based drug design methods are in the spotlight due to their ability to explore huge chemical space and design property-optimized target-specific drug molecules. Recent advances in accessibility of open-source drug-induced transcriptomic data along with the ability of deep learning algorithms to understand hidden patterns have opened opportunities for designing drug molecules based on desired gene expression signatures. In this study, we propose a deep learning model, Gex2SGen (Gene Expression 2 SMILES Generation), to generate novel drug-like molecules based on desired gene expression profiles. The model accepts desired gene expression profiles in a cell-specific manner as input and designs drug-like molecules which can elicit the required transcriptomic profile. The model was first tested against individual gene-knocked-out transcriptomic profiles, where the newly designed molecules showed high similarity with known inhibitors of the knocked-out target genes. The model was next applied on a triple negative breast cancer signature profile, where it could generate novel molecules, highly similar to known anti-breast cancer drugs. Overall, this work provides a generalized method, where the method first learned the molecular signature of a given cell due to a specific condition, and designs new small molecules with drug-like properties.
药物诱导的基因表达谱分析提供了许多有用信息,涵盖药物发现和开发的各个方面。最重要的是,这些知识可用于发现药物的作用机制。最近,基于深度学习的药物设计方法备受关注,因为它们有能力探索巨大的化学空间并设计出性质优化的靶向特定药物分子。开源药物诱导转录组数据可及性的最新进展,以及深度学习算法理解隐藏模式的能力,为基于所需基因表达特征设计药物分子带来了机遇。在本研究中,我们提出了一种深度学习模型Gex2SGen(基因表达2 SMILES生成),以基于所需基因表达谱生成新型类药物分子。该模型以细胞特异性方式接受所需基因表达谱作为输入,并设计出能够引发所需转录组谱的类药物分子。该模型首先针对单个基因敲除转录组谱进行测试,新设计的分子与敲除靶基因的已知抑制剂显示出高度相似性。接下来,该模型应用于三阴性乳腺癌特征谱,能够生成与已知抗乳腺癌药物高度相似的新型分子。总体而言,这项工作提供了一种通用方法,该方法首先了解特定条件下给定细胞的分子特征,然后设计具有类药物性质的新小分子。