Hattori Kazunari, Wakabayashi Hiroaki, Tamaki Kenta
Medicinal Chemistry Technologies and Research Informatics, Pfizer Global Research and Development, Nagoya Laboratories, Pfizer Inc., 5-2 Taketoyo, Aichi 470-2393, Japan.
J Chem Inf Model. 2008 Jan;48(1):135-42. doi: 10.1021/ci7002686. Epub 2008 Jan 5.
In drug discovery programs, predicting key example compounds in competitors' patent applications is important work for scientists working in the same or in related research areas. In general, medicinal chemists are responsible for this work, and they attempt to guess the identity of key compounds based on information provided in patent applications, such as biological data, scale of reaction, and/or optimization of the salt form for a particular compound. However, this is sometimes made difficult by the lack of such information. This paper describes a method for predicting key compounds in competitors' patent applications by using only structural information of example compounds. Based on the assumption that medicinal chemists usually carry out extensive structure--activity relationship (SAR) studies around key compounds, the method identifies compounds located at the centers of densely populated regions in the patent examples' chemical space, as represented by Extended Connectivity Fingerprints (ECFPs). For the validation of the method, a total of 30 patents containing structures of launched drugs were selected to test whether or not the method is able to predict key compounds (the launched drugs). In 17 out of the 30 patents (57%), the method was able to successfully predict the key compounds. The result indicates that our method could provide an alternative approach to predicting key compounds in cases where the conventional medicinal chemist's approach does not work well. This method could also be used as a complement to the traditional medicinal chemist's approach.
在药物研发项目中,预测竞争对手专利申请中的关键示例化合物,对于从事相同或相关研究领域的科学家而言是一项重要工作。一般来说,药物化学家负责此项工作,他们会根据专利申请中提供的信息,如生物数据、反应规模和/或特定化合物盐形式的优化等,尝试猜测关键化合物的身份。然而,有时由于缺乏此类信息,这项工作会变得困难。本文描述了一种仅使用示例化合物的结构信息来预测竞争对手专利申请中关键化合物的方法。基于药物化学家通常会围绕关键化合物开展广泛的构效关系(SAR)研究这一假设,该方法将位于专利示例化学空间中人口密集区域中心的化合物识别为关键化合物,化学空间由扩展连接指纹(ECFP)表示。为验证该方法,总共选择了30项包含已上市药物结构的专利,以测试该方法是否能够预测关键化合物(已上市药物)。在30项专利中的17项(57%)中,该方法能够成功预测关键化合物。结果表明,在传统药物化学家的方法效果不佳的情况下,我们的方法可以提供一种预测关键化合物的替代方法。该方法也可作为传统药物化学家方法的补充。