Galek Peter T A, Fábián László, Motherwell W D Samuel, Allen Frank H, Feeder Neil
Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge CB2 1EZ, England.
Acta Crystallogr B. 2007 Oct;63(Pt 5):768-82. doi: 10.1107/S0108768107030996. Epub 2007 Sep 14.
A new method is presented to predict which donors and acceptors form hydrogen bonds in a crystal structure, based on the statistical analysis of hydrogen bonds in the Cambridge Structural Database (CSD). The method is named the logit hydrogen-bonding propensity (LHP) model. The approach has a potential application in identifying both likely and unusual hydrogen bonding, which can help to rationalize stable and metastable crystalline forms, of relevance to drug development in the pharmaceutical industry. Whilst polymorph prediction techniques are widely used, the LHP model is knowledge-based and is not restricted by the computational issues of polymorph prediction, and as such may form a valuable precursor to polymorph screening. Model construction applies logistic regression, using training data obtained with a new survey method based on the CSD system. The survey categorizes the hydrogen bonds and extracts model parameter values using descriptive structural and chemical properties from three-dimensional organic crystal structures. LHP predictions from a fitted model are made using two-dimensional observables alone. In the initial cases analysed, the model is highly accurate, achieving approximately 90% correct classification of both observed hydrogen bonds and non-interacting donor-acceptor pairs. Extensive statistical validation shows the LHP model to be robust across a range of small-molecule organic crystal structures.
基于剑桥结构数据库(CSD)中氢键的统计分析,提出了一种预测晶体结构中哪些供体和受体形成氢键的新方法。该方法被命名为对数几率氢键倾向(LHP)模型。该方法在识别可能的和不寻常的氢键方面具有潜在应用,这有助于阐明稳定和亚稳晶型,这与制药行业的药物开发相关。虽然多晶型预测技术被广泛使用,但LHP模型是基于知识的,不受多晶型预测计算问题的限制,因此可能成为多晶型筛选的有价值的先导。模型构建应用逻辑回归,使用基于CSD系统的新调查方法获得的训练数据。该调查对氢键进行分类,并使用三维有机晶体结构中的描述性结构和化学性质提取模型参数值。拟合模型的LHP预测仅使用二维可观测值进行。在最初分析的案例中,该模型非常准确,对观察到的氢键和非相互作用的供体-受体对的正确分类率约为90%。广泛的统计验证表明,LHP模型在一系列小分子有机晶体结构中是稳健的。