Gabler Stephan, Soelter Jan, Hussain Taufia, Sachse Silke, Schmuker Michael
Theoretical Neuroscience, Institute of Biology, Dept. of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Königin-Luise-Str. 1-3, D-14195 Berlin, Germany.
Bernstein Center for Computational Neuroscience Berlin, Philippstr. 13, Haus 6, D-10115 Berlin, Germany.
Mol Inform. 2013 Oct;32(9-10):855-65. doi: 10.1002/minf.201300037. Epub 2013 Oct 2.
Responses of olfactory receptors (ORs) can be predicted by applying machine learning methods on a multivariate encoding of an odorant's chemical structure. Physicochemical descriptors that encode features of the molecular graph are a popular choice for such an encoding. Here, we explore the EVA descriptor set, which encodes features derived from the vibrational spectrum of a molecule. We assessed the performance of Support Vector Regression (SVR) and Random Forest Regression (RFR) to predict the gradual response of Drosophila ORs. We compared a 27-dimensional variant of the EVA descriptor against a set of 1467 descriptors provided by the eDragon software package, and against a 32-dimensional subset thereof that has been proposed as the basis for an odor metric consisting of 32 descriptors (HADDAD). The best prediction performance was reproducibly achieved using SVR on the highest-dimensional feature set. The low-dimensional EVA and HADDAD feature sets predicted odor-OR interactions with similar accuracy. Adding charge and polarizability information to the EVA descriptor did not improve the results but rather decreased predictive power. Post-hoc in vivo measurements confirmed these results. Our findings indicate that EVA provides a meaningful low-dimensional representation of odor space, although EVA hardly outperformed "classical" descriptor sets.
通过对气味剂化学结构的多变量编码应用机器学习方法,可以预测嗅觉受体(OR)的反应。编码分子图特征的物理化学描述符是这种编码的常用选择。在这里,我们探索了EVA描述符集,它编码从分子振动光谱中得出的特征。我们评估了支持向量回归(SVR)和随机森林回归(RFR)预测果蝇OR逐渐反应的性能。我们将EVA描述符的27维变体与eDragon软件包提供的一组1467个描述符进行了比较,并与其中一个32维子集进行了比较,该子集已被提议作为由32个描述符组成的气味度量(HADDAD)的基础。在最高维特征集上使用SVR可重复地实现最佳预测性能。低维的EVA和HADDAD特征集以相似的准确度预测气味-OR相互作用。向EVA描述符添加电荷和极化率信息并没有改善结果,反而降低了预测能力。事后体内测量证实了这些结果。我们的研究结果表明,EVA提供了气味空间有意义的低维表示,尽管EVA几乎没有优于“经典”描述符集。