Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, 60208, USA.
Institute for Public Health and Medicine, Feinberg School of Medicine, Center for Health Information Partnerships, Northwestern University, Chicago, IL, 60611, USA.
Mol Inform. 2019 Nov;38(11-12):e1900038. doi: 10.1002/minf.201900038. Epub 2019 Sep 10.
Organic solar cells are an inexpensive, flexible alternative to traditional silicon-based solar cells but disadvantaged by low power conversion efficiency due to empirical design and complex manufacturing processes. This process can be accelerated by generating a comprehensive set of potential candidates. However, this would require a laborious trial and error method of modeling all possible polymer configurations. A machine learning model has the potential to accelerate the process of screening potential donor candidates by associating structural features of the compound using molecular fingerprints with their highest occupied molecular orbital energies. In this paper, extremely randomized tree learning models are employed for the prediction of HOMO values for donor compounds, and a web application is developed. The proposed models outperform neural networks trained on molecular fingerprints as well as SMILES, as well as other state-of-the-art architectures such as Chemception and Molecular Graph Convolution on two datasets of varying sizes.
有机太阳能电池是一种廉价、灵活的传统硅基太阳能电池替代品,但由于经验设计和复杂的制造工艺,其功率转换效率较低。通过生成一套全面的潜在候选物,可以加速这一过程。然而,这将需要一种繁琐的试错方法来对所有可能的聚合物构型进行建模。机器学习模型有可能通过使用分子指纹将化合物的结构特征与其最高占据分子轨道能量相关联,从而加速筛选潜在供体候选物的过程。在本文中,极端随机树学习模型被用于预测供体化合物的 HOMO 值,并开发了一个网络应用程序。所提出的模型在两个不同大小的数据集上的表现优于基于分子指纹和 SMILES 训练的神经网络,以及 Cheception 和分子图卷积等其他最先进的架构。