Ma Xing-Yu, Lyu Hou-Yi, Dong Xue-Juan, Zhang Zhen, Hao Kuan-Rong, Yan Qing-Bo, Su Gang
School of Physical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
Center of Materials Science and Optoelectronics Engineering, College of Materials Science and Optoelectronic Technology, University of Chinese Academy of Sciences, Beijing 100049, China.
J Phys Chem Lett. 2021 Jan 28;12(3):973-981. doi: 10.1021/acs.jpclett.0c03136. Epub 2021 Jan 19.
Regression machine learning is widely applied to predict various materials. However, insufficient materials data usually leads to poor performance. Here, we develop a new voting data-driven method that could generally improve the performance of the regression learning model for accurately predicting properties of materials. We apply it to investigate a large family (2135) of two-dimensional hexagonal binary compounds focusing on ferroelectric properties and find that the performance of the model for electric polarization is indeed greatly improved, where 38 stable ferroelectrics with out-of-plane polarization including 31 metals and 7 semiconductors are screened out. By unsupervised learning, actionable information such as how the number and orbital radius of valence electrons, ionic polarizability, and electronegativity of constituent atoms affect polarization was extracted. Our voting data-driven method not only reduces the size of materials data for constructing a reliable learning model but also enables one to make precise predictions for targeted functional materials.
回归机器学习被广泛应用于预测各种材料。然而,材料数据不足通常会导致性能不佳。在此,我们开发了一种新的投票数据驱动方法,该方法通常可以提高回归学习模型准确预测材料性能的性能。我们将其应用于研究一个包含2135种二维六方二元化合物的大家族,重点关注铁电性能,发现该模型在电极化方面的性能确实得到了极大提升,从中筛选出了38种具有面外极化的稳定铁电体,包括31种金属和7种半导体。通过无监督学习,提取了诸如价电子数量和轨道半径、离子极化率以及组成原子的电负性如何影响极化等可操作信息。我们的投票数据驱动方法不仅减少了构建可靠学习模型所需的材料数据量,还能使人们对目标功能材料进行精确预测。