Liang Fengfeng, Zhang Yu, Zhou Chuntian, Zhang Heng, Liu Guangjie, Zhu Jinlong
School of Computer Science and Technology, Changchun Normal University, Changchun, China.
PLoS One. 2024 Oct 2;19(10):e0311228. doi: 10.1371/journal.pone.0311228. eCollection 2024.
Nanoparticles exhibit broad applications in materials mechanics, medicine, energy and other fields. The ordered arrangement of nanoparticles is very important to fully understand their properties and functionalities. However, in materials science, the acquisition of training images requires a large number of professionals and the labor cost is extremely high, so there are usually very few training samples in the field of materials. In this study, a segmentation method of nanoparticle topological structure based on synthetic data (SD) is proposed, which aims to solve the issue of small data in the field of materials. Our findings reveal that the combination of SD generated by rendering software with merely 15% Authentic Data (AD) shows better performance in training deep learning model. The trained U-Net model shows that Miou of 0.8476, accuracy of 0.9970, Kappa of 0.8207, and Dice of 0.9103, respectively. Compared with data enhancement alone, our approach yields a 1% improvement in the Miou metric. These results show that our proposed strategy can achieve better prediction performance without increasing the cost of data acquisition.
纳米颗粒在材料力学、医学、能源等领域有着广泛的应用。纳米颗粒的有序排列对于充分理解其性质和功能非常重要。然而,在材料科学中,获取训练图像需要大量专业人员,人工成本极高,因此材料领域的训练样本通常很少。在本研究中,提出了一种基于合成数据(SD)的纳米颗粒拓扑结构分割方法,旨在解决材料领域数据量少的问题。我们的研究结果表明,由渲染软件生成的合成数据与仅15%的真实数据(AD)相结合,在训练深度学习模型时表现出更好的性能。训练后的U-Net模型的平均交并比为0.8476,准确率为0.9970,卡帕系数为0.8207,骰子系数为0.9103。与单独的数据增强相比,我们的方法在平均交并比指标上提高了1%。这些结果表明,我们提出的策略在不增加数据采集成本的情况下可以实现更好的预测性能。