Wan Yingtong, Wang Wanru, Zhang Meng, Peng Wei, Tang He
School of Industrial Design, Hubei University of Technology, Wuhan 430068, China.
School of Software Engineering, Huazhong University of Science and Technology, Wuhan 430074, China.
Sensors (Basel). 2024 Dec 27;25(1):96. doi: 10.3390/s25010096.
This paper tackles the challenge of accurately segmenting images of Ming-style furniture, an important aspect of China's cultural heritage, to aid in its preservation and analysis. Existing vision foundation models, like the segment anything model (SAM), struggle with the complex structures of Ming furniture due to the need for manual prompts and imprecise segmentation outputs. To address these limitations, we introduce two key innovations: the material attribute prompter (MAP), which automatically generates prompts based on the furniture's material properties, and the structure refinement module (SRM), which enhances segmentation by combining high- and low-level features. Additionally, we present the MF2K dataset, which includes 2073 images annotated with pixel-level masks across eight materials and environments. Our experiments demonstrate that the proposed method significantly improves the segmentation accuracy, outperforming state-of-the-art models in terms of the mean intersection over union (mIoU). Ablation studies highlight the contributions of the MAP and SRM to both the performance and computational efficiency. This work offers a powerful automated solution for segmenting intricate furniture structures, facilitating digital preservation and in-depth analysis of Ming-style furniture.
本文应对了精确分割明式家具图像这一挑战,明式家具是中国文化遗产的一个重要方面,此举有助于其保护与分析。现有的视觉基础模型,如分割一切模型(SAM),由于需要手动提示且分割输出不精确,在处理明式家具的复杂结构时面临困难。为解决这些局限性,我们引入了两项关键创新:材质属性提示器(MAP),它基于家具的材质属性自动生成提示;以及结构细化模块(SRM),它通过结合高低层特征来增强分割效果。此外,我们还展示了MF2K数据集,该数据集包含2073张图像,这些图像在八种材质和环境下都标注有像素级掩码。我们的实验表明,所提出的方法显著提高了分割精度,在平均交并比(mIoU)方面优于现有最先进的模型。消融研究突出了MAP和SRM对性能和计算效率的贡献。这项工作为分割复杂的家具结构提供了一个强大的自动化解决方案,有助于明式家具的数字保护和深入分析。