Geng Tao, Yang Yuxuan
Nanjing Forestry University, Nanjing, China.
PLoS One. 2025 Sep 4;20(9):e0331240. doi: 10.1371/journal.pone.0331240. eCollection 2025.
Interior design is a complex and creative discipline involving aesthetics, functionality, ergonomics, and materials science. Effective solutions must meet diverse requirements, typically producing multiple deliverables such as renderings and design drawings from various perspectives. Consequently, interior design processes are often inefficient and demand significant creativity. With advances in machine learning, generative models have emerged as a promising means of improving efficiency by creating designs from text descriptions or sketches. However, few generative works focus on interior design, leading to substantial discrepancies between outputs and practical needs, such as differences in size, spatial scope, and the lack of controllable generation quality. To address these challenges, we propose DiffDesign, a controllable diffusion model with meta priors for efficient interior design generation. Specifically, we utilize the generative priors of a 2D diffusion model pre-trained on a large image dataset as our rendering backbone. We further guide the denoising process by disentangling cross-attention control over design attributes, such as appearance, pose, and size, and introduce an optimal transfer-based alignment module to enforce view consistency. Simultaneously, we construct an interior design-specific dataset, DesignHelper, consisting of over 400 solutions across more than 15 spatial types and 15 design styles. This dataset helps fine-tune DiffDesign. Extensive experiments conducted on various benchmark datasets demonstrate the effectiveness and robustness of DiffDesign.
室内设计是一门复杂且富有创造性的学科,涉及美学、功能性、人体工程学和材料科学。有效的解决方案必须满足多样化的需求,通常会产生多种交付成果,例如从不同视角生成的效果图和设计图纸。因此,室内设计流程往往效率低下且需要大量创造力。随着机器学习的进步,生成模型已成为一种有前景的手段,可通过根据文本描述或草图创建设计来提高效率。然而,很少有生成作品专注于室内设计,导致输出结果与实际需求之间存在很大差异,例如尺寸、空间范围的差异以及缺乏可控制的生成质量。为应对这些挑战,我们提出了DiffDesign,这是一种具有元先验的可控扩散模型,用于高效的室内设计生成。具体而言,我们将在大型图像数据集上预训练的二维扩散模型的生成先验用作渲染主干。我们通过解开对外观、姿态和尺寸等设计属性的交叉注意力控制来进一步引导去噪过程,并引入基于最优转移的对齐模块以确保视图一致性。同时,我们构建了一个特定于室内设计的数据集DesignHelper,其中包含超过15种空间类型和15种设计风格的400多个解决方案。该数据集有助于对DiffDesign进行微调。在各种基准数据集上进行的大量实验证明了DiffDesign的有效性和鲁棒性。