• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于绘画风格生成的带有扩散模型的新型灵活身份网络。

A novel flexible identity-net with diffusion models for painting-style generation.

作者信息

Zhao Yifei, Liang Ziqi, Qiu Yingrui, Wang Xiaona

机构信息

School of New Media, Beijing Institute of Graphic Communication, Beijing, 102600, China.

出版信息

Sci Rep. 2025 Jul 31;15(1):27896. doi: 10.1038/s41598-025-12434-4.

DOI:10.1038/s41598-025-12434-4
PMID:40744991
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12313998/
Abstract

Art's unique style and creativity are essential in defining a work's identity, conveying emotions, and shaping audience perception. Recent advancements in diffusion models have revolutionized art design, animation, and gaming, particularly in generating original artwork and visual identities. However, traditional creative processes face challenges such as slow innovation, high costs, and limited scalability. Consequently, deep learning has emerged as a promising solution for enhancing painting-style creative design. In this paper, we present the Painting-Style Design Assistant Network (PDANet), a groundbreaking network architecture designed for advanced style transformation. Our work is supported by the Painting-42 dataset, a meticulously curated collection of 4055 artworks from 42 illustrious Chinese painters, capturing the aesthetic nuances of Chinese painting and offering invaluable design references. Additionally, we introduce a lightweight Identity-Net, designed to enhance large-scale text-to-image (T2I) models by aligning internal knowledge with external control signals. This innovative Identity-Net seamlessly integrates image prompts into the U-Net encoder, enabling the generation of diverse and consistent images. Through extensive quantitative and qualitative evaluations, our approach has demonstrated superior performance compared to existing methods, producing high-quality, versatile content with broad applicability across various creative domains. Our work not only advances the field of AI-driven art but also offers a new paradigm for the future of creative design. The code and data are available at  https://github.com/aigc-hi/PDANet .

摘要

艺术独特的风格和创造力对于定义作品的身份、传达情感以及塑造观众认知至关重要。扩散模型的最新进展彻底改变了艺术设计、动画和游戏领域,尤其是在生成原创艺术作品和视觉形象方面。然而,传统的创作过程面临着诸如创新缓慢、成本高昂和可扩展性有限等挑战。因此,深度学习已成为增强绘画风格创意设计的一个有前景的解决方案。在本文中,我们展示了绘画风格设计辅助网络(PDANet),这是一种为高级风格转换设计的开创性网络架构。我们的工作得到了绘画 - 42数据集的支持,该数据集精心挑选了42位杰出中国画家的4055幅艺术作品,捕捉了中国绘画的美学细微差别并提供了宝贵的设计参考。此外,我们引入了一个轻量级身份网络,旨在通过将内部知识与外部控制信号对齐来增强大规模文本到图像(T2I)模型。这个创新的身份网络将图像提示无缝集成到U-Net编码器中,能够生成多样且一致的图像。通过广泛的定量和定性评估,我们的方法与现有方法相比表现出卓越的性能,能够生成高质量、多功能的内容,在各个创意领域具有广泛的适用性。我们的工作不仅推动了人工智能驱动艺术的领域发展,还为创意设计的未来提供了一种新范式。代码和数据可在https://github.com/aigc-hi/PDANet获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/d44cdf7766f8/41598_2025_12434_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/6912cdfa753b/41598_2025_12434_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/693cc55bed1b/41598_2025_12434_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/65466904f644/41598_2025_12434_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/e70c94e663fe/41598_2025_12434_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/26eaec0c99cd/41598_2025_12434_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/9f3fb6cc7301/41598_2025_12434_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/df08922a35c3/41598_2025_12434_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/cbc1c697c4c7/41598_2025_12434_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/daa0277b2222/41598_2025_12434_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/d44cdf7766f8/41598_2025_12434_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/6912cdfa753b/41598_2025_12434_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/693cc55bed1b/41598_2025_12434_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/65466904f644/41598_2025_12434_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/e70c94e663fe/41598_2025_12434_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/26eaec0c99cd/41598_2025_12434_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/9f3fb6cc7301/41598_2025_12434_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/df08922a35c3/41598_2025_12434_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/cbc1c697c4c7/41598_2025_12434_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/daa0277b2222/41598_2025_12434_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2154/12313998/d44cdf7766f8/41598_2025_12434_Fig10_HTML.jpg

相似文献

1
A novel flexible identity-net with diffusion models for painting-style generation.一种用于绘画风格生成的带有扩散模型的新型灵活身份网络。
Sci Rep. 2025 Jul 31;15(1):27896. doi: 10.1038/s41598-025-12434-4.
2
Short-Term Memory Impairment短期记忆障碍
3
NADM: Noise-Aware Diffusion Model for Landscape Painting Video Generation.NADM:用于山水画视频生成的噪声感知扩散模型。
IEEE Trans Cybern. 2025 Aug;55(8):3686-3698. doi: 10.1109/TCYB.2025.3576752.
4
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
5
Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.早期至晚期融合对来自配准不完善的多模态磁共振成像的胰腺分割的影响。
J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.
6
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
7
Lightweight cross-resolution coarse-to-fine network for efficient deformable medical image registration.用于高效可变形医学图像配准的轻量级跨分辨率粗到细网络
Med Phys. 2025 Apr 25. doi: 10.1002/mp.17827.
8
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
9
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
10
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

本文引用的文献

1
A fuzzy control algorithm based on artificial intelligence for the fusion of traditional Chinese painting and AI painting.一种基于人工智能的模糊控制算法,用于传统中国画与人工智能绘画的融合。
Sci Rep. 2024 Aug 1;14(1):17846. doi: 10.1038/s41598-024-68375-x.
2
Integrated visual transformer and flash attention for lip-to-speech generation GAN.用于唇语语音生成 GAN 的集成视觉转换器和闪存注意
Sci Rep. 2024 Feb 24;14(1):4525. doi: 10.1038/s41598-024-55248-6.
3
Efficient quantization of painting images by relevant colors.通过相关颜色实现绘画图像的高效量化。
Sci Rep. 2023 Feb 21;13(1):3034. doi: 10.1038/s41598-023-29380-8.
4
Layer separation mapping and consolidation evaluation of a fifteenth century panel painting using terahertz time-domain imaging.使用太赫兹时域成像对一幅十五世纪木板油画进行分层映射和加固评估。
Sci Rep. 2022 Dec 5;12(1):21038. doi: 10.1038/s41598-022-25013-8.
5
Label-free prediction of cell painting from brightfield images.无标记物预测明场图像中的细胞染色。
Sci Rep. 2022 Jun 15;12(1):10001. doi: 10.1038/s41598-022-12914-x.
6
Universality and superiority in preference for chromatic composition of art paintings.艺术画作色彩构成偏好中的普遍性与优越性。
Sci Rep. 2022 Mar 11;12(1):4294. doi: 10.1038/s41598-022-08365-z.