• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过潜在空间评估实现无偏差高质量肖像。

Toward Unbiased High-Quality Portraits through Latent-Space Evaluation.

作者信息

Almhaithawi Doaa, Bellini Alessandro, Cerquitelli Tania

机构信息

Department of Control and Computer Engineering, Politecnico di Torino, 10129 Torino, Italy.

Prime Lab, Mathema s.r.l., 50142 Florence, Italy.

出版信息

J Imaging. 2024 Jun 28;10(7):157. doi: 10.3390/jimaging10070157.

DOI:10.3390/jimaging10070157
PMID:39057728
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11278512/
Abstract

Images, texts, voices, and signals can be synthesized by latent spaces in a multidimensional vector, which can be explored without the hurdles of noise or other interfering factors. In this paper, we present a practical use case that demonstrates the power of latent space in exploring complex realities such as image space. We focus on DaVinciFace, an AI-based system that explores the StyleGAN2 space to create a high-quality portrait for anyone in the style of the Renaissance genius Leonardo da Vinci. The user enters one of their portraits and receives the corresponding Da Vinci-style portrait as an output. Since most of Da Vinci's artworks depict young and beautiful women (e.g., "La Belle Ferroniere", "Beatrice de' Benci"), we investigate the ability of DaVinciFace to account for other social categorizations, including gender, race, and age. The experimental results evaluate the effectiveness of our methodology on 1158 portraits acting on the vector representations of the latent space to produce high-quality portraits that retain the facial features of the subject's social categories, and conclude that sparser vectors have a greater effect on these features. To objectively evaluate and quantify our results, we solicited human feedback via a crowd-sourcing campaign. Analysis of the human feedback showed a high tolerance for the loss of important identity features in the resulting portraits when the Da Vinci style is more pronounced, with some exceptions, including Africanized individuals.

摘要

图像、文本、声音和信号可以由多维向量中的潜在空间合成,在这个潜在空间中可以不受噪声或其他干扰因素的阻碍进行探索。在本文中,我们展示了一个实际应用案例,该案例展示了潜在空间在探索诸如图像空间等复杂现实方面的强大作用。我们聚焦于DaVinciFace,这是一个基于人工智能的系统,它探索StyleGAN2空间,以文艺复兴时期天才列奥纳多·达·芬奇的风格为任何人创作高质量肖像。用户输入自己的一张肖像,然后收到相应的达·芬奇风格肖像作为输出。由于达·芬奇的大多数艺术作品描绘的是年轻美丽的女性(例如,《美丽的费隆妮叶夫人》《比阿特丽斯·德·本奇》),我们研究了DaVinciFace处理包括性别、种族和年龄在内的其他社会分类的能力。实验结果评估了我们的方法在1158张肖像上的有效性,这些肖像作用于潜在空间的向量表示,以生成保留主体社会分类面部特征的高质量肖像,并得出结论:更稀疏的向量对这些特征有更大的影响。为了客观地评估和量化我们的结果,我们通过众包活动征求了人类反馈。对人类反馈的分析表明,当达·芬奇风格更明显时,人们对所得肖像中重要身份特征的丢失具有较高的容忍度,但也有一些例外,包括非洲裔个体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/8348be64be32/jimaging-10-00157-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/b6254eb1f351/jimaging-10-00157-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/4a51bac8dd14/jimaging-10-00157-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/b21f2d37d269/jimaging-10-00157-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/5053682c275d/jimaging-10-00157-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/7f5e3f2e3fae/jimaging-10-00157-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/a927e2cd4806/jimaging-10-00157-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/8b1ae24dade5/jimaging-10-00157-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/8348be64be32/jimaging-10-00157-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/b6254eb1f351/jimaging-10-00157-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/4a51bac8dd14/jimaging-10-00157-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/b21f2d37d269/jimaging-10-00157-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/5053682c275d/jimaging-10-00157-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/7f5e3f2e3fae/jimaging-10-00157-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/a927e2cd4806/jimaging-10-00157-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/8b1ae24dade5/jimaging-10-00157-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af6e/11278512/8348be64be32/jimaging-10-00157-g008.jpg

相似文献

1
Toward Unbiased High-Quality Portraits through Latent-Space Evaluation.通过潜在空间评估实现无偏差高质量肖像。
J Imaging. 2024 Jun 28;10(7):157. doi: 10.3390/jimaging10070157.
2
The uncatchable smile in Leonardo da Vinci's La Bella Principessa portrait.列奥纳多·达·芬奇的《美丽公主》画像中那难以捕捉的微笑。
Vision Res. 2015 Aug;113(Pt A):78-86. doi: 10.1016/j.visres.2015.05.014. Epub 2015 Jun 4.
3
Leonardo Da Vinci's Archival of the Dermatologic Condition.列奥纳多·达·芬奇的皮肤病档案。
J Med Humanit. 2021 Dec;42(4):795-799. doi: 10.1007/s10912-021-09709-y. Epub 2021 Aug 27.
4
The da Vinci robot.达芬奇机器人。
J Endourol. 2006 Dec;20(12):986-90. doi: 10.1089/end.2006.20.986.
5
The science of art: Leonardo Da Vinci and facial plastic surgery.艺术科学:列奥纳多·达·芬奇与面部整形手术。
Curr Opin Otolaryngol Head Neck Surg. 2020 Aug;28(4):195-200. doi: 10.1097/MOO.0000000000000643.
6
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
7
Hyperventilation Hypocapnia as The Leonardo da Vinci's Syndrome.作为列奥纳多·达·芬奇综合征的过度通气低碳酸血症
Psychiatr Danub. 2019 Mar;31(Suppl 1):75-78.
8
Leonardo da Vinci's foot: historical evidence of concept.列奥纳多·达·芬奇之足:概念的历史证据
J Invest Surg. 2012 Oct;25(5):281-5. doi: 10.3109/08941939.2012.725011.
9
[Leonardo da Vinci--a dyslectic genius?].[列奥纳多·达·芬奇——一个诵读困难的天才?]
Tidsskr Nor Laegeforen. 2002 Dec 10;122(30):2887-90.
10
The : Competence, Morality, and Attractiveness Inferred From Faces Predict Hiring Decisions for Women.从面部推断出的能力、道德和吸引力会影响对女性的招聘决策。
Front Psychol. 2021 May 13;12:658424. doi: 10.3389/fpsyg.2021.658424. eCollection 2021.

本文引用的文献

1
Diagnosing Gender Bias in Image Recognition Systems.诊断图像识别系统中的性别偏见
Socius. 2020 Jan-Dec;6. doi: 10.1177/2378023120967171. Epub 2020 Nov 11.
2
Generative Adversarial Networks in Brain Imaging: A Narrative Review.脑成像中的生成对抗网络:一篇综述
J Imaging. 2022 Mar 23;8(4):83. doi: 10.3390/jimaging8040083.
3
Towards Disentangling Latent Space for Unsupervised Semantic Face Editing.面向无监督语义人脸编辑的潜在空间解缠。
IEEE Trans Image Process. 2022;31:1475-1489. doi: 10.1109/TIP.2022.3142527. Epub 2022 Jan 27.
4
Improving Deep Interactive Evolution with a Style-Based Generator for Artistic Expression and Creative Exploration.使用基于风格的生成器改进深度交互式进化以实现艺术表达和创意探索。
Entropy (Basel). 2020 Dec 24;23(1):11. doi: 10.3390/e23010011.
5
The Geometry of Continuous Latent Space Models for Network Data.网络数据连续潜在空间模型的几何学
Stat Sci. 2019 Aug;34(3):428-453. doi: 10.1214/19-sts702. Epub 2019 Oct 11.
6
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs.InterFaceGAN:解释 GAN 学习到的解缠面部表示。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2004-2018. doi: 10.1109/TPAMI.2020.3034267. Epub 2022 Mar 4.
7
Relevant Applications of Generative Adversarial Networks in Drug Design and Discovery: Molecular Design, Dimensionality Reduction, and Peptide and Protein Design.生成对抗网络在药物设计和发现中的相关应用:分子设计、降维和肽与蛋白质设计。
Molecules. 2020 Jul 16;25(14):3250. doi: 10.3390/molecules25143250.
8
Latent space manipulation for high-resolution medical image synthesis via the StyleGAN.基于 StyleGAN 的高分辨率医学图像合成的潜在空间处理。
Z Med Phys. 2020 Nov;30(4):305-314. doi: 10.1016/j.zemedi.2020.05.001. Epub 2020 Jun 18.
9
Collocating Clothes With Generative Adversarial Networks Cosupervised by Categories and Attributes: A Multidiscriminator Framework.基于类别和属性协同监督的生成对抗网络服装搭配:一种多判别器框架
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3540-3554. doi: 10.1109/TNNLS.2019.2944979. Epub 2019 Nov 5.
10
Dimensionality reduction for visualizing single-cell data using UMAP.使用UMAP进行单细胞数据可视化的降维方法。
Nat Biotechnol. 2018 Dec 3. doi: 10.1038/nbt.4314.