• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于眼镜检测的多样化数据集:扩展Flickr人脸高质量(FFHQ)数据集

Diverse Dataset for Eyeglasses Detection: Extending the Flickr-Faces-HQ (FFHQ) Dataset.

作者信息

Matuzevičius Dalius

机构信息

Department of Electronic Systems, Vilnius Gediminas Technical University (VILNIUS TECH), 10105 Vilnius, Lithuania.

出版信息

Sensors (Basel). 2024 Dec 1;24(23):7697. doi: 10.3390/s24237697.

DOI:10.3390/s24237697
PMID:39686233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11645010/
Abstract

Facial analysis is an important area of research in computer vision and machine learning, with applications spanning security, healthcare, and user interaction systems. The data-centric AI approach emphasizes the importance of high-quality, diverse, and well-annotated datasets in driving advancements in this field. However, current facial datasets, such as Flickr-Faces-HQ (FFHQ), lack detailed annotations for detecting facial accessories, particularly eyeglasses. This work addresses this limitation by extending the FFHQ dataset with precise bounding box annotations for eyeglasses detection, enhancing its utility for data-centric AI applications. The extended dataset comprises 70,000 images, including over 16,000 images containing eyewear, and it exceeds the CelebAMask-HQ dataset in size and diversity. A semi-automated protocol was employed to efficiently generate accurate bounding box annotations, minimizing the demand for extensive manual labeling. This enriched dataset serves as a valuable resource for training and benchmarking eyewear detection models. Additionally, the baseline benchmark results for eyeglasses detection were presented using deep learning methods, including YOLOv8 and MobileNetV3. The evaluation, conducted through cross-dataset validation, demonstrated the robustness of models trained on the extended FFHQ dataset with their superior performances over existing alternative CelebAMask-HQ. The extended dataset, which has been made publicly available, is expected to support future research and development in eyewear detection, contributing to advancements in facial analysis and related fields.

摘要

面部分析是计算机视觉和机器学习领域的一个重要研究方向,其应用涵盖安全、医疗保健和用户交互系统等领域。以数据为中心的人工智能方法强调高质量、多样化且标注良好的数据集对于推动该领域进步的重要性。然而,当前的面部数据集,如Flickr-Faces-HQ(FFHQ),缺乏用于检测面部配饰(尤其是眼镜)的详细标注。这项工作通过为眼镜检测添加精确的边界框标注来扩展FFHQ数据集,从而解决了这一局限性,增强了其在以数据为中心的人工智能应用中的实用性。扩展后的数据集包含70000张图像,其中超过16000张图像包含眼镜,在规模和多样性上超过了CelebAMask-HQ数据集。采用了一种半自动协议来高效生成准确的边界框标注,最大限度地减少了对大量手动标注的需求。这个丰富的数据集是训练和测试眼镜检测模型的宝贵资源。此外,还使用深度学习方法(包括YOLOv8和MobileNetV3)展示了眼镜检测的基线基准结果。通过跨数据集验证进行的评估表明,在扩展的FFHQ数据集上训练的模型具有鲁棒性,其性能优于现有的替代数据集CelebAMask-HQ。已公开提供的扩展数据集有望支持眼镜检测领域未来的研究与开发,为面部分析及相关领域的进步做出贡献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/b72b046c8116/sensors-24-07697-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/ab31be0abff0/sensors-24-07697-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/38f64bcb1f3e/sensors-24-07697-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/9fa95fe9cd4c/sensors-24-07697-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/a7d5a9f1e220/sensors-24-07697-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/eca46c7c5084/sensors-24-07697-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/d251b19f939a/sensors-24-07697-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/a464c3292e38/sensors-24-07697-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/d6f067319c82/sensors-24-07697-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/b72b046c8116/sensors-24-07697-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/ab31be0abff0/sensors-24-07697-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/38f64bcb1f3e/sensors-24-07697-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/9fa95fe9cd4c/sensors-24-07697-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/a7d5a9f1e220/sensors-24-07697-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/eca46c7c5084/sensors-24-07697-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/d251b19f939a/sensors-24-07697-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/a464c3292e38/sensors-24-07697-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/d6f067319c82/sensors-24-07697-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/11645010/b72b046c8116/sensors-24-07697-g009.jpg

相似文献

1
Diverse Dataset for Eyeglasses Detection: Extending the Flickr-Faces-HQ (FFHQ) Dataset.用于眼镜检测的多样化数据集:扩展Flickr人脸高质量(FFHQ)数据集
Sensors (Basel). 2024 Dec 1;24(23):7697. doi: 10.3390/s24237697.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
Semi-supervised training using cooperative labeling of weakly annotated data for nodule detection in chest CT.基于弱标注数据的协同标注的半监督训练在胸部 CT 结节检测中的应用。
Med Phys. 2023 Jul;50(7):4255-4268. doi: 10.1002/mp.16219. Epub 2023 Jan 27.
4
MaskedFace-Net - A dataset of correctly/incorrectly masked face images in the context of COVID-19.MaskedFace-Net——一个关于新冠疫情背景下戴口罩/未戴口罩面部图像的数据集。
Smart Health (Amst). 2021 Mar;19:100144. doi: 10.1016/j.smhl.2020.100144. Epub 2020 Nov 28.
5
YOLO-based segmented dataset for drone vs. bird detection for deep and machine learning algorithms.用于无人机与鸟类检测的基于YOLO的分段数据集,适用于深度学习和机器学习算法。
Data Brief. 2023 Jun 27;50:109355. doi: 10.1016/j.dib.2023.109355. eCollection 2023 Oct.
6
Task-Oriented Feature-Fused Network With Multivariate Dataset for Joint Face Analysis.面向任务的多变量数据集特征融合网络联合人脸分析
IEEE Trans Cybern. 2020 Mar;50(3):1292-1305. doi: 10.1109/TCYB.2019.2917049. Epub 2019 Jun 5.
7
Multitask Learning Strategy with Pseudo-Labeling: Face Recognition, Facial Landmark Detection, and Head Pose Estimation.多任务学习策略与伪标签:人脸识别、面部地标检测和头部姿势估计。
Sensors (Basel). 2024 May 18;24(10):3212. doi: 10.3390/s24103212.
8
Semi-supervised abdominal multi-organ segmentation by object-redrawing.通过对象重绘实现半监督腹部多器官分割
Med Phys. 2024 Nov;51(11):8334-8347. doi: 10.1002/mp.17364. Epub 2024 Aug 21.
9
Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning.基于深度学习的眼底图像视盘定位和青光眼分类的两阶段框架。
BMC Med Inform Decis Mak. 2019 Jul 17;19(1):136. doi: 10.1186/s12911-019-0842-8.
10
MaskMitosis: a deep learning framework for fully supervised, weakly supervised, and unsupervised mitosis detection in histopathology images.MaskMitosis:一种深度学习框架,用于在组织病理学图像中进行全监督、弱监督和无监督的有丝分裂检测。
Med Biol Eng Comput. 2020 Jul;58(7):1603-1623. doi: 10.1007/s11517-020-02175-z. Epub 2020 May 22.

本文引用的文献

1
A Data-Centric Approach to improve performance of deep learning models.以数据为中心的方法提高深度学习模型的性能。
Sci Rep. 2024 Sep 27;14(1):22329. doi: 10.1038/s41598-024-73643-x.
2
Analysis of Facial Occlusion Challenge in Thermal Images for Human Affective State Recognition.分析热图像中人脸遮挡对人类情感状态识别的挑战。
Sensors (Basel). 2023 Mar 27;23(7):3513. doi: 10.3390/s23073513.
3
Designing an AI-Based Virtual Try-On Web Application.设计基于人工智能的虚拟试穿网络应用程序。
Sensors (Basel). 2022 May 18;22(10):3832. doi: 10.3390/s22103832.
4
Innovation Process for Optical Face Scanner Used to Customize 3D Printed Spectacles.用于定制3D打印眼镜的光学面部扫描仪的创新过程。
Materials (Basel). 2022 May 13;15(10):3496. doi: 10.3390/ma15103496.
5
Deep Learning for Object Detection, Classification and Tracking in Industry Applications.深度学习在工业应用中的目标检测、分类和跟踪。
Sensors (Basel). 2021 Nov 5;21(21):7349. doi: 10.3390/s21217349.
6
Survey and Performance Analysis of Deep Learning Based Object Detection in Challenging Environments.基于深度学习的挑战性环境目标检测的调查与性能分析。
Sensors (Basel). 2021 Jul 28;21(15):5116. doi: 10.3390/s21155116.
7
Unsupervised Eyeglasses Removal in the Wild.无监督野外眼镜移除。
IEEE Trans Cybern. 2021 Sep;51(9):4373-4385. doi: 10.1109/TCYB.2020.2995496. Epub 2021 Sep 15.
8
Deep Residual CNN-Based Ocular Recognition Based on Rough Pupil Detection in the Images by NIR Camera Sensor.基于近红外相机传感器的粗糙瞳孔检测的深度残差卷积神经网络眼部识别。
Sensors (Basel). 2019 Feb 18;19(4):842. doi: 10.3390/s19040842.
9
A Virtual Try-On System for Prescription Eyeglasses.一种用于处方眼镜的虚拟试戴系统。
IEEE Comput Graph Appl. 2017;37(4):84-93. doi: 10.1109/MCG.2017.3271458.
10
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.