Oomplet数据集工具包是一个用于大规模、多类别图像生成的灵活且可扩展的系统。

The Oomplet dataset toolkit as a flexible and extensible system for large-scale, multi-category image generation.

作者信息

Kasarda John P, Zhang Angela, Tong Hua, Tan Yuan, Wang Ruizi, Verstynen Timothy, Tarr Michael J

机构信息

Department of Psychology, Carnegie Mellon University, Pittsburgh, 15213, USA.

Entertainment Technology Center, Carnegie Mellon University, Pittsburgh, 15213, USA.

出版信息

Sci Rep. 2025 Mar 18;15(1):9287. doi: 10.1038/s41598-025-93036-y.

DOI:10.1038/s41598-025-93036-y

PMID:40102478

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11920083/

Abstract

The modern study of perceptual learning across humans, non-human animals, and artificial agents requires large-scale datasets with flexible, customizable, and controllable features for distinguishing between categories. To support this research, we developed the Oomplet Dataset Toolkit (ODT), an open-source, publicly available toolbox capable of generating 9.1 million unique visual stimuli across ten feature dimensions. Each stimulus is a cartoon-like humanoid character, termed an "Oomplet," designed to be an instance within clearly defined visual categories that are engaging and suitable for use with diverse groups, including children. Experiments show that adults can use four to five of the ten dimensions as single classification criteria in simple perceptual discrimination tasks, underscoring the toolkit's flexibility. With the ODT, researchers can dynamically generate large, novel stimulus sets to study perceptual learning across biological and artificial contexts.

摘要

对人类、非人类动物和人工智能主体的知觉学习进行的现代研究，需要具备灵活、可定制和可控特征的大规模数据集，以便区分不同类别。为支持这项研究，我们开发了Oomplet数据集工具包（ODT），这是一个开源的、可公开获取的工具箱，能够在十个特征维度上生成910万个独特的视觉刺激。每个刺激都是一个类似卡通的类人角色，称为“Oomplet”，设计为清晰定义的视觉类别中的一个实例，这些类别引人入胜且适合包括儿童在内的不同群体使用。实验表明，在简单的知觉辨别任务中，成年人可以将十个维度中的四到五个作为单一分类标准，这突出了该工具包的灵活性。借助ODT，研究人员可以动态生成大量新颖的刺激集，以研究生物和人工环境中的知觉学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/92eb/11920083/d48cdcd0298d/41598_2025_93036_Fig1_HTML.jpg

相似文献

The Oomplet dataset toolkit as a flexible and extensible system for large-scale, multi-category image generation.

Sci Rep. 2025 Mar 18;15(1):9287. doi: 10.1038/s41598-025-93036-y.

A difficulty predictor for perceptual category learning.

J Vis. 2019 Jun 3;19(6):20. doi: 10.1167/19.6.20.

Visual training improves perceptual grouping based on basic stimulus features.

Atten Percept Psychophys. 2017 Oct;79(7):2098-2107. doi: 10.3758/s13414-017-1368-8.

Bottom-Up and Top-Down Factors Differentially Influence Stimulus Representations Across Large-Scale Attentional Networks.

J Neurosci. 2018 Mar 7;38(10):2495-2504. doi: 10.1523/JNEUROSCI.2724-17.2018. Epub 2018 Feb 2.

Neural substrates of visual perceptual learning of simple and complex stimuli.

Clin Neurophysiol. 2005 Mar;116(3):632-9. doi: 10.1016/j.clinph.2004.09.019. Epub 2004 Nov 5.

Standardized and reproducible measurement of decision-making in mice.

Elife. 2021 May 20;10:e63711. doi: 10.7554/eLife.63711.

Discrimination sensitivity of visual shapes sharpens in autistic adults but only after explicit category learning.

Mol Autism. 2024 Jun 3;15(1):23. doi: 10.1186/s13229-024-00604-6.

Visual perceptual learning.

Neurobiol Learn Mem. 2011 Feb;95(2):145-51. doi: 10.1016/j.nlm.2010.09.010. Epub 2010 Sep 24.

No transfer of perceptual learning between similar stimuli in the same retinal position.

Curr Biol. 1996 Mar 1;6(3):292-7. doi: 10.1016/s0960-9822(02)00479-7.

Mouse visual cortex areas represent perceptual and semantic features of learned visual categories.

Nat Neurosci. 2021 Oct;24(10):1441-1451. doi: 10.1038/s41593-021-00914-5. Epub 2021 Sep 20.

本文引用的文献

Principles of intensive human neuroimaging.

Trends Neurosci. 2024 Nov;47(11):856-864. doi: 10.1016/j.tins.2024.09.011. Epub 2024 Oct 24.

THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior.

Elife. 2023 Feb 27;12:e82580. doi: 10.7554/eLife.82580.

Selectivity for food in human ventral visual cortex.

Commun Biol. 2023 Feb 15;6(1):175. doi: 10.1038/s42003-023-04546-2.

Early experience with low-pass filtered images facilitates visual category learning in a neural network model.

PLoS One. 2023 Jan 6;18(1):e0280145. doi: 10.1371/journal.pone.0280145. eCollection 2023.

Color-biased regions in the ventral visual pathway are food selective.

Curr Biol. 2023 Jan 9;33(1):134-146.e4. doi: 10.1016/j.cub.2022.11.063. Epub 2022 Dec 26.

A highly selective response to food in human visual cortex revealed by hypothesis-free voxel decomposition.

Curr Biol. 2022 Oct 10;32(19):4159-4171.e9. doi: 10.1016/j.cub.2022.08.009. Epub 2022 Aug 25.

A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence.

Nat Neurosci. 2022 Jan;25(1):116-126. doi: 10.1038/s41593-021-00962-x. Epub 2021 Dec 16.

BOLD5000, a public fMRI dataset while viewing 5000 visual images.

Sci Data. 2019 May 6;6(1):49. doi: 10.1038/s41597-019-0052-3.

Neural and behavioral effects of subordinate-level training of novel objects across manipulations of color and spatial frequency.

Eur J Neurosci. 2020 Dec;52(11):4468-4479. doi: 10.1111/ejn.13889. Epub 2018 Mar 26.

The Novel Object and Unusual Name (NOUN) Database: A collection of novel images for use in experimental research.

Behav Res Methods. 2016 Dec;48(4):1393-1409. doi: 10.3758/s13428-015-0647-3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Oomplet数据集工具包是一个用于大规模、多类别图像生成的灵活且可扩展的系统。

The Oomplet dataset toolkit as a flexible and extensible system for large-scale, multi-category image generation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献