• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

八千万张小图片:用于非参数化物体与场景识别的大型数据集。

80 million tiny images: a large data set for nonparametric object and scene recognition.

作者信息

Torralba Antonio, Fergus Rob, Freeman William T

机构信息

Computer Science and Artificial Intelligence Lab (CSAIL), Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA 02139, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1958-70. doi: 10.1109/TPAMI.2008.128.

DOI:10.1109/TPAMI.2008.128
PMID:18787244
Abstract

With the advent of the Internet, billions of images are now freely available online and constitute a dense sampling of the visual world. Using a variety of non-parametric methods, we explore this world with the aid of a large dataset of 79,302,017 images collected from the Internet. Motivated by psychophysical results showing the remarkable tolerance of the human visual system to degradations in image resolution, the images in the dataset are stored as 32 x 32 color images. Each image is loosely labeled with one of the 75,062 non-abstract nouns in English, as listed in the Wordnet lexical database. Hence the image database gives a comprehensive coverage of all object categories and scenes. The semantic information from Wordnet can be used in conjunction with nearest-neighbor methods to perform object classification over a range of semantic levels minimizing the effects of labeling noise. For certain classes that are particularly prevalent in the dataset, such as people, we are able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.

摘要

随着互联网的出现,数十亿张图像如今可在网上免费获取,构成了视觉世界的密集采样。借助从互联网收集的包含79302017张图像的大型数据集,我们使用各种非参数方法来探索这个世界。受心理物理学结果的启发,这些结果表明人类视觉系统对图像分辨率下降具有显著的耐受性,数据集中的图像存储为32×32的彩色图像。每张图像都用Wordnet词汇数据库中列出的75062个非抽象英语名词之一进行了大致标注。因此,图像数据库全面涵盖了所有对象类别和场景。来自Wordnet的语义信息可与最近邻方法结合使用,在一系列语义级别上执行对象分类,以最小化标注噪声的影响。对于数据集中特别普遍的某些类别,例如人,我们能够证明其识别性能与特定类别的Viola-Jones风格检测器相当。

相似文献

1
80 million tiny images: a large data set for nonparametric object and scene recognition.八千万张小图片:用于非参数化物体与场景识别的大型数据集。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1958-70. doi: 10.1109/TPAMI.2008.128.
2
Automatic semantic annotation of real-world web images.真实世界网络图像的自动语义标注
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1933-44. doi: 10.1109/TPAMI.2008.125.
3
Tiny videos: a large data set for nonparametric video retrieval and frame classification.微小视频:用于非参数视频检索和帧分类的大数据集。
IEEE Trans Pattern Anal Mach Intell. 2011 Mar;33(3):618-30. doi: 10.1109/TPAMI.2010.118.
4
Geometry-based image retrieval in binary image databases.二值图像数据库中基于几何的图像检索
IEEE Trans Pattern Anal Mach Intell. 2008 Jun;30(6):1003-13. doi: 10.1109/TPAMI.2008.37.
5
Localized content-based image retrieval.基于内容的局部图像检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1902-12. doi: 10.1109/TPAMI.2008.112.
6
Supervised learning of semantic classes for image annotation and retrieval.用于图像标注和检索的语义类别的监督学习。
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):394-410. doi: 10.1109/TPAMI.2007.61.
7
Homotopic image pseudo-invariants for openset object recognition and image retrieval.用于开集目标识别与图像检索的同伦图像伪不变量。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1891-901. doi: 10.1109/TPAMI.2008.143.
8
Document image retrieval through word shape coding.通过单词形状编码进行文档图像检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1913-8. doi: 10.1109/TPAMI.2008.89.
9
Context-based object-class recognition and retrieval by generalized correlograms.基于上下文的广义相关图对象类别识别与检索
IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1818-33. doi: 10.1109/TPAMI.2007.1098.
10
Universal and adapted vocabularies for generic visual categorization.用于通用视觉分类的通用和适应性词汇表。
IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1243-56. doi: 10.1109/TPAMI.2007.70755.

引用本文的文献

1
Energy-based jamming pattern open set recognition via spiking wavelet transformer.基于能量的干扰模式开放集识别:通过脉冲小波变换器实现
PLoS One. 2025 Jun 26;20(6):e0325381. doi: 10.1371/journal.pone.0325381. eCollection 2025.
2
A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.医学可穿戴计算中时间序列数据增强的综合调查与比较分析
PLoS One. 2025 Mar 18;20(3):e0315343. doi: 10.1371/journal.pone.0315343. eCollection 2025.
3
Domain adaptation in small-scale and heterogeneous biological datasets.
小规模和异构生物数据集中的域适应
Sci Adv. 2024 Dec 20;10(51):eadp6040. doi: 10.1126/sciadv.adp6040.
4
Exploring feature sparsity for out-of-distribution detection.探索用于分布外检测的特征稀疏性。
Sci Rep. 2024 Nov 18;14(1):28444. doi: 10.1038/s41598-024-79934-7.
5
The development of a machine learning model to train junior ophthalmologists in diagnosing the pre-clinical keratoconus.一种用于培训初级眼科医生诊断临床前期圆锥角膜的机器学习模型的开发。
Front Med (Lausanne). 2024 Sep 18;11:1458356. doi: 10.3389/fmed.2024.1458356. eCollection 2024.
6
A method for small-sized wheat seedlings detection: from annotation mode to model construction.一种用于小型小麦幼苗检测的方法:从标注模式到模型构建。
Plant Methods. 2024 Jan 29;20(1):15. doi: 10.1186/s13007-024-01147-w.
7
Real-Time Detection of an Undercarriage Based on Receptive Field Blocks and Coordinate Attention.基于感受野模块和坐标注意力的起落架实时检测
Sensors (Basel). 2023 Dec 16;23(24):9861. doi: 10.3390/s23249861.
8
Random pruning: channel sparsity by expectation scaling factor.随机剪枝:通过期望缩放因子实现通道稀疏性
PeerJ Comput Sci. 2023 Sep 5;9:e1564. doi: 10.7717/peerj-cs.1564. eCollection 2023.
9
Global contextual attention augmented YOLO with ConvMixer prediction heads for PCB surface defect detection.基于全局上下文注意力增强 YOLO 的 ConvMixer 预测头的 PCB 表面缺陷检测。
Sci Rep. 2023 Jun 16;13(1):9805. doi: 10.1038/s41598-023-36854-2.
10
Challenging deep learning models with image distortion based on the abutting grating illusion.基于邻接光栅错觉,利用图像失真对深度学习模型进行挑战。
Patterns (N Y). 2023 Feb 28;4(3):100695. doi: 10.1016/j.patter.2023.100695. eCollection 2023 Mar 10.