• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种新颖的特征融合模型,用于模仿摄影师的主动观察,以实现面向体育教育的场景重新构图。

A novel feature fusion model to mimic photographers' active observation for scenery recomposition toward physical education.

作者信息

Tang Dongyang, Wang Shang

机构信息

College of Physical Education, Wuhan Sports University, Wuhan, 430079, Hubei, China.

College of Computer Sciences, Beijing Technology and Business University, Beijing, China.

出版信息

Sci Rep. 2025 May 26;15(1):18361. doi: 10.1038/s41598-025-02678-5.

DOI:10.1038/s41598-025-02678-5
PMID:40419604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12106664/
Abstract

We develop a novel computational model to mimic photographers' observation techniques for scene decomposition. Central to our model is a hierarchical structure designed to capture human gaze dynamics accurately using the Binarized Normed Gradients (BING) objectness metric for identifying meaningful scene patches. We introduce a strategy called Locality-preserved and Observer-like Active Learning (LOAL) that constructs gaze shift paths (GSP) incrementally, allowing user interaction in the feature selection process. The GSPs are processed through a multi-layer aggregating algorithm, producing deep feature representations encoded into a Gaussian mixture model (GMM), which underpins our image retargeting approach. Our empirical analyses, supported by a user study, show that our method outperforms comparable techniques significantly, achieving a precision rate 3.2% higher than the second-best performer while halving the testing time. This streamlined approach blends aesthetics with algorithmic efficiency, enhancing AI-driven scene analysis.

摘要

我们开发了一种新颖的计算模型,以模仿摄影师用于场景分解的观察技术。我们模型的核心是一种层次结构,旨在使用二值化规范梯度(BING)目标度量准确捕捉人类注视动态,以识别有意义的场景块。我们引入了一种称为局部性保留和观察者式主动学习(LOAL)的策略,该策略逐步构建注视转移路径(GSP),允许用户在特征选择过程中进行交互。GSP通过多层聚合算法进行处理,生成编码为高斯混合模型(GMM)的深度特征表示,这是我们图像重定目标方法的基础。我们的实证分析在用户研究的支持下表明,我们的方法明显优于可比技术,精度率比第二好的方法高3.2%,同时测试时间减半。这种简化的方法将美学与算法效率相结合,增强了人工智能驱动的场景分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/46a4ff90720b/41598_2025_2678_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/5693d1d89b5d/41598_2025_2678_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/b5bb56200e5b/41598_2025_2678_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/035c5274d174/41598_2025_2678_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/c40b9337b3d0/41598_2025_2678_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/72f080183d2b/41598_2025_2678_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/c18e461fa1df/41598_2025_2678_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/46a4ff90720b/41598_2025_2678_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/5693d1d89b5d/41598_2025_2678_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/b5bb56200e5b/41598_2025_2678_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/035c5274d174/41598_2025_2678_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/c40b9337b3d0/41598_2025_2678_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/72f080183d2b/41598_2025_2678_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/c18e461fa1df/41598_2025_2678_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e13/12106664/46a4ff90720b/41598_2025_2678_Fig6_HTML.jpg

相似文献

1
A novel feature fusion model to mimic photographers' active observation for scenery recomposition toward physical education.一种新颖的特征融合模型,用于模仿摄影师的主动观察,以实现面向体育教育的场景重新构图。
Sci Rep. 2025 May 26;15(1):18361. doi: 10.1038/s41598-025-02678-5.
2
Multi-task feature integration and interactive active learning for scene image resizing.用于场景图像缩放的多任务特征集成与交互式主动学习
Sci Rep. 2025 May 19;15(1):17381. doi: 10.1038/s41598-025-98917-w.
3
Deep Active Learning with Contaminated Tags for Image Aesthetics Assessment.用于图像美学评估的带有污染标签的深度主动学习
IEEE Trans Image Process. 2018 Apr 18. doi: 10.1109/TIP.2018.2828326.
4
Scene categorization by Hessian-regularized active perceptual feature selection.基于黑塞正则化主动感知特征选择的场景分类
Sci Rep. 2025 Jan 4;15(1):739. doi: 10.1038/s41598-024-84181-x.
5
Perceptually Aware Image Retargeting for Mobile Devices.移动端有感知的图像重定向。
IEEE Trans Image Process. 2018 May;27(5):2301-2313. doi: 10.1109/TIP.2017.2779272.
6
A novel binary hashing for agricultural scenery classification.一种用于农业场景分类的新型二进制哈希算法。
Sci Rep. 2024 Nov 11;14(1):27602. doi: 10.1038/s41598-024-77685-z.
7
Bioinspired Scene Classification by Deep Active Learning With Remote Sensing Applications.基于深度学习的主动学习在遥感场景分类中的应用
IEEE Trans Cybern. 2022 Jul;52(7):5682-5694. doi: 10.1109/TCYB.2020.2981480. Epub 2022 Jul 4.
8
Scene Categorization Using Deeply Learned Gaze Shifting Kernel.基于深度学习的注视转移核的场景分类。
IEEE Trans Cybern. 2019 Jun;49(6):2156-2167. doi: 10.1109/TCYB.2018.2820731. Epub 2018 May 11.
9
Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context.在半监督环境下通过深度学习注视行为进行场景分类
IEEE Trans Cybern. 2021 Aug;51(8):4265-4276. doi: 10.1109/TCYB.2019.2913016. Epub 2021 Aug 4.
10
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

本文引用的文献

1
Sparse R-CNN: An End-to-End Framework for Object Detection.稀疏区域卷积神经网络(Sparse R-CNN):一种用于目标检测的端到端框架。
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15650-15664. doi: 10.1109/TPAMI.2023.3292030. Epub 2023 Nov 3.
2
Feature Map Distillation of Thin Nets for Low-Resolution Object Recognition.用于低分辨率目标识别的细网络特征图蒸馏
IEEE Trans Image Process. 2022;31:1364-1379. doi: 10.1109/TIP.2022.3141255. Epub 2022 Jan 25.
3
Recurrently exploring class-wise attention in a hybrid convolutional and bidirectional LSTM network for multi-label aerial image classification.
在用于多标签航空图像分类的混合卷积和双向长短期记忆网络中反复探索类别注意力。
ISPRS J Photogramm Remote Sens. 2019 Mar;149:188-199. doi: 10.1016/j.isprsjprs.2019.01.015.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
5
mCENTRIST: A Multi-Channel Feature Generation Mechanism for Scene Categorization.mCENTRIST:一种用于场景分类的多通道特征生成机制。
IEEE Trans Image Process. 2014 Feb;23(2):823-36. doi: 10.1109/TIP.2013.2295756.
6
Scene Parsing From an MAP Perspective.基于 MAP 的场景解析。
IEEE Trans Cybern. 2015 Sep;45(9):1876-86. doi: 10.1109/TCYB.2014.2361489. Epub 2014 Nov 4.
7
Discovering discriminative graphlets for aerial image categories recognition.发现有判别力的图元以识别航空图像类别。
IEEE Trans Image Process. 2013 Dec;22(12):5071-84. doi: 10.1109/TIP.2013.2278465. Epub 2013 Aug 14.
8
Biologically inspired feature manifold for scene classification.基于生物学启发的场景分类特征流形。
IEEE Trans Image Process. 2010 Jan;19(1):174-84. doi: 10.1109/TIP.2009.2032939.
9
Saliency, attention, and visual search: an information theoretic approach.显著性、注意力与视觉搜索:一种信息论方法。
J Vis. 2009 Mar 13;9(3):5.1-24. doi: 10.1167/9.3.5.
10
Geometric mean for subspace selection.用于子空间选择的几何均值。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):260-74. doi: 10.1109/TPAMI.2008.70.