• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过视图兼容特征融合实现特定对象的多视图分类

Object-Specific Multiview Classification Through View-Compatible Feature Fusion.

作者信息

Perez Soler Javier, Guardiola Jose-Luis, García Sastre Nicolás, Garrigues Carbó Pau, Sanchis Hernández Miguel, Perez-Cortes Juan-Carlos

机构信息

Instituto Tecnológico de Informática (ITI), C. Nicolás Copérnico, 7, 46022 Valencia, Spain.

Departamento de Informática de Sistemas y Computadores (DISCA), Universitat Politècnica de València (UPV), 46022 Valencia, Spain.

出版信息

Sensors (Basel). 2025 Jul 2;25(13):4127. doi: 10.3390/s25134127.

DOI:10.3390/s25134127
PMID:40648382
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12252050/
Abstract

Multi-view classification (MVC) typically focuses on categorizing objects into distinct classes by employing multiple perspectives of the same objects. However, in numerous real-world applications, such as industrial inspection and quality control, there is an increasing need to distinguish particular objects from a pool of similar ones while simultaneously disregarding unknown objects. In these scenarios, relying on a single image may not provide sufficient information to effectively identify the scrutinized object, as different perspectives may reveal distinct characteristics that are essential for accurate classification. Most existing approaches operate within closed-set environments and are focused on generalization, which makes them less effective in distinguishing individual objects from others. This limitations are particularly problematic in industrial quality assessment, where distinguishing between specific objects and discarding unknowns is crucial. To address this challenge, we introduce a View-Compatible Feature Fusion (VCFF) method that utilizes images from predetermined positions as an accurate solution for multi-view classification of specific objects. Unlike other approaches, VCFF explicitly integrates pose information during the fusion process. It does not merely use pose as auxiliary data but employs it to align and selectively fuse features from different views. This mathematically explicit fusion of rotations, based on relative poses, allows VCFF to effectively combine multi-view information, enhancing classification accuracy. Through experimental evaluations, we demonstrate that the proposed VCFF method outperforms state-of-the-art MVC algorithms, especially in open-set scenarios, where the set of possible objects is not fully known in advance. Remarkably, VCFF achieves an average precision of 1.0 using only 8 cameras, whereas existing methods require 20 cameras to reach a maximum of 0.95. In terms of AUC-ROC under the constraint of fewer than 3σ false positives-a critical metric in industrial inspection-current state-of-the-art methods achieve up to 0.72, while VCFF attains a perfect score of 1.0 with just eight cameras. Furthermore, our approach delivers highly accurate rotation estimation, maintaining an error margin slightly above 2° when sampling at 4° intervals.

摘要

多视图分类(MVC)通常专注于通过采用同一物体的多个视角将物体分类到不同类别中。然而,在许多实际应用中,如工业检测和质量控制,越来越需要从一组相似物体中区分出特定物体,同时忽略未知物体。在这些场景中,仅依靠单一图像可能无法提供足够信息来有效识别被检查物体,因为不同视角可能会揭示对准确分类至关重要的不同特征。大多数现有方法在封闭集环境中运行,专注于泛化,这使得它们在将单个物体与其他物体区分开来时效果较差。这种局限性在工业质量评估中尤其成问题,因为区分特定物体和丢弃未知物体至关重要。为应对这一挑战,我们引入了一种视图兼容特征融合(VCFF)方法,该方法利用来自预定位置的图像作为特定物体多视图分类的准确解决方案。与其他方法不同,VCFF在融合过程中明确整合姿态信息。它不仅仅将姿态用作辅助数据,而是利用它来对齐并选择性地融合来自不同视图的特征。基于相对姿态的这种旋转的数学上明确的融合,使VCFF能够有效地组合多视图信息,提高分类精度。通过实验评估,我们证明所提出的VCFF方法优于现有最先进的MVC算法,特别是在开放集场景中,在这种场景下可能物体的集合事先并不完全已知。值得注意的是,VCFF仅使用8个摄像头就能达到1.0的平均精度,而现有方法需要20个摄像头才能达到最高0.95的精度。在工业检测的关键指标——误报率低于3σ的约束下的AUC-ROC方面,现有最先进的方法最高达到0.72,而VCFF仅用8个摄像头就能达到完美的1.0分数。此外,我们的方法提供了高度准确的旋转估计,以4°间隔采样时误差 margin略高于2°。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/37655e6451bc/sensors-25-04127-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/7772522df32d/sensors-25-04127-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/7002f5dc9c09/sensors-25-04127-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/f7237ebd78f9/sensors-25-04127-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/37655e6451bc/sensors-25-04127-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/7772522df32d/sensors-25-04127-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/7002f5dc9c09/sensors-25-04127-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/f7237ebd78f9/sensors-25-04127-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bb1/12252050/37655e6451bc/sensors-25-04127-g006.jpg

相似文献

1
Object-Specific Multiview Classification Through View-Compatible Feature Fusion.通过视图兼容特征融合实现特定对象的多视图分类
Sensors (Basel). 2025 Jul 2;25(13):4127. doi: 10.3390/s25134127.
2
Short-Term Memory Impairment短期记忆障碍
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
5
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
8
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

本文引用的文献

1
Effective Mean Square Differences: A Matching Algorithm for Highly Similar Sheet Metal Parts.有效均方差:一种用于高度相似钣金零件的匹配算法。
Sensors (Basel). 2023 Aug 21;23(16):7300. doi: 10.3390/s23167300.
2
EventDTW: An Improved Dynamic Time Warping Algorithm for Aligning Biomedical Signals of Nonuniform Sampling Frequencies.事件 DTW:一种用于对齐非均匀采样频率生物医学信号的改进动态时间规整算法。
Sensors (Basel). 2020 May 9;20(9):2700. doi: 10.3390/s20092700.
3
A System for In-Line 3D Inspection without Hidden Surfaces.一种无隐藏面在线三维检测系统。
Sensors (Basel). 2018 Sep 7;18(9):2993. doi: 10.3390/s18092993.
4
SeqViews2SeqLabels: Learning 3D Global Features via Aggregating Sequential Views by RNN With Attention.SeqViews2SeqLabels:通过 RNN 与注意力机制聚合序列视图学习 3D 全局特征。
IEEE Trans Image Process. 2019 Feb;28(2):658-672. doi: 10.1109/TIP.2018.2868426. Epub 2018 Sep 3.
5
VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification.VERAM:用于 3D 形状分类的视图增强递归注意力模型。
IEEE Trans Vis Comput Graph. 2019 Dec;25(12):3244-3257. doi: 10.1109/TVCG.2018.2866793. Epub 2018 Aug 23.
6
Inductive Multi-Hypergraph Learning and Its Application on View-Based 3D Object Classification.基于图的多视图 3D 目标分类的归纳式多超图学习
IEEE Trans Image Process. 2018 Dec;27(12):5957-5968. doi: 10.1109/TIP.2018.2862625. Epub 2018 Aug 2.
7
A MATHEMATICAL THEORY OF ADAPTIVE CONTROL PROCESSES.自适应控制过程的数学理论
Proc Natl Acad Sci U S A. 1959 Aug;45(8):1288-90. doi: 10.1073/pnas.45.8.1288.
8
Real-time 3D human capture system for mixed-reality art and entertainment.用于混合现实艺术与娱乐的实时3D人体捕捉系统。
IEEE Trans Vis Comput Graph. 2005 Nov-Dec;11(6):706-21. doi: 10.1109/TVCG.2005.105.