• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AutoAMS:基于自动化注意力的多模态图学习架构搜索。

AutoAMS: Automated attention-based multi-modal graph learning architecture search.

机构信息

School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.

School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.

出版信息

Neural Netw. 2024 Nov;179:106427. doi: 10.1016/j.neunet.2024.106427. Epub 2024 Jun 22.

DOI:10.1016/j.neunet.2024.106427
PMID:39003983
Abstract

Multi-modal attention mechanisms have been successfully used in multi-modal graph learning for various tasks. However, existing attention-based multi-modal graph learning (AMGL) architectures heavily rely on manual design, requiring huge effort and expert experience. Meanwhile, graph neural architecture search (GNAS) has made great progress toward automatically designing graph-based learning architectures. However, it is challenging to directly adopt existing GNAS methods to search for better AMGL architectures because of the search spaces that only focus on designing graph neural network architectures and the search objective that ignores multi-modal interactive information between modalities and long-term content dependencies within different modalities. To address these issues, we propose an automated attention-based multi-modal graph learning architecture search (AutoAMS) framework, which can automatically design the optimal AMGL architectures for different multi-modal tasks. Specifically, we design an effective attention-based multi-modal (AM) search space consisting of four sub-spaces, which can jointly support the automatic search of multi-modal attention representation and other components of multi-modal graph learning architecture. In addition, a novel search objective based on an unsupervised multi-modal reconstruction loss and task-specific loss is introduced to search and train AMGL architectures. The search objective can extract the global features and capture multi-modal interactions from multiple modalities. The experimental results on multi-modal tasks show strong evidence that AutoAMS is capable of designing high-performance AMGL architectures.

摘要

多模态注意力机制已成功应用于多模态图学习的各种任务中。然而,现有的基于注意力的多模态图学习 (AMGL) 架构严重依赖于手动设计,需要大量的努力和专家经验。同时,图神经架构搜索 (GNAS) 在自动设计基于图的学习架构方面取得了重大进展。然而,由于搜索空间仅关注于设计图神经网络架构,以及搜索目标忽略了模态之间的多模态交互信息和不同模态内部的长期内容依赖关系,因此直接采用现有的 GNAS 方法来搜索更好的 AMGL 架构具有挑战性。为了解决这些问题,我们提出了一种自动化基于注意力的多模态图学习架构搜索 (AutoAMS) 框架,该框架可以为不同的多模态任务自动设计最佳的 AMGL 架构。具体来说,我们设计了一个有效的基于注意力的多模态 (AM) 搜索空间,该空间由四个子空间组成,可以共同支持多模态注意力表示和多模态图学习架构的其他组件的自动搜索。此外,引入了一种基于无监督多模态重建损失和任务特定损失的新搜索目标,以搜索和训练 AMGL 架构。搜索目标可以从多个模态中提取全局特征并捕获多模态交互。在多模态任务上的实验结果有力地证明了 AutoAMS 能够设计高性能的 AMGL 架构。

相似文献

1
AutoAMS: Automated attention-based multi-modal graph learning architecture search.AutoAMS:基于自动化注意力的多模态图学习架构搜索。
Neural Netw. 2024 Nov;179:106427. doi: 10.1016/j.neunet.2024.106427. Epub 2024 Jun 22.
2
Multi-View Graph Neural Architecture Search for Biomedical Entity and Relation Extraction.用于生物医学实体和关系提取的多视图图神经网络架构搜索
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1221-1233. doi: 10.1109/TCBB.2022.3205113. Epub 2023 Apr 3.
3
Video Moment Retrieval With Cross-Modal Neural Architecture Search.基于跨模态神经架构搜索的视频瞬间检索
IEEE Trans Image Process. 2022;31:1204-1216. doi: 10.1109/TIP.2022.3140611. Epub 2022 Jan 19.
4
CLIP-Based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-Modal Hashing Retrieval.基于 CLIP 的自适应图注意力网络的大规模无监督多模态哈希检索
Sensors (Basel). 2023 Mar 24;23(7):3439. doi: 10.3390/s23073439.
5
Auto-GNN: Neural architecture search of graph neural networks.自动图神经网络:图神经网络的神经架构搜索
Front Big Data. 2022 Nov 17;5:1029307. doi: 10.3389/fdata.2022.1029307. eCollection 2022.
6
Alzheimer's disease diagnosis from multi-modal data via feature inductive learning and dual multilevel graph neural network.基于特征归纳学习和双重多级图神经网络的多模态数据阿尔茨海默病诊断。
Med Image Anal. 2024 Oct;97:103213. doi: 10.1016/j.media.2024.103213. Epub 2024 May 28.
7
Triplet-aware graph neural networks for factorized multi-modal knowledge graph entity alignment.基于三元组感知图神经网络的分解式多模态知识图实体对齐方法。
Neural Netw. 2024 Nov;179:106479. doi: 10.1016/j.neunet.2024.106479. Epub 2024 Jun 20.
8
A Comprehensive Survey on Deep Graph Representation Learning.关于深度图表示学习的全面调查。
Neural Netw. 2024 May;173:106207. doi: 10.1016/j.neunet.2024.106207. Epub 2024 Feb 27.
9
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
10
A Multi-Objective Evolutionary Approach Based on Graph-in-Graph for Neural Architecture Search of Convolutional Neural Networks.基于图中图的多目标进化方法用于卷积神经网络的神经结构搜索。
Int J Neural Syst. 2021 Sep;31(9):2150035. doi: 10.1142/S0129065721500350. Epub 2021 Jul 24.