• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Radial Graph Convolutional Network for Visual Question Generation.

作者信息

Xu Xing, Wang Tan, Yang Yang, Hanjalic Alan, Shen Heng Tao

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Apr;32(4):1654-1667. doi: 10.1109/TNNLS.2020.2986029. Epub 2021 Apr 2.

DOI:10.1109/TNNLS.2020.2986029
PMID:32340964
Abstract

In this article, we address the problem of visual question generation (VQG), a challenge in which a computer is required to generate meaningful questions about an image targeting a given answer. The existing approaches typically treat the VQG task as a reversed visual question answer (VQA) task, requiring the exhaustive match among all the image regions and the given answer. To reduce the complexity, we propose an innovative answer-centric approach termed radial graph convolutional network (Radial-GCN) to focus on the relevant image regions only. Our Radial-GCN method can quickly find the core answer area in an image by matching the latent answer with the semantic labels learned from all image regions. Then, a novel sparse graph of the radial structure is naturally built to capture the associations between the core node (i.e., answer area) and peripheral nodes (i.e., other areas); the graphic attention is subsequently adopted to steer the convolutional propagation toward potentially more relevant nodes for final question generation. Extensive experiments on three benchmark data sets show the superiority of our approach compared with the reference methods. Even in the unexplored challenging zero-shot VQA task, the synthesized questions by our method remarkably boost the performance of several state-of-the-art VQA methods from 0% to over 40%. The implementation code of our proposed method and the successfully generated questions are available at https://github.com/Wangt-CN/VQG-GCN.

摘要

相似文献

1
Radial Graph Convolutional Network for Visual Question Generation.
IEEE Trans Neural Netw Learn Syst. 2021 Apr;32(4):1654-1667. doi: 10.1109/TNNLS.2020.2986029. Epub 2021 Apr 2.
2
Rich Visual Knowledge-Based Augmentation Network for Visual Question Answering.用于视觉问答的基于丰富视觉知识的增强网络
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4362-4373. doi: 10.1109/TNNLS.2020.3017530. Epub 2021 Oct 5.
3
Interpretable medical image Visual Question Answering via multi-modal relationship graph learning.基于多模态关系图学习的可解释医学图像视觉问答。
Med Image Anal. 2024 Oct;97:103279. doi: 10.1016/j.media.2024.103279. Epub 2024 Jul 20.
4
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool.逆向视觉问答:一个新的基准和 VQA 诊断工具。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):460-474. doi: 10.1109/TPAMI.2018.2880185. Epub 2018 Nov 9.
5
Ask Questions With Double Hints: Visual Question Generation With Answer-Awareness and Region-Reference.带有双重提示的提问:具有答案感知和区域参考的视觉问题生成
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):9648-9660. doi: 10.1109/TPAMI.2024.3425222. Epub 2024 Nov 6.
6
Exploring Duality in Visual Question-Driven Top-Down Saliency.探索视觉问题驱动的自上而下显著性中的二元性。
IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2672-2679. doi: 10.1109/TNNLS.2019.2933439. Epub 2019 Sep 2.
7
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding.知识引导的视觉问题推理:深度表示嵌入面临的挑战
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):2758-2767. doi: 10.1109/TNNLS.2020.3045034. Epub 2022 Jul 6.
8
CGUN-2A: Deep Graph Convolutional Network via Contrastive Learning for Large-Scale Zero-Shot Image Classification.CGUN-2A:基于对比学习的大规模零样本图像分类深度图卷积网络。
Sensors (Basel). 2022 Dec 18;22(24):9980. doi: 10.3390/s22249980.
9
Knowledge-Augmented Visual Question Answering With Natural Language Explanation.
IEEE Trans Image Process. 2024;33:2652-2664. doi: 10.1109/TIP.2024.3379900. Epub 2024 Apr 3.
10
Adversarial Learning With Multi-Modal Attention for Visual Question Answering.用于视觉问答的多模态注意力对抗学习
IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):3894-3908. doi: 10.1109/TNNLS.2020.3016083. Epub 2021 Aug 31.