Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105525
PIRA download icon_1.1View/Download Full Text
Title: Aligned dual channel graph convolutional network for visual question answering
Authors: Huang, Q
Wei, J
Cai, Y
Zheng, C
Chen, J
Leung, HF
Li, Q 
Issue Date: 2020
Source: In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, p. 7166-7176. Stroudsburg, PA, USA: Association for Computational Linguistics (ACL), 2020
Abstract: Visual question answering aims to answer the natural language question about a given image. Existing graph-based methods only focus on the relations between objects in an image and neglect the importance of the syntactic dependency relations between words in a question. To simultaneously capture the relations between objects in an image and the syntactic dependency relations between words in a question, we propose a novel dual channel graph convolutional network (DC-GCN) for better combining visual and textual advantages. The DC-GCN model consists of three parts: an I-GCN module to capture the relations between objects in an image, a Q-GCN module to capture the syntactic dependency relations between words in a question, and an attention alignment module to align image representations and question representations. Experimental results show that our model achieves comparable performance with the state-of-the-art approaches.
Publisher: Association for Computational Linguistics (ACL)
ISBN: 978-1-952148-25-5
DOI: 10.18653/v1/2020.acl-main.642
Description: 58th Annual Meeting of the Association for Computational Linguistics, Online, July 5th-10th, 2020
Rights: © 2020 Association for Computational Linguistics
This publication is licensed on a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/)
The following publication Qingbao Huang, Jielong Wei, Yi Cai, Changmeng Zheng, Junying Chen, Ho-fung Leung, and Qing Li. 2020. Aligned Dual Channel Graph Convolutional Network for Visual Question Answering. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7166–7176, Online. Association for Computational Linguistics is available at https://doi.org/10.18653/v1/2020.acl-main.642.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
2020.acl-main.642.pdf3.81 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

98
Last Week
3
Last month
Citations as of Nov 30, 2025

Downloads

24
Citations as of Nov 30, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.