Progressive Attention Memory Network for Movie Story Question Answering

Kim, Junyeong; Ma, Minuk; Kim, Kyungsu; Kim, Sungjin; Yoo, Chang D.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.08607 (cs)

[Submitted on 18 Apr 2019]

Title:Progressive Attention Memory Network for Movie Story Question Answering

Authors:Junyeong Kim, Minuk Ma, Kyungsu Kim, Sungjin Kim, Chang D. Yoo

View PDF

Abstract:This paper proposes the progressive attention memory network (PAMN) for movie story question answering (QA). Movie story QA is challenging compared to VQA in two aspects: (1) pinpointing the temporal parts relevant to answer the question is difficult as the movies are typically longer than an hour, (2) it has both video and subtitle where different questions require different modality to infer the answer. To overcome these challenges, PAMN involves three main features: (1) progressive attention mechanism that utilizes cues from both question and answer to progressively prune out irrelevant temporal parts in memory, (2) dynamic modality fusion that adaptively determines the contribution of each modality for answering the current question, and (3) belief correction answering scheme that successively corrects the prediction score on each candidate answer. Experiments on publicly available benchmark datasets, MovieQA and TVQA, demonstrate that each feature contributes to our movie story QA architecture, PAMN, and improves performance to achieve the state-of-the-art result. Qualitative analysis by visualizing the inference mechanism of PAMN is also provided.

Comments:	CVPR 2019, Accepted
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1904.08607 [cs.CV]
	(or arXiv:1904.08607v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.08607

Submission history

From: Junyeong Kim [view email]
[v1] Thu, 18 Apr 2019 06:52:17 UTC (6,951 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Attention Memory Network for Movie Story Question Answering

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Attention Memory Network for Movie Story Question Answering

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators