Object and Text-guided Semantics for CNN-based Activity Recognition

Eum, Sungmin; Reale, Christopher; Kwon, Heesung; Bonial, Claire; Voss, Clare

Computer Science > Computer Vision and Pattern Recognition

arXiv:1805.01818 (cs)

[Submitted on 4 May 2018]

Title:Object and Text-guided Semantics for CNN-based Activity Recognition

Authors:Sungmin Eum, Christopher Reale, Heesung Kwon, Claire Bonial, Clare Voss

View PDF

Abstract:Many previous methods have demonstrated the importance of considering semantically relevant objects for carrying out video-based human activity recognition, yet none of the methods have harvested the power of large text corpora to relate the objects and the activities to be transferred into learning a unified deep convolutional neural network. We present a novel activity recognition CNN which co-learns the object recognition task in an end-to-end multitask learning scheme to improve upon the baseline activity recognition performance. We further improve upon the multitask learning approach by exploiting a text-guided semantic space to select the most relevant objects with respect to the target activities. To the best of our knowledge, we are the first to investigate this approach.

Comments:	Submitted to ICIP 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1805.01818 [cs.CV]
	(or arXiv:1805.01818v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1805.01818

Submission history

From: Sungmin Eum [view email]
[v1] Fri, 4 May 2018 15:09:48 UTC (1,067 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sungmin Eum
Christopher Reale
Heesung Kwon
Claire Bonial
Clare R. Voss

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Object and Text-guided Semantics for CNN-based Activity Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Object and Text-guided Semantics for CNN-based Activity Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators