Ensembles of Deep Neural Networks for Action Recognition in Still Images

Mohammadi, Sina; Majelan, Sina Ghofrani; Shokouhi, Shahriar B.

doi:10.1109/ICCKE48569.2019.8965014

Computer Science > Computer Vision and Pattern Recognition

arXiv:2003.09893 (cs)

[Submitted on 22 Mar 2020]

Title:Ensembles of Deep Neural Networks for Action Recognition in Still Images

Authors:Sina Mohammadi, Sina Ghofrani Majelan, Shahriar B. Shokouhi

View PDF

Abstract:Despite the fact that notable improvements have been made recently in the field of feature extraction and classification, human action recognition is still challenging, especially in images, in which, unlike videos, there is no motion. Thus, the methods proposed for recognizing human actions in videos cannot be applied to still images. A big challenge in action recognition in still images is the lack of large enough datasets, which is problematic for training deep Convolutional Neural Networks (CNNs) due to the overfitting issue. In this paper, by taking advantage of pre-trained CNNs, we employ the transfer learning technique to tackle the lack of massive labeled action recognition datasets. Furthermore, since the last layer of the CNN has class-specific information, we apply an attention mechanism on the output feature maps of the CNN to extract more discriminative and powerful features for classification of human actions. Moreover, we use eight different pre-trained CNNs in our framework and investigate their performance on Stanford 40 dataset. Finally, we propose using the Ensemble Learning technique to enhance the overall accuracy of action classification by combining the predictions of multiple models. The best setting of our method is able to achieve 93.17$\%$ accuracy on the Stanford 40 dataset.

Comments:	5 pages, 2 figures, 3 tables, Accepted by ICCKE 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2003.09893 [cs.CV]
	(or arXiv:2003.09893v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2003.09893
Journal reference:	2019 9th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Iran, 2019, pp. 315-318
Related DOI:	https://doi.org/10.1109/ICCKE48569.2019.8965014

Submission history

From: Sina Mohammadi [view email]
[v1] Sun, 22 Mar 2020 13:44:09 UTC (811 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Ensembles of Deep Neural Networks for Action Recognition in Still Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Ensembles of Deep Neural Networks for Action Recognition in Still Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators