CNNs with Multi-Level Attention for Domain Generalization

Ballas, Aristotelis; Diou, Christos

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.00502 (cs)

[Submitted on 2 Apr 2023]

Title:CNNs with Multi-Level Attention for Domain Generalization

Authors:Aristotelis Ballas, Christos Diou

View PDF

Abstract:In the past decade, deep convolutional neural networks have achieved significant success in image classification and ranking and have therefore found numerous applications in multimedia content retrieval. Still, these models suffer from performance degradation when neural networks are tested on out-of-distribution scenarios or on data originating from previously unseen data Domains. In the present work, we focus on this problem of Domain Generalization and propose an alternative neural network architecture for robust, out-of-distribution image classification. We attempt to produce a model that focuses on the causal features of the depicted class for robust image classification in the Domain Generalization setting. To achieve this, we propose attending to multiple-levels of information throughout a Convolutional Neural Network and leveraging the most important attributes of an image by employing trainable attention mechanisms. To validate our method, we evaluate our model on four widely accepted Domain Generalization benchmarks, on which our model is able to surpass previously reported baselines in three out of four datasets and achieve the second best score in the fourth one.

Comments:	Accepted for publication in ICMR '23 (ACM International Conference on Multimedia Retrieval). This is a preprint of the final version
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.00502 [cs.CV]
	(or arXiv:2304.00502v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.00502

Submission history

From: Aristotelis Ballas [view email]
[v1] Sun, 2 Apr 2023 10:34:40 UTC (8,847 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CNNs with Multi-Level Attention for Domain Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CNNs with Multi-Level Attention for Domain Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators