Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning

Li, Wenbin; Wang, Lei; Xu, Jinglin; Huo, Jing; Gao, Yang; Luo, Jiebo

Computer Science > Computer Vision and Pattern Recognition

arXiv:1903.12290 (cs)

[Submitted on 28 Mar 2019 (v1), last revised 10 Apr 2019 (this version, v2)]

Title:Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning

Authors:Wenbin Li, Lei Wang, Jinglin Xu, Jing Huo, Yang Gao, Jiebo Luo

View PDF

Abstract:Few-shot learning in image classification aims to learn a classifier to classify images when only few training examples are available for each class. Recent work has achieved promising classification performance, where an image-level feature based measure is usually used. In this paper, we argue that a measure at such a level may not be effective enough in light of the scarcity of examples in few-shot learning. Instead, we think a local descriptor based image-to-class measure should be taken, inspired by its surprising success in the heydays of local invariant features. Specifically, building upon the recent episodic training mechanism, we propose a Deep Nearest Neighbor Neural Network (DN4 in short) and train it in an end-to-end manner. Its key difference from the literature is the replacement of the image-level feature based measure in the final layer by a local descriptor based image-to-class measure. This measure is conducted online via a $k$-nearest neighbor search over the deep local descriptors of convolutional feature maps. The proposed DN4 not only learns the optimal deep local descriptors for the image-to-class measure, but also utilizes the higher efficiency of such a measure in the case of example scarcity, thanks to the exchangeability of visual patterns across the images in the same class. Our work leads to a simple, effective, and computationally efficient framework for few-shot learning. Experimental study on benchmark datasets consistently shows its superiority over the related state-of-the-art, with the largest absolute improvement of $17\%$ over the next best. The source code can be available from \UrlFont{this https URL}.

Comments:	accepted by CVPR 2019. The code link: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1903.12290 [cs.CV]
	(or arXiv:1903.12290v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1903.12290

Submission history

From: Wenbin Li [view email]
[v1] Thu, 28 Mar 2019 22:02:24 UTC (449 KB)
[v2] Wed, 10 Apr 2019 02:14:33 UTC (449 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators