NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior

Park, Dongwoo; Ko, Suk Pil

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.00410 (cs)

[Submitted on 1 Apr 2025]

Title:NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior

Authors:Dongwoo Park, Suk Pil Ko

View PDF HTML (experimental)

Abstract:Scene text image super-resolution (STISR) enhances the resolution and quality of low-resolution images. Unlike previous studies that treated scene text images as natural images, recent methods using a text prior (TP), extracted from a pre-trained text recognizer, have shown strong performance. However, two major issues emerge: (1) Explicit categorical priors, like TP, can negatively impact STISR if incorrect. We reveal that these explicit priors are unstable and propose replacing them with Non-CAtegorical Prior (NCAP) using penultimate layer representations. (2) Pre-trained recognizers used to generate TP struggle with low-resolution images. To address this, most studies jointly train the recognizer with the STISR network to bridge the domain gap between low- and high-resolution images, but this can cause an overconfidence phenomenon in the prior modality. We highlight this issue and propose a method to mitigate it by mixing hard and soft labels. Experiments on the TextZoom dataset demonstrate an improvement by 3.5%, while our method significantly enhances generalization performance by 14.8\% across four text recognition datasets. Our method generalizes to all TP-guided STISR networks.

Comments:	WACV 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.00410 [cs.CV]
	(or arXiv:2504.00410v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.00410

Submission history

From: Dongwoo Park [view email]
[v1] Tue, 1 Apr 2025 04:14:07 UTC (8,675 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators