RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model

Chen, Keyan; Liu, Chenyang; Chen, Hao; Zhang, Haotian; Li, Wenyuan; Zou, Zhengxia; Shi, Zhenwei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.16269 (cs)

[Submitted on 28 Jun 2023 (v1), last revised 29 Nov 2023 (this version, v2)]

Title:RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model

Authors:Keyan Chen, Chenyang Liu, Hao Chen, Haotian Zhang, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

View PDF

Abstract:Leveraging the extensive training data from SA-1B, the Segment Anything Model (SAM) demonstrates remarkable generalization and zero-shot capabilities. However, as a category-agnostic instance segmentation method, SAM heavily relies on prior manual guidance, including points, boxes, and coarse-grained masks. Furthermore, its performance in remote sensing image segmentation tasks remains largely unexplored and unproven. In this paper, we aim to develop an automated instance segmentation approach for remote sensing images, based on the foundational SAM model and incorporating semantic category information. Drawing inspiration from prompt learning, we propose a method to learn the generation of appropriate prompts for SAM. This enables SAM to produce semantically discernible segmentation results for remote sensing images, a concept we have termed RSPrompter. We also propose several ongoing derivatives for instance segmentation tasks, drawing on recent advancements within the SAM community, and compare their performance with RSPrompter. Extensive experimental results, derived from the WHU building, NWPU VHR-10, and SSDD datasets, validate the effectiveness of our proposed method. The code for our method is publicly available at this http URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.16269 [cs.CV]
	(or arXiv:2306.16269v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.16269

Submission history

From: Keyan Chen [view email]
[v1] Wed, 28 Jun 2023 14:51:34 UTC (43,624 KB)
[v2] Wed, 29 Nov 2023 12:47:59 UTC (21,582 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators