Galileo: Learning Global & Local Features of Many Remote Sensing Modalities

Tseng, Gabriel; Fuller, Anthony; Reil, Marlena; Herzog, Henry; Beukema, Patrick; Bastani, Favyen; Green, James R.; Shelhamer, Evan; Kerner, Hannah; Rolnick, David

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.09356 (cs)

[Submitted on 13 Feb 2025 (v1), last revised 4 Jun 2025 (this version, v3)]

Title:Galileo: Learning Global & Local Features of Many Remote Sensing Modalities

Authors:Gabriel Tseng, Anthony Fuller, Marlena Reil, Henry Herzog, Patrick Beukema, Favyen Bastani, James R. Green, Evan Shelhamer, Hannah Kerner, David Rolnick

View PDF

Abstract:We introduce a highly multimodal transformer to represent many remote sensing modalities - multispectral optical, synthetic aperture radar, elevation, weather, pseudo-labels, and more - across space and time. These inputs are useful for diverse remote sensing tasks, such as crop mapping and flood detection. However, learning shared representations of remote sensing data is challenging, given the diversity of relevant data modalities, and because objects of interest vary massively in scale, from small boats (1-2 pixels and fast) to glaciers (thousands of pixels and slow). We present a novel self-supervised learning algorithm that extracts multi-scale features across a flexible set of input modalities through masked modeling. Our dual global and local contrastive losses differ in their targets (deep representations vs. shallow input projections) and masking strategies (structured vs. not). Our Galileo is a single generalist model that outperforms SoTA specialist models for satellite images and pixel time series across eleven benchmarks and multiple tasks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.09356 [cs.CV]
	(or arXiv:2502.09356v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.09356

Submission history

From: Gabriel Tseng [view email]
[v1] Thu, 13 Feb 2025 14:21:03 UTC (1,649 KB)
[v2] Wed, 28 May 2025 09:46:10 UTC (3,764 KB)
[v3] Wed, 4 Jun 2025 14:07:47 UTC (3,764 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Galileo: Learning Global & Local Features of Many Remote Sensing Modalities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Galileo: Learning Global & Local Features of Many Remote Sensing Modalities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators