ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.22081
  4. Cited By
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality

A Survey on Remote Sensing Foundation Models: From Vision to Multimodality

28 March 2025
Ziyue Huang
Hongxi Yan
Qiqi Zhan
Shuai Yang
Mingming Zhang
Yiming Lei
Chenkai Zhang
Zeming Liu
Qingjie Liu
Yansen Wang
ArXiv (abs)PDFHTML

Papers citing "A Survey on Remote Sensing Foundation Models: From Vision to Multimodality"

46 / 96 papers shown
Title
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception
  for Zero-shot and Few-shot Tasks
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
Xizhou Zhu
Jinguo Zhu
Hao Li
Xiaoshi Wu
Xiaogang Wang
Hongsheng Li
Xiaohua Wang
Jifeng Dai
122
133
0
02 Dec 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
485
7,837
0
11 Nov 2021
LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Christoph Schuhmann
Richard Vencu
Romain Beaumont
R. Kaczmarczyk
Clayton Mullis
Aarush Katta
Theo Coombes
J. Jitsev
Aran Komatsuzaki
VLMMLLMCLIP
243
1,444
0
03 Nov 2021
LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic
  Segmentation
LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation
Junjue Wang
Zhuo Zheng
A. Ma
Xiaoyan Lu
Yanfei Zhong
106
341
0
17 Oct 2021
Self-supervised Learning is More Robust to Dataset Imbalance
Self-supervised Learning is More Robust to Dataset Imbalance
Hong Liu
Jeff Z. HaoChen
Adrien Gaidon
Tengyu Ma
OODSSL
69
167
0
11 Oct 2021
Deep Long-Tailed Learning: A Survey
Deep Long-Tailed Learning: A Survey
Yifan Zhang
Bingyi Kang
Bryan Hooi
Shuicheng Yan
Jiashi Feng
VLM
113
589
0
09 Oct 2021
Geographical Knowledge-driven Representation Learning for Remote Sensing
  Images
Geographical Knowledge-driven Representation Learning for Remote Sensing Images
Wenyuan Li
Keyan Chen
Hao Chen
Zhenwei Shi
SSL
77
69
0
12 Jul 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
511
10,563
0
17 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
300
2,848
0
15 Jun 2021
BigEarthNet-MM: A Large Scale Multi-Modal Multi-Label Benchmark Archive
  for Remote Sensing Image Classification and Retrieval
BigEarthNet-MM: A Large Scale Multi-Modal Multi-Label Benchmark Archive for Remote Sensing Image Classification and Retrieval
Gencer Sumbul
Arne de Wall
Tristan Kreuziger
F. Marcelino
H. Costa
P. Benevides
M. Caetano
Begüm Demir
Volker Markl
98
133
0
17 May 2021
Self-Supervised Learning of Remote Sensing Scene Representations Using
  Contrastive Multiview Coding
Self-Supervised Learning of Remote Sensing Scene Representations Using Contrastive Multiview Coding
Vladan Stojnić
Vladimir Risojević
SSL
83
112
0
14 Apr 2021
Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote
  Sensing Data
Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data
Oscar Manas
Alexandre Lacoste
Xavier Giró-i-Nieto
David Vazquez
Pau Rodríguez López
95
266
0
30 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
470
21,656
0
25 Mar 2021
FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in
  High-Resolution Remote Sensing Imagery
FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing Imagery
Xian Sun
Peijin Wang
Zhiyuan Yan
F. Xu
Ruiping Wang
...
Tao Xu
M. Weinmann
Stefan Hinz
Cheng Wang
Kun Fu
ObjDAI4TS
74
370
0
09 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.0K
29,926
0
26 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
545
3,742
0
24 Feb 2021
Rethinking Rotated Object Detection with Gaussian Wasserstein Distance
  Loss
Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
Xue Yang
Junchi Yan
Qi Ming
Wentao Wang
Xiaopeng Zhang
Qi Tian
194
410
0
28 Jan 2021
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene
  Understanding
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding
Maryam Rahnemoonfar
Tashnim Chowdhury
Argho Sarkar
D. Varshney
M. Yari
Robin Murphy
79
257
0
05 Dec 2020
Geography-Aware Self-Supervised Learning
Geography-Aware Self-Supervised Learning
Kumar Ayush
Burak Uzkent
Chenlin Meng
Kumar Tanmay
Marshall Burke
David B. Lobell
Stefano Ermon
SSL
91
235
0
19 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
684
41,563
0
22 Oct 2020
Remote Sensing Image Scene Classification with Self-Supervised Paradigm
  under Limited Labeled Samples
Remote Sensing Image Scene Classification with Self-Supervised Paradigm under Limited Labeled Samples
Chao Tao
Ji Qi
Weipeng Lu
Hao Wang
Haifeng Li
SSL
77
104
0
02 Oct 2020
Lightweight Temporal Self-Attention for Classifying Satellite Image Time
  Series
Lightweight Temporal Self-Attention for Classifying Satellite Image Time Series
Vivien Sainte Fare Garnot
Loic Landrieu
64
79
0
01 Jul 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
423
6,849
0
13 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
908
42,520
0
28 May 2020
RSVQA: Visual Question Answering for Remote Sensing Data
RSVQA: Visual Question Answering for Remote Sensing Data
Sylvain Lobry
Diego Marcos
J. Murray
D. Tuia
118
221
0
16 Mar 2020
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
398
18,913
0
13 Feb 2020
Momentum Contrast for Unsupervised Visual Representation Learning
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
216
12,146
0
13 Nov 2019
Object Detection in Optical Remote Sensing Images: A Survey and A New
  Benchmark
Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark
Ke Li
G. Wan
Gong Cheng
L. Meng
Junwei Han
79
1,463
0
31 Aug 2019
SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral
  Sentinel-1/2 Imagery for Deep Learning and Data Fusion
SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion
M. Schmitt
Lloyd H. Hughes
C. Qiu
Xiaoxiang Zhu
70
260
0
18 Jun 2019
iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images
iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images
Syed Waqas Zamir
Aditya Arora
Akshita Gupta
Salman Khan
Guolei Sun
Fahad Shahbaz Khan
Fan Zhu
Ling Shao
Guisong Xia
X. Bai
SSegVLM
78
348
0
30 May 2019
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
Chris Choy
JunYoung Gwak
Silvio Savarese
3DPC
179
1,796
0
18 Apr 2019
BigEarthNet: A Large-Scale Benchmark Archive For Remote Sensing Image
  Understanding
BigEarthNet: A Large-Scale Benchmark Archive For Remote Sensing Image Understanding
Gencer Sumbul
Marcela Charfuelan
Begüm Demir
Volker Markl
101
455
0
16 Feb 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,324
0
11 Oct 2018
SpaceNet: A Remote Sensing Dataset and Challenge Series
SpaceNet: A Remote Sensing Dataset and Challenge Series
A. V. Etten
David Lindenbaum
Todd M. Bacastow
84
394
0
03 Jul 2018
xView: Objects in Context in Overhead Imagery
xView: Objects in Context in Overhead Imagery
Darius Lam
Richard Kuzma
Kevin McGee
Samuel Dooley
Michael Laielli
Matthew K. Klaric
Yaroslav Bulatov
Brendan McCord
ObjD
55
323
0
22 Feb 2018
Exploring Models and Data for Remote Sensing Image Caption Generation
Exploring Models and Data for Remote Sensing Image Caption Generation
Xiaoqiang Lu
Binqiang Wang
Xiangtao Zheng
Xuelong Li
63
477
0
21 Dec 2017
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
Gui-Song Xia
X. Bai
Jian Ding
Zhen Zhu
Serge J. Belongie
Jiebo Luo
Mihai Datcu
Marcello Pelillo
Liangpei Zhang
ObjD
129
2,189
0
28 Nov 2017
Functional Map of the World
Functional Map of the World
Gordon A. Christie
Neil Fendley
James Wilson
R. Mukherjee
VGen
85
399
0
21 Nov 2017
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and
  Land Cover Classification
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification
P. Helber
B. Bischke
Andreas Dengel
Damian Borth
158
1,834
0
31 Aug 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
819
132,725
0
12 Jun 2017
PatternNet: A Benchmark Dataset for Performance Evaluation of Remote
  Sensing Image Retrieval
PatternNet: A Benchmark Dataset for Performance Evaluation of Remote Sensing Image Retrieval
Weixun Zhou
Shawn D. Newsam
Congmin Li
Z. Shao
160
461
0
11 Jun 2017
Remote Sensing Image Scene Classification: Benchmark and State of the
  Art
Remote Sensing Image Scene Classification: Benchmark and State of the Art
Gong Cheng
Junwei Han
Xiaoqiang Lu
108
2,269
0
01 Mar 2017
AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene
  Classification
AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene Classification
Gui-Song Xia
Jingwen Hu
Fan Hu
Baoguang Shi
X. Bai
Yanfei Zhong
Liangpei Zhang
82
1,734
0
18 Aug 2016
Recent Advances in Convolutional Neural Networks
Recent Advances in Convolutional Neural Networks
Jiuxiang Gu
Zhenhua Wang
Jason Kuen
Lianyang Ma
Amir Shahroudy
...
Xingxing Wang
Li Wang
Gang Wang
Jianfei Cai
Tsuhan Chen
245
5,239
0
22 Dec 2015
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,641
0
10 Dec 2015
Microsoft COCO Captions: Data Collection and Evaluation Server
Microsoft COCO Captions: Data Collection and Evaluation Server
Xinlei Chen
Hao Fang
Nayeon Lee
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollar
C. L. Zitnick
242
2,497
0
01 Apr 2015
Previous
12