ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.11300
  4. Cited By
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large
  Vision-Language Model for Remote Sensing
v1v2v3v4v5 (latest)

RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing

20 June 2023
Zilun Zhang
Tiancheng Zhao
Yulong Guo
Yuxiang Cai
    DiffMVLM
ArXiv (abs)PDFHTMLGithub (260★)

Papers citing "RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing"

21 / 71 papers shown
Title
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLMCLIP
469
3,906
0
11 Feb 2021
ViLT: Vision-and-Language Transformer Without Convolution or Region
  Supervision
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
Wonjae Kim
Bokyung Son
Ildoo Kim
VLMCLIP
139
1,761
0
05 Feb 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
252
4,305
0
01 Jan 2021
Geography-Aware Self-Supervised Learning
Geography-Aware Self-Supervised Learning
Kumar Ayush
Burak Uzkent
Chenlin Meng
Kumar Tanmay
Marshall Burke
David B. Lobell
Stefano Ermon
SSL
87
235
0
19 Nov 2020
The color out of space: learning self-supervised representations for
  Earth Observation imagery
The color out of space: learning self-supervised representations for Earth Observation imagery
Stefano Vincenzi
Angelo Porrello
Pietro Buzzega
Marco Cipriano
Pietro Fronte
Roberto Cuccu
C. Ippoliti
A. Conte
Simone Calderara
SSL
79
63
0
22 Jun 2020
AdapterFusion: Non-Destructive Task Composition for Transfer Learning
AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Jonas Pfeiffer
Aishwarya Kamath
Andreas Rucklé
Kyunghyun Cho
Iryna Gurevych
CLLMoMe
158
859
0
01 May 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
148
1,947
0
13 Apr 2020
RSVQA: Visual Question Answering for Remote Sensing Data
RSVQA: Visual Question Answering for Remote Sensing Data
Sylvain Lobry
Diego Marcos
J. Murray
D. Tuia
118
221
0
16 Mar 2020
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
508
3,449
0
09 Mar 2020
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Weijie Su
Xizhou Zhu
Yue Cao
Bin Li
Lewei Lu
Furu Wei
Jifeng Dai
VLMMLLMSSL
184
1,668
0
22 Aug 2019
GeoSQA: A Benchmark for Scenario-based Question Answering in the
  Geography Domain at High School Level
GeoSQA: A Benchmark for Scenario-based Question Answering in the Geography Domain at High School Level
Zixian Huang
Yulin Shen
Xiao Li
Yuang Wei
Gong Cheng
Lin Zhou
Xinyu Dai
Yuzhong Qu
RALMELM
50
24
0
20 Aug 2019
VisualBERT: A Simple and Performant Baseline for Vision and Language
VisualBERT: A Simple and Performant Baseline for Vision and Language
Liunian Harold Li
Mark Yatskar
Da Yin
Cho-Jui Hsieh
Kai-Wei Chang
VLM
155
1,967
0
09 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSLVLM
255
3,699
0
06 Aug 2019
BigEarthNet: A Large-Scale Benchmark Archive For Remote Sensing Image
  Understanding
BigEarthNet: A Large-Scale Benchmark Archive For Remote Sensing Image Understanding
Gencer Sumbul
Marcela Charfuelan
Begüm Demir
Volker Markl
98
455
0
16 Feb 2019
Exploring Models and Data for Remote Sensing Image Caption Generation
Exploring Models and Data for Remote Sensing Image Caption Generation
Xiaoqiang Lu
Binqiang Wang
Xiangtao Zheng
Xuelong Li
63
477
0
21 Dec 2017
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
Gui-Song Xia
X. Bai
Jian Ding
Zhen Zhu
Serge J. Belongie
Jiebo Luo
Mihai Datcu
Marcello Pelillo
Liangpei Zhang
ObjD
129
2,189
0
28 Nov 2017
Functional Map of the World
Functional Map of the World
Gordon A. Christie
Neil Fendley
James Wilson
R. Mukherjee
VGen
82
399
0
21 Nov 2017
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and
  Land Cover Classification
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification
P. Helber
B. Bischke
Andreas Dengel
Damian Borth
158
1,834
0
31 Aug 2017
Remote Sensing Image Scene Classification: Benchmark and State of the
  Art
Remote Sensing Image Scene Classification: Benchmark and State of the Art
Gong Cheng
Junwei Han
Xiaoqiang Lu
108
2,269
0
01 Mar 2017
Billion-scale similarity search with GPUs
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
257
3,741
0
28 Feb 2017
AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene
  Classification
AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene Classification
Gui-Song Xia
Jingwen Hu
Fan Hu
Baoguang Shi
X. Bai
Yanfei Zhong
Liangpei Zhang
82
1,734
0
18 Aug 2016
Previous
12