Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.12856
Cited By
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing
20 December 2023
Zhecheng Wang
R. Prabha
Tianyuan Huang
Jiajun Wu
Ram Rajagopal
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"
15 / 15 papers shown
Title
GlobalGeoTree: A Multi-Granular Vision-Language Dataset for Global Tree Species Classification
Yang Mu
Zhitong Xiong
Yi Wang
Muhammad Shahzad
Franz Essl
Mark van Kleunen
Xiao Xiang Zhu
VLM
63
0
0
18 May 2025
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data
Jeremy Irvin
Emily Ruoyu Liu
Joyce Chuyi Chen
Ines Dormoy
Jinyoung Kim
Samar Khanna
Zhuo Zheng
Stefano Ermon
MLLM
VLM
103
8
0
28 Jan 2025
Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
Lijie Tao
Han Zhang
Haizhao Jing
Yu Liu
Kelu Yao
Guoting Wei
Xizhe Xue
70
0
0
03 Jan 2025
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding
Yimiao Zhou
Mengcheng Lan
Xiang Li
Yiping Ke
Yiping Ke
Xue Jiang
Qingyun Li
Xue Yang
Wayne Zhang
ObjD
VLM
170
6
0
16 Nov 2024
GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing
Hosam Elgendy
Ahmed Sharshar
Ahmed Aboeitta
Yasser Ashraf
Mohsen Guizani
52
2
0
25 Oct 2024
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Junyao Ge
Xu Zhang
Yang Zheng
Kaitai Guo
Jimin Liang
70
2
0
27 Aug 2024
When and why vision-language models behave like bags-of-words, and what to do about it?
Mert Yuksekgonul
Federico Bianchi
Pratyusha Kalluri
Dan Jurafsky
James Zou
VLM
CoGe
54
384
0
04 Oct 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLM
VLM
63
704
0
14 Sep 2022
Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information
Zhiqiang Yuan
Wenkai Zhang
Changyuan Tian
Xuee Rong
Zhengyuan Zhang
Hongqi Wang
Kun Fu
Xian Sun
48
124
0
21 Apr 2022
ReforesTree: A Dataset for Estimating Tropical Forest Carbon Stock with Deep Learning and Aerial Imagery
Gyri Reiersen
David Dao
Björn Lütjens
Konstantin Klemmer
Kenza Amara
Attila Steinegger
Ce Zhang
Xiaoxia Zhu
92
32
0
26 Jan 2022
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
390
3,778
0
11 Feb 2021
Geography-Aware Self-Supervised Learning
Kumar Ayush
Burak Uzkent
Chenlin Meng
Kumar Tanmay
Marshall Burke
David B. Lobell
Stefano Ermon
SSL
61
231
0
19 Nov 2020
Learning Visual Representations with Caption Annotations
Mert Bulent Sariyildiz
J. Perez
Diane Larlus
VLM
SSL
66
159
0
04 Aug 2020
BigEarthNet: A Large-Scale Benchmark Archive For Remote Sensing Image Understanding
Gencer Sumbul
Marcela Charfuelan
Begüm Demir
Volker Markl
71
447
0
16 Feb 2019
Tile2Vec: Unsupervised representation learning for spatially distributed data
Neal Jean
Sherrie Wang
Anshul Samar
G. Azzari
David B. Lobell
Stefano Ermon
SSL
55
197
0
08 May 2018
1