Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.05726
Cited By
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
9 May 2023
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision-Language Models in Remote Sensing: Current Progress and Future Trends"
26 / 26 papers shown
Title
GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis
Angelos Zavras
Dimitrios Michail
Xiao Xiang Zhu
Begüm Demir
Ioannis Papoutsis
VLM
86
0
0
13 Feb 2025
Vision-Language Models for Edge Networks: A Comprehensive Survey
Ahmed Sharshar
Latif U. Khan
Waseem Ullah
Mohsen Guizani
VLM
70
3
0
11 Feb 2025
Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention
Shanwen Wang
Changrui Chen
Xin Sun
Danfeng Hong
Jungong Han
34
0
0
18 Jan 2025
Generalization-Enhanced Few-Shot Object Detection in Remote Sensing
Hui Lin
Nan Li
Pengjuan Yao
Kexin Dong
Yuhan Guo
Danfeng Hong
Yuhang Zhang
Congcong Wen
110
4
0
05 Jan 2025
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
160
441
0
14 Oct 2023
PatFig: Generating Short and Long Captions for Patent Figures
Dana Aubakirova
Kim Gerdes
Lufei Liu
17
9
0
15 Sep 2023
Geo-Information Harvesting from Social Media Data
Xiao Xiang Zhu
Yuanyuan Wang
M. Kochupillai
Martin Werner
Matthias Häberle
...
D. Tuia
Alex Levering
Nathan Jacobs
Anna M. Kruspe
Karam Abdulahhad
27
10
0
01 Nov 2022
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
71
106
0
23 Oct 2022
EarthNets: Empowering AI in Earth Observation
Zhitong Xiong
Fahong Zhang
Yi Wang
Yilei Shi
Xiao Xiang Zhu
86
73
0
10 Oct 2022
Language-aware Domain Generalization Network for Cross-Scene Hyperspectral Image Classification
Yuxiang Zhang
Mengmeng Zhang
Wei Li
Shuai Wang
Ran Tao
VLM
34
110
0
06 Sep 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,953
0
04 Mar 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
392
4,137
0
28 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
367
8,495
0
28 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
336
2,267
0
02 Sep 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLM
ObjD
225
898
0
28 Apr 2021
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
287
1,524
0
27 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
277
3,623
0
24 Feb 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
301
3,700
0
11 Feb 2021
RSVQA: Visual Question Answering for Remote Sensing Data
Sylvain Lobry
Diego Marcos
J. Murray
D. Tuia
72
205
0
16 Mar 2020
Frustratingly Simple Few-Shot Object Detection
Xin Wang
Thomas E. Huang
Trevor Darrell
Joseph E. Gonzalez
F. I. F. Richard Yu
ObjD
95
544
0
16 Mar 2020
Meta R-CNN : Towards General Solver for Instance-level Few-shot Learning
Xiaopeng Yan
Ziliang Chen
Anni Xu
Xiaoxi Wang
Xiaodan Liang
Liang Lin
ObjD
160
446
0
28 Sep 2019
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,220
0
16 Nov 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
261
36,371
0
25 Aug 2016
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
242
31,257
0
16 Jan 2013
1