Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.05778
Cited By
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
10 November 2022
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
Xizhou Zhu
Xiao-hua Hu
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions"
50 / 321 papers shown
Title
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
Shizhou Zhang
Dexuan Kong
Yinghui Xing
Yue Lu
Lingyan Ran
Guoqiang Liang
Hexu Wang
Yanning Zhang
38
5
0
19 Sep 2024
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
63
5
0
15 Sep 2024
Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation
Hugo Porta
Emanuele Dalsasso
Diego Marcos
D. Tuia
95
0
0
14 Sep 2024
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage
Denis Zavadski
Damjan Kalšan
Carsten Rother
DiffM
MDE
28
5
0
13 Sep 2024
SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Chenyang Lei
Liyi Chen
Jun Cen
Xiao Chen
Zhen Lei
Felix Heide
Ziwei Liu
Qifeng Chen
Zhaoxiang Zhang
55
0
0
12 Sep 2024
UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised
Tao Ni
Xin Zhan
Tao Luo
Wenbin Liu
Zhan Shi
Junbo Chen
14
0
0
10 Sep 2024
ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions
Furqan Ahmed Shaik
Sandeep Nagar
Aiswarya Maturi
Harshit Kumar Sankhla
Dibyendu Ghosh
Anshuman Majumdar
Srikanth Vidapanakal
Kunal Chaudhary
Sunny Manchanda
Girish Varma
45
0
0
09 Sep 2024
UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images
Lulin Li
Ben Chen
Xuechao Zou
Junliang Xing
Pin Tao
Mamba
53
1
0
05 Sep 2024
AnyGraph: Graph Foundation Model in the Wild
Lianghao Xia
Chao Huang
OOD
42
11
0
20 Aug 2024
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks
Dongshuo Yin
Leiyi Hu
Bin Li
Youqun Zhang
Xue Yang
46
7
0
15 Aug 2024
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation
Beoungwoo Kang
Seunghun Moon
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
ViT
MedIm
32
8
0
14 Aug 2024
Segment Using Just One Example
Pratik Vora
Sudipan Saha
VLM
19
1
0
14 Aug 2024
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu
Bo Wang
Ye Li
ViT
33
0
0
11 Aug 2024
SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving
Peiru Zheng
Yun Zhao
Zhan Gong
Hong Zhu
Shaohua Wu
MLLM
43
7
0
31 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
33
5
0
26 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
45
4
0
24 Jul 2024
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation
Xiaoshuai Hao
Ruikai Li
Hui Zhang
Dingzhe Li
Rong Yin
Sangil Jung
Seungsang Park
ByungIn Yoo
Haimei Zhao
Jing Zhang
40
7
0
16 Jul 2024
Beyond Mask: Rethinking Guidance Types in Few-shot Segmentation
Shijie Chang
Youwei Pang
Xiaoqi Zhao
Lihe Zhang
Huchuan Lu
45
1
0
16 Jul 2024
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding
Danish Nazir
Timo Bartels
Jan Piewek
Thorsten Bagdonat
Tim Fingscheidt
35
0
0
15 Jul 2024
DeepGate3: Towards Scalable Circuit Representation Learning
Zhengyuan Shi
Ziyang Zheng
Sadaf Khan
Qiang Xu
Min Li
Qiang Xu
GNN
AI4CE
49
9
0
15 Jul 2024
Visual Prompt Selection for In-Context Learning Segmentation
Wei Suo
Lanqing Lai
Mengyang Sun
Hanwang Zhang
Peng Wang
Yanning Zhang
VLM
55
3
0
14 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
34
4
0
14 Jul 2024
SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation
Lin Zhang
Wenbo Gao
Jie Yi
Yunyun Yang
46
0
0
14 Jul 2024
Introducing VaDA: Novel Image Segmentation Model for Maritime Object Segmentation Using New Dataset
Yongjin Kim
Jinbum Park
Sanha Kang
Hanguen Kim
45
1
0
12 Jul 2024
Open Panoramic Segmentation
Junwei Zheng
Ruiping Liu
Yufan Chen
Kunyu Peng
Chengzhi Wu
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
VLM
44
8
0
02 Jul 2024
VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
VLM
44
0
0
26 Jun 2024
Depth-Guided Semi-Supervised Instance Segmentation
Xin Chen
Jie Hu
Xiawu Zheng
Jianghang Lin
Liujuan Cao
Rongrong Ji
ISeg
3DV
55
1
0
25 Jun 2024
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images
Elisabeta-Iulia Dima
Pablo Gómez
Sandor Kruk
Peter Kretschmar
Simon Rosen
Călin-Adrian Popa
45
0
0
25 Jun 2024
Speeding Up Image Classifiers with Little Companions
Yang Liu
Kowshik Thopalli
Jayaraman J. Thiagarajan
VLM
34
0
0
24 Jun 2024
UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery
Pengfei Zhang
Chang Li
Yongjun Zhang
Rongjun Qin
26
1
0
23 Jun 2024
Segmentation of Non-Small Cell Lung Carcinomas: Introducing DRU-Net and Multi-Lens Distortion
Soroush Oskouei
Marit Valla
André Pedersen
Erik Smistad
V. G. Dale
...
T. Langø
M. Ramnefjell
L. A. Akslen
Gabriel Kiss
H. Sorger
39
0
0
20 Jun 2024
LGmap: Local-to-Global Mapping Network for Online Long-Range Vectorized HD Map Construction
Kuang Wu
Sulei Nian
Can Shen
Chuan Yang
Zhanbin Li
27
3
0
20 Jun 2024
Beyond Visual Appearances: Privacy-sensitive Objects Identification via Hybrid Graph Reasoning
Zhuohang Jiang
Bingkui Tong
Xia Du
Ahmed Alhammadi
Jizhe Zhou
60
1
0
18 Jun 2024
Coarse-Fine Spectral-Aware Deformable Convolution For Hyperspectral Image Reconstruction
Jincheng Yang
Lishun Wang
Miao Cao
Huan Wang
Yinping Zhao
Xin Yuan
20
0
0
18 Jun 2024
Is Your HD Map Constructor Reliable under Sensor Corruptions?
Xiaoshuai Hao
Mengchuan Wei
Yifan Yang
Haimei Zhao
Hui Zhang
Yi Zhou
Qiang Wang
Weiming Li
Lingdong Kong
Jing Zhang
3DV
64
8
0
18 Jun 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Bo Du
Dacheng Tao
Liangpei Zhang
66
27
0
17 Jun 2024
PIG: Prompt Images Guidance for Night-Time Scene Parsing
Zhifeng Xie
Rui Qiu
Sen Wang
Xin Tan
Yuan Xie
Lizhuang Ma
48
2
0
15 Jun 2024
MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report
Zhongyu Yang
Mai Liu
Jinluo Xie
Yueming Zhang
Chen Shen
Wei Shao
Jichao Jiao
Tengfei Xing
Runbo Hu
Pengfei Xu
44
2
0
14 Jun 2024
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
59
337
0
13 Jun 2024
Enhancing Domain Adaptation through Prompt Gradient Alignment
Hoang Phan
Lam C. Tran
Quyen Tran
Trung Le
52
0
0
13 Jun 2024
RWKV-CLIP: A Robust Vision-Language Representation Learner
Tiancheng Gu
Kaicheng Yang
Xiang An
Ziyong Feng
Dongnan Liu
Weidong Cai
Jiankang Deng
VLM
CLIP
40
14
0
11 Jun 2024
Technical Report for CVPR 2024 WeatherProof Dataset Challenge: Semantic Segmentation on Paired Real Data
Guojin Cao
Jiaxu Li
Jia He
Ying Min
Yunhao Zhang
40
0
0
09 Jun 2024
Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation
Jun Yu
Yunxiang Zhang
Fengzhao Sun
Leilei Wang
Renjie Lu
46
0
0
09 Jun 2024
Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Chih-Kai Yang
Kuan Po Huang
Hung-yi Lee
48
3
0
09 Jun 2024
A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+
Jianzhao Wang
Yanyan Wei
Dehua Hu
Yilin Zhang
Shengeng Tang
Kun Li
Zhao Zhang
44
0
0
08 Jun 2024
Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities
Sai Munikoti
Ian Stewart
Sameera Horawalavithana
Henry Kvinge
Tegan H. Emerson
Sandra E Thompson
Karl Pazdernik
38
2
0
08 Jun 2024
Parameter-Inverted Image Pyramid Networks
Xizhou Zhu
Xue Yang
Zhaokai Wang
Hao Li
Wenhan Dou
Junqi Ge
Lewei Lu
Ping Luo
Jifeng Dai
54
0
0
06 Jun 2024
M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and RGB Data
Matthew J Allen
Francisco Dorr
Joseph A. Gallego-Mejia
Laura Martínez-Ferrer
Anna Jungbluth
Freddie Kalaitzis
Raúl Ramos-Pollán
33
3
0
06 Jun 2024
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge
Nan Zhang
Xidan Zhang
Jianing Wei
Fangjun Wang
Zhiming Tan
MDE
36
0
0
06 Jun 2024
GrootVL: Tree Topology is All You Need in State Space Model
Yicheng Xiao
Lin Song
Shaoli Huang
Jiangshan Wang
Siyu Song
Yixiao Ge
Xiu Li
Ying Shan
Mamba
47
11
0
04 Jun 2024
Previous
1
2
3
4
5
6
7
Next