Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09883
Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 824 papers shown
Title
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection
Jiangning Zhang
Xuhai Chen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
Ming-Hsuan Yang
Dacheng Tao
33
24
0
12 Dec 2023
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection
Joonhyun Jeong
Geondo Park
Jayeon Yoo
Hyungsik Jung
Heesu Kim
VLM
ObjD
43
10
0
12 Dec 2023
Adjustable Robust Transformer for High Myopia Screening in Optical Coherence Tomography
Xiao Ma
Zetian Zhang
Zexuan Ji
Kun Huang
Na Su
Songtao Yuan
Qiang Chen
33
1
0
12 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
33
5
0
11 Dec 2023
Transformer-based Selective Super-Resolution for Efficient Image Refinement
Tianyi Zhang
Kishore Kasichainula
Yaoxin Zhuo
Baoxin Li
Jae-sun Seo
Yu Cao
28
7
0
10 Dec 2023
The Counterattack of CNNs in Self-Supervised Learning: Larger Kernel Size might be All You Need
Tianjin Huang
Tianlong Chen
Zhangyang Wang
Shiwei Liu
34
1
0
09 Dec 2023
Model Evaluation for Domain Identification of Unknown Classes in Open-World Recognition: A Proposal
Gusti Ahmad Fanshuri Alfarisy
O. A. Malik
Ong Wee Hong
16
0
0
09 Dec 2023
Vision-based Learning for Drones: A Survey
Jiaping Xiao
Rangya Zhang
Yuhang Zhang
Mir Feroskhan
37
4
0
08 Dec 2023
Moirai: Towards Optimal Placement for Distributed Inference on Heterogeneous Devices
Beibei Zhang
Hongwei Zhu
Feng Gao
Zhihui Yang
Xiaoyang Sean Wang
29
1
0
07 Dec 2023
Scaling transformer neural networks for skillful and reliable medium-range weather forecasting
Tung Nguyen
Rohan Shah
Hritik Bansal
T. Arcomano
Sandeep Madireddy
R. Maulik
V. Kotamarthi
Ian Foster
Aditya Grover
AI4TS
19
59
0
06 Dec 2023
On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm
Peng Sun
Bei Shi
Daiwei Yu
Tao Lin
DD
31
40
0
06 Dec 2023
UPOCR: Towards Unified Pixel-Level OCR Interface
Dezhi Peng
Zhenhua Yang
Jiaxin Zhang
Chongyu Liu
Yongxin Shi
Kai Ding
Fengjun Guo
Lianwen Jin
41
10
0
05 Dec 2023
Simplifying Neural Network Training Under Class Imbalance
Ravid Shwartz-Ziv
Micah Goldblum
Yucen Lily Li
C. Bayan Bruss
Andrew Gordon Wilson
36
14
0
05 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Alan Yuille
Cihang Xie
VLM
57
7
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan Yuille
VLM
35
55
0
04 Dec 2023
Improving Normalization with the James-Stein Estimator
Seyedalireza Khoshsirat
Chandra Kambhamettu
26
5
0
01 Dec 2023
TeG-DG: Textually Guided Domain Generalization for Face Anti-Spoofing
Lianrui Mu
Jianhong Bai
Xiaoxuan He
Jiangnan Ye
Xiaoyu Liang
Yuchen Yang
Jiedong Zhuang
Haoji Hu
29
2
0
30 Nov 2023
LLVMs4Protest: Harnessing the Power of Large Language and Vision Models for Deciphering Protests in the News
Yongjun Zhang
16
0
0
30 Nov 2023
A Graph-Based Approach for Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
44
10
0
29 Nov 2023
PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens
Sebastian Stapf
Tobias Bauernfeind
Marco Riboldi
ViT
25
1
0
29 Nov 2023
Group-wise Sparse and Explainable Adversarial Attacks
Shpresim Sadiku
Moritz Wagner
Sebastian Pokutta
AAML
15
0
0
29 Nov 2023
LEOD: Label-Efficient Object Detection for Event Cameras
Ziyi Wu
Mathias Gehrig
Qing Lyu
Xudong Liu
Igor Gilitschenski
42
14
0
29 Nov 2023
PHG-Net: Persistent Homology Guided Medical Image Classification
Yao Peng
Hongxiao Wang
Milan Sonka
Danny Z. Chen
14
3
0
28 Nov 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
Dai Shi
ViT
23
77
0
28 Nov 2023
DyRA: Portable Dynamic Resolution Adjustment Network for Existing Detectors
Daeun Seo
Hoeseok Yang
Hyungshin Kim
43
0
0
28 Nov 2023
Optimal Transport Aggregation for Visual Place Recognition
Sergio Izquierdo
Javier Civera
OT
43
62
0
27 Nov 2023
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Xiaohan Ding
Yiyuan Zhang
Yixiao Ge
Sijie Zhao
Lin Song
Xiangyu Yue
Ying Shan
VLM
AI4TS
SSL
29
104
0
27 Nov 2023
Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search
Zhiqi Lin
Youshan Miao
Guanbin Xu
Cheng Li
Olli Saarikivi
Saeed Maleki
Fan Yang
25
6
0
26 Nov 2023
CalibFormer: A Transformer-based Automatic LiDAR-Camera Calibration Network
Yuxuan Xiao
Yao Li
Chengzhen Meng
Xingchen Li
Jianmin Ji
Yanyong Zhang
29
9
0
26 Nov 2023
Advancing Vision Transformers with Group-Mix Attention
Chongjian Ge
Xiaohan Ding
Zhan Tong
Li Yuan
Jiangliu Wang
Yibing Song
Ping Luo
112
16
0
26 Nov 2023
Occlusion Sensitivity Analysis with Augmentation Subspace Perturbation in Deep Feature Space
Pedro Valois
Koichiro Niinuma
Kazuhiro Fukui
AAML
32
4
0
25 Nov 2023
Adapter is All You Need for Tuning Visual Tasks
Dongshuo Yin
Leiyi Hu
Bin Li
Youqun Zhang
18
15
0
25 Nov 2023
Hardware Resilience Properties of Text-Guided Image Classifiers
Syed Talal Wasim
Kabila Haile Soboka
Abdulrahman Mahmoud
Salman Khan
David Brooks
Gu-Yeon Wei
VLM
27
1
0
23 Nov 2023
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
24
12
0
22 Nov 2023
Learning to Optimise Wind Farms with Graph Transformers
Siyi Li
Arnaud Robert
A. A. Faisal
M. Piggott
29
5
0
21 Nov 2023
PMP-Swin: Multi-Scale Patch Message Passing Swin Transformer for Retinal Disease Classification
Zhihan Yang
Zhiming Cheng
Tengjin Weng
Shucheng He
Yaqi Wang
Xin Ye
Shuai Wang
MedIm
11
1
0
20 Nov 2023
CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement
Boni Hu
Lin Chen
Runjian Chen
Shuhui Bu
Pengcheng Han
Haowei Li
21
1
0
20 Nov 2023
SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
Aditya Nalgunda Ganesh
ViT
21
1
0
19 Nov 2023
Benchmarking Feature Extractors for Reinforcement Learning-Based Semiconductor Defect Localization
Enrique Dehaerne
Bappaditya Dey
Sandip Halder
S. de Gendt
30
1
0
18 Nov 2023
Deep Tensor Network
Yifan Zhang
37
0
0
18 Nov 2023
Controlling the Output of a Generative Model by Latent Feature Vector Shifting
Róbert Belanec
Peter Lacko
Kristína Malinovská
22
1
0
15 Nov 2023
SparseSpikformer: A Co-Design Framework for Token and Weight Pruning in Spiking Transformer
Yue Liu
Shanlin Xiao
Bo Li
Zhiyi Yu
45
3
0
15 Nov 2023
Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection
Chun Bao
Jie Cao
Yaqian Ning
Tianhua Zhao
Zhijun Li
Zechen Wang
Li Zhang
Qun Hao
22
4
0
15 Nov 2023
Explicit Change Relation Learning for Change Detection in VHR Remote Sensing Images
Dalong Zheng
Zebin Wu
Jia-Wei Liu
Chih-Cheng Hung
Zhihui Wei
19
0
0
14 Nov 2023
SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification
S. M. N. Ashraf
Md. Adyelullahil Mamun
Hasnat Md. Abdullah
Rabiul Alam
ViT
MedIm
15
7
0
13 Nov 2023
NDDepth: Normal-Distance Assisted Monocular Depth Estimation and Completion
Shuwei Shao
Zhongcai Pei
Weihai Chen
Peter C. Y. Chen
Zhengguo Li
30
5
0
13 Nov 2023
Polar-Net: A Clinical-Friendly Model for Alzheimer's Disease Detection in OCTA Images
Shouyue Liu
Jinkui Hao
Yanwu Xu
Huazhu Fu
Xinyu Guo
Jiang-Dong Liu
Yalin Zheng
Yonghuai Liu
Jiong Zhang
Yitian Zhao
17
6
0
10 Nov 2023
Analysis of NaN Divergence in Training Monocular Depth Estimation Model
Bum Jun Kim
Hyeonah Jang
Sang Woo Kim
43
0
0
07 Nov 2023
GTP-ViT: Efficient Vision Transformers via Graph-based Token Propagation
Xuwei Xu
Sen Wang
Yudong Chen
Yanping Zheng
Zhewei Wei
Jiajun Liu
ViT
32
9
0
06 Nov 2023
Patch-based Selection and Refinement for Early Object Detection
Tianyi Zhang
Kishore Kasichainula
Yaoxin Zhuo
Baoxin Li
Jae-sun Seo
Yu Cao
46
5
0
03 Nov 2023
Previous
1
2
3
...
7
8
9
...
15
16
17
Next