Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09883
Cited By
v1
v2 (latest)
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14834★)
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 840 papers shown
Title
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing
Khawar Islam
M. Zaheer
Arif Mahmood
Karthik Nandakumar
Naveed Akhtar
DiffM
201
2
0
03 Dec 2024
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods
Jiamian Hu
Yuanyuan Hong
Yihua Chen
He Wang
Moriaki Yasuhara
120
1
0
03 Dec 2024
MeasureNet: Measurement Based Celiac Disease Identification
Aayush Kumar Tyagi
Vaibhav Mishra
Ashok Tiwari
Lalita Mehra
Prasenjit Das
G. Makharia
Prathosh AP
Mausam
122
0
0
02 Dec 2024
STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation
Sunghun Yang
Minhyeok Lee
Suhwan Cho
Jungho Lee
Sangyoun Lee
MDE
192
0
0
02 Dec 2024
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Alice Heiman
Xiaoman Zhang
E. Chen
Sung Eun Kim
Pranav Rajpurkar
HILM
MedIm
159
0
0
27 Nov 2024
Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning
Hoàng-Ân Lê
P. Berg
Minh Pham
126
1
0
26 Nov 2024
GeoFormer: A Multi-Polygon Segmentation Transformer
Maxim Khomiakov
Michael Riis Andersen
J. Frellsen
105
1
0
25 Nov 2024
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data Processing
Hao Liu
Mamba
AI4CE
148
2
0
22 Nov 2024
ReXrank: A Public Leaderboard for AI-Powered Radiology Report Generation
Xiaoman Zhang
Hong-Yu Zhou
Xiaoli Yang
Oishi Banerjee
J. N. Acosta
Josh Miller
Ouwen Huang
Pranav Rajpurkar
LM&MA
170
5
0
22 Nov 2024
Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal Approach
Vaishnavi Khindkar
V. Balasubramanian
Chetan Arora
A. Subramanian
C. V. Jawahar
116
0
0
20 Nov 2024
Emotional Images: Assessing Emotions in Images and Potential Biases in Generative Models
Maneet Mehta
Cody Buntain
EGVM
64
2
0
08 Nov 2024
Confidence Calibration of Classifiers with Many Classes
Adrien LeCoz
Stéphane Herbin
Faouzi Adjed
UQCV
82
1
0
05 Nov 2024
AM Flow: Adapters for Temporal Processing in Action Recognition
Tanay Agrawal
Abid Ali
A. Dantcheva
François Brémond
68
0
0
04 Nov 2024
MamT
4
^4
4
: Multi-view Attention Networks for Mammography Cancer Classification
Alisher Ibragimov
Sofya Senotrusova
Arsenii Litvinov
E. Ushakov
E. Karpulevich
Yury Markin
59
0
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
36
0
0
31 Oct 2024
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
Jia Fu
Xiao Zhang
Sepideh Pashami
Fatemeh Rahimian
Anders Holst
DiffM
AAML
73
0
0
31 Oct 2024
Context-Aware Token Selection and Packing for Enhanced Vision Transformer
Tianyi Zhang
B. Li
Jae-sun Seo
Yu Cao
72
0
0
31 Oct 2024
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets
Adrian Iordache
B. Alexe
Radu Tudor Ionescu
137
1
0
29 Oct 2024
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection
Jia Wei
Yun Li
Xiaomao Fan
Wenjun Ma
Meiyu Qiu
Hongyu Chen
Wenbin Lei
30
0
0
29 Oct 2024
Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust
Xiaofeng Lei
Yih-Chung Tham
Jocelyn Hui Lin Goh
Yangqin Feng
Yang Bai
Z. Soh
Rick Siow Mong Goh
Xinxing Xu
Yong Liu
Ching-Yu Cheng
23
0
0
27 Oct 2024
PESFormer: Boosting Macro- and Micro-expression Spotting with Direct Timestamp Encoding
Wang-Wang Yu
Kai-Fu Yang
Xiangrui Hu
Jingwen Jiang
Hong-Mei Yan
Yong-Jie Li
62
0
0
24 Oct 2024
FIPER: Generalizable Factorized Features for Robust Low-Level Vision Models
Yang-Che Sun
Cheng Yu Yeo
Ernie Chu
Jun-Cheng Chen
Yu-Lun Liu
SupR
120
0
0
23 Oct 2024
LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices
Chuntao Ding
Xu Cao
Jianhang Xie
Linlin Fan
Shangguang Wang
Zhichao Lu
87
1
0
22 Oct 2024
Test-time Adversarial Defense with Opposite Adversarial Path and High Attack Time Cost
Cheng-Han Yeh
Kuanchun Yu
Chun-Shien Lu
DiffM
AAML
151
0
0
22 Oct 2024
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?
Lingao Xiao
Yang He
DD
81
7
0
21 Oct 2024
D-SarcNet: A Dual-stream Deep Learning Framework for Automatic Analysis of Sarcomere Structures in Fluorescently Labeled hiPSC-CMs
Huyen Le
Khiet Dang
N. H. Nguyen
Mai Tran
Hieu Pham
28
0
0
19 Oct 2024
Towards Zero-Shot Camera Trap Image Categorization
Jiří Vyskočil
Lukas Picek
VLM
50
0
0
16 Oct 2024
Transformer based super-resolution downscaling for regional reanalysis: Full domain vs tiling approaches
Antonio Pérez
Mario Santa Cruz
Daniel San Martín
José Manuel Gutiérrez
48
0
0
16 Oct 2024
Hespi: A pipeline for automatically detecting information from hebarium specimen sheets
Robert Turnbull
Emily Fitzgerald
Karen Thompson
Joanne L. Birch
41
0
0
11 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
66
2
0
11 Oct 2024
HorGait: A Hybrid Model for Accurate Gait Recognition in LiDAR Point Cloud Planar Projections
Jiaxing Hao
Yanxi Wang
Zhigang Chang
Hongmin Gao
Zihao Cheng
Chen Wu
Xin Zhao
Peiye Fang
Rachmat Muwardi
ViT
92
0
0
11 Oct 2024
When Graph meets Multimodal: Benchmarking and Meditating on Multimodal Attributed Graphs Learning
Hao Yan
Cuiping Li
Zhigang Yu
Jun Yin
Ruochen Liu
Peiyan Zhang
Weihao Han
Mingzheng Li
Zhengxin Zeng
60
1
0
11 Oct 2024
IceDiff: High Resolution and High-Quality Sea Ice Forecasting with Generative Diffusion Prior
Jingyi Xu
Siwei Tu
Weidong Yang
Shuhao Li
Keyi Liu
Yeqi Luo
Lipeng Ma
Ben Fei
Junlin Wu
DiffM
AI4Cl
54
2
0
10 Oct 2024
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery
Ang He
Ximei Wu
Xing Xu
Jing Chen
Xiaobin Guo
Sheng Xu
68
1
0
09 Oct 2024
CALoR: Towards Comprehensive Model Inversion Defense
Hongyao Yu
Yixiang Qiu
Hao Fang
Bin Chen
Sijin Yu
Bin Wang
Shu-Tao Xia
Ke Xu
73
1
0
08 Oct 2024
GLRT-Based Metric Learning for Remote Sensing Object Retrieval
Linping Zhang
Yu Liu
Xueqian Wang
Gang Li
You He
66
0
0
08 Oct 2024
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Fang Gao
XueTao Li
Jiabao Wang
Shengheng Ma
Jun Yu
41
0
0
08 Oct 2024
MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization
Yunlong Zhao
Xiaoheng Deng
Xiu Su
Hongyan Xu
Xiuxing Li
Yijing Liu
Shan You
FedML
DD
82
1
0
07 Oct 2024
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time
Chiao-An Yang
Ziwei Liu
Raymond A. Yeh
54
1
0
01 Oct 2024
CBAM-SwinT-BL: Small Rail Surface Defect Detection Method Based on Swin Transformer with Block Level CBAM Enhancement
Jiayi Zhao
Alison Wun-lam Yeung
Ali Muhammad
Songjiang Lai
Vincent To-Yee NG
48
3
0
30 Sep 2024
Universal Medical Image Representation Learning with Compositional Decoders
Kaini Wang
Ling Yang
Siping Zhou
Guangquan Zhou
Wentao Zhang
Bin Cui
Shuo Li
SSL
MedIm
80
0
0
30 Sep 2024
All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation
Xu Zhang
Peiyao Guo
Ming Lu
Zhan Ma
68
2
0
29 Sep 2024
Exploring Token Pruning in Vision State Space Models
Zheng Zhan
Zhenglun Kong
Yifan Gong
Yushu Wu
Zichong Meng
...
Xuan Shen
Stratis Ioannidis
Wei Niu
Pu Zhao
Yanzhi Wang
106
10
0
27 Sep 2024
Cottention: Linear Transformers With Cosine Attention
Gabriel Mongaras
Trevor Dohm
Eric C. Larson
78
0
0
27 Sep 2024
HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting
Nian Ran
Peng Xiao
Yue Wang
Wesley Shi
Jianxin Lin
Qi Meng
Richard Allmendinger
AI4Cl
179
0
0
27 Sep 2024
MALPOLON: A Framework for Deep Species Distribution Modeling
Théo Larcher
Lukás Picek
Benjamin Deneu
Titouan Lorieul
Maximilien Servajean
Alexis Joly
GP
45
0
0
26 Sep 2024
HydraViT: Stacking Heads for a Scalable ViT
Janek Haberer
A. Hojjat
Olaf Landsiedel
85
0
0
26 Sep 2024
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
Guoyang Zhao
Fulong Ma
Weiqing Qi
Chenguang Zhang
Yuxuan Liu
Ming Liu
Jun Ma
VLM
CLIP
403
3
0
23 Sep 2024
Fake It till You Make It: Curricular Dynamic Forgery Augmentations towards General Deepfake Detection
Yuzhen Lin
Wentang Song
Bin Li
Yuezun Li
Jiangqun Ni
Han Chen
Qiushi Li
85
14
0
22 Sep 2024
Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification
Fatema Jannat
Sina Gholami
Jennifer I. Lim
Theodore Leng
Minhaj Nur Alam
Hamed Tabkhi
48
1
0
17 Sep 2024
Previous
1
2
3
4
5
6
...
15
16
17
Next