Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09883
Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 823 papers shown
Title
Towards Fundamentally Scalable Model Selection: Asymptotically Fast Update and Selection
Wenxiao Wang
Weiming Zhuang
Lingjuan Lyu
44
0
0
11 Jun 2024
ReduceFormer: Attention with Tensor Reduction by Summation
John Yang
Le An
Su Inn Park
31
0
0
11 Jun 2024
A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion
Xiaoli Zhang
Liying Wang
Libo Zhao
Xiongfei Li
Siwei Ma
42
0
0
11 Jun 2024
Multiplane Prior Guided Few-Shot Aerial Scene Rendering
Zihan Gao
Licheng Jiao
Lingling Li
Xu Liu
F. Liu
Puhua Chen
Yuwei Guo
29
3
0
07 Jun 2024
PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
Jiahao Fang
Huizheng Wang
Qize Yang
Dehao Kong
Xu Dai
Jinyi Deng
Yang Hu
Shouyi Yin
30
1
0
06 Jun 2024
OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference
Dujian Ding
Bicheng Xu
L. Lakshmanan
VLM
41
1
0
06 Jun 2024
LADI v2: Multi-label Dataset and Classifiers for Low-Altitude Disaster Imagery
Samuel Scheele
Katherine Picchione
Jeffrey Liu
32
0
0
04 Jun 2024
Generative Active Learning for Long-tailed Instance Segmentation
Muzhi Zhu
Chengxiang Fan
Hao Chen
Y. Liu
Weian Mao
Xiaogang Xu
Chunhua Shen
50
5
0
04 Jun 2024
GrootVL: Tree Topology is All You Need in State Space Model
Yicheng Xiao
Lin Song
Shaoli Huang
Jiangshan Wang
Siyu Song
Yixiao Ge
Xiu Li
Ying Shan
Mamba
44
10
0
04 Jun 2024
Prototypical Transformer as Unified Motion Learners
Cheng Han
Yawen Lu
Guohao Sun
James Liang
Zhiwen Cao
...
S. Dianat
Raghuveer M. Rao
Tong Geng
Zhiqiang Tao
Dongfang Liu
ViT
37
2
0
03 Jun 2024
On the Use of Anchoring for Training Vision Models
V. Narayanaswamy
Kowshik Thopalli
Rushil Anirudh
Yamen Mubarka
W. Sakla
Jayaraman J. Thiagarajan
50
0
0
01 Jun 2024
You Only Need Less Attention at Each Stage in Vision Transformers
Shuoxi Zhang
Hanpeng Liu
Stephen Lin
Kun He
53
5
0
01 Jun 2024
DS@BioMed at ImageCLEFmedical Caption 2024: Enhanced Attention Mechanisms in Medical Caption Generation through Concept Detection Integration
Nhi Ngoc-Yen Nguyen
Le-Huy Tu
Dieu-Phuong Nguyen
Nhat-Tan Do
Minh Triet Thai
Bao-Thien Nguyen-Tat
MedIm
34
1
0
01 Jun 2024
CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation
M. Rusanovsky
Or Hirschorn
S. Avidan
29
3
0
01 Jun 2024
YotoR-You Only Transform One Representation
José Ignacio Díaz Villa
P. Loncomilla
Javier Ruiz-del-Solar
ViT
44
0
0
30 May 2024
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
You Huang
Zongyu Lan
Liujuan Cao
Xianming Lin
Shengchuan Zhang
Guannan Jiang
Rongrong Ji
VLM
29
2
0
29 May 2024
Wavelet-Based Image Tokenizer for Vision Transformers
Zhenhai Zhu
Radu Soricut
ViT
50
3
0
28 May 2024
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao
Xinggang Wang
Lianghui Zhu
Qian Zhang
Chang Huang
57
4
0
28 May 2024
On Fairness of Low-Rank Adaptation of Large Models
Zhoujie Ding
Ken Ziyu Liu
Pura Peetathawatchai
Berivan Isik
Sanmi Koyejo
48
4
0
27 May 2024
Building Vision Models upon Heat Conduction
Zhaozhi Wang
Yue Liu
Yunfan Liu
Hongtian Yu
Yaowei Wang
QiXiang Ye
ViT
VLM
58
0
0
26 May 2024
ModelLock: Locking Your Model With a Spell
Yifeng Gao
Yuhua Sun
Xingjun Ma
Zuxuan Wu
Yu-Gang Jiang
VLM
48
1
0
25 May 2024
Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification
Chak Fong Chong
Jielong Guo
Xu Yang
Wei Ke
Yapeng Wang
VLM
32
0
0
24 May 2024
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Ting Liu
Xuyang Liu
Liangtao Shi
Zunnan Xu
Siteng Huang
Yi Xin
Quanjun Yin
43
5
0
23 May 2024
ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution
Guillaume Couairon
Christian Lessig
A. Charantonis
C. Monteleoni
27
1
0
23 May 2024
Scalable Visual State Space Model with Fractal Scanning
Lv Tang
Haoke Xiao
Peng-Tao Jiang
Hao Zhang
Jinwei Chen
Bo-wen Li
Mamba
48
7
0
23 May 2024
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Yuheng Shi
Minjing Dong
Chang Xu
Mamba
48
32
0
23 May 2024
Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers
Bum Jun Kim
Sang Woo Kim
ViT
43
1
0
23 May 2024
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
52
2
0
22 May 2024
Counterfactual Gradients-based Quantification of Prediction Trust in Neural Networks
Mohit Prabhushankar
Ghassan AlRegib
UQCV
29
0
0
22 May 2024
OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models
Zhaojian Yu
Yinghao Wu
Zhuotao Deng
Yansong Tang
Xiao-Ping Zhang
49
2
0
21 May 2024
Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy
Feng Wang
M. C. Gursoy
Senem Velipasalar
43
0
0
15 May 2024
Resource Efficient Perception for Vision Systems
M. I. A V Subramanyam
Niyati Singal
Vinay K. Verma
46
0
0
12 May 2024
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba
Hongwei Ren
Yue Zhou
Jiadong Zhu
Haotian Fu
Yulong Huang
Xiaopeng Lin
Yuetong Fang
Fei Ma
Hao Yu
Bo-Xun Cheng
Mamba
43
9
0
09 May 2024
Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement
Jiesong Bai
Yuhao Yin
Qiyuan He
Yuanxian Li
Xiaofeng Zhang
Mamba
40
28
0
06 May 2024
Multimodal Sense-Informed Prediction of 3D Human Motions
Zhenyu Lou
Qiongjie Cui
Haofan Wang
Xu Tang
Hong Zhou
33
5
0
05 May 2024
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
Yuchuan Tian
Zhijun Tu
Hanting Chen
Jie Hu
Chao Xu
Yunhe Wang
38
16
0
04 May 2024
Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers
Palawat Busaranuvong
Emmanuel O. Agu
Deepak Kumar
Shefalika Gautam
Reza Saadati Fard
B. Tulu
Diane Strong
MedIm
23
0
0
01 May 2024
Analyzing and Exploring Training Recipes for Large-Scale Transformer-Based Weather Prediction
Jared Willard
Peter Harrington
Shashank Subramanian
Ankur Mahesh
Travis A. O'Brien
William D. Collins
AI4TS
47
7
0
30 Apr 2024
Large Language Model Informed Patent Image Retrieval
Hao-Cheng Lo
Jung-Mei Chu
Jieh Hsiang
Chun-Chieh Cho
VLM
30
2
0
30 Apr 2024
Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing
Leonardo Rossi
Vittorio Bernuzzi
Tomaso Fontanini
Massimo Bertozzi
Andrea Prati
53
3
0
29 Apr 2024
A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
Yiyuan Yang
Ming Jin
Haomin Wen
Chaoli Zhang
Yuxuan Liang
...
Bin Yang
Zenglin Xu
Jiang Bian
Shirui Pan
Qingsong Wen
DiffM
AI4TS
SyDa
37
39
0
29 Apr 2024
HIPer: A Human-Inspired Scene Perception Model for Multifunctional Mobile Robots
Florenz Graf
Jochen Lindermayr
Birgit Graf
Werner Kraus
Marco F. Huber
46
3
0
27 Apr 2024
PromptCIR: Blind Compressed Image Restoration with Prompt Learning
Bingchen Li
Xin Li
Yiting Lu
Ruoyu Feng
Mengxi Guo
Shijie Zhao
Li Zhang
Zhibo Chen
39
13
0
26 Apr 2024
Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics Techniques
Ziliang Hong
Debesh Jha
Koushik Biswas
Zheyu Zhang
Yury Velichko
...
Amir Borhani
B. Turkbey
A. Medetalibeyoğlu
Gorkem Durak
Ulas Bagci
MedIm
24
1
0
25 Apr 2024
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
Xiaohong Liu
Xiongkuo Min
Guangtao Zhai
Chunyi Li
Tengchuan Kou
...
Qi Yan
Youran Qu
Xiaohui Zeng
Lele Wang
Renjie Liao
58
29
0
25 Apr 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
46
38
0
24 Apr 2024
Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains
Eunsu Baek
Keondo Park
Jiyoon Kim
Hyung-Sin Kim
OODD
OOD
31
4
0
24 Apr 2024
Vision Transformer-based Adversarial Domain Adaptation
Yahan Li
Yuan Wu
ViT
33
0
0
24 Apr 2024
CKGConv: General Graph Convolution with Continuous Kernels
Liheng Ma
Soumyasundar Pal
Yitian Zhang
Jiaming Zhou
Yingxue Zhang
Mark J. Coates
37
3
0
21 Apr 2024
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Jie Ma
Min Hu
Pinghui Wang
Wangchun Sun
Lingyun Song
Hongbin Pei
Jun Liu
Youtian Du
35
4
0
18 Apr 2024
Previous
1
2
3
4
5
6
...
15
16
17
Next