ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.15156
  4. Cited By
Blending Anti-Aliasing into Vision Transformer

Blending Anti-Aliasing into Vision Transformer

28 October 2021
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
ArXivPDFHTML

Papers citing "Blending Anti-Aliasing into Vision Transformer"

19 / 19 papers shown
Title
Spectral-Adaptive Modulation Networks for Visual Perception
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
42
0
0
31 Mar 2025
Universal Functional Regression with Neural Operator Flows
Universal Functional Regression with Neural Operator Flows
Yaozhong Shi
Angela F. Gao
Zachary E. Ross
Kamyar Azizzadenesheli
37
3
0
03 Apr 2024
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Stephanie Fu
Mark Hamilton
Laura E. Brandt
Axel Feldmann
Zhoutong Zhang
William T. Freeman
MDE
30
49
0
15 Mar 2024
When Semantic Segmentation Meets Frequency Aliasing
When Semantic Segmentation Meets Frequency Aliasing
Linwei Chen
Lin Gu
Ying Fu
51
5
0
14 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Linwei Chen
Lin Gu
Ying Fu
34
22
0
08 Mar 2024
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning
  for End-to-end Navigation of Autonomous Vehicles
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles
Dong Hu
Chao Huang
Jingda Wu
Hongbo Gao
38
5
0
20 Feb 2024
SPANet: Frequency-balancing Token Mixer using Spectral Pooling
  Aggregation Modulation
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
20
8
0
22 Aug 2023
Optimizing PatchCore for Few/many-shot Anomaly Detection
Optimizing PatchCore for Few/many-shot Anomaly Detection
Joao Santos
Triet Tran
Oliver Rippel
23
10
0
20 Jul 2023
ReasonNet: End-to-End Driving with Temporal and Global Reasoning
ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Hao Shao
Letian Wang
Ruobing Chen
Steven L. Waslander
Hongsheng Li
Y. Liu
LRM
30
71
0
17 May 2023
AIM: Adapting Image Models for Efficient Video Action Recognition
AIM: Adapting Image Models for Efficient Video Action Recognition
Taojiannan Yang
Yi Zhu
Yusheng Xie
Aston Zhang
C. L. P. Chen
Mu Li
ViT
46
144
0
06 Feb 2023
What Makes for Good Tokenizers in Vision Transformer?
What Makes for Good Tokenizers in Vision Transformer?
Shengju Qian
Yi Zhu
Wenbo Li
Mu Li
Jiaya Jia
ViT
34
13
0
21 Dec 2022
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion
  Transformer
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Hao Shao
Letian Wang
Ruobing Chen
Hongsheng Li
Y. Liu
38
195
0
28 Jul 2022
MLP-Mixer: An all-MLP Architecture for Vision
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
271
2,603
0
04 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,775
0
29 Apr 2021
VidTr: Video Transformer Without Convolutions
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
136
193
0
23 Apr 2021
On Aliased Resizing and Surprising Subtleties in GAN Evaluation
On Aliased Resizing and Surprising Subtleties in GAN Evaluation
Gaurav Parmar
Richard Y. Zhang
Jun-Yan Zhu
EGVM
25
74
0
22 Apr 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
277
3,622
0
24 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,777
0
24 Feb 2021
Semantic Understanding of Scenes through the ADE20K Dataset
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
253
1,828
0
18 Aug 2016
1