Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.12292
Cited By
Contextual Transformer Networks for Visual Recognition
26 July 2021
Yehao Li
Ting Yao
Yingwei Pan
Tao Mei
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (532★)
Papers citing
"Contextual Transformer Networks for Visual Recognition"
50 / 85 papers shown
Title
CBAGAN-RRT: Convolutional Block Attention Generative Adversarial Network for Sampling-Based Path Planning
Abhinav Sagar
Sai Teja Gilukara
65
0
0
01 Jul 2025
A Decade of You Only Look Once (YOLO) for Object Detection
Leo Thomas Ramos
Angel D. Sappa
132
0
0
24 Apr 2025
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
98
0
0
29 Mar 2025
OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation
Mallika Garg
Debashis Ghosh
P. M. Pradhan
3DH
94
0
0
27 Mar 2025
Quantum Complex-Valued Self-Attention Model
Fu Chen
Qinglin Zhao
Li Feng
Longfei Tang
Yangbin Lin
Haitao Huang
MQ
139
0
0
24 Mar 2025
Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions
Xumin Gao
M. Stevens
Grzegorz Cielniak
24
0
0
15 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
111
2
0
12 Nov 2024
Deep Learning for 3D Point Cloud Enhancement: A Survey
Siwen Quan
Junhao Yu
Ziming Nie
Muze Wang
Sijia Feng
Pei An
Jiaqi Yang
3DPC
78
4
0
30 Oct 2024
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
Yifu Chen
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Zhineng Chen
Tao Mei
DiffM
70
7
0
12 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
68
10
0
27 Aug 2024
Brain Tumor Segmentation in MRI Images with 3D U-Net and Contextual Transformer
Thien-Qua T. Nguyen
Hieu-Nghia Nguyen
Thanh-Hieu Bui
Thien B. Nguyen-Tat
V. M. Ngo
ViT
MedIm
70
2
0
11 Jul 2024
MVAD: A Multiple Visual Artifact Detector for Video Streaming
Chen Feng
Duolikun Danier
Fan Zhang
David Bull
73
0
0
31 May 2024
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
77
3
0
22 May 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
99
3
0
26 Mar 2024
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Zhongwei Zhang
Fuchen Long
Yingwei Pan
Zhaofan Qiu
Ting Yao
Yang Cao
Tao Mei
VGen
95
29
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
117
26
0
25 Mar 2024
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Ting Yao
Yehao Li
Yingwei Pan
Tao Mei
ViT
80
21
0
18 Mar 2024
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
Haoyang Liu
Aditya Singh
Yijiang Li
Haohan Wang
AAML
ViT
130
1
0
15 Mar 2024
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing
Sheng Li
Geng Yuan
Yuezhen Dai
Youtao Zhang
Yanzhi Wang
Xulong Tang
92
19
0
30 Jan 2024
SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks
Serdar Erişen
SSeg
72
12
0
28 Jan 2024
BD-MSA: Body decouple VHR Remote Sensing Image Change Detection method guided by multi-scale feature information aggregation
Yonghui Tan
Xiaolong Li
Yishu Chen
Jinquan Ai
47
3
0
09 Jan 2024
Progressive Feedback-Enhanced Transformer for Image Forgery Localization
Haochen Zhu
Gang Cao
Xianglin Huang
ViT
114
7
0
15 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
71
20
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
105
41
0
09 Nov 2023
Control3D: Towards Controllable Text-to-3D Generation
Yang Chen
Yingwei Pan
Yehao Li
Ting Yao
Tao Mei
DiffM
97
49
0
09 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Badri N. Patro
Vijay Srinivas Agneeswaran
88
15
0
02 Nov 2023
RMT: Retentive Networks Meet Vision Transformers
Qihang Fan
Huaibo Huang
Mingrui Chen
Hongmin Liu
Ran He
ViT
157
91
0
20 Sep 2023
CCSPNet-Joint: Efficient Joint Training Method for Traffic Sign Detection Under Extreme Conditions
Haoqin Hong
Yue Zhou
Xiangyu Shu
Xianfang Hu
ViT
57
3
0
13 Sep 2023
DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images
Xuechao Zou
Keqin Li
Junliang Xing
Yu-an Zhang
Shiying Wang
Lei Jin
Pin Tao
DiffM
85
33
0
08 Aug 2023
Learning to Generate Training Datasets for Robust Semantic Segmentation
Marwane Hariat
Olivier Laurent
Rémi Kazmierczak
Shihao Zhang
Andrei Bursuc
Angela Yao
Gianni Franchi
UQCV
60
2
0
01 Aug 2023
DualAttNet: Synergistic Fusion of Image-level and Fine-Grained Disease Attention for Multi-Label Lesion Detection in Chest X-rays
Qing Xu
Wenting Duan
MedIm
24
7
0
23 Jun 2023
The 2023 Video Similarity Dataset and Challenge
Ed Pizzi
Giorgos Kordopatis-Zilos
Hiral Patel
Gheorghe Postelnicu
Sugosh Nagavara Ravindra
A. Gupta
Symeon Papadopoulos
Giorgos Tolias
Matthijs Douze
76
7
0
15 Jun 2023
Progressive Sub-Graph Clustering Algorithm for Semi-Supervised Domain Adaptation Speaker Verification
Zhuo Li
Jingze Lu
Z. Zhao
Wenchao Wang
Pengyuan Zhang
61
1
0
22 May 2023
The HCCL system for VoxCeleb Speaker Recognition Challenge 2022
Zhenduo Zhao
Zhuo Li
Wenchao Wang
Pengyuan Zhang
54
4
0
22 May 2023
Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction
Tharindu Fernando
Harshala Gammulle
Sridha Sridharan
Simon Denman
Clinton Fookes
3DH
56
1
0
19 May 2023
Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for Speech-to-Image Generation
Zhenxing Zhang
Lambert Schomaker
41
3
0
17 May 2023
Faster OreFSDet : A Lightweight and Effective Few-shot Object Detector for Ore Images
Yang Zhang
Lei Cheng
Yuting Peng
C. Xu
Yanwei Fu
Bo Wu
Guodong Sun
ObjD
123
7
0
02 May 2023
Feature-compatible Progressive Learning for Video Copy Detection
Wenhao Wang
Yifan Sun
Yi Yang
93
3
0
20 Apr 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi
Jianan Wang
Yihao Chen
Yukai Shi
Lei Zhang
98
20
0
19 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
87
49
0
13 Apr 2023
Multi-site, Multi-domain Airway Tree Modeling (ATM'22): A Public Benchmark for Pulmonary Airway Segmentation
Minghui Zhang
Yang Wu
Hanxiao Zhang
Yulei Qin
Hao Zheng
...
Raúl San José Estépar
C. Espinosa
Jiayuan Sun
Guang-Zhong Yang
Yun Gu
64
12
0
10 Mar 2023
A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data
Hayat Rajani
N. Gracias
Rafael García
ViT
49
14
0
24 Feb 2023
Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
Shipeng Zhu
Zuoyan Zhao
Pengfei Fang
H. Xue
SupR
DiffM
93
25
0
21 Feb 2023
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
142
6
0
16 Feb 2023
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
Jiayu Jiao
Yuyao Tang
Kun-Li Channing Lin
Yipeng Gao
Jinhua Ma
Yaowei Wang
Wei-Shi Zheng
MedIm
ViT
98
151
0
03 Feb 2023
Two-stage Contextual Transformer-based Convolutional Neural Network for Airway Extraction from CT Images
Yanan Wu
Shuiqing Zhao
Shouliang Qi
Jie Feng
H. Pang
...
Long Bai
Meng-Yi Li
Shuyue Xia
W. Qian
Hongliang Ren
ViT
MedIm
76
26
0
15 Dec 2022
CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
Kevin Hyekang Joo
Khoa T. Vo
Kashu Yamazaki
Ngan Le
52
51
0
09 Dec 2022
Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks
Chun Bao
Jie Cao
Yaqian Ning
Yang Cheng
Q. Hao
47
1
0
23 Nov 2022
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViT
MedIm
76
35
0
18 Nov 2022
Dynamic Temporal Filtering in Video Models
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
AI4TS
95
18
0
15 Nov 2022
1
2
Next