Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.12872
Cited By
End-to-End Object Detection with Transformers
26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Object Detection with Transformers"
50 / 5,127 papers shown
Title
DeepViT: Towards Deeper Vision Transformer
Daquan Zhou
Bingyi Kang
Xiaojie Jin
Linjie Yang
Xiaochen Lian
Zihang Jiang
Qibin Hou
Jiashi Feng
ViT
42
510
0
22 Mar 2021
Incorporating Convolution Designs into Visual Transformers
Kun Yuan
Shaopeng Guo
Ziwei Liu
Aojun Zhou
F. Yu
Wei Wu
ViT
38
467
0
22 Mar 2021
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
Ning Wang
Wen-gang Zhou
Jie Wang
Houqiang Li
ViT
25
518
0
22 Mar 2021
Learning Calibrated-Guidance for Object Detection in Aerial Images
Zongqi Wei
Dong Liang
Dong-Ming Zhang
Liyan Zhang
Qixiang Geng
Mingqiang Wei
Huiyu Zhou
27
35
0
21 Mar 2021
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Stéphane dÁscoli
Hugo Touvron
Matthew L. Leavitt
Ari S. Morcos
Giulio Biroli
Levent Sagun
ViT
34
803
0
19 Mar 2021
CE-FPN: Enhancing Channel Information for Object Detection
Yihao Luo
Xiang Cao
Juntao Zhang
Xiang Cao
Jingjuan Guo
Haibo Shen
Tianjiang Wang
Qi Feng
ObjD
24
146
0
19 Mar 2021
Scalable Vision Transformers with Hierarchical Pooling
Zizheng Pan
Bohan Zhuang
Jing Liu
Haoyu He
Jianfei Cai
ViT
25
126
0
19 Mar 2021
UNETR: Transformers for 3D Medical Image Segmentation
Ali Hatamizadeh
Yucheng Tang
Vishwesh Nath
Dong Yang
Andriy Myronenko
Bennett Landman
H. Roth
Daguang Xu
ViT
MedIm
57
1,533
0
18 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
C. L. P. Chen
Zhengming Ding
ViT
39
437
0
18 Mar 2021
Consistency-based Active Learning for Object Detection
Weiping Yu
Sijie Zhu
Taojiannan Yang
C. L. P. Chen
ObjD
20
50
0
18 Mar 2021
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Mandela Patrick
Yuki M. Asano
Bernie Huang
Ishan Misra
Florian Metze
Joao Henriques
Andrea Vedaldi
AI4TS
18
33
0
18 Mar 2021
Hierarchical Attention-based Age Estimation and Bias Estimation
Shakediel Hiba
Y. Keller
CVBM
27
10
0
17 Mar 2021
You Only Look One-level Feature
Qiang Chen
Yingming Wang
Tong Yang
X. Zhang
Jian Cheng
Jian-jun Sun
ObjD
29
516
0
17 Mar 2021
QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection
Chenhongyi Yang
Zehao Huang
Naiyan Wang
ObjD
24
225
0
16 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
33
375
0
14 Mar 2021
Probabilistic two-stage detection
Xingyi Zhou
V. Koltun
Philipp Krahenbuhl
ObjD
30
224
0
12 Mar 2021
Unknown Object Segmentation from Stereo Images
M. Durner
W. Boerdijk
M. Sundermeyer
W. Friedl
Zoltán-Csaba Márton
Rudolph Triebel
31
34
0
11 Mar 2021
Involution: Inverting the Inherence of Convolution for Visual Recognition
Duo Li
Jie Hu
Changhu Wang
Xiangtai Li
Qi She
Lei Zhu
Tong Zhang
Qifeng Chen
BDL
17
304
0
10 Mar 2021
U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
Olivier Petit
Nicolas Thome
Clément Rambour
L. Soler
ViT
MedIm
24
236
0
10 Mar 2021
Reformulating HOI Detection as Adaptive Set Prediction
Mingfei Chen
Yue Liao
Si Liu
Zhiyuan Chen
Fei-Yue Wang
Chao Qian
27
142
0
10 Mar 2021
TransMed: Transformers Advance Multi-modal Medical Image Classification
Yin Dai
Yifan Gao
ViT
MedIm
32
280
0
10 Mar 2021
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
Masato Tamura
Hiroki Ohashi
Tomoaki Yoshinaga
28
207
0
09 Mar 2021
Causal Attention for Vision-Language Tasks
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
CML
28
148
0
05 Mar 2021
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
48
973
0
04 Mar 2021
Domain Generalization: A Survey
Kaiyang Zhou
Ziwei Liu
Yu Qiao
Tao Xiang
Chen Change Loy
OOD
AI4CE
66
980
0
03 Mar 2021
Generative Adversarial Transformers
Drew A. Hudson
C. L. Zitnick
ViT
23
179
0
01 Mar 2021
Universal-Prototype Enhancing for Few-Shot Object Detection
Aming Wu
Yahong Han
Linchao Zhu
Yi Yang
ObjD
28
84
0
01 Mar 2021
Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning
A. Jaus
Kailun Yang
Rainer Stiefelhagen
37
36
0
01 Mar 2021
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
284
1,524
0
27 Feb 2021
SparseBERT: Rethinking the Importance Analysis in Self-attention
Han Shi
Jiahui Gao
Xiaozhe Ren
Hang Xu
Xiaodan Liang
Zhenguo Li
James T. Kwok
21
54
0
25 Feb 2021
Localization Distillation for Dense Object Detection
Zhaohui Zheng
Rongguang Ye
Ping Wang
Dongwei Ren
W. Zuo
Qibin Hou
Ming-Ming Cheng
ObjD
98
115
0
24 Feb 2021
Revisiting Classification Perspective on Scene Text Recognition
Hongxiang Cai
Jun Sun
Yichao Xiong
16
10
0
22 Feb 2021
Predicting times of waiting on red signals using BERT
Witold Szejgis
Anna Warno
P. Góra
21
1
0
20 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
269
179
0
17 Feb 2021
RMS-Net: Regression and Masking for Soccer Event Spotting
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
32
28
0
15 Feb 2021
Image Captioning using Multiple Transformers for Self-Attention Mechanism
Farrukh Olimov
Shikha Dubey
Labina Shrestha
Tran Trung Tin
M. Jeon
ViT
26
2
0
14 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,981
0
09 Feb 2021
Occluded Video Instance Segmentation: A Benchmark
Jiyang Qi
Yan Gao
Yao Hu
Xinggang Wang
Xiaoyu Liu
Xiang Bai
Serge J. Belongie
Alan Yuille
Philip H. S. Torr
S. Bai
VOS
VLM
27
135
0
02 Feb 2021
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Hongsheng Li
3DPC
139
404
0
31 Jan 2021
Augmenting Proposals by the Detector Itself
Xiaopei Wan
Zhenhua Guo
Chao He
Yujiu Yang
Fangbo Tao
ObjD
26
2
0
28 Jan 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
290
979
0
27 Jan 2021
Channel Boosting Feature Ensemble for Radar-based Object Detection
Shoaib Azam
Farzeen Munir
M. Jeon
26
7
0
10 Jan 2021
MSED: a multi-modal sleep event detection model for clinical sleep analysis
Alexander Neergaard Zahid
P. Jennum
Emmanuel Mignot
H. Sørensen
35
10
0
07 Jan 2021
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
F. Khan
M. Shah
ViT
227
2,428
0
04 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
246
565
0
31 Dec 2020
Reservoir Transformers
Sheng Shen
Alexei Baevski
Ari S. Morcos
Kurt Keutzer
Michael Auli
Douwe Kiela
27
17
0
30 Dec 2020
Inception Convolution with Efficient Dilation Search
Jie Liu
Chuming Li
Feng Liang
Chen Lin
Ming-hui Sun
Junjie Yan
Wanli Ouyang
Dong Xu
36
27
0
25 Dec 2020
Implicit Feature Pyramid Network for Object Detection
Tiancai Wang
X. Zhang
Jian-jun Sun
ObjD
13
27
0
25 Dec 2020
SceneFormer: Indoor Scene Generation with Transformers
Xinpeng Wang
Chandan Yeshwanth
Matthias Nießner
ViT
3DPC
18
147
0
17 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
34
613
0
17 Dec 2020
Previous
1
2
3
...
101
102
103
Next