Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.12872
Cited By
End-to-End Object Detection with Transformers
26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Object Detection with Transformers"
50 / 5,221 papers shown
Title
VariabilityTrack:Multi-Object Tracking with Variable Speed Object Movement
Run Luo
Ji Wei
Qiao Lin
VOT
30
1
0
12 Mar 2022
One-stage Video Instance Segmentation: From Frame-in Frame-out to Clip-in Clip-out
Minghan Li
Lei Zhang
CLIP
VLM
39
1
0
12 Mar 2022
Joint CNN and Transformer Network via weakly supervised Learning for efficient crowd counting
Fusen Wang
Kai Liu
Fei Long
Nong Sang
Xiaofeng Xia
J. Sang
ViT
40
19
0
12 Mar 2022
EventFormer: AU Event Transformer for Facial Action Unit Event Detection
Yingjie Chen
Jiarui Zhang
Tao Wang
Yun Liang
ViT
29
0
0
12 Mar 2022
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
Tianlong Chen
Zhenyu (Allen) Zhang
Yu Cheng
Ahmed Hassan Awadallah
Zhangyang Wang
ViT
41
37
0
12 Mar 2022
Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Sudhir Yarram
Jialian Wu
Pan Ji
Yi Tian Xu
Junsong Yuan
ViT
22
2
0
12 Mar 2022
Active Token Mixer
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
15
0
11 Mar 2022
TAPE: Task-Agnostic Prior Embedding for Image Restoration
Lin Liu
Lingxi Xie
Xiaopeng Zhang
Shanxin Yuan
Xiangyu Chen
Wen-gang Zhou
Houqiang Li
Qi Tian
10
53
0
11 Mar 2022
Towards Self-Supervised Learning of Global and Object-Centric Representations
Federico Baldassarre
Hossein Azizpour
SSL
3DPC
OCL
43
13
0
11 Mar 2022
Peng Cheng Object Detection Benchmark for Smart City
Yaowei Wang
Zhouxin Yang
R. Liu
Deng Li
Yuandu Lai
Leyuan Fang
Yahong Han
ObjD
3DPC
14
1
0
11 Mar 2022
Visualizing and Understanding Patch Interactions in Vision Transformer
Jie Ma
Yalong Bai
Bineng Zhong
Wei Zhang
Ting Yao
Tao Mei
ViT
23
32
0
11 Mar 2022
The Overlooked Classifier in Human-Object Interaction Recognition
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Lin Liang
Lei Li
Zicheng Liu
VLM
47
8
0
10 Mar 2022
Point Density-Aware Voxels for LiDAR 3D Object Detection
Jordan S. K. Hu
Tianshu Kuai
Steven L. Waslander
3DPC
27
161
0
10 Mar 2022
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
3DPC
43
526
0
10 Mar 2022
Prediction-Guided Distillation for Dense Object Detection
Chenhongyi Yang
Mateusz Ochal
Amos Storkey
Elliot J. Crowley
32
28
0
10 Mar 2022
Representation Compensation Networks for Continual Semantic Segmentation
Chang-Bin Zhang
Jianqiang Xiao
Xialei Liu
Ying-Cong Chen
Mingg-Ming Cheng
SSeg
CLL
37
93
0
10 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Bo-wen Li
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
22
183
0
10 Mar 2022
Domain Generalisation for Object Detection under Covariate and Concept Shift
Karthik Seemakurthy
E. Aptoula
Charles Fox
Petra Bosilj
ObjD
OOD
26
10
0
10 Mar 2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim
Seung Shin
Yoonsik Kim
Han-Cheol Cho
Taeho Kil
Jaeheung Surh
Seunghyun Park
Bado Lee
Youngmin Baek
22
8
0
10 Mar 2022
OpenTAL: Towards Open Set Temporal Action Localization
Wentao Bao
Qi Yu
Yu Kong
EDL
30
26
0
10 Mar 2022
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice
Peihao Wang
Wenqing Zheng
Tianlong Chen
Zhangyang Wang
ViT
33
127
0
09 Mar 2022
Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction
Xiaoqi Zhao
Youwei Pang
Lihe Zhang
Huchuan Lu
29
21
0
09 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
34
301
0
09 Mar 2022
Nested Named Entity Recognition as Latent Lexicalized Constituency Parsing
Chao Lou
Aaron Courville
Kewei Tu
30
37
0
09 Mar 2022
Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction
Matthieu Zins
Gilles Simon
M. Berger
29
13
0
09 Mar 2022
ChiTransformer:Towards Reliable Stereo from Cues
Qing Su
Shihao Ji
MDE
ViT
18
12
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
24
13
0
08 Mar 2022
YouTube-GDD: A challenging gun detection dataset with rich contextual information
Yongxiang Gu
Xingbin Liao
Xiaolin Qin
13
6
0
08 Mar 2022
Lane Detection with Versatile AtrousFormer and Local Semantic Guidance
Jiaxing Yang
Lihe Zhang
Huchuan Lu
ViT
29
19
0
08 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
54
131
0
08 Mar 2022
Enhancing Door-Status Detection for Autonomous Mobile Robots during Environment-Specific Operational Use
Michele Antonazzi
Matteo Luperto
Nicola Basilico
N. A. Borghese
17
4
0
08 Mar 2022
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
59
1,375
0
07 Mar 2022
Deep Learning Serves Traffic Safety Analysis: A Forward-looking Review
Abolfazl Razi
Xiwen Chen
Huayu Li
Hao Wang
Brendan J. Russo
Yan Chen
Hongbin Yu
33
39
0
07 Mar 2022
Knowledge Amalgamation for Object Detection with Transformers
Haofei Zhang
Feng Mao
Mengqi Xue
Gongfan Fang
Zunlei Feng
Mingli Song
Mingli Song
ViT
111
12
0
07 Mar 2022
Interactive Disambiguation for Behavior Tree Execution
Matteo Iovino
Fethiye Irmak Dogan
Iolanda Leite
Christian Smith
22
15
0
06 Mar 2022
Exploring Dual-task Correlation for Pose Guided Person Image Generation
Peng Zhang
Lingxiao Yang
Jianhuang Lai
Xiaohua Xie
ViT
29
81
0
06 Mar 2022
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Qishuai Diao
Yi-Xin Jiang
Bin Wen
Jianxiang Sun
Zehuan Yuan
36
60
0
05 Mar 2022
Boosting Crowd Counting via Multifaceted Attention
Hui Lin
Zhiheng Ma
Rongrong Ji
Yaowei Wang
Xiaopeng Hong
23
145
0
05 Mar 2022
Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression
A. B. Koyuncu
Han Gao
Atanas Boev
Georgii Gaikov
Elena Alshina
Eckehard Steinbach
ViT
39
68
0
04 Mar 2022
Rethinking Efficient Lane Detection via Curve Modeling
Zhengyang Feng
Shaohua Guo
Xin Tan
Ke Xu
Min Wang
Lizhuang Ma
38
144
0
04 Mar 2022
DiT: Self-supervised Pre-training for Document Image Transformer
Junlong Li
Yiheng Xu
Tengchao Lv
Lei Cui
Chaoxi Zhang
Furu Wei
ViT
VLM
35
159
0
04 Mar 2022
ViT-P: Rethinking Data-efficient Vision Transformers from Locality
Bin Chen
Ran A. Wang
Di Ming
Xin Feng
ViT
18
7
0
04 Mar 2022
F2DNet: Fast Focal Detection Network for Pedestrian Detection
Abdul Hannan Khan
Mohsin Munir
L. V. Elst
Andreas Dengel
ObjD
24
24
0
04 Mar 2022
Patch Similarity Aware Data-Free Quantization for Vision Transformers
Zhikai Li
Liping Ma
Mengjuan Chen
Junrui Xiao
Qingyi Gu
MQ
ViT
19
44
0
04 Mar 2022
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation
Peng Li
Jiayin Zhao
Jingyao Wu
Chao Deng
Haoqian Wang
Tao Yu
26
19
0
04 Mar 2022
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Jean-Michel Fortin
Olivier Gamache
Vincent Grondin
F. Pomerleau
Philippe Giguère
27
22
0
03 Mar 2022
Efficient Video Instance Segmentation via Tracklet Query and Proposal
Jialian Wu
Sudhir Yarram
Hui Liang
Tian Lan
Junsong Yuan
J. Eledath
Gérard Medioni
21
37
0
03 Mar 2022
Correlation-Aware Deep Tracking
Fei Xie
Chunyu Wang
Guangting Wang
Yue Cao
Wankou Yang
Wenjun Zeng
VOT
32
119
0
03 Mar 2022
Multi-Tailed Vision Transformer for Efficient Inference
Yunke Wang
Bo Du
Wenyuan Wang
Chang Xu
ViT
213
6
0
03 Mar 2022
Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work
Khawar Islam
ViT
28
45
0
03 Mar 2022
Previous
1
2
3
...
90
91
92
...
103
104
105
Next