Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.12872
Cited By
End-to-End Object Detection with Transformers
26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Object Detection with Transformers"
50 / 5,255 papers shown
Title
Recent Few-Shot Object Detection Algorithms: A Survey with Performance Comparison
Tianying Liu
Lu Zhang
Yang Wang
Jihong Guan
Yanwei Fu
Jiajia Zhao
Shuigeng Zhou
ObjD
AAML
18
27
0
27 Mar 2022
3D-OAE: Occlusion Auto-Encoders for Self-Supervised Learning on Point Clouds
Junsheng Zhou
Xin Wen
Baorui Ma
Yu-Shen Liu
Yue Gao
Yi Fang
Zhizhong Han
3DPC
33
17
0
26 Mar 2022
Transformer-empowered Multi-scale Contextual Matching and Aggregation for Multi-contrast MRI Super-resolution
Guangyuan Li
Jun Lv
Yapeng Tian
Qingyu Dou
Chengyan Wang
Chenliang Xu
Jing Qin
MedIm
26
57
0
26 Mar 2022
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
Yue Liao
Aixi Zhang
Miao Lu
Yongliang Wang
Xiaobo Li
Si Liu
VLM
28
125
0
26 Mar 2022
Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness
Giulio Lovisotto
Nicole Finnie
Mauricio Muñoz
Chaithanya Kumar Mummadi
J. H. Metzen
AAML
ViT
30
32
0
25 Mar 2022
Efficient Visual Tracking via Hierarchical Cross-Attention Transformer
Xin Chen
Ben Kang
D. Wang
Dongdong Li
Huchuan Lu
ViT
28
48
0
25 Mar 2022
High-Performance Transformer Tracking
Xin Chen
B. Yan
Jiawen Zhu
Huchuan Lu
Xiang Ruan
D. Wang
ViT
23
33
0
25 Mar 2022
MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis
Liwen Xu
Z. Wang
Bingwen Wu
S. Lui
14
36
0
25 Mar 2022
Point2Seq: Detecting 3D Objects as Sequences
Yujing Xue
Jiageng Mao
Minzhe Niu
Hang Xu
Michael Bi Mi
Wei Zhang
Xiaogang Wang
Xinchao Wang
3DPC
44
15
0
25 Mar 2022
RayTran: 3D pose estimation and shape reconstruction of multiple objects from videos with ray-traced transformers
M. Tyszkiewicz
Kevis-Kokitsi Maninis
S. Popov
V. Ferrari
ViT
29
17
0
24 Mar 2022
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
Hansheng Chen
Pichao Wang
Fan Wang
Wei Tian
Lu Xiong
Hao Li
99
146
0
24 Mar 2022
Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer
Omkar Thawakar
Sanath Narayan
Jiale Cao
Hisham Cholakkal
Rao Muhammad Anwer
Muhammad Haris Khan
Salman Khan
M. Felsberg
Fahad Shahbaz Khan
ViT
12
14
0
24 Mar 2022
Global Tracking Transformers
Xingyi Zhou
Tianwei Yin
V. Koltun
Philipp Krahenbuhl
VOT
31
133
0
24 Mar 2022
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Likun Cai
Zhi-Li Zhang
Yi Zhu
Li Zhang
Mu Li
Xiangyang Xue
VLM
ObjD
43
40
0
24 Mar 2022
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning
Juncheng Li
Junlin Xie
Long Qian
Linchao Zhu
Siliang Tang
Fei Wu
Yi Yang
Yueting Zhuang
Xinze Wang
39
73
0
24 Mar 2022
Focus-and-Detect: A Small Object Detection Framework for Aerial Images
Onur Can Koyun
Reyhan Kevser Keser
Ibrahim Batuhan Akkaya
B. U. Toreyin
ObjD
13
69
0
24 Mar 2022
Object Memory Transformer for Object Goal Navigation
Rui Fukushima
Keita Ota
Asako Kanezaki
Y. Sasaki
Yusuke Yoshiyasu
20
34
0
24 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
27
28
0
24 Mar 2022
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization
Yan Xu
Junyi Lin
Guofeng Zhang
Xiaogang Wang
Hongsheng Li
37
58
0
24 Mar 2022
Beyond Fixation: Dynamic Window Visual Transformer
Pengzhen Ren
Changlin Li
Guangrun Wang
Yun Xiao
Qing Du
Xiaodan Liang
Qing Du Xiaodan Liang Xiaojun Chang
ViT
33
32
0
24 Mar 2022
Sparse Instance Activation for Real-Time Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Wenqiang Zhang
Qian Zhang
Chang Huang
Zhaoxiang Zhang
Wenyu Liu
ISeg
33
125
0
24 Mar 2022
Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering
Zhou Yu
Zitian Jin
Jun Yu
Mingliang Xu
Hongbo Wang
Jianping Fan
33
4
0
24 Mar 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Peng Gao
Hongsheng Li
Hongsheng Li
ViT
MDE
59
82
0
24 Mar 2022
Unsupervised Salient Object Detection with Spectral Cluster Voting
Gyungin Shin
Samuel Albanie
Weidi Xie
31
65
0
23 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
158
1,130
0
23 Mar 2022
DPST: De Novo Peptide Sequencing with Amino-Acid-Aware Transformers
Yan Yang
Zakir Hossain
Khan Asif
Liyuan Pan
Shafin Rahman
Eric A. Stone
9
4
0
23 Mar 2022
Scale-Equivalent Distillation for Semi-Supervised Object Detection
Qiushan Guo
Yao Mu
Jianyu Chen
Tianqi Wang
Yizhou Yu
Ping Luo
23
28
0
23 Mar 2022
Deep Frequency Filtering for Domain Generalization
Shiqi Lin
Zhizheng Zhang
Zhipeng Huang
Yan Lu
Cuiling Lan
...
Jiang Wang
Zicheng Liu
Amey Parulkar
V. Navkal
Zhibo Chen
35
50
0
23 Mar 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
40
1,538
0
23 Mar 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
38
263
0
22 Mar 2022
Open-Vocabulary DETR with Conditional Matching
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
ObjD
VLM
41
197
0
22 Mar 2022
Meta-attention for ViT-backed Continual Learning
Mengqi Xue
Haofei Zhang
Mingli Song
Mingli Song
CLL
32
42
0
22 Mar 2022
Weakly-Supervised Salient Object Detection Using Point Supervision
Shuyong Gao
Wei Zhang
Yan Wang
Qianyu Guo
Chenglong Zhang
Yang He
Wenqiang Zhang
23
53
0
22 Mar 2022
QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation
Yuxin Hong
Xuelin Qian
Simian Luo
Xiangyang Xue
Yanwei Fu
30
2
0
22 Mar 2022
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
Xuyang Bai
Zeyu Hu
Xinge Zhu
Qingqiu Huang
Yilun Chen
Hongbo Fu
Chiew-Lan Tai
ViT
3DPC
42
586
0
22 Mar 2022
Scalable Video Object Segmentation with Identification Mechanism
Zongxin Yang
Jiaxu Miao
Yunchao Wei
Wenguan Wang
Xiaohan Wang
Yi Yang
VOS
44
23
0
22 Mar 2022
Multi-Modal Learning for AU Detection Based on Multi-Head Fused Transformers
Xiang Zhang
L. Yin
ViT
24
12
0
22 Mar 2022
Global Matching with Overlapping Attention for Optical Flow Estimation
Shiyu Zhao
Long Zhao
Zhixing Zhang
Enyu Zhou
Dimitris N. Metaxas
3DPC
33
76
0
21 Mar 2022
Test-time Adaptation with Slot-Centric Models
Mihir Prabhudesai
Anirudh Goyal
S. Paul
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gaurav Aggarwal
Thomas Kipf
Deepak Pathak
Katerina Fragkiadaki
TTA
26
9
0
21 Mar 2022
Transforming Model Prediction for Tracking
Christoph Mayer
Martin Danelljan
Goutam Bhat
M. Paul
D. Paudel
Feng Yu
Luc Van Gool
64
229
0
21 Mar 2022
Masked Discrimination for Self-Supervised Learning on Point Clouds
Haotian Liu
Mu Cai
Yong Jae Lee
3DPC
21
164
0
21 Mar 2022
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark
Li Chen
Chonghao Sima
Yang Li
Zehan Zheng
Jiajie Xu
...
Hongyang Li
Conghui He
Jianping Shi
Yu Qiao
Junchi Yan
3DPC
ViT
35
181
0
21 Mar 2022
MixFormer: End-to-End Tracking with Iterative Mixed Attention
Yutao Cui
Jiang Cheng
Limin Wang
Gangshan Wu
VOT
34
454
0
21 Mar 2022
MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer
Kuan-Chih Huang
Tsung-Han Wu
Hung-Ting Su
Winston H. Hsu
ViT
MDE
15
159
0
21 Mar 2022
AnoViT: Unsupervised Anomaly Detection and Localization with Vision Transformer-based Encoder-Decoder
Yunseung Lee
Pilsung Kang
ViT
32
74
0
21 Mar 2022
ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer
Rui Yang
Hailong Ma
Jie Wu
Yansong Tang
Xuefeng Xiao
Min Zheng
Xiu Li
ViT
19
53
0
21 Mar 2022
DSRRTracker: Dynamic Search Region Refinement for Attention-based Siamese Multi-Object Tracking
Jia Wan
Hong Zhang
J. Zhang
Yuan Ding
Yifan Yang
Yan Li
Xuliang Li
VOT
46
5
0
21 Mar 2022
LocATe: End-to-end Localization of Actions in 3D with Transformers
Jiankai Sun
Bolei Zhou
Michael J. Black
Arjun Chandrasekaran
64
8
0
21 Mar 2022
FUTR3D: A Unified Sensor Fusion Framework for 3D Detection
Xuanyao Chen
Tianyuan Zhang
Yue Wang
Yilun Wang
Hang Zhao
3DPC
39
233
0
20 Mar 2022
Parallel Instance Query Network for Named Entity Recognition
Yongliang Shen
Xiaobin Wang
Zeqi Tan
Guangwei Xu
Pengjun Xie
Fei Huang
Weiming Lu
Yueting Zhuang
24
57
0
20 Mar 2022
Previous
1
2
3
...
88
89
90
...
104
105
106
Next