ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,221 papers shown
Title
VariabilityTrack:Multi-Object Tracking with Variable Speed Object Movement
Run Luo
Ji Wei
Qiao Lin
VOT
30
1
0
12 Mar 2022
One-stage Video Instance Segmentation: From Frame-in Frame-out to
  Clip-in Clip-out
One-stage Video Instance Segmentation: From Frame-in Frame-out to Clip-in Clip-out
Minghan Li
Lei Zhang
CLIP
VLM
39
1
0
12 Mar 2022
Joint CNN and Transformer Network via weakly supervised Learning for
  efficient crowd counting
Joint CNN and Transformer Network via weakly supervised Learning for efficient crowd counting
Fusen Wang
Kai Liu
Fei Long
Nong Sang
Xiaofeng Xia
J. Sang
ViT
40
19
0
12 Mar 2022
EventFormer: AU Event Transformer for Facial Action Unit Event Detection
EventFormer: AU Event Transformer for Facial Action Unit Event Detection
Yingjie Chen
Jiarui Zhang
Tao Wang
Yun Liang
ViT
29
0
0
12 Mar 2022
The Principle of Diversity: Training Stronger Vision Transformers Calls
  for Reducing All Levels of Redundancy
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
Tianlong Chen
Zhenyu (Allen) Zhang
Yu Cheng
Ahmed Hassan Awadallah
Zhangyang Wang
ViT
41
37
0
12 Mar 2022
Deformable VisTR: Spatio temporal deformable attention for video
  instance segmentation
Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Sudhir Yarram
Jialian Wu
Pan Ji
Yi Tian Xu
Junsong Yuan
ViT
22
2
0
12 Mar 2022
Active Token Mixer
Active Token Mixer
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
15
0
11 Mar 2022
TAPE: Task-Agnostic Prior Embedding for Image Restoration
TAPE: Task-Agnostic Prior Embedding for Image Restoration
Lin Liu
Lingxi Xie
Xiaopeng Zhang
Shanxin Yuan
Xiangyu Chen
Wen-gang Zhou
Houqiang Li
Qi Tian
10
53
0
11 Mar 2022
Towards Self-Supervised Learning of Global and Object-Centric
  Representations
Towards Self-Supervised Learning of Global and Object-Centric Representations
Federico Baldassarre
Hossein Azizpour
SSL
3DPC
OCL
43
13
0
11 Mar 2022
Peng Cheng Object Detection Benchmark for Smart City
Peng Cheng Object Detection Benchmark for Smart City
Yaowei Wang
Zhouxin Yang
R. Liu
Deng Li
Yuandu Lai
Leyuan Fang
Yahong Han
ObjD
3DPC
14
1
0
11 Mar 2022
Visualizing and Understanding Patch Interactions in Vision Transformer
Visualizing and Understanding Patch Interactions in Vision Transformer
Jie Ma
Yalong Bai
Bineng Zhong
Wei Zhang
Ting Yao
Tao Mei
ViT
23
32
0
11 Mar 2022
The Overlooked Classifier in Human-Object Interaction Recognition
The Overlooked Classifier in Human-Object Interaction Recognition
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Lin Liang
Lei Li
Zicheng Liu
VLM
47
8
0
10 Mar 2022
Point Density-Aware Voxels for LiDAR 3D Object Detection
Point Density-Aware Voxels for LiDAR 3D Object Detection
Jordan S. K. Hu
Tianshu Kuai
Steven L. Waslander
3DPC
27
161
0
10 Mar 2022
PETR: Position Embedding Transformation for Multi-View 3D Object
  Detection
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
3DPC
43
526
0
10 Mar 2022
Prediction-Guided Distillation for Dense Object Detection
Prediction-Guided Distillation for Dense Object Detection
Chenhongyi Yang
Mateusz Ochal
Amos Storkey
Elliot J. Crowley
32
28
0
10 Mar 2022
Representation Compensation Networks for Continual Semantic Segmentation
Representation Compensation Networks for Continual Semantic Segmentation
Chang-Bin Zhang
Jianqiang Xiao
Xialei Liu
Ying-Cong Chen
Mingg-Ming Cheng
SSeg
CLL
37
93
0
10 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object
  Tracking
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Bo-wen Li
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
22
183
0
10 Mar 2022
Domain Generalisation for Object Detection under Covariate and Concept
  Shift
Domain Generalisation for Object Detection under Covariate and Concept Shift
Karthik Seemakurthy
E. Aptoula
Charles Fox
Petra Bosilj
ObjD
OOD
26
10
0
10 Mar 2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim
Seung Shin
Yoonsik Kim
Han-Cheol Cho
Taeho Kil
Jaeheung Surh
Seunghyun Park
Bado Lee
Youngmin Baek
22
8
0
10 Mar 2022
OpenTAL: Towards Open Set Temporal Action Localization
OpenTAL: Towards Open Set Temporal Action Localization
Wentao Bao
Qi Yu
Yu Kong
EDL
30
26
0
10 Mar 2022
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain
  Analysis: From Theory to Practice
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice
Peihao Wang
Wenqing Zheng
Tianlong Chen
Zhangyang Wang
ViT
33
127
0
09 Mar 2022
Joint Learning of Salient Object Detection, Depth Estimation and Contour
  Extraction
Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction
Xiaoqi Zhao
Youwei Pang
Lihe Zhang
Huchuan Lu
29
21
0
09 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
34
301
0
09 Mar 2022
Nested Named Entity Recognition as Latent Lexicalized Constituency
  Parsing
Nested Named Entity Recognition as Latent Lexicalized Constituency Parsing
Chao Lou
Aaron Courville
Kewei Tu
30
37
0
09 Mar 2022
Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and
  3D-Aware Ellipse Prediction
Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction
Matthieu Zins
Gilles Simon
M. Berger
29
13
0
09 Mar 2022
ChiTransformer:Towards Reliable Stereo from Cues
ChiTransformer:Towards Reliable Stereo from Cues
Qing Su
Shihao Ji
MDE
ViT
18
12
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for
  Segmentation
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
24
13
0
08 Mar 2022
YouTube-GDD: A challenging gun detection dataset with rich contextual
  information
YouTube-GDD: A challenging gun detection dataset with rich contextual information
Yongxiang Gu
Xingbin Liao
Xiaolin Qin
13
6
0
08 Mar 2022
Lane Detection with Versatile AtrousFormer and Local Semantic Guidance
Lane Detection with Versatile AtrousFormer and Local Semantic Guidance
Jiaxing Yang
Lihe Zhang
Huchuan Lu
ViT
29
19
0
08 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary
  Camera Rigs
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
54
131
0
08 Mar 2022
Enhancing Door-Status Detection for Autonomous Mobile Robots during
  Environment-Specific Operational Use
Enhancing Door-Status Detection for Autonomous Mobile Robots during Environment-Specific Operational Use
Michele Antonazzi
Matteo Luperto
Nicola Basilico
N. A. Borghese
17
4
0
08 Mar 2022
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
59
1,375
0
07 Mar 2022
Deep Learning Serves Traffic Safety Analysis: A Forward-looking Review
Deep Learning Serves Traffic Safety Analysis: A Forward-looking Review
Abolfazl Razi
Xiwen Chen
Huayu Li
Hao Wang
Brendan J. Russo
Yan Chen
Hongbin Yu
33
39
0
07 Mar 2022
Knowledge Amalgamation for Object Detection with Transformers
Knowledge Amalgamation for Object Detection with Transformers
Haofei Zhang
Feng Mao
Mengqi Xue
Gongfan Fang
Zunlei Feng
Mingli Song
Mingli Song
ViT
111
12
0
07 Mar 2022
Interactive Disambiguation for Behavior Tree Execution
Interactive Disambiguation for Behavior Tree Execution
Matteo Iovino
Fethiye Irmak Dogan
Iolanda Leite
Christian Smith
22
15
0
06 Mar 2022
Exploring Dual-task Correlation for Pose Guided Person Image Generation
Exploring Dual-task Correlation for Pose Guided Person Image Generation
Peng Zhang
Lingxiao Yang
Jianhuang Lai
Xiaohua Xie
ViT
29
81
0
06 Mar 2022
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Qishuai Diao
Yi-Xin Jiang
Bin Wen
Jianxiang Sun
Zehuan Yuan
36
60
0
05 Mar 2022
Boosting Crowd Counting via Multifaceted Attention
Boosting Crowd Counting via Multifaceted Attention
Hui Lin
Zhiheng Ma
Rongrong Ji
Yaowei Wang
Xiaopeng Hong
23
145
0
05 Mar 2022
Contextformer: A Transformer with Spatio-Channel Attention for Context
  Modeling in Learned Image Compression
Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression
A. B. Koyuncu
Han Gao
Atanas Boev
Georgii Gaikov
Elena Alshina
Eckehard Steinbach
ViT
39
68
0
04 Mar 2022
Rethinking Efficient Lane Detection via Curve Modeling
Rethinking Efficient Lane Detection via Curve Modeling
Zhengyang Feng
Shaohua Guo
Xin Tan
Ke Xu
Min Wang
Lizhuang Ma
38
144
0
04 Mar 2022
DiT: Self-supervised Pre-training for Document Image Transformer
DiT: Self-supervised Pre-training for Document Image Transformer
Junlong Li
Yiheng Xu
Tengchao Lv
Lei Cui
Chaoxi Zhang
Furu Wei
ViT
VLM
35
159
0
04 Mar 2022
ViT-P: Rethinking Data-efficient Vision Transformers from Locality
ViT-P: Rethinking Data-efficient Vision Transformers from Locality
Bin Chen
Ran A. Wang
Di Ming
Xin Feng
ViT
18
7
0
04 Mar 2022
F2DNet: Fast Focal Detection Network for Pedestrian Detection
F2DNet: Fast Focal Detection Network for Pedestrian Detection
Abdul Hannan Khan
Mohsin Munir
L. V. Elst
Andreas Dengel
ObjD
24
24
0
04 Mar 2022
Patch Similarity Aware Data-Free Quantization for Vision Transformers
Patch Similarity Aware Data-Free Quantization for Vision Transformers
Zhikai Li
Liping Ma
Mengjuan Chen
Junrui Xiao
Qingyi Gu
MQ
ViT
19
44
0
04 Mar 2022
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field
  Disparity Estimation
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation
Peng Li
Jiayin Zhao
Jingyao Wu
Chao Deng
Haoqian Wang
Tao Yu
26
19
0
04 Mar 2022
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Jean-Michel Fortin
Olivier Gamache
Vincent Grondin
F. Pomerleau
Philippe Giguère
27
22
0
03 Mar 2022
Efficient Video Instance Segmentation via Tracklet Query and Proposal
Efficient Video Instance Segmentation via Tracklet Query and Proposal
Jialian Wu
Sudhir Yarram
Hui Liang
Tian Lan
Junsong Yuan
J. Eledath
Gérard Medioni
21
37
0
03 Mar 2022
Correlation-Aware Deep Tracking
Correlation-Aware Deep Tracking
Fei Xie
Chunyu Wang
Guangting Wang
Yue Cao
Wankou Yang
Wenjun Zeng
VOT
32
119
0
03 Mar 2022
Multi-Tailed Vision Transformer for Efficient Inference
Multi-Tailed Vision Transformer for Efficient Inference
Yunke Wang
Bo Du
Wenyuan Wang
Chang Xu
ViT
213
6
0
03 Mar 2022
Recent Advances in Vision Transformer: A Survey and Outlook of Recent
  Work
Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work
Khawar Islam
ViT
28
45
0
03 Mar 2022
Previous
123...909192...103104105
Next