ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,286 papers shown
Title
VMFormer: End-to-End Video Matting with Transformer
VMFormer: End-to-End Video Matting with Transformer
Jiacheng Li
Vidit Goel
Marianna Ohanyan
Shant Navasardyan
Yunchao Wei
Humphrey Shi
ViT
38
18
0
26 Aug 2022
From WSI-level to Patch-level: Structure Prior Guided Binuclear Cell
  Fine-grained Detection
From WSI-level to Patch-level: Structure Prior Guided Binuclear Cell Fine-grained Detection
Baomin Wang
G. Hu
Dan Chen
Lihua Hu
Cheng Li
Yu An
G. Hu
Guangyu Jia
21
1
0
26 Aug 2022
Few-Shot Learning Meets Transformer: Unified Query-Support Transformers
  for Few-Shot Classification
Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
Xixi Wang
Tianlin Li
Bo Jiang
Bin Luo
45
43
0
26 Aug 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted
  Window
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
39
9
0
24 Aug 2022
Motion Robust High-Speed Light-Weighted Object Detection With Event
  Camera
Motion Robust High-Speed Light-Weighted Object Detection With Event Camera
Bing-Quan Liu
Chang Xu
Wen Yang
Huai Yu
Lei Yu
43
35
0
24 Aug 2022
Towards Efficient Use of Multi-Scale Features in Transformer-Based
  Object Detectors
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang
Zhipeng Luo
Zichen Tian
Yingchen Yu
Jingyi Zhang
Shijian Lu
37
27
0
24 Aug 2022
A Spatio-Temporal Attentive Network for Video-Based Crowd Counting
A Spatio-Temporal Attentive Network for Video-Based Crowd Counting
Marco Avvenuti
Marco Bongiovanni
Luca Ciampi
Fabrizio Falchi
Claudio Gennaro
Nicola Messina
36
9
0
24 Aug 2022
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Yanbei Chen
Massimiliano Mancini
Xiatian Zhu
Zeynep Akata
50
115
0
24 Aug 2022
Federated Self-Supervised Contrastive Learning and Masked Autoencoder
  for Dermatological Disease Diagnosis
Federated Self-Supervised Contrastive Learning and Masked Autoencoder for Dermatological Disease Diagnosis
Yawen Wu
Dewen Zeng
Zhepeng Wang
Yi Sheng
Lei Yang
A. James
Yiyu Shi
Jingtong Hu
30
7
0
24 Aug 2022
SwinFIR: Revisiting the SwinIR with Fast Fourier Convolution and
  Improved Training for Image Super-Resolution
SwinFIR: Revisiting the SwinIR with Fast Fourier Convolution and Improved Training for Image Super-Resolution
Dafeng Zhang
Feiyu Huang
Shizhuo Liu
Xiaobing Wang
Zhezhu Jin
24
90
0
24 Aug 2022
Distance-Aware Occlusion Detection with Focused Attention
Distance-Aware Occlusion Detection with Focused Attention
Yongqian Li
Yucheng Tu
Xiaoxue Chen
Hao Zhao
Guyue Zhou
24
6
0
23 Aug 2022
DeepInteraction: 3D Object Detection via Modality Interaction
DeepInteraction: 3D Object Detection via Modality Interaction
Zeyu Yang
Jia-Qing Chen
Zhenwei Miao
Wei Li
Xiatian Zhu
Li Zhang
51
132
0
23 Aug 2022
A Comprehensive Study of Real-Time Object Detection Networks Across
  Multiple Domains: A Survey
A Comprehensive Study of Real-Time Object Detection Networks Across Multiple Domains: A Survey
Elahe Arani
Shruthi Gowda
Ratnajit Mukherjee
Omar Magdy
Senthilkumar S. Kathiresan
Bahram Zonooz
ObjD
OOD
37
27
0
23 Aug 2022
FocusFormer: Focusing on What We Need via Architecture Sampler
FocusFormer: Focusing on What We Need via Architecture Sampler
Jing Liu
Jianfei Cai
Bohan Zhuang
45
7
0
23 Aug 2022
Towards Accurate Facial Landmark Detection via Cascaded Transformers
Towards Accurate Facial Landmark Detection via Cascaded Transformers
Hui Li
Zidong Guo
Seon-Min Rhee
S. Han
Jae-Joon Han
CVBM
ViT
35
36
0
23 Aug 2022
Object Detection in Aerial Images with Uncertainty-Aware Graph Network
Object Detection in Aerial Images with Uncertainty-Aware Graph Network
Jongha Kim
Jinheon Baek
Sung Ju Hwang
35
0
0
23 Aug 2022
InstanceFormer: An Online Video Instance Segmentation Framework
InstanceFormer: An Online Video Instance Segmentation Framework
Rajat Koner
Tanveer Hannan
Suprosanna Shit
Sahand Sharifzadeh
Matthias Schubert
Thomas Seidl
Volker Tresp
ViT
37
14
0
22 Aug 2022
Design Automation for Fast, Lightweight, and Effective Deep Learning
  Models: A Survey
Design Automation for Fast, Lightweight, and Effective Deep Learning Models: A Survey
Dalin Zhang
Kaixuan Chen
Yan Zhao
B. Yang
Li-Ping Yao
Christian S. Jensen
53
3
0
22 Aug 2022
A Simple Baseline for Multi-Camera 3D Object Detection
A Simple Baseline for Multi-Camera 3D Object Detection
Yunpeng Zhang
Wenzhao Zheng
Zhengbiao Zhu
Guan Huang
Jie Zhou
Jiwen Lu
3DPC
27
19
0
22 Aug 2022
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Jingyu Lin
Jie Jiang
Y. Yan
Chunchao Guo
Hongfa Wang
Wei Liu
Hanzi Wang
ViT
36
3
0
21 Aug 2022
SnowFormer: Context Interaction Transformer with Scale-awareness for
  Single Image Desnowing
SnowFormer: Context Interaction Transformer with Scale-awareness for Single Image Desnowing
Sixiang Chen
Tian-Chun Ye
Yun-Peng Liu
Erkang Chen
ViT
34
12
0
20 Aug 2022
Multiple Instance Neuroimage Transformer
Multiple Instance Neuroimage Transformer
Ayush Singla
Qingyu Zhao
Daniel K. Do
Yuyin Zhou
K. Pohl
Ehsan Adeli
ViT
MedIm
26
11
0
19 Aug 2022
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Bradley McDanel
C. Huynh
ViT
30
1
0
19 Aug 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview
  Pedestrian Detection with Attention
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
31
3
0
19 Aug 2022
Improved Image Classification with Token Fusion
Improved Image Classification with Token Fusion
Keong-Hun Choi
Jin-Woo Kim
Yaolong Wang
J. Ha
ViT
26
0
0
19 Aug 2022
Single-Stage Open-world Instance Segmentation with Cross-task
  Consistency Regularization
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Xizhe Xue
Dongdong Yu
Lingqiao Liu
Yu Liu
Satoshi Tsutsui
Ying Li
Zehuan Yuan
Ping Song
Mike Zheng Shou
ISeg
30
4
0
18 Aug 2022
Open-Vocabulary Universal Image Segmentation with MaskCLIP
Open-Vocabulary Universal Image Segmentation with MaskCLIP
Zheng Ding
Jieke Wang
Zhuowen Tu
CLIP
ISeg
VLM
52
86
0
18 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate
  Semantic Attention Refinement
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
18
35
0
18 Aug 2022
Learning Spatial-Frequency Transformer for Visual Object Tracking
Learning Spatial-Frequency Transformer for Visual Object Tracking
Chuanming Tang
Tianlin Li
Yuanchao Bai
Zhe Wu
Jianlin Zhang
Yongmei Huang
ViT
42
43
0
18 Aug 2022
RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object
  Detection
RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection
Chang Xu
Jinwang Wang
Wen Yang
Huai Yu
Lei Yu
Guisong Xia
ObjD
44
130
0
18 Aug 2022
Unifying Visual Perception by Dispersible Points Learning
Unifying Visual Perception by Dispersible Points Learning
Jianming Liang
Guanglu Song
B. Leng
Yu Liu
VOS
OCL
12
3
0
18 Aug 2022
Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in
  Driving Scenes
Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes
Yu-Huan Wu
Da Zhang
Le Zhang
Xin Zhan
Dengxin Dai
Yun-Hai Liu
Ming-Ming Cheng
3DPC
28
2
0
18 Aug 2022
Conviformers: Convolutionally guided Vision Transformer
Conviformers: Convolutionally guided Vision Transformer
Mohit Vaishnav
Thomas Fel
I. F. Rodriguez
Thomas Serre
ViT
43
1
0
17 Aug 2022
InterTrack: Interaction Transformer for 3D Multi-Object Tracking
InterTrack: Interaction Transformer for 3D Multi-Object Tracking
John Willes
Cody Reading
Steven L. Waslander
VOT
37
13
0
17 Aug 2022
Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model
Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model
Xiulong Yang
Sheng-Min Shih
Yinlin Fu
Xiaoting Zhao
Shihao Ji
DiffM
33
56
0
16 Aug 2022
Temporal Action Localization with Multi-temporal Scales
Temporal Action Localization with Multi-temporal Scales
Zan Gao
Xinglei Cui
Tao Zhuo
Zhiyong Cheng
An-an Liu
Meng Wang
Shenyong Chen
ViT
28
1
0
16 Aug 2022
HoW-3D: Holistic 3D Wireframe Perception from a Single Image
HoW-3D: Holistic 3D Wireframe Perception from a Single Image
Wenchao Ma
Bin Tan
Nan Xue
Tianfu Wu
Xianwei Zheng
Guisong Xia
3DV
20
11
0
15 Aug 2022
HighlightNet: Highlighting Low-Light Potential Features for Real-Time
  UAV Tracking
HighlightNet: Highlighting Low-Light Potential Features for Real-Time UAV Tracking
Changhong Fu
Haolin Dong
Junjie Ye
Guang-Zheng Zheng
Sihang Li
Jilin Zhao
37
14
0
14 Aug 2022
Flow-Guided Transformer for Video Inpainting
Flow-Guided Transformer for Video Inpainting
Kaiwen Zhang
Jingjing Fu
Dong Liu
ViT
40
68
0
14 Aug 2022
Differentiable Inductive Logic Programming in High-Dimensional Space
Differentiable Inductive Logic Programming in High-Dimensional Space
Stanislaw J. Purgal
David M. Cerna
C. Kaliszyk
20
0
0
13 Aug 2022
Recent Progress in Transformer-based Medical Image Analysis
Recent Progress in Transformer-based Medical Image Analysis
Zhao-cheng Liu
Qiujie Lv
Ziduo Yang
Yifan Li
Chau Hung Lee
Leizhao Shen
MedIm
52
58
0
13 Aug 2022
Semi-supervised Vision Transformers at Scale
Semi-supervised Vision Transformers at Scale
Zhaowei Cai
Avinash Ravichandran
Paolo Favaro
Manchen Wang
Davide Modolo
Rahul Bhotika
Zhuowen Tu
Stefano Soatto
ViT
37
55
0
11 Aug 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative
  Grounding
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding
Zixiang Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Si Liu
27
12
0
11 Aug 2022
OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark
  under Heterogeneous AI Computing Platforms
OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark under Heterogeneous AI Computing Platforms
Jiafan Zhuang
Xiansong Huang
Yang Yang
Jiancong Chen
Yue Yu
Wei-Nan Gao
Ge Li
Jie Chen
Tong Zhang
VLM
27
5
0
11 Aug 2022
Finding Reusable Machine Learning Components to Build Programming
  Language Processing Pipelines
Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Patrick Flynn
T. Vanderbruggen
C. Liao
Pei-Hung Lin
M. Emani
Xipeng Shen
34
4
0
11 Aug 2022
Multi-scale Feature Aggregation for Crowd Counting
Multi-scale Feature Aggregation for Crowd Counting
Xiaoheng Jiang
Xinyi Wu
Hisham Cholakkal
Rao Muhammad Anwer
Jiale Xu
Bing Zhou
Yanwei Pang
Fahad Shahbaz Khan
16
1
0
10 Aug 2022
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with
  Transformer
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer
Zhi-Chun Luo
Changqing Zhou
Liang Pan
Gongjie Zhang
Ti Liu
Yueru Luo
Haiyu Zhao
Ziwei Liu
Shijian Lu
3DPC
23
15
0
10 Aug 2022
TSRFormer: Table Structure Recognition with Transformers
TSRFormer: Table Structure Recognition with Transformers
Weihong Lin
Zhengmao Sun
Chixiang Ma
Mingze Li
Jiawei Wang
Lei-huan Sun
Qiang Huo
ViT
LMTD
39
34
0
09 Aug 2022
Sports Video Analysis on Large-Scale Data
Sports Video Analysis on Large-Scale Data
Dekun Wu
Henghui Zhao
Xingce Bao
Richard P. Wildes
29
13
0
09 Aug 2022
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze
  Estimation
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
Bolin Lai
Miao Liu
Fiona Ryan
James M. Rehg
ViT
45
33
0
08 Aug 2022
Previous
123...767778...104105106
Next