ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,307 papers shown
Title
tSF: Transformer-based Semantic Filter for Few-Shot Learning
tSF: Transformer-based Semantic Filter for Few-Shot Learning
Jinxiang Lai
Siqian Yang
Wenlong Liu
Yi Zeng
Zhongyi Huang
Wenlong Wu
Jun Liu
Bin-Bin Gao
Chengjie Wang
VLM
28
19
0
02 Nov 2022
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual
  Anomaly Detector
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual Anomaly Detector
Haiming Yao
Xue Wang
Wenyong Yu
48
9
0
01 Nov 2022
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Hongxiang Jiang
Wenming Meng
Hongmei Zhu
Qiaosheng Zhang
Jihao Yin
45
4
0
31 Oct 2022
LAD-RCNN:A Powerful Tool for Livestock Face Detection and Normalization
LAD-RCNN:A Powerful Tool for Livestock Face Detection and Normalization
Lin Sun
Guiqiong Liu
Xunping Jiang
Jing Liu
X. Wang
Hang Yang
Shiping Yang
CVBM
30
6
0
31 Oct 2022
Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot
  Object Detection
Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot Object Detection
Shan Zhang
Naila Murray
Lei Wang
Piotr Koniusz
ViT
45
16
0
30 Oct 2022
Relative Attention-based One-Class Adversarial Autoencoder for
  Continuous Authentication of Smartphone Users
Relative Attention-based One-Class Adversarial Autoencoder for Continuous Authentication of Smartphone Users
Mingming Hu
Kun Zhang
Ruibang You
Bibo Tu
AAML
35
1
0
30 Oct 2022
Two-Level Temporal Relation Model for Online Video Instance Segmentation
Two-Level Temporal Relation Model for Online Video Instance Segmentation
Ç. S. Çoban
Oguzhan Keskin
Jordi Pont-Tuset
Fatma Guney
VOS
40
0
0
30 Oct 2022
Pair DETR: Contrastive Learning Speeds Up DETR Training
Pair DETR: Contrastive Learning Speeds Up DETR Training
M. Iranmanesh
Xiaotong Chen
Kuo-Chin Lien
ViT
32
0
0
29 Oct 2022
ImplantFormer: Vision Transformer based Implant Position Regression
  Using Dental CBCT Data
ImplantFormer: Vision Transformer based Implant Position Regression Using Dental CBCT Data
Xinquan Yang
Xuguang Li
Xuechen Li
Pei-Yao Wu
Linlin Shen
Yongqiang Deng
MedIm
52
8
0
29 Oct 2022
Grafting Vision Transformers
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
29
3
0
28 Oct 2022
VLT: Vision-Language Transformer and Query Generation for Referring
  Segmentation
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
Henghui Ding
Chang Liu
Suchen Wang
Xudong Jiang
86
116
0
28 Oct 2022
Towards Improving Workers' Safety and Progress Monitoring of
  Construction Sites Through Construction Site Understanding
Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding
Mahdi Bonyani
Maryam Soleymani
29
0
0
27 Oct 2022
Li3DeTr: A LiDAR based 3D Detection Transformer
Li3DeTr: A LiDAR based 3D Detection Transformer
Gopi Krishna Erabati
Helder Araújo
ViT
3DPC
117
14
0
27 Oct 2022
MSF3DDETR: Multi-Sensor Fusion 3D Detection Transformer for Autonomous
  Driving
MSF3DDETR: Multi-Sensor Fusion 3D Detection Transformer for Autonomous Driving
Gopi Krishna Erabati
Helder Araújo
ViT
3DPC
29
3
0
27 Oct 2022
The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group
  Dance Challenge
The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group Dance Challenge
Yuang Zhang
Tiancai Wang
Weiyao Lin
Xiangyu Zhang
13
0
0
27 Oct 2022
Visual Semantic Parsing: From Images to Abstract Meaning Representation
Visual Semantic Parsing: From Images to Abstract Meaning Representation
M. A. Abdelsalam
Zhan Shi
Federico Fancellu
Kalliopi Basioti
Dhaivat Bhatt
Vladimir Pavlovic
Afsaneh Fazly
GNN
65
4
0
26 Oct 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
44
82
0
26 Oct 2022
End-to-end Tracking with a Multi-query Transformer
End-to-end Tracking with a Multi-query Transformer
Bruno Korbar
Andrew Zisserman
VOT
32
6
0
26 Oct 2022
Discovering Design Concepts for CAD Sketches
Discovering Design Concepts for CAD Sketches
Yuezhi Yang
Hao Pan
19
12
0
26 Oct 2022
Can Transformer Attention Spread Give Insights Into Uncertainty of
  Detected and Tracked Objects?
Can Transformer Attention Spread Give Insights Into Uncertainty of Detected and Tracked Objects?
Felicia Ruppel
F. Faion
Claudius Gläser
Klaus C. J. Dietmayer
21
0
0
26 Oct 2022
Cross-View Image Sequence Geo-localization
Cross-View Image Sequence Geo-localization
Xiaohan Zhang
Waqas Sultani
S. Wshah
29
22
0
25 Oct 2022
Refining Action Boundaries for One-stage Detection
Refining Action Boundaries for One-stage Detection
Hanyuan Wang
Majid Mirmehdi
Dima Damen
Toby Perrett
ObjD
37
1
0
25 Oct 2022
Search for Concepts: Discovering Visual Concepts Using Direct
  Optimization
Search for Concepts: Discovering Visual Concepts Using Direct Optimization
P. Reddy
Paul Guerrero
Niloy J. Mitra
OCL
26
4
0
25 Oct 2022
Pointly-Supervised Panoptic Segmentation
Pointly-Supervised Panoptic Segmentation
Junsong Fan
Zhaoxiang Zhang
Tieniu Tan
40
23
0
25 Oct 2022
A Comparative Attention Framework for Better Few-Shot Object Detection
  on Aerial Images
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial Images
Pierre Le Jeune
Anissa Zergaïnoh-Mokraoui
ObjD
49
3
0
25 Oct 2022
End-to-end Transformer for Compressed Video Quality Enhancement
End-to-end Transformer for Compressed Video Quality Enhancement
Li Yu
Wenshuai Chang
Shiyu Wu
Moncef Gabbouj
ViT
28
8
0
25 Oct 2022
Strong-TransCenter: Improved Multi-Object Tracking based on Transformers
  with Dense Representations
Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations
Amit Galor
Roy Orfaig
B. Bobrovsky
VOT
50
6
0
24 Oct 2022
Video based Object 6D Pose Estimation using Transformers
Video based Object 6D Pose Estimation using Transformers
Apoorva Beedu
Huda AlAmri
Irfan Essa
ViT
24
8
0
24 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
45
162
0
24 Oct 2022
Towards Unifying Reference Expression Generation and Comprehension
Towards Unifying Reference Expression Generation and Comprehension
Duo Zheng
Tao Kong
Ya Jing
Jiaan Wang
Xiaojie Wang
ObjD
35
6
0
24 Oct 2022
Iterative Patch Selection for High-Resolution Image Recognition
Iterative Patch Selection for High-Resolution Image Recognition
Benjamin Bergner
C. Lippert
Aravindh Mahendran
36
13
0
24 Oct 2022
Holistically-Attracted Wireframe Parsing: From Supervised to
  Self-Supervised Learning
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning
Nan Xue
Tianfu Wu
Song Bai
Fu-Dong Wang
Gui-Song Xia
Lefei Zhang
Philip Torr
14
23
0
24 Oct 2022
BARS: A Benchmark for Airport Runway Segmentation
BARS: A Benchmark for Airport Runway Segmentation
Wenhui Chen
Zhijiang Zhang
Liang Yu
Yichun Tai
19
11
0
24 Oct 2022
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
  Propagation in Transformers
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers
Zhuo Huang
Zhiyou Zhao
Banghuai Li
Jungong Han
3DPC
ViT
42
55
0
23 Oct 2022
Extending Phrase Grounding with Pronouns in Visual Dialogues
Extending Phrase Grounding with Pronouns in Visual Dialogues
Panzhong Lu
Xin Zhang
Meishan Zhang
Min Zhang
ObjD
37
4
0
23 Oct 2022
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing
  Data
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data
Yangfan Zhan
Zhitong Xiong
Yuan. Yuan
78
110
0
23 Oct 2022
Spectrum-BERT: Pre-training of Deep Bidirectional Transformers for
  Spectral Classification of Chinese Liquors
Spectrum-BERT: Pre-training of Deep Bidirectional Transformers for Spectral Classification of Chinese Liquors
Yansong Wang
Yundong Sun
Yan-Jiao Fu
Dongjie Zhu
Zhaoshuo Tian
32
6
0
22 Oct 2022
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using
  Strips Window Attention
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention
Chi Zhang
Lu Zhou
Lei Wang
Zaiyan Dai
Jun Yang
ViT
47
24
0
22 Oct 2022
Instance-Aware Image Completion
Instance-Aware Image Completion
Ji-Ho Cho
Minguk Kang
Vibhav Vineet
Jaesik Park
ISeg
VLM
28
2
0
22 Oct 2022
Face Pyramid Vision Transformer
Face Pyramid Vision Transformer
Khawar Islam
M. Zaheer
Arif Mahmood
ViT
CVBM
29
4
0
21 Oct 2022
Automatic Cattle Identification using YOLOv5 and Mosaic Augmentation: A
  Comparative Analysis
Automatic Cattle Identification using YOLOv5 and Mosaic Augmentation: A Comparative Analysis
Rabindra Dulal
Lihong Zheng
M. A. Kabir
S. McGrath
J. Medway
D. Swain
Will Swain
28
18
0
21 Oct 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal
  Language Grounding
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
Yuechen Wang
Wen-gang Zhou
Houqiang Li
AI4TS
24
12
0
21 Oct 2022
Boosting vision transformers for image retrieval
Boosting vision transformers for image retrieval
Chull Hwan Song
Jooyoung Yoon
Shunghyun Choi
Yannis Avrithis
ViT
47
33
0
21 Oct 2022
3D Human Pose Estimation in Multi-View Operating Room Videos Using
  Differentiable Camera Projections
3D Human Pose Estimation in Multi-View Operating Room Videos Using Differentiable Camera Projections
Beerend G. A. Gerats
J. Wolterink
I. A. Broeders
3DH
35
11
0
21 Oct 2022
AROS: Affordance Recognition with One-Shot Human Stances
AROS: Affordance Recognition with One-Shot Human Stances
Abel Pacheco-Ortega
W. Mayol-Cuevas
3DH
47
0
0
21 Oct 2022
CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement
  Transformers
CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers
Pedro Castro
Tae-Kyun Kim
40
30
0
21 Oct 2022
Rethinking Learning Approaches for Long-Term Action Anticipation
Rethinking Learning Approaches for Long-Term Action Anticipation
Megha Nawhal
Akash Abdu Jyothi
Greg Mori
AI4TS
39
27
0
20 Oct 2022
Transformer-based Action recognition in hand-object interacting
  scenarios
Transformer-based Action recognition in hand-object interacting scenarios
Hoseong Cho
Seungryul Baek
EgoV
42
2
0
20 Oct 2022
Transformer-based Global 3D Hand Pose Estimation in Two Hands
  Manipulating Objects Scenarios
Transformer-based Global 3D Hand Pose Estimation in Two Hands Manipulating Objects Scenarios
Hoseong Cho
Donguk Kim
Chanwoo Kim
Seongyeong Lee
Seungryul Baek
34
1
0
20 Oct 2022
Large-batch Optimization for Dense Visual Predictions
Large-batch Optimization for Dense Visual Predictions
Zeyue Xue
Jianming Liang
Guanglu Song
Zhuofan Zong
Liang Chen
Yu Liu
Ping Luo
VLM
52
9
0
20 Oct 2022
Previous
123...717273...105106107
Next