ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,281 papers shown
Title
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
32
11
0
08 Aug 2022
Semi-Supervised Cross-Modal Salient Object Detection with U-Structure
  Networks
Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks
Yunqing Bao
Hang Dai
Abdulmotaleb Elsaddik
24
2
0
08 Aug 2022
3D Vision with Transformers: A Survey
3D Vision with Transformers: A Survey
Jean Lahoud
Jiale Cao
Fahad Shahbaz Khan
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Ming-Hsuan Yang
ViT
MedIm
47
32
0
08 Aug 2022
Global Hierarchical Attention for 3D Point Cloud Analysis
Global Hierarchical Attention for 3D Point Cloud Analysis
Dan Jia
Alexander Hermans
Bastian Leibe
3DPC
21
0
0
07 Aug 2022
Jointformer: Single-Frame Lifting Transformer with Error Prediction and
  Refinement for 3D Human Pose Estimation
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation
Sebastian Lutz
R. Blythman
Koustav Ghosal
Matthew Moynihan
C. Simms
A. Smolic
ViT
34
15
0
07 Aug 2022
No More Strided Convolutions or Pooling: A New CNN Building Block for
  Low-Resolution Images and Small Objects
No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects
Raja Sunkara
Tie-Mei Luo
ObjD
27
323
0
07 Aug 2022
Blackbox Attacks via Surrogate Ensemble Search
Blackbox Attacks via Surrogate Ensemble Search
Zikui Cai
Chengyu Song
S. Krishnamurthy
Amit K. Roy-Chowdhury
Ulugbek S. Kamilov
AAML
27
19
0
07 Aug 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision
  Transformer
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
49
176
0
06 Aug 2022
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly
  Detection and Localization
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
É. Mathian
H. Liu
L. Fernandez-Cuesta
Dimitris Samaras
M. Foll
L. Chen
ViT
33
12
0
06 Aug 2022
Transformer-based assignment decision network for multiple object tracking
Transformer-based assignment decision network for multiple object tracking
Athena Psalta
Vasileios Tsironis
K. Karantzalos
VOT
79
9
0
06 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World
  Survival Game Crafter
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Aleksandar Stanić
Yujin Tang
David R Ha
Jürgen Schmidhuber
ELM
31
13
0
05 Aug 2022
ChiQA: A Large Scale Image-based Real-World Question Answering Dataset
  for Multi-Modal Understanding
ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Bingning Wang
Feiya Lv
Ting Yao
Yiming Yuan
Jin Ma
Yu Luo
Haijin Liang
31
3
0
05 Aug 2022
Multimodal Brain Disease Classification with Functional Interaction
  Learning from Single fMRI Volume
Multimodal Brain Disease Classification with Functional Interaction Learning from Single fMRI Volume
Weiming Dai
Zi-Ke Zhang
L. Tian
Shengyuan Yu
Shuhui Wang
Zhao Dong
Hairong Zheng
MedIm
33
2
0
05 Aug 2022
Calibrate the inter-observer segmentation uncertainty via
  diagnosis-first principle
Calibrate the inter-observer segmentation uncertainty via diagnosis-first principle
Junde Wu
Huihui Fang
Hoayi Xiong
Lixin Duan
Mingkui Tan
Weihua Yang
Huiying Liu
Yanwu Xu
MedIm
55
1
0
05 Aug 2022
TransMatting: Enhancing Transparent Objects Matting with Transformers
TransMatting: Enhancing Transparent Objects Matting with Transformers
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
21
20
0
05 Aug 2022
An Efficient Person Clustering Algorithm for Open Checkout-free
  Groceries
An Efficient Person Clustering Algorithm for Open Checkout-free Groceries
Junde Wu
Yu Zhang
Rao Fu
Yuanpei Liu
Jing Gao
29
1
0
05 Aug 2022
Vision-Centric BEV Perception: A Survey
Vision-Centric BEV Perception: A Survey
Yuexin Ma
Tai Wang
Xuyang Bai
Huitong Yang
Yuenan Hou
Yaming Wang
Yu Qiao
Ruigang Yang
Tianyi Zhou
Xinge Zhu
66
130
0
04 Aug 2022
TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object
  Detection
TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection
Zhipeng Luo
Gongjie Zhang
Changqing Zhou
Ti Liu
Shijian Lu
Liang Pan
3DPC
ViT
53
9
0
04 Aug 2022
DropKey
DropKey
Bonan li
Yinhan Hu
Xuecheng Nie
Congying Han
Xiangjian Jiang
Tiande Guo
Luoqi Liu
20
11
0
04 Aug 2022
MVSFormer: Multi-View Stereo by Learning Robust Image Features and
  Temperature-based Depth
MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth
Chenjie Cao
Xinlin Ren
Yanwei Fu
36
48
0
04 Aug 2022
End-to-end deep learning for directly estimating grape yield from
  ground-based imagery
End-to-end deep learning for directly estimating grape yield from ground-based imagery
A. Olenskyj
B. Sams
Zhenghao Fei
Vishal Singh
P. Raja
G. Bornhorst
J. M. Earles
31
28
0
04 Aug 2022
MinVIS: A Minimal Video Instance Segmentation Framework without
  Video-based Training
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
De-An Huang
Zhiding Yu
Anima Anandkumar
VLM
58
78
0
03 Aug 2022
Re-Attention Transformer for Weakly Supervised Object Localization
Re-Attention Transformer for Weakly Supervised Object Localization
Hui Su
Yue Ye
Zhiwei Chen
Min-Gyoo Song
Lechao Cheng
WSOL
38
14
0
03 Aug 2022
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
Jun Wang
M. Gao
Yuqian Hu
Ramprasaath R. Selvaraju
Chetan Ramaiah
Ran Xu
J. JáJá
Larry S. Davis
ViT
35
17
0
03 Aug 2022
ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Junru Gu
Chenxu Hu
Tian-Yi Zhang
Xuanyao Chen
Yilun Wang
Yue Wang
Hang Zhao
33
98
0
02 Aug 2022
Unified Normalization for Accelerating and Stabilizing Transformers
Unified Normalization for Accelerating and Stabilizing Transformers
Qiming Yang
Kai Zhang
Chaoxiang Lan
Zhi Yang
Zheyang Li
Wenming Tan
Jun Xiao
Shiliang Pu
23
8
0
02 Aug 2022
Making the Best of Both Worlds: A Domain-Oriented Transformer for
  Unsupervised Domain Adaptation
Making the Best of Both Worlds: A Domain-Oriented Transformer for Unsupervised Domain Adaptation
Wen-hui Ma
Jinming Zhang
Shuang Li
Chi Harold Liu
Yulin Wang
Wei Li
26
14
0
02 Aug 2022
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring
  Space for Video Object Segmentation
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
Ye Yu
Jialing Yuan
Gaurav Mittal
Fuxin Li
Mei Chen
VOS
67
28
0
01 Aug 2022
Understanding Adversarial Robustness of Vision Transformers via Cauchy
  Problem
Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem
Zheng Wang
Wenjie Ruan
ViT
44
8
0
01 Aug 2022
Cross Attention Based Style Distribution for Controllable Person Image
  Synthesis
Cross Attention Based Style Distribution for Controllable Person Image Synthesis
Xinyue Zhou
M. Yin
Xinyuan Chen
Li Sun
Changxin Gao
Qingli Li
DiffM
19
54
0
01 Aug 2022
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation
Ronghao Dang
Liuyi Wang
Zongtao He
Shuai Su
Chengju Liu
Qi Chen
22
16
0
01 Aug 2022
One-Shot Medical Landmark Localization by Edge-Guided Transform and
  Noisy Landmark Refinement
One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement
Zihao Yin
Ping Gong
Chun-yu Wang
Yizhou Yu
Yizhou Wang
37
12
0
31 Jul 2022
Less is More: Consistent Video Depth Estimation with Masked Frames
  Modeling
Less is More: Consistent Video Depth Estimation with Masked Frames Modeling
Yiran Wang
Zhiyu Pan
Xingyi Li
Zhiguo Cao
Ke Xian
Jianming Zhang
35
27
0
31 Jul 2022
One for All: One-stage Referring Expression Comprehension with Dynamic
  Reasoning
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning
Zhipeng Zhang
Zhimin Wei
Zhongzhen Huang
Rui Niu
Peng Wang
ObjD
LRM
25
9
0
31 Jul 2022
Point Primitive Transformer for Long-Term 4D Point Cloud Video
  Understanding
Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding
Hao-Kai Wen
Yunze Liu
Jingwei Huang
Bokun Duan
Li Yi
ViT
3DPC
33
26
0
30 Jul 2022
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation
  Exploitation
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation Exploitation
Gongjie Zhang
Zhipeng Luo
Kaiwen Cui
Shijian Lu
Eric P. Xing
ViT
51
93
0
30 Jul 2022
Computer Vision Methods for the Microstructural Analysis of Materials:
  The State-of-the-art and Future Perspectives
Computer Vision Methods for the Microstructural Analysis of Materials: The State-of-the-art and Future Perspectives
Khaled Alrfou
Amir Kordijazi
Tian Zhao
3DV
50
6
0
29 Jul 2022
End-to-end View Synthesis via NeRF Attention
End-to-end View Synthesis via NeRF Attention
Zelin Zhao
Jiaya Jia
39
8
0
29 Jul 2022
Prompting for Multi-Modal Tracking
Prompting for Multi-Modal Tracking
Jinyu Yang
Zhe Li
Fengcai Zheng
A. Leonardis
Jingkuan Song
26
89
0
29 Jul 2022
ScaleFormer: Revisiting the Transformer-based Backbones from a
  Scale-wise Perspective for Medical Image Segmentation
ScaleFormer: Revisiting the Transformer-based Backbones from a Scale-wise Perspective for Medical Image Segmentation
Huimin Huang
Shiao Xie
Lanfen Lin
Yutaro Iwamoto
X. Han
Yen-Wei Chen
Ruofeng Tong
ViT
MedIm
27
45
0
29 Jul 2022
Visual Recognition by Request
Visual Recognition by Request
Chufeng Tang
Lingxi Xie
Xiaopeng Zhang
Xiaolin Hu
Qi Tian
VLM
21
15
0
28 Jul 2022
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale
  Feature Fusion
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion
Gongjie Zhang
Zhipeng Luo
Jiaxing Huang
Shijian Lu
Eric Xing
ViT
46
20
0
28 Jul 2022
A Transformer-based Generative Adversarial Network for Brain Tumor
  Segmentation
A Transformer-based Generative Adversarial Network for Brain Tumor Segmentation
Liqun Huang
Long Chen
Bai-wen Zhang
S. Chai
MedIm
ViT
26
27
0
28 Jul 2022
Towards Large-Scale Small Object Detection: Survey and Benchmarks
Towards Large-Scale Small Object Detection: Survey and Benchmarks
Gong Cheng
Xiang Yuan
Xiwen Yao
Ke Yan
Qinghua Zeng
Xingxing Xie
Junwei Han
ObjD
41
308
0
28 Jul 2022
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion
  Transformer
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Hao Shao
Letian Wang
Ruobing Chen
Hongsheng Li
Y. Liu
52
196
0
28 Jul 2022
Video Mask Transfiner for High-Quality Video Instance Segmentation
Video Mask Transfiner for High-Quality Video Instance Segmentation
Lei Ke
Henghui Ding
Martin Danelljan
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
29
29
0
28 Jul 2022
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery
  with Transformers
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers
Junhyeong Cho
Youwang Kim
Tae-Hyun Oh
ViT
16
121
0
27 Jul 2022
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion
  Sensing
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing
Jiaxi Jiang
Paul Streli
Huajian Qiu
A. Fender
Larissa Laich
Patrick Snape
Christian Holz
26
106
0
27 Jul 2022
Membership Inference Attacks via Adversarial Examples
Membership Inference Attacks via Adversarial Examples
Hamid Jalalzai
Elie Kadoche
Rémi Leluc
Vincent Plassier
AAML
FedML
MIACV
55
7
0
27 Jul 2022
Iterative Scene Graph Generation
Iterative Scene Graph Generation
Siddhesh Khandelwal
Leonid Sigal
OCL
34
29
0
27 Jul 2022
Previous
123...777879...104105106
Next