Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 916 papers shown
Title
Arena: A Patch-of-Interest ViT Inference Acceleration System for Edge-Assisted Video Analytics
Haosong Peng
Wei Feng
Hao Li
Yufeng Zhan
Qihua Zhou
Yuanqing Xia
33
2
0
14 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
31
15
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
71
2
0
04 Apr 2024
Roadside Monocular 3D Detection via 2D Detection Prompting
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
Yanan Li
Shu Kong
51
0
0
01 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Zeren Chen
Zhelun Shi
Xiaoya Lu
Lehan He
Sucheng Qian
...
Zhen-fei Yin
Jing Shao
Jing Shao
Cewu Lu
Cewu Lu
38
5
0
28 Mar 2024
Cross-domain Fiber Cluster Shape Analysis for Language Performance Cognitive Score Prediction
Yui Lo
Yuqian Chen
Dongnan Liu
Wan Liu
L. Zekelman
...
Yogesh Rathi
N. Makris
A. Golby
Weidong Cai
L. O’Donnell
31
6
0
27 Mar 2024
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
55
12
0
25 Mar 2024
Cross-domain Multi-modal Few-shot Object Detection via Rich Text
Zeyu Shangguan
Daniel Seita
Mohammad Rostami
ObjD
55
1
0
24 Mar 2024
FaceXFormer: A Unified Transformer for Facial Analysis
Kartik Narayan
VS Vibashan
Rama Chellappa
Vishal M. Patel
ViT
59
12
0
19 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
42
7
0
18 Mar 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
65
14
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
56
4
0
18 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
29
3
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
43
1
0
15 Mar 2024
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Hongyuan Yu
Cheng Wan
Mengchen Liu
Dongdong Chen
Bin Xiao
Xiyang Dai
Yan Huang
Yuan Lu
Liang Wang
73
5
0
15 Mar 2024
ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
Fangqiang Ding
Yunzhou Zhu
Xiangyu Wen
Gaowen Liu
Chris Xiaoxuan Lu
42
2
0
14 Mar 2024
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Quoc-Vinh Lai-Dang
ViT
33
2
0
12 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
68
13
0
12 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
35
4
0
07 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
30
5
0
06 Mar 2024
EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
Jiacheng Lin
Jiajun Chen
Kunyu Peng
Xuan He
Zhiyong Li
Rainer Stiefelhagen
Kailun Yang
50
6
0
28 Feb 2024
EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor Neighborhoods
Huiyuan Xiong
Jun Shen
Taohong Zhu
Yuelong Pan
30
3
0
28 Feb 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
47
16
0
28 Feb 2024
SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection
Junsu Kim
Hoseong Cho
Jihyeon Kim
Yihalem Yimolal Tiruneh
Seungryul Baek
DiffM
43
20
0
27 Feb 2024
Deployment Prior Injection for Run-time Calibratable Object Detection
Mo Zhou
Yiding Yang
Haoxiang Li
Vishal M. Patel
Gang Hua
44
0
0
27 Feb 2024
Multi-Human Mesh Recovery with Transformers
Zeyu Wang
Zhenzhen Weng
Serena Yeung-Levy
3DH
32
1
0
26 Feb 2024
Learning Low-Rank Feature for Thorax Disease Classification
Rajeev Goel
Utkarsh Nath
Yancheng Wang
Alvin C. Silva
Teresa Wu
Yingzhen Yang
22
0
0
14 Feb 2024
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Jiajun Song
Yang Zhang
Shuqiang Jiang
35
10
0
14 Feb 2024
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Rui Song
Chenwei Liang
Hu Cao
Zhiran Yan
Walter Zimmer
Markus Gross
Andreas Festag
Alois C. Knoll
36
21
0
12 Feb 2024
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
Yifeng Bai
Zhirong Chen
Pengpeng Liang
Erkang Cheng
Erkang Cheng
ViT
36
8
0
09 Feb 2024
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
Chengjian Feng
Yujie Zhong
Zequn Jie
Weidi Xie
Lin Ma
ObjD
38
13
0
08 Feb 2024
VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large Models
Yi Zhao
Yilin Zhang
Rong Xiang
Jing Li
Hillming Li
43
16
0
29 Jan 2024
BlenDA: Domain Adaptive Object Detection through diffusion-based blending
Tzuhsuan Huang
Chen-Che Huang
Chung-Hao Ku
Jun-Cheng Chen
36
5
0
18 Jan 2024
Stream Query Denoising for Vectorized HD Map Construction
Shuo Wang
Fan Jia
Yingfei Liu
Yucheng Zhao
Zehui Chen
Tiancai Wang
Chi Zhang
Xiangyu Zhang
Feng Zhao
36
19
0
17 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
77
1
0
15 Jan 2024
Surface Normal Estimation with Transformers
Barry Shichen Hu
Siyun Liang
Johannes Paetzold
H. Nguyen
Isao Echizen
Jiapeng Tang
ViT
3DPC
28
0
0
11 Jan 2024
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving
Tianyu Li
Peijin Jia
Bangjun Wang
Li Chen
Kun Jiang
Junchi Yan
Hongyang Li
38
34
0
26 Dec 2023
Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing
Juliano Pinto
Georg Hess
Yuxuan Xia
H. Wymeersch
Lennart Svensson
VOT
27
3
0
22 Dec 2023
Point Deformable Network with Enhanced Normal Embedding for Point Cloud Analysis
Xingyilang Yin
Xi Yang
Liangchen Liu
Nannan Wang
Xinbo Gao
3DPC
31
3
0
20 Dec 2023
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
35
10
0
19 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
37
3
0
18 Dec 2023
Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy
Junsu Kim
Sumin Hong
Chanwoo Kim
Jihyeon Kim
Yihalem Yimolal Tiruneh
Jeongwan On
Jihyun Song
Sunhwa Choi
Seungryul Baek
13
3
0
14 Dec 2023
Mixed Pseudo Labels for Semi-Supervised Object Detection
Ze-Yi Chen
Wenwei Zhang
Xinjiang Wang
Kai Chen
Zhi Wang
ObjD
40
10
0
12 Dec 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
31
2
0
11 Dec 2023
RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos
Tanveer Hannan
Md. Mohaiminul Islam
Thomas Seidl
Gedas Bertasius
28
3
0
11 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
29
5
0
09 Dec 2023
Online Vectorized HD Map Construction using Geometry
Zhixin Zhang
Yiyuan Zhang
Xiaohan Ding
Fusheng Jin
Xiangyu Yue
3DPC
34
23
0
06 Dec 2023
Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Xubin Zhong
Changxing Ding
Yupeng Hu
Dacheng Tao
28
0
0
04 Dec 2023
Improving Interpretation Faithfulness for Vision Transformers
Lijie Hu
Yixin Liu
Ninghao Liu
Mengdi Huai
Lichao Sun
Di Wang
37
5
0
29 Nov 2023
Previous
1
2
3
4
5
...
17
18
19
Next