Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 916 papers shown
Title
A Graph-Based Approach for Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
25
10
0
29 Nov 2023
Learning Saliency From Fixations
Y. A. D. Djilali
Kevin McGuinness
Noel E. O'Connor
23
2
0
23 Nov 2023
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention
Zuyao Chen
Jinlin Wu
Zhen Lei
Zhaoxiang Zhang
Changwen Chen
25
11
0
18 Nov 2023
Multiple View Geometry Transformers for 3D Human Pose Estimation
Ziwei Liao
Jialiang Zhu
Chunyu Wang
Han Hu
Steven L. Waslander
ViT
28
2
0
18 Nov 2023
Improved TokenPose with Sparsity
Anning Li
ViT
34
0
0
16 Nov 2023
Contrastive Learning for Multi-Object Tracking with Transformers
Pierre-François De Plaen
Nicola Marinello
Marc Proesmans
Tinne Tuytelaars
Luc Van Gool
VOT
36
6
0
14 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
45
143
0
10 Nov 2023
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
28
14
0
09 Nov 2023
Multi-Modal Gaze Following in Conversational Scenarios
Yuqi Hou
Zhongqun Zhang
Nora Horanyi
Jaewon Moon
Yihua Cheng
Hyung Jin Chang
21
5
0
09 Nov 2023
S
3
^3
3
AD: Semi-supervised Small Apple Detection in Orchard Environments
Robert Johanson
Christian Wilms
Ole Johannsen
Simone Frintrop
27
3
0
08 Nov 2023
Rotation Invariant Transformer for Recognizing Object in UAVs
Shuo Chen
Mang Ye
Bo Du
ViT
32
18
0
05 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
27
2
0
01 Nov 2023
Audio-Visual Instance Segmentation
Ruohao Guo
Yaru Chen
Yanyu Qi
Wenzhen Yue
Dantong Niu
...
Wenzhen Yue
Ji Shi
Qixun Wang
Peiliang Zhang
Buwen Liang
VLM
VOS
31
2
0
28 Oct 2023
ConvBKI: Real-Time Probabilistic Semantic Mapping Network with Quantifiable Uncertainty
Joey Wilson
Yuewei Fu
Joshua Friesen
Parker Ewen
Andrew Capodieci
P. Jayakumar
Kira Barton
Maani Ghaffari
29
9
0
24 Oct 2023
Ranking-based Adaptive Query Generation for DETRs in Crowded Pedestrian Detection
Feng Gao
Jiaxu Leng
Ji Gan
Xinbo Gao
AI4TS
37
0
0
24 Oct 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
27
3
0
20 Oct 2023
Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers
Yuanduo Hong
Jue Wang
Weichao Sun
Huihui Pan
VLM
ViT
37
7
0
19 Oct 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
39
14
0
16 Oct 2023
Multimodal Object Query Initialization for 3D Object Detection
Mathijs R. van Geerenstein
Felicia Ruppel
Klaus C. J. Dietmayer
D. Gavrila
3DPC
30
2
0
16 Oct 2023
SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical Image Segmentation
Tan-Hanh Pham
Xianqi Li
Kim-Doang Nguyen
MedIm
ViT
26
8
0
16 Oct 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
34
4
0
10 Oct 2023
Uni3DETR: Unified 3D Detection Transformer
Zhenyu Wang
Yali Li
Xi Chen
Hengshuang Zhao
Shengjin Wang
3DPC
44
18
0
09 Oct 2023
HOD: A Benchmark Dataset for Harmful Object Detection
Eungyeom Ha
Heemook Kim
Sung Chul Hong
Dongbin Na
27
8
0
08 Oct 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
J. Han
Guiguang Ding
ViT
34
6
0
27 Sep 2023
IFT: Image Fusion Transformer for Ghost-free High Dynamic Range Imaging
Hai-lin Wang
Wei Li
Yuanyuan Xi
Jie Hu
Hanting Chen
Longyu Li
Yun Wang
14
1
0
26 Sep 2023
UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for Robustness against Missing Sensor Modalities
Shiming Wang
Holger Caesar
Liangliang Nan
Julian F. P. Kooij
64
11
0
25 Sep 2023
Species196: A One-Million Semi-supervised Dataset for Fine-grained Species Recognition
W. He
Kai Han
Ying Nie
Chengcheng Wang
Yunhe Wang
VLM
48
6
0
25 Sep 2023
UniHead: Unifying Multi-Perception for Detection Heads
Hantao Zhou
Rui Yang
Yachao Zhang
Haoran Duan
Yawen Huang
R. Hu
Xiu Li
Yefeng Zheng
31
12
0
23 Sep 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
38
12
0
21 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
35
1
0
20 Sep 2023
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing
Jiahang Li
Yikang Zhang
Peng Yun
Guangliang Zhou
Qijun Chen
Rui Fan
ViT
OffRL
18
26
0
19 Sep 2023
OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking
Jianjun Gao
Yi Wang
Kim-Hui Yap
Kratika Garg
Bo Han
84
1
0
19 Sep 2023
ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection
Zhimeng Xin
Tianxu Wu
Shiming Chen
Yixiong Zou
Ling Shao
Xinge You
42
4
0
15 Sep 2023
Large-Vocabulary 3D Diffusion Model with Transformer
Ziang Cao
Fangzhou Hong
Tong Wu
Liang Pan
Ziwei Liu
DiffM
20
36
0
14 Sep 2023
Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning
Chunqing Ruan
Hongjian Wang
VLM
VPVLM
32
1
0
12 Sep 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOT
ViT
39
6
0
09 Sep 2023
Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation
Shyam Nandan Rai
Fabio Cermelli
Barbara Caputo
Carlo Masone
ISeg
ViT
33
5
0
08 Sep 2023
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
Manlin Zhang
Jie Wu
Yuxi Ren
Ming Li
Jie Qin
Xuefeng Xiao
Wei Liu
Rui Wang
Min Zheng
Andy J. Ma
DiffM
33
20
0
07 Sep 2023
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
36
14
0
07 Sep 2023
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples
Guanghui Li
Mingqi Gao
Heng Liu
Xiantong Zhen
Feng Zheng
VOS
31
3
0
05 Sep 2023
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang
Xiaoyang Wu
Xi Chen
Hengshuang Zhao
Lei Zhu
Joan Lasenby
ISeg
3DPC
VLM
55
46
0
01 Sep 2023
Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction
Wenjie Gao
Jiawei Fu
Yanqing Shen
Haodong Jing
Shitao Chen
Nanning Zheng
33
15
0
29 Aug 2023
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation
Tim Meinhardt
Matt Feiszli
Yuchen Fan
Laura Leal-Taixe
Rakesh Ranjan
ViT
19
5
0
29 Aug 2023
BIT: Bi-Level Temporal Modeling for Efficient Supervised Action Segmentation
Zijia Lu
Ehsan Elhamifar
48
2
0
28 Aug 2023
Forensic Histopathological Recognition via a Context-Aware MIL Network Powered by Self-Supervised Contrastive Learning
Chen Shen
Jun Zhang
Xinggong Liang
Zeyi Hao
Ke Li
Fan Wang
Zhenyuan Wang
C. Lian
19
2
0
27 Aug 2023
DISCO: Distribution-Aware Calibration for Object Detection with Noisy Bounding Boxes
Donghao Zhou
Jialin Li
Jinpeng Li
Jiancheng Huang
Qiang Nie
Yong-Jin Liu
Bin-Bin Gao
Qiong Wang
Pheng-Ann Heng
Guangyong Chen
38
3
0
23 Aug 2023
Motion-to-Matching: A Mixed Paradigm for 3D Single Object Tracking
Zhiheng Li
Yu Lin
Yubo Cui
Shuo Li
Zheng Fang
32
3
0
23 Aug 2023
Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Bin Zhang
Sen Wang
Yifan Liu
Brano Kusy
Xue Li
Jiajun Liu
ObjD
42
0
0
22 Aug 2023
A Unified Query-based Paradigm for Camouflaged Instance Segmentation
Do Dong
Jialun Pei
Rongrong Gao
Tian-Zhu Xiang
Shuo Wang
Huan Xiong
ISeg
26
12
0
14 Aug 2023
RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs
Zhouxia Wang
Jiawei Zhang
Tianshui Chen
Wenping Wang
Ping Luo
41
16
0
14 Aug 2023
Previous
1
2
3
4
5
6
...
17
18
19
Next