ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.10881
  4. Cited By
Rethinking Transformer-based Set Prediction for Object Detection

Rethinking Transformer-based Set Prediction for Object Detection

21 November 2020
Zhiqing Sun
Shengcao Cao
Yiming Yang
Kris M. Kitani
    ViT
ArXivPDFHTML

Papers citing "Rethinking Transformer-based Set Prediction for Object Detection"

50 / 151 papers shown
Title
SetKE: Knowledge Editing for Knowledge Elements Overlap
SetKE: Knowledge Editing for Knowledge Elements Overlap
Yifan Wei
Xiaoyan Yu
Ran Song
Hao Peng
Angsheng Li
KELM
58
0
0
29 Apr 2025
Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents
Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents
Zichuan Li
Jian Cui
Xiaojing Liao
Luyi Xing
LLMAG
40
0
0
04 Apr 2025
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Xin Gu
Yaojie Shen
Chenxi Luo
Tiejian Luo
Yan Huang
Yuewei Lin
Heng Fan
L. Zhang
63
1
0
16 Feb 2025
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
Qizhen Lan
Qing Tian
52
0
0
15 Feb 2025
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Guiqiu Liao
M. Jogan
Marcel Hussing
Kenta Nakahashi
Kazuhiro Yasufuku
Amin Madani
Eric Eaton
Daniel A. Hashimoto
133
0
0
21 Jan 2025
Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Yifang Xu
Yunzhuo Sun
Benxiang Zhai
Zien Xie
Youyao Jia
S. Du
47
2
0
18 Jan 2025
Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper
Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper
Helia Mohamadi
Mohammad Ali Keyvanrad
Mohammad Reza Mohammadi
38
0
0
23 Dec 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection
  in Historical Maps
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
Xue Xia
Daiwei Zhang
Wenxuan Song
Wei Huang
L. Hurni
AI4TS
VLM
28
0
0
11 Nov 2024
Cross Resolution Encoding-Decoding For Detection Transformers
Cross Resolution Encoding-Decoding For Detection Transformers
Ashish Kumar
Jaesik Park
ViT
33
0
0
05 Oct 2024
OPUS: Occupancy Prediction Using a Sparse Set
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
30
9
0
14 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks:
  Capturing Global Context and Spatial Relationships
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
32
7
0
27 Aug 2024
CPM: Class-conditional Prompting Machine for Audio-visual Segmentation
CPM: Class-conditional Prompting Machine for Audio-visual Segmentation
Yuanhong Chen
Chong Wang
Yuyuan Liu
Hu Wang
Gustavo Carneiro
40
2
0
07 Jul 2024
LFMamba: Light Field Image Super-Resolution with State Space Model
LFMamba: Light Field Image Super-Resolution with State Space Model
Wang xia
Yao Lu
Shunzhou Wang
Ziqi Wang
Peiqi Xia
Tianfei Zhou
Mamba
48
4
0
18 Jun 2024
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection
  Track for V3Det Challenge 2024
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024
Peixi Wu
Bosong Chai
Xuan Nie
Longquan Yan
Zeyu Wang
Qifan Zhou
Boning Wang
Yansong Peng
Hebei Li
ObjD
31
1
0
13 Jun 2024
Enhancing DETRs Variants through Improved Content Query and Similar
  Query Aggregation
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
39
2
0
06 May 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
28
1
0
26 Apr 2024
A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation
A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation
Xin Zhang
Liangxiu Han
Tam Sobeih
Lianghao Han
Darren Dancey
52
1
0
26 Apr 2024
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing
  Sparsity, Trained from Scratch on Small Datasets
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets
Dongjing Shan
guiqiang chen
ViT
42
0
0
07 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
31
15
0
04 Apr 2024
Force-EvT: A Closer Look at Robotic Gripper Force Measurement with
  Event-based Vision Transformer
Force-EvT: A Closer Look at Robotic Gripper Force Measurement with Event-based Vision Transformer
Qianyu Guo
Ziqing Yu
Jiaming Fu
Yawen Lu
Yahya H. Zweiri
Dongming Gan
21
3
0
01 Apr 2024
On permutation-invariant neural networks
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OOD
AAML
38
12
0
26 Mar 2024
Semi-supervised Counting via Pixel-by-pixel Density Distribution
  Modelling
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling
Hui Lin
Zhiheng Ma
Rongrong Ji
Yao Wang
Zhou Su
Xiaopeng Hong
Deyu Meng
33
2
0
23 Feb 2024
Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations
Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations
Bhishma Dedhia
N. Jha
OCL
48
1
0
02 Feb 2024
Small Object Detection by DETR via Information Augmentation and Adaptive
  Feature Fusion
Small Object Detection by DETR via Information Augmentation and Adaptive Feature Fusion
Ji Huang
Hui Wang
ViT
19
5
0
16 Jan 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
MS-DETR: Efficient DETR Training with Mixed Supervision
Chuyang Zhao
Yifan Sun
Wenhao Wang
Qiang Chen
Errui Ding
Yi Yang
Jingdong Wang
MU
28
20
0
08 Jan 2024
LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient
  Image Recognition
LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition
Youbing Hu
Yun Cheng
Anqi Lu
Zhiqiang Cao
Dawei Wei
Jie Liu
Zhijun Li
ViT
16
6
0
08 Jan 2024
TPC-ViT: Token Propagation Controller for Efficient Vision Transformer
TPC-ViT: Token Propagation Controller for Efficient Vision Transformer
Wentao Zhu
21
2
0
03 Jan 2024
Transformer-Based Multi-Object Smoothing with Decoupled Data Association
  and Smoothing
Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing
Juliano Pinto
Georg Hess
Yuxuan Xia
H. Wymeersch
Lennart Svensson
VOT
22
3
0
22 Dec 2023
Weakly Supervised Open-Vocabulary Object Detection
Weakly Supervised Open-Vocabulary Object Detection
Jianghang Lin
Yunhang Shen
Bingquan Wang
Shaohui Lin
Ke Li
Liujuan Cao
WSOD
28
6
0
19 Dec 2023
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal
  Sentence Grounding in Videos
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Pilhyeon Lee
Hyeran Byun
19
10
0
30 Nov 2023
Learning Saliency From Fixations
Learning Saliency From Fixations
Y. A. D. Djilali
Kevin McGuinness
Noel E. O'Connor
23
2
0
23 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic
  Scanning Augmentation
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
22
2
0
01 Nov 2023
Distractor-aware Event-based Tracking
Distractor-aware Event-based Tracking
Yingkai Fu
Meng Li
Wenxi Liu
Yuanchen Wang
Jiqing Zhang
Baocai Yin
Xiaopeng Wei
Xin Yang
26
8
0
22 Oct 2023
Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection
Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection
Junjia Huang
Haofeng Li
Xiang Wan
Guanbin Li
MedIm
ViT
43
10
0
22 Oct 2023
Transformers in Small Object Detection: A Benchmark and Survey of
  State-of-the-Art
Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art
Aref Miri Rekavandi
Shima Rashidi
F. Boussaïd
Stephen Hoefs
Emre Akbas
Bennamoun
ViT
46
23
0
10 Sep 2023
SG-Former: Self-guided Transformer with Evolving Token Reallocation
SG-Former: Self-guided Transformer with Evolving Token Reallocation
Sucheng Ren
Xingyi Yang
Songhua Liu
Xinchao Wang
ViT
27
41
0
23 Aug 2023
Knowing Where to Focus: Event-aware Transformer for Video Grounding
Knowing Where to Focus: Event-aware Transformer for Video Grounding
Jinhyun Jang
Jungin Park
Jin-Hwa Kim
Hyeongjun Kwon
K. Sohn
16
49
0
14 Aug 2023
LATR: 3D Lane Detection from Monocular Images with Transformer
LATR: 3D Lane Detection from Monocular Images with Transformer
Yueru Luo
Chaoda Zheng
Xu Yan
Tang Kun
Chao Zheng
Shuguang Cui
Zhen Li
ViT
33
32
0
08 Aug 2023
DiT: Efficient Vision Transformers with Dynamic Token Routing
DiT: Efficient Vision Transformers with Dynamic Token Routing
Yuchen Ma
Zhengcong Fei
Junshi Huang
ViT
24
2
0
07 Aug 2023
Less is More: Focus Attention for Efficient DETR
Less is More: Focus Attention for Efficient DETR
Dehua Zheng
Wenhui Dong
Hailin Hu
Xinghao Chen
Yunhe Wang
19
58
0
24 Jul 2023
Cascade-DETR: Delving into High-Quality Universal Object Detection
Cascade-DETR: Delving into High-Quality Universal Object Detection
Mingqiao Ye
Lei Ke
Siyuan Li
Yu-Wing Tai
Chi-Keung Tang
Martin Danelljan
F. I. F. Richard Yu
47
34
0
20 Jul 2023
Cross-Spatial Pixel Integration and Cross-Stage Feature Fusion Based
  Transformer Network for Remote Sensing Image Super-Resolution
Cross-Spatial Pixel Integration and Cross-Stage Feature Fusion Based Transformer Network for Remote Sensing Image Super-Resolution
Yuting Lu
Lingtong Min
Binglu Wang
Le Zheng
Xiaoxu Wang
Yongqiang Zhao
Teng Long
11
6
0
06 Jul 2023
Bridging the Performance Gap between DETR and R-CNN for Graphical Object
  Detection in Document Images
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
Tahira Shehzadi
K. Hashmi
D. Stricker
Marcus Liwicki
Muhammad Zeshan Afzal
21
7
0
23 Jun 2023
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object
  Detection
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection
Hao Ouyang
34
4
0
15 Jun 2023
Object Detection with Transformers: A Review
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
18
28
0
07 Jun 2023
PhenoBench -- A Large Dataset and Benchmarks for Semantic Image
  Interpretation in the Agricultural Domain
PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain
J. Weyler
Federico Magistri
E. Marks
Yue Linn Chong
Matteo Sodano
Gianmarco Roggiolani
Nived Chebrolu
C. Stachniss
Jens Behley
32
30
0
07 Jun 2023
Let There Be Order: Rethinking Ordering in Autoregressive Graph
  Generation
Let There Be Order: Rethinking Ordering in Autoregressive Graph Generation
Jie Bu
Kazi Sajeed Mehrab
Anuj Karpatne
21
3
0
24 May 2023
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated
  Detection Transformer
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer
Hakjin Lee
Minki Song
Jamyoung Koo
Junghoon Seo
32
7
0
12 May 2023
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for
  Monocular 3D Object Detection
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection
Xuan He
Fan Yang
Kailun Yang
Jiacheng Lin
Haolong Fu
M. Wang
Jin Yuan
Zhiyong Li
ViT
20
12
0
12 May 2023
iDisc: Internal Discretization for Monocular Depth Estimation
iDisc: Internal Discretization for Monocular Depth Estimation
Luigi Piccinelli
Christos Sakaridis
F. I. F. Richard Yu
27
81
0
13 Apr 2023
1234
Next