Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.09630
Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"
50 / 1,082 papers shown
Title
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Liangtao Shi
Ting Liu
Xiantao Hu
Yue Hu
Quanjun Yin
Richang Hong
ObjD
54
0
0
24 Feb 2025
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Xinyi Ying
Chao Xiao
Ruojing Li
Xu He
Boyang Li
...
Miao Li
Shilin Zhou
Wei An
Weidong Sheng
Li Liu
157
7
0
21 Feb 2025
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
Aryan Jadon
Avinash Patil
Shashank Kumar
SyDa
57
1
0
21 Feb 2025
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
Thomas Froech
Olaf Wysocki
Yan Xia
Junyu Xie
Benedikt Schwab
Daniel Cremers
T. H. Kolbe
41
0
0
20 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
132
0
0
11 Feb 2025
Adaptive Perception for Unified Visual Multi-modal Object Tracking
Xiantao Hu
Bineng Zhong
Qihua Liang
Zhiyi Mo
Liangtao Shi
Ying Tai
Jian Yang
40
1
0
10 Feb 2025
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
Lei Yang
Guowu Yuan
Hao Zhou
Hongyu Liu
Jian Chen
Hao Wu
108
30
0
05 Feb 2025
YOLOSCM: An improved YOLO algorithm for cars detection
Changhui Deng
Lieyang Chen
Shinan Liu
78
0
0
23 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
53
1
0
18 Jan 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Sitong Gong
Yunzhi Zhuge
Lu Zhang
Zheng Yang
Pingping Zhang
Huchuan Lu
46
0
0
15 Jan 2025
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Zhimeng Xin
Tianxu Wu
Shiming Chen
Shuo Ye
Zijing Xie
Yixiong Zou
Xinge You
Yufei Guo
38
0
0
13 Jan 2025
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao
Xiaoye Qu
Dangyang Chen
Yuanyuan Fu
61
0
0
10 Jan 2025
BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation
Maxwell Meyer
Jack Spruyt
53
0
0
08 Jan 2025
RDD4D: 4D Attention-Guided Road Damage Detection And Classification
Asma Alkalbani
Muhammad Saqib
Ahmed Salim Alrawahi
A. Anwar
Chandarnath Adak
Saeed Anwar
44
1
0
07 Jan 2025
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union
Brian Hsuan-Cheng Liao
Chih-Hong Cheng
Hasan Esen
Alois Knoll
EgoV
48
0
0
03 Jan 2025
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
Yaxian Wang
Henghui Ding
Shuting He
Xudong Jiang
Bifan Wei
Jun Liu
ObjD
63
1
0
03 Jan 2025
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
67
4
0
31 Dec 2024
Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes
Lujia Lv
Di Wu
Yangyi Xia
Jia Wu
Xiaojing Liu
Yi He
41
0
0
31 Dec 2024
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiao Chang
Yufei Guo
DiffM
64
1
0
31 Dec 2024
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking
You Wu
Yongxin Li
Mengyuan Liu
Xucheng Wang
Xiangyang Yang
Hengzhou Ye
Dan Zeng
Qijun Zhao
Shuiwang Li
209
0
0
28 Dec 2024
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection
Jiangnan Yang
Shuangli Liu
Jingjun Wu
Xinyu Su
Nan Hai
Xueli Huang
84
2
0
22 Dec 2024
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Xiantao Hu
Ying Tai
Xu Zhao
Chen Zhao
Zhenyu Zhang
Jun Yu Li
Bineng Zhong
Jian Yang
91
8
0
20 Dec 2024
Robust Tracking via Mamba-based Context-aware Token Learning
Jinxia Xie
Bineng Zhong
Qihua Liang
Ning Li
Zhiyi Mo
Shuxiang Song
Mamba
84
3
0
18 Dec 2024
What is YOLOv6? A Deep Insight into the Object Detection Model
Athulya Sundaresan Geetha
3DH
VLM
ObjD
88
1
0
17 Dec 2024
Parallel CPU- and GPU-based connected component algorithms for event building for hybrid pixel detectors
Tomáš Čelko
František Mráz
Benedikt Bergmann
P. Mánek
73
0
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
101
0
0
16 Dec 2024
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning
Zhuyang Xie
Yan Yang
Yankai Yu
Jie Wang
Yongquan Jiang
Xiao-Jun Wu
88
0
0
16 Dec 2024
Exploring Enhanced Contextual Information for Video-Level Object Tracking
Ben Kang
Xin Chen
Simiao Lai
Yang Liu
Y. Liu
Dong Wang
Mamba
78
3
0
15 Dec 2024
Point Cloud to Mesh Reconstruction: A Focus on Key Learning-Based Paradigms
Fatima Zahra Iguenfer
Achraf Hsain
Hiba Amissa
Yousra Chtouki
3DV
3DPC
86
0
0
14 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
77
0
0
14 Dec 2024
Temporal Action Localization with Cross Layer Task Decoupling and Refinement
Qiang Li
Di Liu
Jun Kong
Sen Li
Hui Xu
Jianzhong Wang
90
0
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
109
1
0
12 Dec 2024
Mojito: Motion Trajectory and Intensity Control for Video Generation
Xuehai He
Shuohang Wang
Jianwei Yang
Xiaoxia Wu
Yansen Wang
Kuan-Chieh Wang
Z. Zhan
Olatunji Ruwase
Yelong Shen
Xinze Wang
VGen
91
1
0
12 Dec 2024
Swap Path Network for Robust Person Search Pre-training
Lucas Jaffe
A. Zakhor
3DPC
77
0
0
06 Dec 2024
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
Dhiman Paul
Md Rizwan Parvez
Nabeel Mohammed
Shafin Rahman
VGen
85
0
0
02 Dec 2024
CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models
Zhixiang Guo
Siyuan Liang
Aishan Liu
Dacheng Tao
AAML
89
1
0
02 Dec 2024
MambaNUT: Nighttime UAV Tracking via Mamba-based Adaptive Curriculum Learning
You Wu
Xiangyang Yang
Xucheng Wang
Hengzhou Ye
Dan Zeng
Shuiwang Li
Mamba
98
0
0
01 Dec 2024
DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Ser-Nam Lim
R. Ramnath
74
1
0
29 Nov 2024
Improving Accuracy and Generalization for Efficient Visual Tracking
Ram J. Zaveri
Shivang Patel
Yu Gu
Gianfranco Doretto
VLM
94
0
0
28 Nov 2024
HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning
Zengxi Zhang
Zhiying Jiang
Long Ma
Jinyuan Liu
Xin-Yue Fan
Risheng Liu
79
2
0
27 Nov 2024
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Qing Jiang
Gen Luo
Yuqin Yang
Yuda Xiong
Yihao Chen
Zhaoyang Zeng
Tianhe Ren
Lei Zhang
VLM
LRM
109
7
0
27 Nov 2024
Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models
Ronghuan Wu
Wanchao Su
Jing Liao
DiffM
76
1
0
25 Nov 2024
Leverage Task Context for Object Affordance Ranking
Haojie Huang
Hongchen Luo
Wei-dong Zhai
Yang Cao
Zheng-jun Zha
84
0
0
25 Nov 2024
Corner2Net: Detecting Objects as Cascade Corners
Chenglong Liu
Jintao Liu
Haorao Wei
Jinze Yang
Liangyu Xu
Yuchen Guo
Lu Fang
70
0
0
24 Nov 2024
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Chunhui Zhang
Li Liu
Hao-Kai Wen
Xi Zhou
Yufei Wang
Mamba
113
2
0
24 Nov 2024
3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality
Hanbeom Chang
Jongseong Brad Choi
C. Yeum
59
0
0
19 Nov 2024
WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images
Lars Nieradzik
Henrike Stephani
Jördis Sieburg-Rockel
Stephanie Helmling
Andrea Olbrich
Stephanie Wrage
J. Keuper
71
0
0
18 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Wentao Bao
Keqin Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLM
ObjD
54
2
0
17 Nov 2024
RETR: Multi-View Radar Detection Transformer for Indoor Perception
Ryoma Yataka
Adriano Cardace
Peng Wang
P. Boufounos
R. Takahashi
46
1
0
15 Nov 2024
Grounded Video Caption Generation
Evangelos Kazakos
Cordelia Schmid
Josef Sivic
46
0
0
12 Nov 2024
Previous
1
2
3
4
5
...
20
21
22
Next