ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,104 papers shown
Title
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
Chengbo Yuan
Suraj Joshi
Shaoting Zhu
Hang Su
Hang Zhao
Yang Gao
VGen
95
6
0
24 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Feng-Long Xie
Yongchao Xu
ObjD
126
0
0
24 Mar 2025
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
Wenrui Cai
Qingjie Liu
Yansen Wang
MoE
156
0
0
24 Mar 2025
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
Haolin Qin
Tingfa Xu
Tianhao Li
Zhenxiang Chen
Tao Feng
Jia-Nan Li
51
0
0
22 Mar 2025
Temporal Action Detection Model Compression by Progressive Block Drop
Temporal Action Detection Model Compression by Progressive Block Drop
Xiaoyong Chen
Yong Guo
Jiaming Liang
Sitong Zhuang
Runhao Zeng
Xiping Hu
92
0
0
21 Mar 2025
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Saket Gurukar
Asim Kadav
VLM
144
0
0
17 Mar 2025
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection
Bibi Erum Ayesha
T. Satyanarayana Murthy
Palamakula Ramesh Babu
Ramu Kuchipudi
102
1
0
17 Mar 2025
STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans
STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans
Shashikant Verma
Harish Katti
Soumyaratna Debnath
Yamuna Swamy
Shanmuganathan Raman
482
0
0
17 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjDVLM
491
0
0
14 Mar 2025
Towards General Multimodal Visual Tracking
Andong Lu
Mai Wen
Jinhu Wang
Yuanzhi Guo
Chenglong Li
Jin Tang
Bin Luo
92
0
0
14 Mar 2025
Large-scale Pre-training for Grounded Video Caption Generation
Large-scale Pre-training for Grounded Video Caption Generation
Evangelos Kazakos
Cordelia Schmid
Josef Sivic
86
0
0
13 Mar 2025
Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking
Chaocan Xue
Bineng Zhong
Qihua Liang
Yaozong Zheng
Ning Li
Yuanliang Xue
Shuxiang Song
75
0
0
09 Mar 2025
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan
Zhenbang Ren
Haodi Wu
Wenjie Wei
Rui-jie Zhu
...
Jason K. Eshraghian
Haicheng Qu
Jing Zhang
Malu Zhang
Yiran Yang
103
1
0
09 Mar 2025
Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection
Chandan Kumar Sah
Ankit Kumar Shaw
Xiaoli Lian
Arsalan Shahid Baig
Tuopu Wen
Kun Jiang
Mengmeng Yang
Ke Wang
93
1
0
08 Mar 2025
Detection of Customer Interested Garments in Surveillance Video using Computer Vision
Earnest Paul Ijjina
A. Joshi
Goutham Kanahasabai
46
0
0
01 Mar 2025
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
Jiawen Zhu
Huayi Tang
Xin Chen
Xinying Wang
Dong Wang
Huchuan Lu
98
2
0
01 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice
Eric Heim
Oren Wright
David Shriver
OODFaML
130
0
0
01 Mar 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
110
3
0
27 Feb 2025
MITracker: Multi-View Integration for Visual Object Tracking
MITracker: Multi-View Integration for Visual Object Tracking
Mengjie Xu
Yitao Zhu
Haotian Jiang
Jiaming Li
Zhenrong Shen
...
Haolin Huang
Xinyu Wang
Qing Yang
H. Zhang
Qian Wang
119
0
0
27 Feb 2025
LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking
LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking
Yunfeng Li
Bo Wang
Ye Li
93
0
0
25 Feb 2025
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Liangtao Shi
Ting Liu
Xiantao Hu
Yue Hu
Quanjun Yin
Richang Hong
ObjD
116
0
0
24 Feb 2025
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Xinyi Ying
Chao Xiao
Ruojing Li
Xu He
Boyang Li
...
Miao Li
Shilin Zhou
Wei An
Weidong Sheng
Li Liu
215
7
0
21 Feb 2025
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
Aryan Jadon
Avinash Patil
Shashank Kumar
SyDa
85
1
0
21 Feb 2025
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
Thomas Froech
Olaf Wysocki
Yan Xia
Junyu Xie
Benedikt Schwab
Daniel Cremers
T. H. Kolbe
67
0
0
20 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
204
0
0
11 Feb 2025
Adaptive Perception for Unified Visual Multi-modal Object Tracking
Xiantao Hu
Bineng Zhong
Qihua Liang
Zhiyi Mo
Liangtao Shi
Ying Tai
Jian Yang
78
2
0
10 Feb 2025
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
Lei Yang
Guowu Yuan
Hao Zhou
Hongyu Liu
Jian Chen
Hao Wu
230
30
0
05 Feb 2025
YOLOSCM: An improved YOLO algorithm for cars detection
YOLOSCM: An improved YOLO algorithm for cars detection
Changhui Deng
Lieyang Chen
Shinan Liu
139
0
0
23 Jan 2025
PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues
PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues
Yanchao Wang
Dawei Zhang
Run Li
Zhonglong Zheng
Minglu Li
VOT
53
0
0
20 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
138
2
0
18 Jan 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Sitong Gong
Yunzhi Zhuge
Lu Zhang
Zhiyong Yang
Pingping Zhang
Huchuan Lu
95
3
0
15 Jan 2025
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Zhimeng Xin
Tianxu Wu
Shiming Chen
Shuo Ye
Zijing Xie
Yixiong Zou
Xinge You
Yufei Guo
50
0
0
13 Jan 2025
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao
Xiaoye Qu
Dangyang Chen
Yuanyuan Fu
141
0
0
10 Jan 2025
BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation
BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation
Maxwell Meyer
Jack Spruyt
96
0
0
08 Jan 2025
RDD4D: 4D Attention-Guided Road Damage Detection And Classification
RDD4D: 4D Attention-Guided Road Damage Detection And Classification
Asma Alkalbani
Muhammad Saqib
Ahmed Salim Alrawahi
A. Anwar
Chandarnath Adak
Saeed Anwar
81
2
0
07 Jan 2025
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
Yaxian Wang
Henghui Ding
Shuting He
Xudong Jiang
Bifan Wei
Jun Liu
ObjD
109
2
0
03 Jan 2025
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union
Brian Hsuan-Cheng Liao
Chih-Hong Cheng
Hasan Esen
Alois Knoll
EgoV
104
1
0
03 Jan 2025
Towards Visual Grounding: A Survey
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
282
5
0
31 Dec 2024
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiao Chang
Jiangming Wang
DiffM
134
1
0
31 Dec 2024
Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes
Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes
Lujia Lv
Di Wu
Yangyi Xia
Jia Wu
Xiaojing Liu
Yi He
82
0
0
31 Dec 2024
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared
  Small Target Detection
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection
Jiangnan Yang
Shuangli Liu
Jingjun Wu
Xinyu Su
Nan Hai
Xueli Huang
171
10
0
22 Dec 2024
Exploiting Multimodal Spatial-temporal Patterns for Video Object
  Tracking
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Xiantao Hu
Ying Tai
Xu Zhao
Chen Zhao
Zhenyu Zhang
Jun Yu Li
Bineng Zhong
Jian Yang
173
12
0
20 Dec 2024
Robust Tracking via Mamba-based Context-aware Token Learning
Robust Tracking via Mamba-based Context-aware Token Learning
Jinxia Xie
Bineng Zhong
Qihua Liang
Ning Li
Zhiyi Mo
Shuxiang Song
Mamba
101
4
0
18 Dec 2024
What is YOLOv6? A Deep Insight into the Object Detection Model
What is YOLOv6? A Deep Insight into the Object Detection Model
Athulya Sundaresan Geetha
3DHVLMObjD
115
1
0
17 Dec 2024
Parallel CPU- and GPU-based connected component algorithms for event
  building for hybrid pixel detectors
Parallel CPU- and GPU-based connected component algorithms for event building for hybrid pixel detectors
Tomáš Čelko
František Mráz
Benedikt Bergmann
P. Mánek
83
0
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic
  Unbiased Learning
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
157
1
0
16 Dec 2024
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic
  Co-learning
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning
Zhuyang Xie
Yan Yang
Yankai Yu
Jie Wang
Yongquan Jiang
Xiao-Jun Wu
104
0
0
16 Dec 2024
Exploring Enhanced Contextual Information for Video-Level Object
  Tracking
Exploring Enhanced Contextual Information for Video-Level Object Tracking
Ben Kang
Xin Chen
Simiao Lai
Yang Liu
Y. Liu
Dong Wang
Mamba
129
5
0
15 Dec 2024
Point Cloud to Mesh Reconstruction: A Focus on Key Learning-Based
  Paradigms
Point Cloud to Mesh Reconstruction: A Focus on Key Learning-Based Paradigms
Fatima Zahra Iguenfer
Achraf Hsain
Hiba Amissa
Yousra Chtouki
3DV3DPC
161
0
0
14 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt
  Paradigm
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
132
0
0
14 Dec 2024
Previous
12345...212223
Next