ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,082 papers shown
Title
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
Liangtao Shi
Ting Liu
Xiantao Hu
Yue Hu
Quanjun Yin
Richang Hong
ObjD
54
0
0
24 Feb 2025
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Xinyi Ying
Chao Xiao
Ruojing Li
Xu He
Boyang Li
...
Miao Li
Shilin Zhou
Wei An
Weidong Sheng
Li Liu
157
7
0
21 Feb 2025
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models
Aryan Jadon
Avinash Patil
Shashank Kumar
SyDa
57
1
0
21 Feb 2025
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
Thomas Froech
Olaf Wysocki
Yan Xia
Junyu Xie
Benedikt Schwab
Daniel Cremers
T. H. Kolbe
41
0
0
20 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
132
0
0
11 Feb 2025
Adaptive Perception for Unified Visual Multi-modal Object Tracking
Xiantao Hu
Bineng Zhong
Qihua Liang
Zhiyi Mo
Liangtao Shi
Ying Tai
Jian Yang
40
1
0
10 Feb 2025
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
Lei Yang
Guowu Yuan
Hao Zhou
Hongyu Liu
Jian Chen
Hao Wu
108
30
0
05 Feb 2025
YOLOSCM: An improved YOLO algorithm for cars detection
YOLOSCM: An improved YOLO algorithm for cars detection
Changhui Deng
Lieyang Chen
Shinan Liu
78
0
0
23 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
53
1
0
18 Jan 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Sitong Gong
Yunzhi Zhuge
Lu Zhang
Zheng Yang
Pingping Zhang
Huchuan Lu
46
0
0
15 Jan 2025
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Zhimeng Xin
Tianxu Wu
Shiming Chen
Shuo Ye
Zijing Xie
Yixiong Zou
Xinge You
Yufei Guo
38
0
0
13 Jan 2025
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao
Xiaoye Qu
Dangyang Chen
Yuanyuan Fu
61
0
0
10 Jan 2025
BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation
BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation
Maxwell Meyer
Jack Spruyt
53
0
0
08 Jan 2025
RDD4D: 4D Attention-Guided Road Damage Detection And Classification
RDD4D: 4D Attention-Guided Road Damage Detection And Classification
Asma Alkalbani
Muhammad Saqib
Ahmed Salim Alrawahi
A. Anwar
Chandarnath Adak
Saeed Anwar
44
1
0
07 Jan 2025
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union
Brian Hsuan-Cheng Liao
Chih-Hong Cheng
Hasan Esen
Alois Knoll
EgoV
48
0
0
03 Jan 2025
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
Yaxian Wang
Henghui Ding
Shuting He
Xudong Jiang
Bifan Wei
Jun Liu
ObjD
63
1
0
03 Jan 2025
Towards Visual Grounding: A Survey
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
67
4
0
31 Dec 2024
Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes
Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes
Lujia Lv
Di Wu
Yangyi Xia
Jia Wu
Xiaojing Liu
Yi He
41
0
0
31 Dec 2024
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiao Chang
Yufei Guo
DiffM
64
1
0
31 Dec 2024
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking
You Wu
Yongxin Li
Mengyuan Liu
Xucheng Wang
Xiangyang Yang
Hengzhou Ye
Dan Zeng
Qijun Zhao
Shuiwang Li
209
0
0
28 Dec 2024
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared
  Small Target Detection
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection
Jiangnan Yang
Shuangli Liu
Jingjun Wu
Xinyu Su
Nan Hai
Xueli Huang
84
2
0
22 Dec 2024
Exploiting Multimodal Spatial-temporal Patterns for Video Object
  Tracking
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Xiantao Hu
Ying Tai
Xu Zhao
Chen Zhao
Zhenyu Zhang
Jun Yu Li
Bineng Zhong
Jian Yang
91
8
0
20 Dec 2024
Robust Tracking via Mamba-based Context-aware Token Learning
Robust Tracking via Mamba-based Context-aware Token Learning
Jinxia Xie
Bineng Zhong
Qihua Liang
Ning Li
Zhiyi Mo
Shuxiang Song
Mamba
84
3
0
18 Dec 2024
What is YOLOv6? A Deep Insight into the Object Detection Model
What is YOLOv6? A Deep Insight into the Object Detection Model
Athulya Sundaresan Geetha
3DH
VLM
ObjD
88
1
0
17 Dec 2024
Parallel CPU- and GPU-based connected component algorithms for event
  building for hybrid pixel detectors
Parallel CPU- and GPU-based connected component algorithms for event building for hybrid pixel detectors
Tomáš Čelko
František Mráz
Benedikt Bergmann
P. Mánek
73
0
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic
  Unbiased Learning
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
101
0
0
16 Dec 2024
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic
  Co-learning
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning
Zhuyang Xie
Yan Yang
Yankai Yu
Jie Wang
Yongquan Jiang
Xiao-Jun Wu
88
0
0
16 Dec 2024
Exploring Enhanced Contextual Information for Video-Level Object
  Tracking
Exploring Enhanced Contextual Information for Video-Level Object Tracking
Ben Kang
Xin Chen
Simiao Lai
Yang Liu
Y. Liu
Dong Wang
Mamba
78
3
0
15 Dec 2024
Point Cloud to Mesh Reconstruction: A Focus on Key Learning-Based
  Paradigms
Point Cloud to Mesh Reconstruction: A Focus on Key Learning-Based Paradigms
Fatima Zahra Iguenfer
Achraf Hsain
Hiba Amissa
Yousra Chtouki
3DV
3DPC
86
0
0
14 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt
  Paradigm
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
77
0
0
14 Dec 2024
Temporal Action Localization with Cross Layer Task Decoupling and
  Refinement
Temporal Action Localization with Cross Layer Task Decoupling and Refinement
Qiang Li
Di Liu
Jun Kong
Sen Li
Hui Xu
Jianzhong Wang
90
0
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
109
1
0
12 Dec 2024
Mojito: Motion Trajectory and Intensity Control for Video Generation
Mojito: Motion Trajectory and Intensity Control for Video Generation
Xuehai He
Shuohang Wang
Jianwei Yang
Xiaoxia Wu
Yansen Wang
Kuan-Chieh Wang
Z. Zhan
Olatunji Ruwase
Yelong Shen
Xinze Wang
VGen
91
1
0
12 Dec 2024
Swap Path Network for Robust Person Search Pre-training
Swap Path Network for Robust Person Search Pre-training
Lucas Jaffe
A. Zakhor
3DPC
77
0
0
06 Dec 2024
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for
  Joint Video Highlight Detection and Moment Retrieval
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
Dhiman Paul
Md Rizwan Parvez
Nabeel Mohammed
Shafin Rahman
VGen
85
0
0
02 Dec 2024
CopyrightShield: Spatial Similarity Guided Backdoor Defense against
  Copyright Infringement in Diffusion Models
CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models
Zhixiang Guo
Siyuan Liang
Aishan Liu
Dacheng Tao
AAML
89
1
0
02 Dec 2024
MambaNUT: Nighttime UAV Tracking via Mamba-based Adaptive Curriculum Learning
MambaNUT: Nighttime UAV Tracking via Mamba-based Adaptive Curriculum Learning
You Wu
Xiangyang Yang
Xucheng Wang
Hengzhou Ye
Dan Zeng
Shuiwang Li
Mamba
98
0
0
01 Dec 2024
DLaVA: Document Language and Vision Assistant for Answer Localization
  with Enhanced Interpretability and Trustworthiness
DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Ser-Nam Lim
R. Ramnath
74
1
0
29 Nov 2024
Improving Accuracy and Generalization for Efficient Visual Tracking
Improving Accuracy and Generalization for Efficient Visual Tracking
Ram J. Zaveri
Shivang Patel
Yu Gu
Gianfranco Doretto
VLM
94
0
0
28 Nov 2024
HUPE: Heuristic Underwater Perceptual Enhancement with Semantic
  Collaborative Learning
HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning
Zengxi Zhang
Zhiying Jiang
Long Ma
Jinyuan Liu
Xin-Yue Fan
Risheng Liu
79
2
0
27 Nov 2024
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Qing Jiang
Gen Luo
Yuqin Yang
Yuda Xiong
Yihao Chen
Zhaoyang Zeng
Tianhe Ren
Lei Zhang
VLM
LRM
109
7
0
27 Nov 2024
Chat2SVG: Vector Graphics Generation with Large Language Models and
  Image Diffusion Models
Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models
Ronghuan Wu
Wanchao Su
Jing Liao
DiffM
76
1
0
25 Nov 2024
Leverage Task Context for Object Affordance Ranking
Leverage Task Context for Object Affordance Ranking
Haojie Huang
Hongchen Luo
Wei-dong Zhai
Yang Cao
Zheng-jun Zha
84
0
0
25 Nov 2024
Corner2Net: Detecting Objects as Cascade Corners
Corner2Net: Detecting Objects as Cascade Corners
Chenglong Liu
Jintao Liu
Haorao Wei
Jinze Yang
Liangyu Xu
Yuchen Guo
Lu Fang
70
0
0
24 Nov 2024
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Chunhui Zhang
Li Liu
Hao-Kai Wen
Xi Zhou
Yufei Wang
Mamba
113
2
0
24 Nov 2024
3D Reconstruction by Looking: Instantaneous Blind Spot Detector for
  Indoor SLAM through Mixed Reality
3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality
Hanbeom Chang
Jongseong Brad Choi
C. Yeum
59
0
0
19 Nov 2024
WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images
Lars Nieradzik
Henrike Stephani
Jördis Sieburg-Rockel
Stephanie Helmling
Andrea Olbrich
Stephanie Wrage
J. Keuper
71
0
0
18 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Wentao Bao
Keqin Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLM
ObjD
54
2
0
17 Nov 2024
RETR: Multi-View Radar Detection Transformer for Indoor Perception
RETR: Multi-View Radar Detection Transformer for Indoor Perception
Ryoma Yataka
Adriano Cardace
Peng Wang
P. Boufounos
R. Takahashi
46
1
0
15 Nov 2024
Grounded Video Caption Generation
Grounded Video Caption Generation
Evangelos Kazakos
Cordelia Schmid
Josef Sivic
46
0
0
12 Nov 2024
Previous
12345...202122
Next