Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.09630
Cited By
v1
v2 (latest)
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"
50 / 1,104 papers shown
Title
PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning
Yizhe Li
Sanping Zhou
Zheng Qin
Le Wang
ViT
17
0
0
19 Jun 2025
BRISC: Annotated Dataset for Brain Tumor Segmentation and Classification with Swin-HAFNet
Amirreza Fateh
Yasin Rezvani
Sara Moayedi
Sadjad Rezvani
Fatemeh Fateh
Mansoor Fateh
20
0
0
17 Jun 2025
Text-Aware Image Restoration with Diffusion Models
Jaewon Min
J. Kim
Paul Hyunbin Cho
J. Lee
Jihye Park
Minkyu Park
S. Kim
Hyunhee Park
Seungryong Kim
51
0
0
11 Jun 2025
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective
Robert-Jan Bruintjes
A. Lengyel
O. Kayhan
Davide Zambrano
Nergis Tomen
Hadi Jamali Rad
Jan van Gemert
VLM
31
0
0
10 Jun 2025
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai
A. Chowdhury
Zihe Wang
Sooyoung Jeon
Lemeng Wang
Jiacheng Hou
Jihyung Kil
Wei-Lun Chao
CoGe
55
0
0
10 Jun 2025
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Yizhen Li
Dell Zhang
Xuelong Li
Yiqing Shen
VLM
19
0
0
09 Jun 2025
MiMo-VL Technical Report
Xiaomi LLM-Core Team
Zihao Yue
Zhenru Lin
Yifan Song
Weikun Wang
...
Di Zhang
Chong Ma
Chang Liu
Can Cai
Bingquan Xia
OffRL
MoE
VLM
LRM
84
0
0
04 Jun 2025
HiLO: High-Level Object Fusion for Autonomous Driving using Transformers
Timo Osterburg
Franz Albers
Christopher P. Diehl
Rajesh Pushparaj
Torsten Bertram
50
0
0
03 Jun 2025
Conformal Object Detection by Sequential Risk Control
Léo Andéol
Luca Mossina
Adrien Mazoyer
Sébastien Gerchinovitz
63
0
0
29 May 2025
Can NeRFs See without Cameras?
Chaitanya Amballa
Sattwik Basu
Yu-Lin Wei
Zhijian Yang
Mehmet Ergezer
Romit Roy Choudhury
29
0
0
28 May 2025
CADReview: Automatically Reviewing CAD Programs with Error Detection and Correction
Jiali Chen
Xusen Hei
HongFei Liu
Yuancheng Wei
Zikun Deng
Jiayuan Xie
Yi Cai
Li Qing
55
0
0
28 May 2025
Fully Spiking Neural Networks for Unified Frame-Event Object Tracking
Jingjun Yang
Liangwei Fan
Jinpu Zhang
Xiangkai Lian
Hui Shen
D. Hu
12
0
0
27 May 2025
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
Fuwen Luo
Shengfeng Lou
C. L. Philip Chen
Ziyue Wang
Chenliang Li
...
Peng Li
Ming Yan
Ji Zhang
Fei Huang
Yang Liu
AI4TS
LRM
81
0
0
27 May 2025
MSA at SemEval-2025 Task 3: High Quality Weak Labeling and LLM Ensemble Verification for Multilingual Hallucination Detection
Baraa Hikal
Ahmed Nasreldin
Ali Hamdi
HILM
26
0
0
27 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
D. Jiang
ObjD
VLM
22
0
0
27 May 2025
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features
X. Feng
D. Zhang
Shuyan Hu
X. Li
M. Wu
Jie Zhang
Xiaosha Chen
Kexin Huang
57
0
0
26 May 2025
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation
Zuyao Chen
Jinlin Wu
Zhen Lei
Chang Wen Chen
41
0
0
26 May 2025
MLLMs are Deeply Affected by Modality Bias
Xu Zheng
Chenfei Liao
Yuqian Fu
Kaiyu Lei
Yuanhuiyi Lyu
...
Yu Jiang
N. Sebe
Dacheng Tao
Luc Van Gool
Xuming Hu
78
0
0
24 May 2025
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts
Yuchen Zhang
Yaxiong Wang
Yujiao Wu
Lianwei Wu
Li Zhu
AAML
105
0
0
23 May 2025
Efficient Motion Prompt Learning for Robust Visual Tracking
Jie Zhao
Xin Chen
Yongsheng Yuan
Michael Felsberg
Dong Wang
Huchuan Lu
43
0
0
22 May 2025
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
Shiyu Xuan
Zechao Li
Jinhui Tang
97
0
0
19 May 2025
Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Yuan Zhang
Xinfeng Zhang
Xiaoming Qi Xinyu Wu
Feng Chen
Guanyu Yang
Huazhu Fu
MedIm
LM&MA
AI4CE
163
0
0
16 May 2025
Using Cross-Domain Detection Loss to Infer Multi-Scale Information for Improved Tiny Head Tracking
Jisu Kim
Alex Mattingly
Eung-Joo Lee
Benjamin S. Riggan
29
0
0
14 May 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering
Yu Cheng
A. Goel
Hakan Bilen
LRM
66
0
0
12 May 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
179
0
0
09 May 2025
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking
Weihong Li
Xiaoqiong Liu
Heng Fan
L. Zhang
64
0
0
09 May 2025
RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet
Eliraz Orfaig
Inna Stainvas
Igal Bilik
86
0
0
05 May 2025
Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging
Elena Mulero Ayllón
Massimiliano Mantegna
Linlin Shen
Paolo Soda
V. Guarrasi
M. Tortora
75
0
0
02 May 2025
Efficient Vision-based Vehicle Speed Estimation
Andrej Macko
Lukás Gajdosech
Viktor Kocur
447
0
0
02 May 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
113
0
0
28 Apr 2025
Improving Open-World Object Localization by Discovering Background
Ashish Singh
Michael Jeffrey Jones
Kuan-Chuan Peng
A. Cherian
Moitreya Chatterjee
Erik Learned-Miller
ObjD
OCL
VLM
116
0
0
24 Apr 2025
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le
Trung Pham
Jianfei Cai
H. Rezatofighi
69
0
0
23 Apr 2025
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Jingchao Wang
Hong Wang
Wenlong Zhang
Kunhua Ji
Dingjiang Huang
Yefeng Zheng
ObjD
124
0
0
22 Apr 2025
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity Recognition
Jian Zhang
Tianqing Zhang
Qi Li
Hongwei Wang
53
0
0
19 Apr 2025
FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV Tracking
Ying Wang
Tingfa Xu
Jianan Li
98
1
0
18 Apr 2025
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery
Wei Zhang
Miaoxin Cai
Yaqian Ning
Tianze Zhang
Yin Zhuang
He Chen
Jun Li
Xuerui Mao
101
0
0
17 Apr 2025
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
146
1
0
17 Apr 2025
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Andreas Lau Hansen
Lukas Wanzeck
Dim P. Papadopoulos
65
0
0
17 Apr 2025
Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking
You Wu
Xucheng Wang
Xiangyang Yang
Mengyuan Liu
Dan Zeng
Hengzhou Ye
Shuiwang Li
103
0
0
12 Apr 2025
Light-YOLOv8-Flame: A Lightweight High-Performance Flame Detection Algorithm
Jiawei Lan
Zhibiao Wang
Haoyang Yu
Ye Tao
Wenhua Cui
189
0
0
11 Apr 2025
End-to-End Facial Expression Detection in Long Videos
Yini Fang
Alec Diallo
Yiqi Shi
F. Jumelle
Bertram Shi
CVBM
54
0
0
10 Apr 2025
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
101
0
0
10 Apr 2025
Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Rajhans Singh
Rafael Bidese Puhl
Kshitiz Dhakal
Sudhir Sornapudi
83
0
0
09 Apr 2025
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
1.1K
2
0
09 Apr 2025
PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario
Sriram Mandalika
Lalitha V
Athira Nambiar
82
2
0
08 Apr 2025
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Nikhil Shivakumar Nayak
187
3
0
04 Apr 2025
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking
Chunhui Zhang
Li Liu
Jialin Gao
Xin Sun
Hao Wen
Xi Zhou
Shiming Ge
Yucheng Wang
110
1
0
02 Apr 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
94
0
0
27 Mar 2025
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou
Wei Ji
Rui Ma
Li Cheng
ViT
122
0
0
27 Mar 2025
RelTriple: Learning Plausible Indoor Layouts by Integrating Relationship Triples into the Diffusion Process
Kaifan Sun
Bingchen Yang
Peter Wonka
Jun Xiao
Haiyong Jiang
126
0
0
26 Mar 2025
1
2
3
4
...
21
22
23
Next