ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,082 papers shown
Title
Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet
  Framework
Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework
Eliraz Orfaig
Inna Stainvas
Igal Bilik
44
0
0
05 Jun 2024
Mixup Augmentation with Multiple Interpolations
Mixup Augmentation with Multiple Interpolations
Lifeng Shen
Jincheng Yu
Hansi Yang
James T. Kwok
33
0
0
03 Jun 2024
Learning Manipulation by Predicting Interaction
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
71
22
0
01 Jun 2024
Towards Unified Multi-granularity Text Detection with Interactive
  Attention
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
67
1
0
30 May 2024
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation
Guan-Bo Wang
Zhiming Li
Qingchao Chen
Yang Liu
48
9
0
27 May 2024
An Enhanced Encoder-Decoder Network Architecture for Reducing
  Information Loss in Image Semantic Segmentation
An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation
Zijun Gao
Qi Wang
Taiyuan Mei
X. Cheng
Yun Zi
Haowei Yang
53
11
0
26 May 2024
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation
  Learning
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning
Zhenyu Wei
Yujie He
Zhanchuan Cai
MDE
55
0
0
23 May 2024
A Multimodal Learning-based Approach for Autonomous Landing of UAV
A Multimodal Learning-based Approach for Autonomous Landing of UAV
Francisco Neves
Luís Branco
Maria Pereira
R. Claro
Andry Pinto
29
1
0
21 May 2024
Context-Enhanced Video Moment Retrieval with Large Language Models
Context-Enhanced Video Moment Retrieval with Large Language Models
Weijia Liu
Bo Miao
Jiuxin Cao
Xueling Zhu
Bo Liu
Mehwish Nasim
Ajmal Mian
57
2
0
21 May 2024
FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of
  Rotated Object Detection
FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection
Siliang Ma
Yong Xu
24
0
0
16 May 2024
MetaFruit Meets Foundation Models: Leveraging a Comprehensive
  Multi-Fruit Dataset for Advancing Agricultural Foundation Models
MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models
Jiajia Li
Kyle Lammers
Xunyuan Yin
Xiang Yin
Long He
Renfu Lu
Zhaojian Li
35
3
0
14 May 2024
PotatoGANs: Utilizing Generative Adversarial Networks, Instance
  Segmentation, and Explainable AI for Enhanced Potato Disease Identification
  and Classification
PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification
Mohammad Shafiul Alam
Fatema Tuj Johora Faria
Mukaffi Bin Moin
Ahmed Al Wase
Md. Rabius Sani
Khan Md. Hasib
MedIm
23
2
0
12 May 2024
Replication Study and Benchmarking of Real-Time Object Detection Models
Replication Study and Benchmarking of Real-Time Object Detection Models
Pierre-Luc Asselin
Vincent Coulombe
William Guimont-Martin
William Larrivée-Hardy
57
0
0
11 May 2024
FlexEControl: Flexible and Efficient Multimodal Control for
  Text-to-Image Generation
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
Xuehai He
Jian Zheng
Jacob Zhiyuan Fang
Robinson Piramuthu
Mohit Bansal
Vicente Ordonez
Gunnar A. Sigurdsson
Nanyun Peng
Xin Eric Wang
DiffM
58
1
0
08 May 2024
Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using
  AI
Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI
Rikathi Pal
Sudeshna Mondal
Aditi Gupta
Priya Saha
Somoballi Ghoshal
Amlan Chakrabarti
S. Sur-Kolay
26
4
0
07 May 2024
Low-light Object Detection
Low-light Object Detection
Pengpeng Li
Hao Gu
Yang Yang
26
0
0
06 May 2024
Enhancing DETRs Variants through Improved Content Query and Similar
  Query Aggregation
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
47
2
0
06 May 2024
Federated Learning with Heterogeneous Data Handling for Robust Vehicular
  Object Detection
Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection
Ahmad Khalil
Tizian Dege
Pegah Golchin
Rostyslav Olshevskyi
Antonio Fernández Anta
Tobias Meuser
FedML
29
1
0
02 May 2024
New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for
  Vietnamese Multimodal Aspect-Category Sentiment Analysis
New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis
Quy Hoang Nguyen
Minh-Van Truong Nguyen
Kiet Van Nguyen
38
2
0
01 May 2024
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target
  Identification with Large Multimodal Models
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
Hongzhan Lin
Zixin Chen
Ziyang Luo
Mingfei Cheng
Jing Ma
Guang Chen
61
6
0
01 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
42
0
0
30 Apr 2024
Why does Knowledge Distillation Work? Rethink its Attention and Fidelity
  Mechanism
Why does Knowledge Distillation Work? Rethink its Attention and Fidelity Mechanism
Chenqi Guo
Shiwei Zhong
Xiaofeng Liu
Qianli Feng
Yinglong Ma
30
2
0
30 Apr 2024
Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using
  Modified Attention Unet
Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet
Rikathi Pal
Priya Saha
Somoballi Ghoshal
Amlan Chakrabarti
S. Sur-Kolay
23
1
0
28 Apr 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
30
1
0
26 Apr 2024
Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic
  Surgery
Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery
Yuyang Sheng
Sophia Bano
Matthew J. Clarkson
Mobarakol Islam
48
6
0
22 Apr 2024
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
Zhangjing Yang
Dun Liu
Wensheng Cheng
Jinqiao Wang
Yi Wu
VLM
36
2
0
22 Apr 2024
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual
  Grounding
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding
Linhui Xiao
Xiaoshan Yang
Fang Peng
Yaowei Wang
Changsheng Xu
ObjD
39
8
0
20 Apr 2024
Simultaneous Detection and Interaction Reasoning for Object-Centric
  Action Recognition
Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition
Xunsong Li
Pengzhan Sun
Yangcen Liu
Lixin Duan
Wen Li
50
3
0
18 Apr 2024
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework
  through Prompt-based Localization
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization
Yongdong Luo
Haojia Lin
Xiawu Zheng
Yigeng Jiang
Rongrong Ji
Jie Hu
Guannan Jiang
Songan Zhang
Rongrong Ji
39
0
0
17 Apr 2024
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint
  Moment Retrieval and Highlight Detection
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
Jin Yang
Ping Wei
Huan Li
Ziyang Ren
56
8
0
14 Apr 2024
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation
  in Operating Rooms
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms
Diandian Guo
Manxi Lin
Jialun Pei
He Tang
Yueming Jin
Pheng-Ann Heng
42
2
0
14 Apr 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Lewei Yao
Renjie Pi
Jianhua Han
Xiaodan Liang
Hang Xu
Wei Zhang
Zhenguo Li
Dan Xu
VLM
ObjD
53
20
0
14 Apr 2024
SFSORT: Scene Features-based Simple Online Real-Time Tracker
SFSORT: Scene Features-based Simple Online Real-Time Tracker
M. M. Morsali
Z. Sharifi
F. Fallah
S. Hashembeiki
H. Mohammadzade
S. B. Shouraki
VOT
37
3
0
11 Apr 2024
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
Lifan Jiang
Zhihui Wang
Changmiao Wang
Ming Li
Jiaxu Leng
DiffM
33
0
0
11 Apr 2024
MedRG: Medical Report Grounding with Multi-modal Large Language Model
MedRG: Medical Report Grounding with Multi-modal Large Language Model
K. Zou
Yang Bai
Zhihao Chen
Yang Zhou
Yidi Chen
Kai Ren
Meng Wang
Xuedong Yuan
Xiaojing Shen
Huazhu Fu
MedIm
55
4
0
10 Apr 2024
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
Chenguang Liu
Guangshuai Gao
Ziyue Huang
Zhenghui Hu
Qingjie Liu
Yunhong Wang
ObjD
39
15
0
09 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
78
2
0
06 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
36
15
0
04 Apr 2024
UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Tiantian Geng
Teng Wang
Yanfu Zhang
Jinming Duan
Weili Guan
Feng Zheng
39
0
0
04 Apr 2024
TE-TAD: Towards Full End-to-End Temporal Action Detection via
  Time-Aligned Coordinate Expression
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
Ho-Joong Kim
Jung-Ho Hong
Heejo Kong
Seong-Whan Lee
60
5
0
03 Apr 2024
EGTR: Extracting Graph from Transformer for Scene Graph Generation
EGTR: Extracting Graph from Transformer for Scene Graph Generation
Jinbae Im
Jeongyeon Nam
Nokyung Park
Hyungmin Lee
Seunghyun Park
ViT
49
19
0
02 Apr 2024
Red-Teaming Segment Anything Model
Red-Teaming Segment Anything Model
K. Jankowski
Bartlomiej Sobieski
Mateusz Kwiatkowski
J. Szulc
Michael F. Janik
Hubert Baniecki
P. Biecek
VLM
AAML
48
3
0
02 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object
  Detection
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
54
12
0
02 Apr 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
VLM
71
33
0
29 Mar 2024
ENet-21: An Optimized light CNN Structure for Lane Detection
ENet-21: An Optimized light CNN Structure for Lane Detection
Seyed Rasoul Hosseini
Mohammad Teshnehlab
29
3
0
28 Mar 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via
  Cycle-Modality Propagation
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
45
7
0
28 Mar 2024
Infrared Small Target Detection with Scale and Location Sensitivity
Infrared Small Target Detection with Scale and Location Sensitivity
Qiankun Liu
Rui Liu
Bolun Zheng
Hongkui Wang
Ying Fu
46
26
0
28 Mar 2024
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Qingping Sun
Yanjun Wang
Ailing Zeng
Wanqi Yin
Chen Wei
...
Haiyi Mei
Chi Sing Leung
Ziwei Liu
Lei Yang
Zhongang Cai
3DH
48
16
0
26 Mar 2024
Exploring Dynamic Transformer for Efficient Object Tracking
Exploring Dynamic Transformer for Efficient Object Tracking
Jiawen Zhu
Xin Chen
Haiwen Diao
Shuai Li
Jun-Yan He
Chenyang Li
Bin Luo
Dong Wang
Huchuan Lu
63
2
0
26 Mar 2024
Multiple Object Tracking as ID Prediction
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
66
12
0
25 Mar 2024
Previous
123456...202122
Next