ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,082 papers shown
Title
ODTrack: Online Dense Temporal Token Learning for Visual Tracking
ODTrack: Online Dense Temporal Token Learning for Visual Tracking
Yaozong Zheng
Bineng Zhong
Qihua Liang
Zhiyi Mo
Shengping Zhang
Xianxian Li
VOT
31
48
0
03 Jan 2024
Temporal Adaptive RGBT Tracking with Modality Prompt
Temporal Adaptive RGBT Tracking with Modality Prompt
Hongyu Wang
Xiaotao Liu
Yifan Li
Meng Sun
Dian Yuan
Jing Liu
43
28
0
02 Jan 2024
Joint Generative Modeling of Scene Graphs and Images via Diffusion
  Models
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Bicheng Xu
Qi Yan
Renjie Liao
Lele Wang
Leonid Sigal
DiffM
39
2
0
02 Jan 2024
Multiscale Vision Transformers meet Bipartite Matching for efficient
  single-stage Action Localization
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Ioanna Ntinou
Enrique Sanchez
Georgios Tzimiropoulos
55
4
0
29 Dec 2023
Shape-IoU: More Accurate Metric considering Bounding Box Shape and Scale
Shape-IoU: More Accurate Metric considering Bounding Box Shape and Scale
Hao Zhang
Shuaijie Zhang
36
69
0
29 Dec 2023
Grasping, Part Identification, and Pose Refinement in One Shot with a
  Tactile Gripper
Grasping, Part Identification, and Pose Refinement in One Shot with a Tactile Gripper
Joyce Xin-Yan Lim
Quang Pham
19
1
0
29 Dec 2023
Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal
  Distillation
Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal Distillation
Jiaxi Wang
Wenhui Hu
Xueyang Liu
Beihu Wu
Yuting Qiu
Yingying Cai
26
0
0
29 Dec 2023
Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance
  Segmentation
Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance Segmentation
Baris Can Cam
Kemal Oksuz
Fehmi Kahraman
Z. S. Baltaci
Sinan Kalkan
Emre Akbas
17
0
0
28 Dec 2023
A Comprehensive Study of Object Tracking in Low-Light Environments
A Comprehensive Study of Object Tracking in Low-Light Environments
Anqi Yi
Nantheera Anantrasirichai
45
8
0
25 Dec 2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
42
17
0
25 Dec 2023
Word length-aware text spotting: Enhancing detection and recognition in
  dense text image
Word length-aware text spotting: Enhancing detection and recognition in dense text image
Hao Wang
Huabing Zhou
Yanduo Zhang
Tao Lu
Jiayi Ma
38
1
0
25 Dec 2023
Cycle-Consistency Learning for Captioning and Grounding
Cycle-Consistency Learning for Captioning and Grounding
Ning Wang
Jiajun Deng
Mingbo Jia
ObjD
45
7
0
23 Dec 2023
FRED: Towards a Full Rotation-Equivariance in Aerial Image Object
  Detection
FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection
Chanho Lee
Jinsu Son
Hyounguk Shon
Yunho Jeon
Junmo Kim
ObjD
30
8
0
22 Dec 2023
Context Enhanced Transformer for Single Image Object Detection
Context Enhanced Transformer for Single Image Object Detection
Seungjun An
Seonghoon Park
Gyeongnyeon Kim
Jeongyeol Baek
Byeongwon Lee
Seungryong Kim
ViT
43
3
0
22 Dec 2023
Prototype-based Cross-Modal Object Tracking
Prototype-based Cross-Modal Object Tracking
Lei Liu
Chenglong Li
Futian Wang
Longfeng Shen
Jin Tang
47
2
0
22 Dec 2023
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video
  Moment Retrieval
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval
Zhihang Liu
Jun Li
Hongtao Xie
Pandeng Li
Jiannan Ge
Sun-Ao Liu
Guoqing Jin
55
18
0
19 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
45
3
0
18 Dec 2023
Overcome the Fear Of Missing Out: Active Sensing UAV Scanning for
  Precision Agriculture
Overcome the Fear Of Missing Out: Active Sensing UAV Scanning for Precision Agriculture
Marios Krestenitis
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Konstantinos Ioannidis
Elias B. Kosmatopoulos
S. Vrochidis
55
6
0
15 Dec 2023
General Object Foundation Model for Images and Videos at Scale
General Object Foundation Model for Images and Videos at Scale
Junfeng Wu
Yi Jiang
Qihao Liu
Zehuan Yuan
Xiang Bai
Song Bai
VOS
VLM
43
39
0
14 Dec 2023
Class-Wise Buffer Management for Incremental Object Detection: An
  Effective Buffer Training Strategy
Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy
Junsu Kim
Sumin Hong
Chanwoo Kim
Jihyeon Kim
Yihalem Yimolal Tiruneh
Jeongwan On
Jihyun Song
Sunhwa Choi
Seungryul Baek
21
3
0
14 Dec 2023
UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation
UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation
Kefu Yi
Kai Luo
Xiaolei Luo
Jiangui Huang
Hao Wu
Rongdong Hu
Wei Hao
VOT
37
42
0
14 Dec 2023
Exploration of visual prompt in Grounded pre-trained open-set detection
Exploration of visual prompt in Grounded pre-trained open-set detection
Qibo Chen
Weizhong Jin
Shuchang Li
Mengdi Liu
Li Yu
Jian Jiang
Xiaozheng Wang
VLM
21
0
0
14 Dec 2023
SKDF: A Simple Knowledge Distillation Framework for Distilling
  Open-Vocabulary Knowledge to Open-world Object Detector
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector
Shuailei Ma
Yuefeng Wang
Ying-yu Wei
Jiaqi Fan
Enming Zhang
Xinyu Sun
Peihao Chen
ObjD
32
1
0
14 Dec 2023
Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable
  Attention and Query Aggregation
Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation
Arul Selvam Periyasamy
Vladimir Tsaturyan
Sven Behnke
ViT
42
2
0
13 Dec 2023
Mono3DVG: 3D Visual Grounding in Monocular Images
Mono3DVG: 3D Visual Grounding in Monocular Images
Yangfan Zhan
Yuan. Yuan
Zhitong Xiong
MDE
36
9
0
13 Dec 2023
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
Kuan-Chih Huang
Yi-Hsuan Tsai
Ming-Hsuan Yang
3DPC
40
4
0
12 Dec 2023
Taking it further: leveraging pseudo labels for field delineation across
  label-scarce smallholder regions
Taking it further: leveraging pseudo labels for field delineation across label-scarce smallholder regions
Philippe Rufin
Sherrie Wang
Sá Nogueira Lisboa
Jan Hemmerling
M. Tulbure
P. Meyfroidt
15
3
0
12 Dec 2023
Edge Wasserstein Distance Loss for Oriented Object Detection
Edge Wasserstein Distance Loss for Oriented Object Detection
Yuke Zhu
Yumeng Ruan
Zihua Xiong
Sheng Guo
43
0
0
12 Dec 2023
ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for
  Cell Instance Segmentation
ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation
Ming Kang
Chee-Ming Ting
F. F. Ting
Raphaël C.-W. Phan
29
137
0
11 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage
  Multi-Person Multi-Task Human-Centric Perception
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
44
5
0
09 Dec 2023
Design and Implementation of Automatic Assisted Aiming System For
  Robomaster EP Based on YOLOv5
Design and Implementation of Automatic Assisted Aiming System For Robomaster EP Based on YOLOv5
Junjia Qin
Kangli Xu
34
0
0
08 Dec 2023
Towards More Practical Group Activity Detection: A New Benchmark and
  Model
Towards More Practical Group Activity Detection: A New Benchmark and Model
Dongkeun Kim
Youngkil Song
Minsu Cho
Suha Kwak
33
2
0
05 Dec 2023
Lenna: Language Enhanced Reasoning Detection Assistant
Lenna: Language Enhanced Reasoning Detection Assistant
Fei Wei
Xinyu Zhang
Ailing Zhang
Bo Zhang
Xiangxiang Chu
MLLM
LRM
29
23
0
05 Dec 2023
Aligning and Prompting Everything All at Once for Universal Visual
  Perception
Aligning and Prompting Everything All at Once for Universal Visual Perception
Yunhang Shen
Chaoyou Fu
Peixian Chen
Mengdan Zhang
Ke Li
Xing Sun
Yunsheng Wu
Shaohui Lin
Rongrong Ji
VLM
ObjD
62
33
0
04 Dec 2023
Instance-guided Cartoon Editing with a Large-scale Dataset
Instance-guided Cartoon Editing with a Large-scale Dataset
Jian Lin
Chengze Li
Xueting Liu
Zhongping Ge
31
0
0
04 Dec 2023
Disentangled Interaction Representation for One-Stage Human-Object
  Interaction Detection
Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Xubin Zhong
Changxing Ding
Yupeng Hu
Dacheng Tao
36
0
0
04 Dec 2023
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal
  Sentence Grounding in Videos
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Pilhyeon Lee
Hyeran Byun
21
10
0
30 Nov 2023
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Fukun Yin
Xin Chen
C. Zhang
Biao Jiang
Zibo Zhao
Jiayuan Fan
Gang Yu
Taihao Li
Tao Chen
37
20
0
29 Nov 2023
Centre Stage: Centricity-based Audio-Visual Temporal Action Detection
Centre Stage: Centricity-based Audio-Visual Temporal Action Detection
Hanyuan Wang
Majid Mirmehdi
Dima Damen
Toby Perrett
57
2
0
28 Nov 2023
REACT: Recognize Every Action Everywhere All At Once
REACT: Recognize Every Action Everywhere All At Once
N. V. R. Chappa
Pha Nguyen
P. Dobbs
Khoa Luu
46
6
0
27 Nov 2023
Segment Every Out-of-Distribution Object
Segment Every Out-of-Distribution Object
Wenjie Zhao
Jia Li
Xin Dong
Yu Xiang
Yunhui Guo
40
8
0
27 Nov 2023
Efficient Pre-training for Localized Instruction Generation of Videos
Efficient Pre-training for Localized Instruction Generation of Videos
Anil Batra
Davide Moltisanti
Laura Sevilla-Lara
Marcus Rohrbach
Frank Keller
41
0
0
27 Nov 2023
Temporal Action Localization for Inertial-based Human Activity
  Recognition
Temporal Action Localization for Inertial-based Human Activity Recognition
Marius Bock
Michael Moeller
Kristof Van Laerhoven
30
0
0
27 Nov 2023
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
Benjamin Kiefer
Lojze Žust
Matej Kristan
J. Pers
Matija Tersek
...
Magdalena Šumunec
Nadir Kapetanović
A. Michel
Wolfgang Gross
Martin Weinmann
33
4
0
23 Nov 2023
Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
  Advanced Object Detection
Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for Advanced Object Detection
Ahmed Sharshar
Aleksandr Matsun
21
2
0
21 Nov 2023
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with
  Spatial Relation Matching
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
Meng Chu
Zhedong Zheng
Wei Ji
Tingyu Wang
Tat-Seng Chua
28
10
0
21 Nov 2023
Attention-Based Real-Time Defenses for Physical Adversarial Attacks in
  Vision Applications
Attention-Based Real-Time Defenses for Physical Adversarial Attacks in Vision Applications
Giulio Rossolini
Alessandro Biondi
Giorgio Buttazzo
AAML
23
2
0
19 Nov 2023
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph
  Generation via Visual-Concept Alignment and Retention
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention
Zuyao Chen
Jinlin Wu
Zhen Lei
Zhaoxiang Zhang
Changwen Chen
40
11
0
18 Nov 2023
The Analysis and Extraction of Structure from Organizational Charts
The Analysis and Extraction of Structure from Organizational Charts
Nikhil Manali
David Doermann
Mahesh Desai
13
0
0
16 Nov 2023
Correlation-Guided Query-Dependency Calibration for Video Temporal
  Grounding
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding
WonJun Moon
Sangeek Hyun
Subeen Lee
Jae-Pil Heo
34
4
0
15 Nov 2023
Previous
123...678...202122
Next