ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. ObjD

Object Detection

ObjD
More data

Identifies and localizes objects within images or videos. Fundamental for surveillance, robotics, and autonomous vehicles.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 2,004 papers shown
Title
Improving Classification of Occluded Objects through Scene Context
Improving Classification of Occluded Objects through Scene Context
Courtney M. King
Daniel D. Leeds
Damian Lyons
George Kalaitzis
ObjD
54
0
0
30 Oct 2025
RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models
RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models
Zijun Liao
Yian Zhao
Xin Shan
Yu Yan
Chang Liu
Lei Lu
Xiangyang Ji
Jie Chen
ObjDVLM
78
0
0
29 Oct 2025
Prototype-Driven Adaptation for Few-Shot Object Detection
Prototype-Driven Adaptation for Few-Shot Object Detection
Yushen Huang
Zhiming Wang
ObjD
0
0
0
29 Oct 2025
PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity
PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity
Yuqian Yuan
W. Zhang
Xin Li
Shihao Wang
Kehan Li
Wentong Li
Jun Xiao
Lei Zhang
Beng Chin Ooi
ObjD
78
0
0
27 Oct 2025
A Training-Free Framework for Open-Vocabulary Image Segmentation and Recognition with EfficientNet and CLIP
A Training-Free Framework for Open-Vocabulary Image Segmentation and Recognition with EfficientNet and CLIP
Ying Dai
Wei Yu Chen
ObjDVLM
29
0
0
22 Oct 2025
Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
Arpan Maity
Tamal Ghosh
ObjD
20
0
0
21 Oct 2025
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
Yehonathan Refael
Amit Aides
Aviad Barzilai
George Leifman
Genady Beryozkin
Vered Silverman
Bolous Jaber
Tomer Shekel
ObjD
101
0
0
20 Oct 2025
Beat Tracking as Object Detection
Beat Tracking as Object Detection
Jaehoon Ahn
Moon-Ryul Jung
ObjD
45
0
0
16 Oct 2025
CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection
CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection
Hojun Choi
Youngsun Lim
Jaeyo Shin
Hyunjung Shim
ObjDLRMVLM
60
0
0
16 Oct 2025
Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU
Accelerated Feature Detectors for Visual SLAM: A Comparative Study of FPGA vs GPU
Ruiqi Ye
M. Luján
ObjD
12
0
0
15 Oct 2025
DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search
DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search
Kartik Narayan
Yang Xu
Tian Cao
Kavya Nerella
Vishal M. Patel
Navid Shiee
Peter Grasch
Chao Jia
Yinfei Yang
Zhe Gan
ObjDKELMVLM
56
0
0
14 Oct 2025
Detect Anything via Next Point Prediction
Detect Anything via Next Point Prediction
Qing Jiang
Junan Huo
Xingyu Chen
Yuda Xiong
Zhaoyang Zeng
Yihao Chen
Tianhe Ren
Junzhi Yu
Lei Zhang
ObjD
67
0
0
14 Oct 2025
When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models
When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models
Samer Al-Hamadani
ObjDVLM
1
0
0
13 Oct 2025
A Simple and Better Baseline for Visual Grounding
A Simple and Better Baseline for Visual Grounding
Jingchao Wang
Wenlong Zhang
Dingjiang Huang
Hong Wang
Yefeng Zheng
ObjD
9
0
0
12 Oct 2025
A Multimodal Depth-Aware Method For Embodied Reference Understanding
A Multimodal Depth-Aware Method For Embodied Reference Understanding
Fevziye Irem Eyiokur
Dogucan Yaman
H. K. Ekenel
Alexander Waibel
ObjD
80
0
0
09 Oct 2025
Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition
Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition
Ranjan Sapkota
Manoj Karkee
ObjDMU
116
0
0
06 Oct 2025
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Jyoti Kini
Rohit Gupta
Mubarak Shah
ObjDVLM
62
0
0
04 Oct 2025
Referring Expression Comprehension for Small Objects
Referring Expression Comprehension for Small Objects
Kanoko Goto
Takumi Hirose
Mahiro Ukai
Shuhei Kurita
Nakamasa Inoue
ObjD
42
0
0
04 Oct 2025
CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
Qihua Dong
Luis Figueroa
Handong Zhao
Kushal Kafle
Jason Kuen
Zhihong Ding
Scott D. Cohen
Y. Fu
ObjDLRM
48
0
0
03 Oct 2025
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
Nirmal Elamon
Rouzbeh Davoudi
ObjD
40
0
0
03 Oct 2025
Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation
Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation
Jinchang Zhang
Zijun Li
Jiakai Lin
Guoyu Lu
ObjDVLM
16
0
0
01 Oct 2025
VLOD-TTA: Test-Time Adaptation of Vision-Language Object Detectors
VLOD-TTA: Test-Time Adaptation of Vision-Language Object Detectors
Atif Belal
H. R. Medeiros
M. Pedersoli
Eric Granger
ObjDVLM
16
0
0
01 Oct 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLMObjDVLMLRM
68
0
0
30 Sep 2025
YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
Ranjan Sapkota
Rahul Harsha Cheppally
Ajay Sharda
Manoj Karkee
ObjD
83
2
0
29 Sep 2025
Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning
Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning
Chenhui Xu
F. Yu
Michael J. Bianco
Jacob Kovarskiy
Raphael Tang
...
Rupanjali Kukal
Mikael Figueroa
Rishi Madhok
Nikolaos Karianakis
Jinjun Xiong
ObjDReLMLRM
14
0
0
29 Sep 2025
GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning
GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning
Mustansar Fiaz
Hiyam Debary
P. Fraccaro
D. Paudel
Luc Van Gool
Fahad Shahbaz Khan
Salman Khan
ObjDOffRLVLMLRM
18
0
0
29 Sep 2025
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
Siheng Wang
Zhengdao Li
Yanshu Li
Canran Xiao
Haibo Zhan
...
Zhikang Dong
Jifeng Shen
Junhao Dong
Qiang Sun
Piotr Koniusz
ObjDVLM
44
0
0
27 Sep 2025
Geo-R1: Improving Few-Shot Geospatial Referring Expression Understanding with Reinforcement Fine-Tuning
Geo-R1: Improving Few-Shot Geospatial Referring Expression Understanding with Reinforcement Fine-Tuning
Zilun Zhang
Zian Guan
T. Zhao
H. Shen
Tianyu Li
Yuxiang Cai
Zhonggen Su
Zhaojun Liu
Jianwei Yin
Xiang Li
ObjDLRM
64
0
0
26 Sep 2025
HierLight-YOLO: A Hierarchical and Lightweight Object Detection Network for UAV Photography
HierLight-YOLO: A Hierarchical and Lightweight Object Detection Network for UAV Photography
Defan Chen
Yaohua Hu
Luchan Zhang
ObjD
80
0
0
26 Sep 2025
MIRG-RL: Multi-Image Reasoning and Grounding with Reinforcement Learning
MIRG-RL: Multi-Image Reasoning and Grounding with Reinforcement Learning
Lihao Zheng
Jiawei Chen
Xintian Shen
Hao Ma
Tao Wei
ObjDOffRLVLMLRM
36
0
0
26 Sep 2025
GeoRef: Referring Expressions in Geometry via Task Formulation, Synthetic Supervision, and Reinforced MLLM-based Solutions
GeoRef: Referring Expressions in Geometry via Task Formulation, Synthetic Supervision, and Reinforced MLLM-based Solutions
Bing Liu
Wenqiang Yv
X. J. Yang
S. Wang
Junzhuo Liu
Peng Wang
G. Wang
Yang Yang
H. Shen
ObjD
28
0
0
25 Sep 2025
Real-Time Object Detection Meets DINOv3
Real-Time Object Detection Meets DINOv3
Shihua Huang
Yongjie Hou
Longfei Liu
Xuanlong Yu
Xi Shen
ObjD3DHPINNVLM
100
0
0
25 Sep 2025
RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images
RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images
Ke Li
Di Wang
Ting Wang
Fuyu Dong
Yiming Zhang
L. Zhang
X. Wang
Shaofeng Li
Quan Wang
ObjDVGen
20
0
0
23 Sep 2025
MVP: Motion Vector Propagation for Zero-Shot Video Object Detection
MVP: Motion Vector Propagation for Zero-Shot Video Object Detection
Binhua Huang
Ni Wang
Wendong Yao
Soumyabrata Dev
ObjDVLM
36
0
0
22 Sep 2025
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
Ye Liu
Zongyang Ma
Junfu Pu
Zhongang Qi
Yang Wu
Mingyu Ding
Chang Wen Chen
MLLMObjDLRM
94
0
0
22 Sep 2025
MO R-CNN: Multispectral Oriented R-CNN for Object Detection in Remote Sensing Image
MO R-CNN: Multispectral Oriented R-CNN for Object Detection in Remote Sensing Image
Leiyu Wang
Biao Jin
Feng Huang
Liqiong Chen
Zhengyong Wang
X. He
Honggang Chen
ObjD
45
0
0
21 Sep 2025
Enhanced Detection of Tiny Objects in Aerial Images
Enhanced Detection of Tiny Objects in Aerial Images
Kihyun Kim
Michalis Lazarou
Tania Stathaki
ObjD
20
0
0
21 Sep 2025
Catching the Details: Self-Distilled RoI Predictors for Fine-Grained MLLM Perception
Catching the Details: Self-Distilled RoI Predictors for Fine-Grained MLLM Perception
Yuheng Shi
Xiaohuan Pei
Minjing Dong
Chang Xu
ObjD
77
0
0
21 Sep 2025
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Wenhuan Lu
Xinyue Song
Wenjun Ke
Zhizhi Yu
Wenhao Yang
Jianguo Wei
ObjD
4
0
0
20 Sep 2025
MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment
MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment
Elena Camuffo
F. Barbato
Mete Ozay
Simone Milani
Umberto Michieli
ObjD
118
0
0
17 Sep 2025
Improving Generalized Visual Grounding with Instance-aware Joint Learning
Improving Generalized Visual Grounding with Instance-aware Joint LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Ming Dai
Wenxuan Cheng
Jiang-Jiang Liu
Lingfeng Yang
Zhenhua Feng
Wankou Yang
Jingdong Wang
ObjDISeg
72
2
0
17 Sep 2025
Performance Optimization of YOLO-FEDER FusionNet for Robust Drone Detection in Visually Complex Environments
Performance Optimization of YOLO-FEDER FusionNet for Robust Drone Detection in Visually Complex Environments
Tamara R. Lenhard
Andreas Weinmann
Tobias Koch
ObjD
39
0
0
17 Sep 2025
Recurrent Cross-View Object Geo-Localization
Recurrent Cross-View Object Geo-Localization
Xiaohan Zhang
S. Cao
Xiaokai Bai
Yiming Li
Zhangkai Shen
Zhe Wu
Xiaoxi Hu
Hui-Liang Shen
ObjD
36
0
0
16 Sep 2025
Towards Understanding Visual Grounding in Visual Language Models
Towards Understanding Visual Grounding in Visual Language Models
Georgios Pantazopoulos
Eda B. Özyiğit
ObjD
109
0
0
12 Sep 2025
Zero-Shot Referring Expression Comprehension via Visual-Language True/False Verification
Zero-Shot Referring Expression Comprehension via Visual-Language True/False Verification
Jeffrey Liu
Rongbin Hu
ObjD
18
0
0
12 Sep 2025
A Co-Training Semi-Supervised Framework Using Faster R-CNN and YOLO Networks for Object Detection in Densely Packed Retail Images
A Co-Training Semi-Supervised Framework Using Faster R-CNN and YOLO Networks for Object Detection in Densely Packed Retail Images
Hossein Yazdanjouei
Arash Mansouri
Mohammad Shokouhifar
ObjD
68
0
0
11 Sep 2025
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
Jiangnan Xie
Xiaolong Zheng
Liang Zheng
ObjD
56
0
0
08 Sep 2025
Light-Weight Cross-Modal Enhancement Method with Benchmark Construction for UAV-based Open-Vocabulary Object Detection
Light-Weight Cross-Modal Enhancement Method with Benchmark Construction for UAV-based Open-Vocabulary Object Detection
Zhenhai Weng
Xinjie Li
Can Wu
Weijie He
Jianfeng Lv
Dong Zhou
Zhongliang Yu
ObjDVLM
108
0
0
07 Sep 2025
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
Ming Dai
Wenxuan Cheng
Jiedong Zhuang
Jiang-Jiang Liu
Hongshen Zhao
Zhenhua Feng
Wankou Yang
ObjD
84
3
0
05 Sep 2025
A Data-Driven RetinaNet Model for Small Object Detection in Aerial Images
A Data-Driven RetinaNet Model for Small Object Detection in Aerial Images
Zhicheng Tang
Jinwen Tang
Yi Shang
ObjD
44
0
0
03 Sep 2025
Loading #Papers per Month with "ObjD"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available