ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.13682
  4. Cited By
HOTR: End-to-End Human-Object Interaction Detection with Transformers

HOTR: End-to-End Human-Object Interaction Detection with Transformers

28 April 2021
Bumsoo Kim
Junhyun Lee
Jaewoo Kang
Eun-Sol Kim
Hyunwoo J. Kim
    ViT
ArXivPDFHTML

Papers citing "HOTR: End-to-End Human-Object Interaction Detection with Transformers"

50 / 145 papers shown
Title
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction
Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction
Yuhan Wang
Cheng Liu
Daou Zhang
Weichao Wu
41
0
0
13 Mar 2025
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features
Ruochen Zhang
Hyeung-Sik Choi
Dongwook Jung
Phan Huy Nam Anh
Sang-Ki Jeong
Zihao Zhu
3DPC
MDE
32
0
0
08 Jan 2025
Orchestrating the Symphony of Prompt Distribution Learning for
  Human-Object Interaction Detection
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection
Mingda Jia
Liming Zhao
Ge Li
Yun Zheng
VLM
73
0
0
11 Dec 2024
VLM-HOI: Vision Language Models for Interpretable Human-Object
  Interaction Analysis
VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis
Donggoo Kang
Dasol Jeong
Hyunmin Lee
Sangwoo Park
Hasil Park
Sunkyu Kwon
Yeongjoon Kim
Joonki Paik
MLLM
VLM
74
0
0
27 Nov 2024
Human-Object Interaction Detection Collaborated with Large
  Relation-driven Diffusion Models
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
Liulei Li
Wenguan Wang
Y. Yang
42
7
0
26 Oct 2024
CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision
  Large Language Models
CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models
J. Gao
Chen Cai
Ruoyu Wang
Wenyang Liu
Kim-Hui Yap
Kratika Garg
Boon-Siew Han
VLM
23
0
0
21 Oct 2024
Visual-Geometric Collaborative Guidance for Affordance Learning
Visual-Geometric Collaborative Guidance for Affordance Learning
Hongchen Luo
Wei-dong Zhai
J. Wang
Yang Cao
Zheng-jun Zha
20
0
0
15 Oct 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
56
3
0
16 Sep 2024
Merging Multiple Datasets for Improved Appearance-Based Gaze Estimation
Merging Multiple Datasets for Improved Appearance-Based Gaze Estimation
Liang Wu
Bertram E. Shi
26
1
0
02 Sep 2024
A Review of Human-Object Interaction Detection
A Review of Human-Object Interaction Detection
Yuxiao Wang
Qiwei Xiong
Yu Lei
Weiying Xue
Qi Liu
Zhenao Wei
52
2
0
20 Aug 2024
Towards Flexible Visual Relationship Segmentation
Towards Flexible Visual Relationship Segmentation
Fangrui Zhu
Jianwei Yang
Huaizu Jiang
VOS
34
1
0
15 Aug 2024
UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection
UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection
Mu Chen
Minghan Chen
Yi Yang
54
4
0
14 Aug 2024
Efficient Human-Object-Interaction (EHOI) Detection via Interaction
  Label Coding and Conditional Decision
Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision
Tsung-Shan Yang
Yun Cheng Wang
Chengwei Wei
Suya You
C.-C. Jay Kuo
51
0
0
13 Aug 2024
An analysis of HOI: using a training-free method with multimodal visual
  foundation models when only the test set is available, without the training
  set
An analysis of HOI: using a training-free method with multimodal visual foundation models when only the test set is available, without the training set
Chaoyi Ai
VLM
33
0
0
11 Aug 2024
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Ting Lei
Shaofeng Yin
Yuxin Peng
Yang Liu
VLM
29
5
0
05 Aug 2024
A Plug-and-Play Method for Rare Human-Object Interactions Detection by
  Bridging Domain Gap
A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap
Lijun Zhang
Wei Suo
Yiyan Qi
Yanning Zhang
27
2
0
31 Jul 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
57
3
0
26 Jul 2024
CycleHOI: Improving Human-Object Interaction Detection with Cycle
  Consistency of Detection and Generation
CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation
Yisen Wang
Yao Teng
Limin Wang
DiffM
41
1
0
16 Jul 2024
Human-Centric Transformer for Domain Adaptive Action Recognition
Human-Centric Transformer for Domain Adaptive Action Recognition
Kun-Yu Lin
Jiaming Zhou
Wei-Shi Zheng
31
6
0
15 Jul 2024
A Fair Ranking and New Model for Panoptic Scene Graph Generation
A Fair Ranking and New Model for Panoptic Scene Graph Generation
Julian Lorenz
Alexander Pest
Daniel Kienzle
K. Ludwig
Rainer Lienhart
46
1
0
12 Jul 2024
Nonverbal Interaction Detection
Nonverbal Interaction Detection
Jianan Wei
Tianfei Zhou
Yi Yang
Wenguan Wang
38
4
0
11 Jul 2024
Geometric Features Enhanced Human-Object Interaction Detection
Geometric Features Enhanced Human-Object Interaction Detection
Manli Zhu
Edmond S. L. Ho
Shuang Chen
Longzhi Yang
Hubert P. H. Shum
37
1
0
26 Jun 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie-jin Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
32
8
0
11 Jun 2024
Active Object Detection with Knowledge Aggregation and Distillation from
  Large Models
Active Object Detection with Knowledge Aggregation and Distillation from Large Models
Dejie Yang
Yang Liu
37
3
0
21 May 2024
Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by
  Human-Object Interaction Recognition
Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Yuchen Zhou
Linkai Liu
Chao Gou
38
3
0
16 May 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
28
1
0
26 Apr 2024
A Review and Efficient Implementation of Scene Graph Generation Metrics
A Review and Efficient Implementation of Scene Graph Generation Metrics
Julian Lorenz
Robin Schon
K. Ludwig
Rainer Lienhart
3DV
28
1
0
15 Apr 2024
Exploring the Potential of Large Foundation Models for Open-Vocabulary
  HOI Detection
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection
Ting Lei
Shaofeng Yin
Yang Liu
VLM
47
9
0
09 Apr 2024
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote
  Sensing Image Understanding
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Run Shao
Zhaoyang Zhang
Chao Tao
Yunsheng Zhang
Chengli Peng
Haifeng Li
VLM
35
5
0
27 Mar 2024
Groupwise Query Specialization and Quality-Aware Multi-Assignment for
  Transformer-based Visual Relationship Detection
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
Jongha Kim
Jihwan Park
Jinyoung Park
Jinyoung Kim
Sehyung Kim
Hyunwoo J. Kim
44
4
0
26 Mar 2024
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship
  Detection
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann
Markus Ryll
Alex Bewley
Matthias Minderer
50
4
0
21 Mar 2024
Towards Zero-shot Human-Object Interaction Detection via Vision-Language
  Integration
Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration
Weiying Xue
Qi Liu
Qiwei Xiong
Yuxiao Wang
Zhenao Wei
Xiaofen Xing
Xiangmin Xu
VLM
42
3
0
12 Mar 2024
FreeA: Human-object Interaction Detection using Free Annotation Labels
FreeA: Human-object Interaction Detection using Free Annotation Labels
Yuxiao Wang
Yuxiao Wang
Xinyu Jiang
Yu Lei
Zhenao Wei
Jinxiu Liu
Qi Liu
Weiying Xue
VLM
32
1
0
04 Mar 2024
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association
Zhongpai Gao
Huayi Zhou
Abhishek Sharma
Meng Zheng
Benjamin Planche
Terrence Chen
Ziyan Wu
37
1
0
12 Feb 2024
SGTR+: End-to-end Scene Graph Generation with Transformer
SGTR+: End-to-end Scene Graph Generation with Transformer
Rongjie Li
Songyang Zhang
Xuming He
ViT
34
2
0
23 Jan 2024
Exploring Self- and Cross-Triplet Correlations for Human-Object
  Interaction Detection
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection
Weibo Jiang
Weihong Ren
Jiandong Tian
Liangqiong Qu
Zhiyong Wang
Honghai Liu
22
4
0
11 Jan 2024
Expediting Contrastive Language-Image Pretraining via Self-distilled
  Encoders
Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders
Bumsoo Kim
Jinhyung Kim
Yeonsik Jo
S. Kim
VLM
23
3
0
19 Dec 2023
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
Yuhang Yang
Wei Zhai
Hongcheng Luo
Yang Cao
Zheng-Jun Zha
25
23
0
14 Dec 2023
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
Jiun Tian Hoe
Xudong Jiang
Chee Seng Chan
Yap-Peng Tan
Weipeng Hu
19
11
0
10 Dec 2023
Disentangled Interaction Representation for One-Stage Human-Object
  Interaction Detection
Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Xubin Zhong
Changxing Ding
Yupeng Hu
Dacheng Tao
23
0
0
04 Dec 2023
Zero-shot Referring Expression Comprehension via Structural Similarity
  Between Images and Captions
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions
Zeyu Han
Fangrui Zhu
Qianru Lao
Huaizu Jiang
ObjD
27
11
0
28 Nov 2023
Neural-Logic Human-Object Interaction Detection
Neural-Logic Human-Object Interaction Detection
Liulei Li
Jianan Wei
Wenguan Wang
Yi Yang
43
16
0
16 Nov 2023
DRUformer: Enhancing the driving scene Important object detection with
  driving relationship self-understanding
DRUformer: Enhancing the driving scene Important object detection with driving relationship self-understanding
Yingjie Niu
Ming Ding
Keisuke Fujii
Kento Ohtani
Alexander Carballo
K. Takeda
ViT
38
0
0
11 Nov 2023
Detecting Any Human-Object Interaction Relationship: Universal HOI
  Detector with Spatial Prompt Learning on Foundation Models
Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models
Yichao Cao
Qingfei Tang
Xiu Su
Chen Song
Shan You
Xiaobo Lu
Chang Xu
30
21
0
07 Nov 2023
Towards a Unified Transformer-based Framework for Scene Graph Generation
  and Human-object Interaction Detection
Towards a Unified Transformer-based Framework for Scene Graph Generation and Human-object Interaction Detection
Tao He
Lianli Gao
Jingkuan Song
Yuan-Fang Li
ViT
24
11
0
03 Nov 2023
Sharingan: A Transformer-based Architecture for Gaze Following
Sharingan: A Transformer-based Architecture for Gaze Following
Samy Tafasca
Anshul Gupta
J. Odobez
ViT
21
3
0
01 Oct 2023
A Hierarchical Graph-based Approach for Recognition and Description
  Generation of Bimanual Actions in Videos
A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos
Fatemeh Ziaeetabar
Reza Safabakhsh
S. Momtazi
M. Tamosiunaite
F. Worgotter
17
1
0
01 Oct 2023
DECO: Dense Estimation of 3D Human-Scene Contact In The Wild
DECO: Dense Estimation of 3D Human-Scene Contact In The Wild
Shashank Tripathi
Agniv Chatterjee
Jean-Claude Passy
Hongwei Yi
Dimitrios Tzionas
Michael J. Black
3DH
27
21
0
26 Sep 2023
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge
  Distillation at Multiple Levels
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels
Bo Wan
Tinne Tuytelaars
VLM
24
3
0
10 Sep 2023
123
Next