Attend and Interact: Higher-Order Object Interactions for Video Understanding

16 November 2017

Papers citing "Attend and Interact: Higher-Order Object Interactions for Video Understanding"

29 / 29 papers shown

Title
Learning Higher-order Object Interactions for Keypoint-based Video Understanding Yi Huang Asim Kadav Farley Lai Deep Patel H. Graf 15 1 0 16 May 2023
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions Yong-Lu Li Hongwei Fan Zuoyu Qiu Yiming Dou Liang Xu ... Peiyang Guo Haisheng Su Dongliang Wang Wei Yu Wu Cewu Lu 35 7 0 14 Nov 2022
Holistic Interaction Transformer Network for Action Detection Gueter Josmy Faure Min-Hung Chen S. Lai 33 37 0 23 Oct 2022
Is an Object-Centric Video Representation Beneficial for Transfer? Chuhan Zhang Ankush Gupta Andrew Zisserman ViT 37 27 0 20 Jul 2022
Exponential Separations in Symmetric Neural Networks Aaron Zweig Joan Bruna 32 7 0 02 Jun 2022
Learning What and Where: Disentangling Location and Identity Tracking Without Supervision Manuel Traub S. Otte Tobias Menge Matthias Karlbauer Jannik Thummel Martin Volker Butz 34 20 0 26 May 2022
Hierarchical Self-supervised Representation Learning for Movie Understanding Fanyi Xiao Kaustav Kundu Joseph Tighe Davide Modolo SSL 44 24 0 06 Apr 2022
EAN: Event Adaptive Network for Enhanced Action Recognition Yuan Tian Yichao Yan Guangtao Zhai G. Guo Zhiyong Gao 35 41 0 22 Jul 2021
Aligning Correlation Information for Domain Adaptation in Action Recognition Yuecong Xu Jianfei Yang Haozhi Cao K. Mao Jianxiong Yin Simon See 24 38 0 11 Jul 2021
Towards Long-Form Video Understanding Chaoxia Wu Philipp Krahenbuhl VLM ViT 49 165 0 21 Jun 2021
Coarse-Fine Networks for Temporal Activity Detection in Videos Kumara Kahatapitiya Michael S. Ryoo AI4TS 53 38 0 01 Mar 2021
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries Swathikiran Sudhakaran Sergio Escalera Oswald Lanz EgoV 27 15 0 16 Feb 2021
Adversarial Bipartite Graph Learning for Video Domain Adaptation Yadan Luo Zi Huang Zijian Wang Zheng-Wei Zhang Mahsa Baktashmotlagh 24 51 0 31 Jul 2020
Long Short-Term Relation Networks for Video Action Detection Dong Li Ting Yao Zhaofan Qiu Houqiang Li Tao Mei 12 22 0 31 Mar 2020
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation Boxiao Pan Haoye Cai De-An Huang Kuan-Hui Lee Adrien Gaidon Ehsan Adeli Juan Carlos Niebles 31 235 0 31 Mar 2020
Multi-modal Dense Video Captioning Vladimir E. Iashin Esa Rahtu 22 164 0 17 Mar 2020
Delving Deeper into the Decoder for Video Captioning Haoran Chen Jianmin Li Xiaolin Hu 43 34 0 16 Jan 2020
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs Jingwei Ji Ranjay Krishna Li Fei-Fei Juan Carlos Niebles 39 335 0 15 Dec 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering Cheng Zhang Wei-Lun Chao D. Xuan 23 50 0 28 Jul 2019
Composition-Aware Image Aesthetics Assessment Dong Liu R. Puri Nagendra Kamath Subhabrata Bhattacharya CoGe 25 59 0 25 Jul 2019
Representation Learning on Visual-Symbolic Graphs for Video Understanding E. Mavroudi Benjamín Béjar Haro René Vidal 27 8 0 17 May 2019
Context-aware Human Motion Prediction Enric Corona Albert Pumarola Guillem Alenyà Francesc Moreno-Noguer 3DH 11 139 0 06 Apr 2019
Grounded Video Description Luowei Zhou Yannis Kalantidis Xinlei Chen Jason J. Corso Marcus Rohrbach 27 190 0 17 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition Hao Huang Luowei Zhou Wei Zhang Jason J. Corso Chenliang Xu 21 3 0 13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding Chao-Yuan Wu Christoph Feichtenhofer Haoqi Fan Kaiming He Philipp Krahenbuhl Ross B. Girshick 41 477 0 12 Dec 2018
A Structured Model For Action Detection Yubo Zhang P. Tokmakov M. Hebert Cordelia Schmid 28 101 0 09 Dec 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification Yang Du Chunfen Yuan Bing Li Lili Zhao Yangxi Li Weiming Hu 81 79 0 03 Aug 2018
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition Chih-Yao Ma Min-Hung Chen Z. Kira G. Al-Regib AI4TS 32 241 0 30 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection Hanwang Zhang Zawlin Kyaw Shih-Fu Chang Tat-Seng Chua ViT 154 560 0 27 Feb 2017