Action Genome: Actions as Composition of Spatio-temporal Scene Graphs

15 December 2019

Li Fei-Fei

Papers citing "Action Genome: Actions as Composition of Spatio-temporal Scene Graphs"

50 / 64 papers shown

Title
Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding Aaron Lohner Francesco Compagno Jonathan M Francis A. Oltramari 97 2 0 10 Jan 2025
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions Xiaoyang Liu Boran Wen Xinpeng Liu Zizheng Zhou Hongwei Fan Cewu Lu Lizhuang Ma Yulong Chen Yongqian Li 102 2 0 27 Dec 2024
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation Hang Zhang Zhuoling Li Jun Liu LRM 128 1 0 15 Dec 2024
HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation Trong-Thuan Nguyen Pha Nguyen J. Cothren Alper Yilmaz Khoa Luu 120 1 0 27 Nov 2024
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation Rohith Peddi Saurabh Ayush Abhay Shrivastava Parag Singla Vibhav Gogate 111 0 0 20 Nov 2024
Situational Scene Graph for Structured Human-centric Situation Understanding Chinthani Sugandhika Chen Li Deepu Rajan Basura Fernando 389 1 0 30 Oct 2024
Object-Attribute-Relation Representation Based Video Semantic Communication Qiyuan Du Yiping Duan Qianqian Yang Xiaoming Tao Mérouane Debbah 78 3 0 15 Jun 2024
3VL: Using Trees to Improve Vision-Language Models' Interpretability Nir Yellinek Leonid Karlinsky Raja Giryes CoGe VLM 171 4 0 28 Dec 2023
STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning Palaash Agrawal Haidi Azaman Cheston Tan 74 3 0 13 Sep 2023
DDS: Decoupled Dynamic Scene-Graph Generation Network A S M Iftekhar Raphael Ruschel Satish Kumar Suya You B. S. Manjunath 59 2 0 18 Jan 2023
ProtoGAN: Towards Few Shot Learning for Action Recognition Sai Kumar Dwivedi Vikram Gupta Rahul Mitra Shuaib Ahmed Arjun Jain 56 94 0 17 Sep 2019
Specifying Object Attributes and Relations in Interactive Scene Generation Oron Ashual Lior Wolf 130 179 0 11 Sep 2019
Explainable Video Action Reasoning via Prior Knowledge and State Transitions Tao Zhuo Zhiyong Cheng Peng Zhang Yongkang Wong Mohan Kankanhalli FAtt 47 61 0 28 Aug 2019
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition M. Bishay Georgios Zoumpourlis Ioannis Patras ViT 49 155 0 21 Jul 2019
A Short Note on the Kinetics-700 Human Action Dataset João Carreira Eric Noland Chloe Hillier Andrew Zisserman 52 446 0 15 Jul 2019
Scene Graph Prediction with Limited Labels V. Chen P. Varma Ranjay Krishna Michael S. Bernstein Christopher Ré Li Fei-Fei 45 86 0 25 Apr 2019
Graphical Contrastive Losses for Scene Graph Parsing Ji Zhang Kevin J. Shih Ahmed Elgammal Andrew Tao Bryan Catanzaro 52 229 0 07 Mar 2019
Long-Term Feature Banks for Detailed Video Understanding Chao-Yuan Wu Christoph Feichtenhofer Haoqi Fan Kaiming He Philipp Krahenbuhl Ross B. Girshick 151 479 0 12 Dec 2018
SlowFast Networks for Video Recognition Christoph Feichtenhofer Haoqi Fan Jitendra Malik Kaiming He 146 3,244 0 10 Dec 2018
Video Action Transformer Network Rohit Girdhar João Carreira Carl Doersch Andrew Zisserman ViT 120 706 0 06 Dec 2018
Timeception for Complex Action Recognition Noureldien Hussein E. Gavves A. Smeulders 99 213 0 04 Dec 2018
TSM: Temporal Shift Module for Efficient Video Understanding Ji Lin Chuang Gan Song Han 78 1,677 0 20 Nov 2018
Graph R-CNN for Scene Graph Generation Jianwei Yang Jiasen Lu Stefan Lee Dhruv Batra Devi Parikh GNN 93 839 0 01 Aug 2018
Actor-Centric Relation Network Chen Sun Abhinav Shrivastava Carl Vondrick Kevin Patrick Murphy Rahul Sukthankar Cordelia Schmid 78 220 0 28 Jul 2018
A Better Baseline for AVA Rohit Girdhar João Carreira Carl Doersch Andrew Zisserman 50 67 0 26 Jul 2018
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation Yikang Li Wanli Ouyang Bolei Zhou Jianping Shi Yawen Cui Xiaogang Wang GNN 43 273 0 29 Jun 2018
Object Level Visual Reasoning in Videos Fabien Baradel Natalia Neverova Christian Wolf J. Mille Greg Mori 75 163 0 16 Jun 2018
Videos as Space-Time Region Graphs Xinyu Wang Abhinav Gupta 67 753 0 05 Jun 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset Dima Damen Hazel Doughty G. Farinella Sanja Fidler Antonino Furnari ... Davide Moltisanti Jonathan Munro Toby Perrett Will Price Michael Wray EgoV 72 1,011 0 08 Apr 2018
Image Generation from Scene Graphs Justin Johnson Agrim Gupta Li Fei-Fei GNN 280 818 0 04 Apr 2018
Referring Relationships Ranjay Krishna Ines Chami Michael S. Bernstein Li Fei-Fei 52 94 0 28 Mar 2018
Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction Roei Herzig Moshiko Raboh Gal Chechik Jonathan Berant Amir Globerson GNN OCL 56 133 0 15 Feb 2018
A Generative Approach to Zero-Shot and Few-Shot Action Recognition Ashish Mishra Vinay Kumar Verma M. K. Reddy Arulkumar Subramaniam Piyush Rai Anurag Mittal VLM GAN 58 133 0 27 Jan 2018
Temporal Relational Reasoning in Videos Bolei Zhou A. Andonian Aude Oliva Antonio Torralba NAI 78 1,035 0 22 Nov 2017
Non-local Neural Networks Xinyu Wang Ross B. Girshick Abhinav Gupta Kaiming He OffRL 215 8,867 0 21 Nov 2017
Neural Motifs: Scene Graph Parsing with Global Context Rowan Zellers Mark Yatskar Sam Thomson Yejin Choi GNN 71 992 0 17 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding Chih-Yao Ma Asim Kadav I. Melvin Z. Kira G. Al-Regib H. Graf 47 145 0 16 Nov 2017
Scene Graph Generation from Objects, Phrases and Region Captions Yikang Li Wanli Ouyang Bolei Zhou Kun Wang Xiaogang Wang 67 501 0 31 Jul 2017
Pixels to Graphs by Associative Embedding Alejandro Newell Jia Deng GNN VOS 61 232 0 22 Jun 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions Chunhui Gu Chen Sun David A. Ross Carl Vondrick C. Pantofaru ... G. Toderici Susanna Ricco Rahul Sukthankar Cordelia Schmid Jitendra Malik VGen 87 1,021 0 23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset João Carreira Andrew Zisserman 199 7,961 0 22 May 2017
The Kinetics Human Action Video Dataset W. Kay João Carreira Karen Simonyan Brian Zhang Chloe Hillier ... Tim Green T. Back Apostol Natsev Mustafa Suleyman Andrew Zisserman 200 3,771 0 19 May 2017
Inferring and Executing Programs for Visual Reasoning Justin Johnson B. Hariharan Laurens van der Maaten Judy Hoffman Li Fei-Fei C. L. Zitnick Ross B. Girshick NAI 61 543 0 10 May 2017
Dense-Captioning Events in Videos Ranjay Krishna Kenji Hata F. Ren Li Fei-Fei Juan Carlos Niebles 120 1,225 0 02 May 2017
Detecting Visual Relationships with Deep Relational Networks Bo Dai Yuqi Zhang Dahua Lin GNN 83 501 0 11 Apr 2017
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection Xiaodan Liang Lisa Lee Eric Xing 51 251 0 08 Mar 2017
Scene Graph Generation by Iterative Message Passing Danfei Xu Yuke Zhu Chris Choy Li Fei-Fei GNN 3DV 64 1,214 0 10 Jan 2017
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization Ramprasaath R. Selvaraju Michael Cogswell Abhishek Das Ramakrishna Vedantam Devi Parikh Dhruv Batra FAtt 216 19,796 0 07 Oct 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang Luc Van Gool ViT 90 3,814 0 02 Aug 2016
Visual Relationship Detection with Language Priors Cewu Lu Ranjay Krishna Michael S. Bernstein Li Fei-Fei VLM 55 1,137 0 31 Jul 2016