Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition

22 October 2020

Papers citing "Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition"

50 / 65 papers shown

Title
Accuracy and Performance Comparison of Video Action Recognition Approaches Matthew Hutchinson S. Samsi William Arcand David Bestor Bill Bergeron ... Andrew Prout Antonio Rosa Albert Reuther Charles Yee V. Gadepally 36 5 0 20 Aug 2020
Directional Temporal Modeling for Action Recognition Xinyu Li Bing Shuai Joseph Tighe 48 41 0 21 Jul 2020
Temporal Distinct Representation Learning for Action Recognition Junwu Weng Donghao Luo Yabiao Wang Ying Tai Chengjie Wang Jilin Li Feiyue Huang Xudong Jiang Junsong Yuan 55 26 0 15 Jul 2020
X3D: Expanding Architectures for Efficient Video Recognition Christoph Feichtenhofer 125 1,019 0 09 Apr 2020
Temporal Pyramid Network for Action Recognition Ceyuan Yang Yinghao Xu Jianping Shi Bo Dai Bolei Zhou 47 372 0 07 Apr 2020
Gate-Shift Networks for Video Action Recognition Swathikiran Sudhakaran Sergio Escalera Oswald Lanz 3DPC 55 155 0 01 Dec 2019
TEINet: Towards an Efficient Architecture for Video Recognition Zhaoyang Liu Donghao Luo Yabiao Wang Limin Wang Ying Tai Chengjie Wang Jilin Li Feiyue Huang Tong Lu ViT 77 240 0 21 Nov 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition Chenxu Luo Alan Yuille 151 151 0 28 Sep 2019
Action recognition with spatial-temporal discriminative filter banks Brais Martínez Davide Modolo Yuanjun Xiong Joseph Tighe 51 66 0 20 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition Boyuan Jiang Mengmeng Wang Weihao Gan Wei Wu Junjie Yan 79 382 0 07 Aug 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling Laura Sevilla-Lara Shengxin Cindy Zha Zhicheng Yan Vedanuj Goswami Matt Feiszli Lorenzo Torresani 73 75 0 19 Jul 2019
Video Modeling with Correlation Networks Heng Wang Du Tran Lorenzo Torresani Matt Feiszli 57 128 0 07 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures Michael S. Ryoo A. Piergiovanni Mingxing Tan A. Angelova 50 102 0 30 May 2019
Large-scale weakly-supervised pre-training for video action recognition Deepti Ghadiyaram Matt Feiszli Du Tran Xueting Yan Heng Wang D. Mahajan 59 299 0 02 May 2019
Video Classification with Channel-Separated Convolutional Networks Du Tran Heng Wang Lorenzo Torresani Matt Feiszli 3DV 61 586 0 04 Apr 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition Chong Li Qiaoyong Zhong Di Xie Shiliang Pu 58 82 0 04 Mar 2019
DistInit: Learning Video Representations Without a Single Labeled Video Rohit Girdhar Du Tran Lorenzo Torresani Deva Ramanan 45 54 0 26 Jan 2019
SlowFast Networks for Video Recognition Christoph Feichtenhofer Haoqi Fan Jitendra Malik Kaiming He 164 3,272 0 10 Dec 2018
Video Action Transformer Network Rohit Girdhar João Carreira Carl Doersch Andrew Zisserman ViT 124 708 0 06 Dec 2018
Timeception for Complex Action Recognition Noureldien Hussein E. Gavves A. Smeulders 101 214 0 04 Dec 2018
Rethinking ImageNet Pre-training Kaiming He Ross B. Girshick Piotr Dollár VLM SSeg 125 1,084 0 21 Nov 2018
TSM: Temporal Shift Module for Efficient Video Understanding Ji Lin Chuang Gan Song Han 85 1,688 0 20 Nov 2018
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition Dongliang He Zhichao Zhou Chuang Gan Fu Li Xiao-Chang Liu Yandong Li Limin Wang Shilei Wen 73 133 0 05 Nov 2018
Representation Flow for Action Recognition A. Piergiovanni Michael S. Ryoo 75 147 0 02 Oct 2018
W-TALC: Weakly-supervised Temporal Activity Localization and Classification S. Paul Sourya Roy Amit K. Roy-Chowdhury 77 309 0 27 Jul 2018
Motion Feature Network: Fixed Motion Filter for Action Recognition Myunggi Lee Seungeui Lee S. Son Gyutae Park Nojun Kwak 75 122 0 26 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification Ali Diba Mohsen Fayyaz Vivek Sharma M. M. Arzani Rahman Yousefzadeh Juergen Gall Luc Van Gool 3DPC 65 181 0 19 Jun 2018
ECO: Efficient Convolutional Network for Online Video Understanding Mohammadreza Zolfaghari Kamaljeet Singh Thomas Brox 183 498 0 24 Apr 2018
Group Normalization Yuxin Wu Kaiming He 221 3,652 0 22 Mar 2018
Moments in Time Dataset: one million videos for event understanding Mathew Monfort A. Andonian Bolei Zhou K. Ramakrishnan Sarah Adel Bargal ... L. Brown Quanfu Fan Dan Gutfreund Carl Vondrick A. Oliva 92 548 0 09 Jan 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification Saining Xie Chen Sun Jonathan Huang Zhuowen Tu Kevin Patrick Murphy 3DH 137 1,328 0 13 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun Manohar Paluri 200 3,029 0 30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition Shuyang Sun Zhanghui Kuang Wanli Ouyang Lu Sheng Wayne Zhang 74 296 0 29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Zhaofan Qiu Ting Yao Tao Mei 84 1,661 0 28 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? Kensho Hara Hirokatsu Kataoka Y. Satoh 3DPC 118 1,934 0 27 Nov 2017
Appearance-and-Relation Networks for Video Classification Limin Wang Wei Li Wen Li Luc Van Gool 65 351 0 24 Nov 2017
Temporal Relational Reasoning in Videos Bolei Zhou A. Andonian Aude Oliva Antonio Torralba NAI 93 1,039 0 22 Nov 2017
Non-local Neural Networks Xinyu Wang Ross B. Girshick Abhinav Gupta Kaiming He OffRL 283 8,902 0 21 Nov 2017
Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition Kensho Hara Hirokatsu Kataoka Y. Satoh 3DPC 74 605 0 25 Aug 2017
What Actions are Needed for Understanding Human Actions in Videos? Gunnar Sigurdsson Olga Russakovsky Abhinav Gupta 47 133 0 09 Aug 2017
The "something something" video database for learning and evaluating visual common sense Raghav Goyal Samira Ebrahimi Kahou Vincent Michalski Joanna Materzynska S. Westphal ... Moritz Mueller-Freitag F. Hoppe Christian Thurau Ingo Bax Roland Memisevic VLM 82 1,530 0 13 Jun 2017
Collaborative Summarization of Topic-Related Videos Yikang Shen Amit K. Roy-Chowdhury EgoV 47 79 0 09 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset João Carreira Andrew Zisserman 221 8,012 0 22 May 2017
The Kinetics Human Action Video Dataset W. Kay João Carreira Karen Simonyan Brian Zhang Chloe Hillier ... Tim Green T. Back Apostol Natsev Mustafa Suleyman Andrew Zisserman 235 3,801 0 19 May 2017
ActionVLAD: Learning spatio-temporal aggregation for action classification Rohit Girdhar Deva Ramanan Abhinav Gupta Josef Sivic Bryan C. Russell AI4TS 70 451 0 10 Apr 2017
UntrimmedNets for Weakly Supervised Action Recognition and Detection Limin Wang Yuanjun Xiong Dahua Lin Luc Van Gool 55 491 0 09 Mar 2017
Spatiotemporal Residual Networks for Video Action Recognition Christoph Feichtenhofer A. Pinz Richard P. Wildes 102 719 0 07 Nov 2016
Weakly supervised learning of actions from transcripts Hilde Kuehne Alexander Richard Juergen Gall 49 118 0 07 Oct 2016
YouTube-8M: A Large-Scale Video Classification Benchmark Sami Abu-El-Haija Nisarg Kothari Joonseok Lee Apostol Natsev G. Toderici Balakrishnan Varadarajan Sudheendra Vijayanarasimhan VLM 136 1,268 0 27 Sep 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang Luc Van Gool ViT 100 3,831 0 02 Aug 2016