Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11757
Cited By
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
22 October 2020
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition"
50 / 65 papers shown
Title
Accuracy and Performance Comparison of Video Action Recognition Approaches
Matthew Hutchinson
S. Samsi
William Arcand
David Bestor
Bill Bergeron
...
Andrew Prout
Antonio Rosa
Albert Reuther
Charles Yee
V. Gadepally
36
5
0
20 Aug 2020
Directional Temporal Modeling for Action Recognition
Xinyu Li
Bing Shuai
Joseph Tighe
48
41
0
21 Jul 2020
Temporal Distinct Representation Learning for Action Recognition
Junwu Weng
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Xudong Jiang
Junsong Yuan
55
26
0
15 Jul 2020
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
125
1,019
0
09 Apr 2020
Temporal Pyramid Network for Action Recognition
Ceyuan Yang
Yinghao Xu
Jianping Shi
Bo Dai
Bolei Zhou
47
372
0
07 Apr 2020
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
55
155
0
01 Dec 2019
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
77
240
0
21 Nov 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
151
151
0
28 Sep 2019
Action recognition with spatial-temporal discriminative filter banks
Brais Martínez
Davide Modolo
Yuanjun Xiong
Joseph Tighe
51
66
0
20 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
79
382
0
07 Aug 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
73
75
0
19 Jul 2019
Video Modeling with Correlation Networks
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
57
128
0
07 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
50
102
0
30 May 2019
Large-scale weakly-supervised pre-training for video action recognition
Deepti Ghadiyaram
Matt Feiszli
Du Tran
Xueting Yan
Heng Wang
D. Mahajan
59
299
0
02 May 2019
Video Classification with Channel-Separated Convolutional Networks
Du Tran
Heng Wang
Lorenzo Torresani
Matt Feiszli
3DV
61
586
0
04 Apr 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
58
82
0
04 Mar 2019
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
45
54
0
26 Jan 2019
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
164
3,272
0
10 Dec 2018
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
124
708
0
06 Dec 2018
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
101
214
0
04 Dec 2018
Rethinking ImageNet Pre-training
Kaiming He
Ross B. Girshick
Piotr Dollár
VLM
SSeg
125
1,084
0
21 Nov 2018
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
85
1,688
0
20 Nov 2018
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
Dongliang He
Zhichao Zhou
Chuang Gan
Fu Li
Xiao-Chang Liu
Yandong Li
Limin Wang
Shilei Wen
73
133
0
05 Nov 2018
Representation Flow for Action Recognition
A. Piergiovanni
Michael S. Ryoo
75
147
0
02 Oct 2018
W-TALC: Weakly-supervised Temporal Activity Localization and Classification
S. Paul
Sourya Roy
Amit K. Roy-Chowdhury
77
309
0
27 Jul 2018
Motion Feature Network: Fixed Motion Filter for Action Recognition
Myunggi Lee
Seungeui Lee
S. Son
Gyutae Park
Nojun Kwak
75
122
0
26 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
65
181
0
19 Jun 2018
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
183
498
0
24 Apr 2018
Group Normalization
Yuxin Wu
Kaiming He
221
3,652
0
22 Mar 2018
Moments in Time Dataset: one million videos for event understanding
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
...
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
92
548
0
09 Jan 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
137
1,328
0
13 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
200
3,029
0
30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun
Zhanghui Kuang
Wanli Ouyang
Lu Sheng
Wayne Zhang
74
296
0
29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
84
1,661
0
28 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
118
1,934
0
27 Nov 2017
Appearance-and-Relation Networks for Video Classification
Limin Wang
Wei Li
Wen Li
Luc Van Gool
65
351
0
24 Nov 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
93
1,039
0
22 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
283
8,902
0
21 Nov 2017
Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
74
605
0
25 Aug 2017
What Actions are Needed for Understanding Human Actions in Videos?
Gunnar Sigurdsson
Olga Russakovsky
Abhinav Gupta
47
133
0
09 Aug 2017
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
82
1,530
0
13 Jun 2017
Collaborative Summarization of Topic-Related Videos
Yikang Shen
Amit K. Roy-Chowdhury
EgoV
47
79
0
09 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
221
8,012
0
22 May 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
235
3,801
0
19 May 2017
ActionVLAD: Learning spatio-temporal aggregation for action classification
Rohit Girdhar
Deva Ramanan
Abhinav Gupta
Josef Sivic
Bryan C. Russell
AI4TS
70
451
0
10 Apr 2017
UntrimmedNets for Weakly Supervised Action Recognition and Detection
Limin Wang
Yuanjun Xiong
Dahua Lin
Luc Van Gool
55
491
0
09 Mar 2017
Spatiotemporal Residual Networks for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
102
719
0
07 Nov 2016
Weakly supervised learning of actions from transcripts
Hilde Kuehne
Alexander Richard
Juergen Gall
49
118
0
07 Oct 2016
YouTube-8M: A Large-Scale Video Classification Benchmark
Sami Abu-El-Haija
Nisarg Kothari
Joonseok Lee
Apostol Natsev
G. Toderici
Balakrishnan Varadarajan
Sudheendra Vijayanarasimhan
VLM
136
1,268
0
27 Sep 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
100
3,831
0
02 Aug 2016
1
2
Next