ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.06567
  4. Cited By
A Comprehensive Study of Deep Video Action Recognition

A Comprehensive Study of Deep Video Action Recognition

11 December 2020
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
    VLM
    AI4TS
ArXivPDFHTML

Papers citing "A Comprehensive Study of Deep Video Action Recognition"

50 / 213 papers shown
Title
Evolving Space-Time Neural Architectures for Videos
Evolving Space-Time Neural Architectures for Videos
A. Piergiovanni
A. Angelova
Alexander Toshev
Michael S. Ryoo
67
57
0
26 Nov 2018
Self-Supervised Video Representation Learning with Space-Time Cubic
  Puzzles
Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles
Dahun Kim
Donghyeon Cho
In So Kweon
SSL
65
347
0
24 Nov 2018
TSM: Temporal Shift Module for Efficient Video Understanding
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
85
1,683
0
20 Nov 2018
Random Temporal Skipping for Multirate Video Analysis
Random Temporal Skipping for Multirate Video Analysis
Yi Zhu
Shawn D. Newsam
38
14
0
30 Oct 2018
Representation Flow for Action Recognition
Representation Flow for Action Recognition
A. Piergiovanni
Michael S. Ryoo
75
147
0
02 Oct 2018
A Short Note about Kinetics-600
A Short Note about Kinetics-600
João Carreira
Eric Noland
Andras Banki-Horvath
Chloe Hillier
Andrew Zisserman
82
524
0
03 Aug 2018
Multi-Fiber Networks for Video Recognition
Multi-Fiber Networks for Video Recognition
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
CVBM
100
217
0
30 Jul 2018
W-TALC: Weakly-supervised Temporal Activity Localization and
  Classification
W-TALC: Weakly-supervised Temporal Activity Localization and Classification
S. Paul
Sourya Roy
Amit K. Roy-Chowdhury
74
307
0
27 Jul 2018
Motion Feature Network: Fixed Motion Filter for Action Recognition
Motion Feature Network: Fixed Motion Filter for Action Recognition
Myunggi Lee
Seungeui Lee
S. Son
Gyutae Park
Nojun Kwak
72
122
0
26 Jul 2018
AutoLoc: Weakly-supervised Temporal Action Localization
AutoLoc: Weakly-supervised Temporal Action Localization
Zheng Shou
Hang Gao
Lei Zhang
K. Miyazawa
Shih-Fu Chang
82
259
0
22 Jul 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
284
10,253
0
10 Jul 2018
Adversarial Perturbations Against Real-Time Video Classification Systems
Adversarial Perturbations Against Real-Time Video Classification Systems
Shasha Li
Ajaya Neupane
S. Paul
Chengyu Song
S. Krishnamurthy
Amit K. Roy-Chowdhury
A. Swami
AAML
61
119
0
02 Jul 2018
Cooperative Learning of Audio and Video Models from Self-Supervised
  Synchronization
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
Bruno Korbar
Du Tran
Lorenzo Torresani
93
474
0
30 Jun 2018
Human Action Recognition and Prediction: A Survey
Human Action Recognition and Prediction: A Survey
Yu Kong
Y. Fu
75
619
0
28 Jun 2018
DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search
Hanxiao Liu
Karen Simonyan
Yiming Yang
185
4,345
0
24 Jun 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
59
181
0
19 Jun 2018
Modality Distillation with Multiple Stream Networks for Action
  Recognition
Modality Distillation with Multiple Stream Networks for Action Recognition
Nuno C. Garcia
Pietro Morerio
Vittorio Murino
61
182
0
19 Jun 2018
Videos as Space-Time Region Graphs
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
83
755
0
05 Jun 2018
Unsupervised Feature Learning via Non-Parametric Instance-level
  Discrimination
Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination
Zhirong Wu
Yuanjun Xiong
Stella X. Yu
Dahua Lin
SSL
170
3,450
0
05 May 2018
ECO: Efficient Convolutional Network for Online Video Understanding
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
180
498
0
24 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
61
234
0
07 Apr 2018
End-to-End Learning of Motion Representation for Video Understanding
End-to-End Learning of Motion Representation for Video Understanding
Lijie Fan
Wen-bing Huang
Chuang Gan
Stefano Ermon
Boqing Gong
Junzhou Huang
66
214
0
02 Apr 2018
Towards Universal Representation for Unseen Action Recognition
Towards Universal Representation for Unseen Action Recognition
Yi Zhu
Yang Long
Yu Guan
Shawn D. Newsam
Ling Shao
AI4TS
80
104
0
22 Mar 2018
Unsupervised Representation Learning by Predicting Image Rotations
Unsupervised Representation Learning by Predicting Image Rotations
Spyros Gidaris
Praveer Singh
N. Komodakis
OOD
SSL
DRL
231
3,283
0
21 Mar 2018
Efficient Neural Architecture Search via Parameter Sharing
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
101
2,761
0
09 Feb 2018
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action
  Recognition
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Sijie Yan
Yuanjun Xiong
Dahua Lin
GNN
228
4,150
0
23 Jan 2018
Moments in Time Dataset: one million videos for event understanding
Moments in Time Dataset: one million videos for event understanding
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
...
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
90
545
0
09 Jan 2018
What have we learned from deep representations for action recognition?
What have we learned from deep representations for action recognition?
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
Andrew Zisserman
SSL
53
47
0
04 Jan 2018
Weakly Supervised Action Localization by Sparse Temporal Pooling Network
Weakly Supervised Action Localization by Sparse Temporal Pooling Network
P. Nguyen
Ting Liu
Gautam Prasad
Bohyung Han
WSOL
143
349
0
14 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
137
1,325
0
13 Dec 2017
Compressed Video Action Recognition
Compressed Video Action Recognition
Chao-Yuan Wu
Manzil Zaheer
Hexiang Hu
R. Manmatha
Alex Smola
Philipp Krahenbuhl
131
325
0
02 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
196
3,021
0
30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for
  Video Action Recognition
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun
Zhanghui Kuang
Wanli Ouyang
Lu Sheng
Wayne Zhang
74
296
0
29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
84
1,659
0
28 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
118
1,931
0
27 Nov 2017
Appearance-and-Relation Networks for Video Classification
Appearance-and-Relation Networks for Video Classification
Limin Wang
Wei Li
Wen Li
Luc Van Gool
65
351
0
24 Nov 2017
Temporal Relational Reasoning in Videos
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
91
1,037
0
22 Nov 2017
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video
  Classification
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
A. Karami
M. M. Arzani
Rahman Yousefzadeh
Luc Van Gool
62
242
0
22 Nov 2017
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Bichen Wu
Alvin Wan
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Noah Golmant
A. Gholaminejad
Joseph E. Gonzalez
Kurt Keutzer
3DPC
61
363
0
22 Nov 2017
Non-local Neural Networks
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
263
8,888
0
21 Nov 2017
mixup: Beyond Empirical Risk Minimization
mixup: Beyond Empirical Risk Minimization
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
271
9,743
0
25 Oct 2017
Squeeze-and-Excitation Networks
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
382
26,365
0
05 Sep 2017
Improved Regularization of Convolutional Neural Networks with Cutout
Improved Regularization of Convolutional Neural Networks with Cutout
Terrance Devries
Graham W. Taylor
107
3,758
0
15 Aug 2017
Lattice Long Short-Term Memory for Human Action Recognition
Lattice Long Short-Term Memory for Human Action Recognition
Lin Sun
Kui Jia
Kevin Chen
Dit-Yan Yeung
Bertram E. Shi
Silvio Savarese
59
155
0
13 Aug 2017
Unsupervised Representation Learning by Sorting Sequences
Unsupervised Representation Learning by Sorting Sequences
Hsin-Ying Lee
Jia-Bin Huang
Maneesh Kumar Singh
Ming-Hsuan Yang
SSL
DRL
69
534
0
03 Aug 2017
Spatial-Aware Object Embeddings for Zero-Shot Localization and
  Classification of Actions
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions
Pascal Mettes
Cees G. M. Snoek
50
88
0
28 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for
  Mobile Devices
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
132
6,850
0
04 Jul 2017
The "something something" video database for learning and evaluating
  visual common sense
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
82
1,529
0
13 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
628
130,942
0
12 Jun 2017
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
120
3,675
0
08 Jun 2017
Previous
12345
Next