Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
45 / 3,645 papers shown
Title
Fully-Coupled Two-Stream Spatiotemporal Networks for Extremely Low Resolution Action Recognition
Mingze Xu
Aidean Sharghi
Xin Chen
David J. Crandall
EgoV
52
30
0
11 Jan 2018
Moments in Time Dataset: one million videos for event understanding
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
...
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
122
553
0
09 Jan 2018
What have we learned from deep representations for action recognition?
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
Andrew Zisserman
SSL
88
47
0
04 Jan 2018
Detect-and-Track: Efficient Pose Estimation in Videos
Rohit Girdhar
Georgia Gkioxari
Lorenzo Torresani
Manohar Paluri
Du Tran
3DH
120
230
0
26 Dec 2017
On the Integration of Optical Flow and Action Recognition
Laura Sevilla-Lara
Yiyi Liao
Fatma Guney
Varun Jampani
Andreas Geiger
Michael J. Black
149
196
0
22 Dec 2017
Weakly Supervised Action Localization by Sparse Temporal Pooling Network
P. Nguyen
Ting Liu
Gautam Prasad
Bohyung Han
WSOL
205
351
0
14 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
187
1,336
0
13 Dec 2017
From Lifestyle Vlogs to Everyday Interactions
David Fouhey
Weicheng Kuo
Alexei A. Efros
Jitendra Malik
84
125
0
06 Dec 2017
Learning Latent Super-Events to Detect Multiple Activities in Videos
A. Piergiovanni
Michael S. Ryoo
79
90
0
05 Dec 2017
Compressed Video Action Recognition
Chao-Yuan Wu
Manzil Zaheer
Hexiang Hu
R. Manmatha
Alex Smola
Philipp Krahenbuhl
184
325
0
02 Dec 2017
Graph Distillation for Action Detection with Privileged Modalities
Zelun Luo
Jun-Ting Hsieh
Lu Jiang
Juan Carlos Niebles
Li Fei-Fei
107
104
0
30 Nov 2017
Budget-Aware Activity Detection with A Recurrent Policy Network
Behrooz Mahasseni
Xiaodong Yang
Pavlo Molchanov
Jan Kautz
72
6
0
30 Nov 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
258
3,042
0
30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun
Zhanghui Kuang
Wanli Ouyang
Lu Sheng
Wayne Zhang
94
297
0
29 Nov 2017
Revisiting hand-crafted feature for action recognition: a set of improved dense trajectories
K. Matsui
Toru Tamaki
Gwladys Auffret
B. Raytchev
K. Kaneda
28
0
0
28 Nov 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture
Katsunori Ohnishi
Shohei Yamamoto
Yoshitaka Ushiku
Tatsuya Harada
VGen
GAN
81
60
0
27 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
135
1,937
0
27 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao-Chang Liu
Shilei Wen
104
209
0
27 Nov 2017
Appearance-and-Relation Networks for Video Classification
Limin Wang
Wei Li
Wen Li
Luc Van Gool
100
352
0
24 Nov 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
149
1,043
0
22 Nov 2017
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
A. Karami
M. M. Arzani
Rahman Yousefzadeh
Luc Van Gool
86
242
0
22 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
366
8,940
0
21 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
83
145
0
16 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
92
89
0
11 Nov 2017
Attentional Pooling for Action Recognition
Rohit Girdhar
Deva Ramanan
138
321
0
04 Nov 2017
Multi-modal Aggregation for Video Classification
Chen Chen
Xiaowei Zhao
Yang Liu
33
1
0
27 Oct 2017
PoseTrack: A Benchmark for Human Pose Estimation and Tracking
Mykhaylo Andriluka
Umar Iqbal
Eldar Insafutdinov
L. Pishchulin
Anton Milan
Juergen Gall
Bernt Schiele
126
461
0
27 Oct 2017
ActivityNet Challenge 2017 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Ranjay Krishna
Victor Escorcia
Kenji Hata
S. Buch
110
48
0
22 Oct 2017
Human Activity Recognition Using Robust Adaptive Privileged Probabilistic Learning
Michalis Vrigkas
Evangelos Kazakos
Christophoros Nikou
I. Kakadiaris
106
1
0
19 Sep 2017
Two-stream Flow-guided Convolutional Attention Networks for Action Recognition
An Tran
L. Cheong
54
54
0
30 Aug 2017
Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
91
607
0
25 Aug 2017
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos
Brendan Jou
Xavier Giró-i-Nieto
Jordi Torres
Shih-Fu Chang
84
219
0
22 Aug 2017
ConvNet Architecture Search for Spatiotemporal Feature Learning
Du Tran
Jamie Ray
Zheng Shou
Shih-Fu Chang
Manohar Paluri
3DPC
126
385
0
16 Aug 2017
Learnable pooling with Context Gating for video classification
Antoine Miech
Ivan Laptev
Josef Sivic
87
327
0
21 Jun 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
167
1,033
0
23 May 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
278
3,824
0
19 May 2017
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
131
816
0
08 May 2017
Second-order Temporal Pooling for Action Recognition
A. Cherian
Stephen Gould
EgoV
42
29
0
23 Apr 2017
Temporal Action Detection with Structured Segment Networks
Yue Zhao
Yuanjun Xiong
Limin Wang
Zhirong Wu
Xiaoou Tang
Dahua Lin
145
916
0
20 Apr 2017
Hidden Two-Stream Convolutional Networks for Action Recognition
Yi Zhu
Zhenzhong Lan
Shawn D. Newsam
Alexander G. Hauptmann
122
282
0
02 Apr 2017
Semantic Video Segmentation by Gated Recurrent Flow Propagation
David Nilsson
C. Sminchisescu
192
224
0
28 Dec 2016
ActionFlowNet: Learning Motion Representation for Action Recognition
Joe Yue-Hei Ng
Jonghyun Choi
J. Neumann
L. Davis
87
121
0
09 Dec 2016
Action Recognition with Dynamic Image Networks
Hakan Bilen
Basura Fernando
Efstratios Gavves
Andrea Vedaldi
FAtt
91
224
0
02 Dec 2016
Human Action Recognition without Human
Yun He
Soma Shirakabe
Y. Satoh
Hirokatsu Kataoka
83
43
0
29 Aug 2016
Discriminatively Trained Latent Ordinal Model for Video Classification
Karan Sikka
Gaurav Sharma
55
11
0
08 Aug 2016
Previous
1
2
3
...
71
72
73