Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,645 papers shown
Title
Unsupervised Learning of View-invariant Action Representations
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
SSL
97
100
0
06 Sep 2018
Hierarchical Video Understanding
F. Mahdisoltani
Roland Memisevic
David Fleet
15
1
0
04 Sep 2018
Top-down Attention Recurrent VLAD Encoding for Action Recognition in Videos
Swathikiran Sudhakaran
Oswald Lanz
41
6
0
29 Aug 2018
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild
Yu Luo
Jianbo Ye
Reginald B. Adams
Jia Li
M. Newman
Jianmin Wang
114
86
0
28 Aug 2018
Targeted Nonlinear Adversarial Perturbations in Images and Videos
R. Rey-de-Castro
H. Rabitz
AAML
81
10
0
27 Aug 2018
Predicting Action Tubes
Gurkirt Singh
Suman Saha
Fabio Cuzzolin
ViT
130
22
0
23 Aug 2018
Deep Adaptive Temporal Pooling for Activity Recognition
Sibo Song
Ngai-Man Cheung
V. Chandrasekhar
Bappaditya Mandal
81
16
0
22 Aug 2018
Video-to-Video Synthesis
Ting-Chun Wang
Ming-Yuan Liu
Jun-Yan Zhu
Guilin Liu
Andrew Tao
Jan Kautz
Bryan Catanzaro
GAN
VGen
137
992
0
20 Aug 2018
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos
Zhaoyang Zhang
Zhanghui Kuang
Ping Luo
Xue Jiang
Wayne Zhang
38
12
0
15 Aug 2018
Fast Video Shot Transition Localization with Deep Structured Models
Shitao Tang
Xue Jiang
Zhanghui Kuang
Yimin Chen
Wayne Zhang
57
46
0
13 Aug 2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
107
65
0
11 Aug 2018
Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection
Da Zhang
Xiyang Dai
Yuan-fang Wang
87
41
0
07 Aug 2018
A Short Note about Kinetics-600
João Carreira
Eric Noland
Andras Banki-Horvath
Chloe Hillier
Andrew Zisserman
108
529
0
03 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
186
79
0
03 Aug 2018
Learning Actionable Representations from Visual Observations
Debidatta Dwibedi
Jonathan Tompson
Corey Lynch
P. Sermanet
SSL
91
80
0
02 Aug 2018
TraMNet - Transition Matrix Network for Efficient Action Tube Proposals
Gurkirt Singh
Suman Saha
Fabio Cuzzolin
125
10
0
01 Aug 2018
Action Anticipation By Predicting Future Dynamic Images
Cristian Rodriguez-Opazo
Basura Fernando
Hongdong Li
58
65
0
01 Aug 2018
Analyzing Human-Human Interactions: A Survey
Alexandros Stergiou
R. Poppe
69
14
0
31 Jul 2018
Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition
Swathikiran Sudhakaran
Oswald Lanz
EgoV
68
82
0
31 Jul 2018
Multi-Fiber Networks for Video Recognition
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
CVBM
149
220
0
30 Jul 2018
Story Understanding in Video Advertisements
Keren Ye
Kyle Buettner
Adriana Kovashka
44
12
0
29 Jul 2018
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
141
221
0
28 Jul 2018
Diagnosing Error in Temporal Action Detectors
Humam Alwassel
Fabian Caba Heilbron
Victor Escorcia
Guohao Li
168
106
0
27 Jul 2018
W-TALC: Weakly-supervised Temporal Activity Localization and Classification
S. Paul
Sourya Roy
Amit K. Roy-Chowdhury
121
311
0
27 Jul 2018
A Better Baseline for AVA
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
85
67
0
26 Jul 2018
Motion Feature Network: Fixed Motion Filter for Action Recognition
Myunggi Lee
Seungeui Lee
S. Son
Gyutae Park
Nojun Kwak
103
124
0
26 Jul 2018
Contrastive Video Representation Learning via Adversarial Perturbations
Jue Wang
A. Cherian
21
1
0
24 Jul 2018
AutoLoc: Weakly-supervised Temporal Action Localization
Zheng Shou
Hang Gao
Lei Zhang
K. Miyazawa
Shih-Fu Chang
117
261
0
22 Jul 2018
Correlation Net: Spatiotemporal multimodal deep learning for action recognition
N. Yudistira
Takio Kurita
111
21
0
22 Jul 2018
S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Networks
Da Zhang
Xiyang Dai
Xin Eric Wang
Yuan-fang Wang
3DPC
74
59
0
21 Jul 2018
Video Time: Properties, Encoders and Evaluation
Amir Ghodrati
E. Gavves
Cees G. M. Snoek
146
26
0
18 Jul 2018
Video-based Person Re-identification via 3D Convolutional Networks and Non-local Attention
Xingyu Liao
Lingxiao He
Zhouwang Yang
Chi Zhang
3DPC
80
73
0
12 Jul 2018
Sem-GAN: Semantically-Consistent Image-to-Image Translation
A. Cherian
Alan Sullivan
73
78
0
12 Jul 2018
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector
Jia-Xing Zhong
Nannan Li
Weijie Kong
Zhang Tao
Thomas H. Li
Ge Li
146
96
0
09 Jul 2018
Adversarial Perturbations Against Real-Time Video Classification Systems
Shasha Li
Ajaya Neupane
S. Paul
Chengyu Song
S. Krishnamurthy
Amit K. Roy-Chowdhury
A. Swami
AAML
93
121
0
02 Jul 2018
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
Bruno Korbar
Du Tran
Lorenzo Torresani
107
477
0
30 Jun 2018
A flexible model for training action localization with varying levels of supervision
Guilhem Chéron
Jean-Baptiste Alayrac
Ivan Laptev
Cordelia Schmid
93
42
0
29 Jun 2018
Human Action Recognition and Prediction: A Survey
Yu Kong
Y. Fu
95
632
0
28 Jun 2018
Modeling Spatio-Temporal Human Track Structure for Action Localization
Guilhem Chéron
A. Osokin
Ivan Laptev
Cordelia Schmid
130
3
0
28 Jun 2018
Differentiable Learning-to-Normalize via Switchable Normalization
Ping Luo
Jiamin Ren
Zhanglin Peng
Ruimao Zhang
Jingyu Li
92
177
0
28 Jun 2018
Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition
Dongliang He
Fu Li
Qijie Zhao
Xiang Long
Yi Fu
Shilei Wen
75
18
0
27 Jun 2018
RUC+CMU: System Report for Dense Captioning Events in Videos
Shizhe Chen
Yuqing Song
Yida Zhao
Jiarong Qiu
Qin Jin
Alexander G. Hauptmann
33
7
0
22 Jun 2018
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori
Huda AlAmri
Jue Wang
Gordon Wichern
Takaaki Hori
...
Raphael Gontijo-Lopes
Abhishek Das
Irfan Essa
Dhruv Batra
Devi Parikh
VGen
81
125
0
21 Jun 2018
Learning Multimodal Representations for Unseen Activities
A. Piergiovanni
Michael S. Ryoo
SSL
50
4
0
21 Jun 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
99
183
0
19 Jun 2018
Modality Distillation with Multiple Stream Networks for Action Recognition
Nuno C. Garcia
Pietro Morerio
Vittorio Murino
89
184
0
19 Jun 2018
Deep Spatiotemporal Representation of the Face for Automatic Pain Intensity Estimation
M. Tavakolian
Abdenour Hadid
CVBM
MedIm
3DH
49
19
0
18 Jun 2018
Object Level Visual Reasoning in Videos
Fabien Baradel
Natalia Neverova
Christian Wolf
J. Mille
Greg Mori
101
164
0
16 Jun 2018
Qiniu Submission to ActivityNet Challenge 2018
Xiaoteng Zhang
Yixin Bao
Feiyun Zhang
Kaiqin Hu
Yicheng Wang
Liang Zhu
Qinzhu He
Yining Lin
Jie Shao
Yao Peng
3DPC
49
3
0
12 Jun 2018
Massively Parallel Video Networks
João Carreira
Viorica Patraucean
L. Mazaré
Andrew Zisserman
Simon Osindero
73
42
0
11 Jun 2018
Previous
1
2
3
...
70
71
72
73
Next