Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 1,387 papers shown
Title
Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks
Yuenan Hou
Zheng Ma
Slobodan Mitrovic
Chen Change Loy
26
64
0
07 Nov 2018
Learning with privileged information via adversarial discriminative modality distillation
Nuno C. Garcia
Pietro Morerio
Vittorio Murino
27
67
0
19 Oct 2018
Morph: Flexible Acceleration for 3D CNN-based Video Understanding
Kartik Hegde
R. Agrawal
Yulun Yao
Christopher W. Fletcher
30
71
0
16 Oct 2018
Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs
Dinesh Acharya
Zhiwu Huang
D. Paudel
Luc Van Gool
GAN
11
68
0
04 Oct 2018
Representation Flow for Action Recognition
A. Piergiovanni
Michael S. Ryoo
30
146
0
02 Oct 2018
Interpretable Spatio-temporal Attention for Video Action Recognition
Lili Meng
Bo Zhao
B. Chang
Gao Huang
Wei Sun
Fred Tung
Leonid Sigal
33
83
0
01 Oct 2018
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
19
41
0
29 Sep 2018
Rate-Accuracy Trade-Off In Video Classification With Deep Convolutional Neural Networks
M. Jubran
Alhabib Abbas
Aaron Chadha
Y. Andreopoulos
10
12
0
27 Sep 2018
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification
Jinlai Liu
Zehuan Yuan
Changhu Wang
24
9
0
16 Sep 2018
FIVR: Fine-grained Incident Video Retrieval
Giorgos Kordopatis-Zilos
Symeon Papadopoulos
Ioannis Patras
I. Kompatsiaris
16
60
0
11 Sep 2018
Using phase instead of optical flow for action recognition
Omar Hommos
S. Pintea
Pascal Mettes
Jan van Gemert
45
13
0
10 Sep 2018
Unsupervised Learning of View-invariant Action Representations
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
SSL
32
99
0
06 Sep 2018
Hierarchical Video Understanding
F. Mahdisoltani
Roland Memisevic
David Fleet
8
1
0
04 Sep 2018
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild
Yu Luo
Jianbo Ye
Reginald B. Adams
Jia Li
M. Newman
Jianmin Wang
56
86
0
28 Aug 2018
Targeted Nonlinear Adversarial Perturbations in Images and Videos
R. Rey-de-Castro
H. Rabitz
AAML
16
10
0
27 Aug 2018
Predicting Action Tubes
Gurkirt Singh
Suman Saha
Fabio Cuzzolin
ViT
27
22
0
23 Aug 2018
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos
Zhaoyang Zhang
Zhanghui Kuang
Ping Luo
Xue Jiang
Wayne Zhang
19
12
0
15 Aug 2018
Fast Video Shot Transition Localization with Deep Structured Models
Shitao Tang
Xue Jiang
Zhanghui Kuang
Yimin Chen
Wayne Zhang
27
45
0
13 Aug 2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
42
65
0
11 Aug 2018
Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection
Da Zhang
Xiyang Dai
Yuan-fang Wang
24
41
0
07 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
81
79
0
03 Aug 2018
Learning Actionable Representations from Visual Observations
Debidatta Dwibedi
Jonathan Tompson
Corey Lynch
P. Sermanet
SSL
22
80
0
02 Aug 2018
Multi-Fiber Networks for Video Recognition
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
CVBM
19
216
0
30 Jul 2018
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
41
220
0
28 Jul 2018
Diagnosing Error in Temporal Action Detectors
Humam Alwassel
Fabian Caba Heilbron
Victor Escorcia
Guohao Li
43
106
0
27 Jul 2018
A Better Baseline for AVA
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
18
66
0
26 Jul 2018
Motion Feature Network: Fixed Motion Filter for Action Recognition
Myunggi Lee
Seungeui Lee
S. Son
Gyutae Park
Nojun Kwak
30
121
0
26 Jul 2018
Correlation Net: Spatiotemporal multimodal deep learning for action recognition
N. Yudistira
Takio Kurita
29
21
0
22 Jul 2018
Video-based Person Re-identification via 3D Convolutional Networks and Non-local Attention
Xingyu Liao
Lingxiao He
Zhouwang Yang
Chi Zhang
3DPC
27
72
0
12 Jul 2018
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector
Jia-Xing Zhong
Nannan Li
Weijie Kong
Zhang Tao
Thomas H. Li
Ge Li
14
93
0
09 Jul 2018
Differentiable Learning-to-Normalize via Switchable Normalization
Ping Luo
Jiamin Ren
Zhanglin Peng
Ruimao Zhang
Jingyu Li
11
176
0
28 Jun 2018
RUC+CMU: System Report for Dense Captioning Events in Videos
Shizhe Chen
Yuqing Song
Yida Zhao
Jiarong Qiu
Qin Jin
Alexander G. Hauptmann
19
7
0
22 Jun 2018
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori
Huda AlAmri
Jue Wang
Gordon Wichern
Takaaki Hori
...
Raphael Gontijo-Lopes
Abhishek Das
Irfan Essa
Dhruv Batra
Devi Parikh
VGen
18
125
0
21 Jun 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
26
181
0
19 Jun 2018
Modality Distillation with Multiple Stream Networks for Action Recognition
Nuno C. Garcia
Pietro Morerio
Vittorio Murino
30
181
0
19 Jun 2018
Deep Spatiotemporal Representation of the Face for Automatic Pain Intensity Estimation
M. Tavakolian
Abdenour Hadid
CVBM
MedIm
3DH
18
19
0
18 Jun 2018
Object Level Visual Reasoning in Videos
Fabien Baradel
Natalia Neverova
Christian Wolf
J. Mille
Greg Mori
24
163
0
16 Jun 2018
Videos as Space-Time Region Graphs
Xueliang Wang
Abhinav Gupta
36
752
0
05 Jun 2018
VideoCapsuleNet: A Simplified Network for Action Detection
Kevin Duarte
Yogesh S Rawat
M. Shah
MedIm
29
165
0
21 May 2018
DenseImage Network: Video Spatial-Temporal Evolution Encoding and Understanding
Xiaokai Chen
Ke Gao
VGen
21
5
0
19 May 2018
Exploring the Limits of Weakly Supervised Pretraining
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin R. Bharambe
L. V. D. van der Maaten
VLM
98
1,356
0
02 May 2018
Actor and Observer: Joint Modeling of First and Third-Person Videos
Gunnar A. Sigurdsson
Abhinav Gupta
Cordelia Schmid
Ali Farhadi
Alahari Karteek
EgoV
27
154
0
25 Apr 2018
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
142
496
0
24 Apr 2018
Video Compression through Image Interpolation
Chao-Yuan Wu
Nayan Singhal
Philipp Krahenbuhl
VGen
34
317
0
18 Apr 2018
PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalities
Lan Wang
Chenqiang Gao
Luyu Yang
Yue Zhao
W. Zuo
Deyu Meng
GAN
23
21
0
17 Apr 2018
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos
Silvio Giancola
Mohieddine Amine
Tarek Dghaily
Guohao Li
AI4TS
21
194
0
12 Apr 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
51
745
0
10 Apr 2018
Fine-grained Activity Recognition in Baseball Videos
A. Piergiovanni
Michael S. Ryoo
27
74
0
09 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
22
233
0
07 Apr 2018
When will you do what? - Anticipating Temporal Occurrences of Activities
Yazan Abu Farha
Alexander Richard
Juergen Gall
30
189
0
03 Apr 2018
Previous
1
2
3
...
26
27
28
Next