Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,645 papers shown
Title
Efficient Video Classification Using Fewer Frames
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
81
88
0
27 Feb 2019
Equi-normalization of Neural Networks
Pierre Stock
Benjamin Graham
Rémi Gribonval
Hervé Jégou
ODL
46
18
0
27 Feb 2019
STAR-Net: Action Recognition using Spatio-Temporal Activation Reprojection
William J. McNally
A. Wong
J. McPhee
HAI
3DH
65
26
0
26 Feb 2019
IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition
Ke Yang
Peng Qiao
Dongsheng Li
Y. Dou
ViT
59
8
0
26 Feb 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
Longlong Jing
Yingli Tian
SSL
235
1,706
0
16 Feb 2019
Exploring Frame Segmentation Networks for Temporal Action Localization
Ke Yang
Xiaolong Shen
Peng Qiao
Shijie Li
Dongsheng Li
Y. Dou
68
10
0
14 Feb 2019
MOTS: Multi-Object Tracking and Segmentation
P. Voigtlaender
Michael Krause
Aljosa Osep
Jonathon Luiten
Berin Balachandar Gnana Sekar
Andreas Geiger
Bastian Leibe
VOT
94
581
0
10 Feb 2019
Saliency Tubes: Visual Explanations for Spatio-Temporal Convolutions
Alexandros Stergiou
G. Kapidis
Grigorios Kalliatakis
C. Chrysoulas
R. Veltkamp
R. Poppe
FAtt
69
47
0
04 Feb 2019
Differentiable Grammars for Videos
A. Piergiovanni
A. Angelova
Michael S. Ryoo
86
6
0
01 Feb 2019
Anomaly Locality in Video Surveillance
Federico Landi
Cees G. M. Snoek
Rita Cucchiara
74
54
0
29 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
68
54
0
26 Jan 2019
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
61
195
0
25 Jan 2019
Skeleton-based Action Recognition of People Handling Objects
Sunoh Kim
Kimin Yun
Jongyoul Park
J. Choi
61
38
0
21 Jan 2019
Semantic Image Networks for Human Action Recognition
Sunder Ali Khowaja
Seok-Lyong Lee
41
33
0
21 Jan 2019
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition
Zheng Shou
Xudong Lin
Yannis Kalantidis
Laura Sevilla-Lara
Marcus Rohrbach
Shih-Fu Chang
Zhicheng Yan
VGen
111
120
0
11 Jan 2019
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals
Lu Jin
Zechao Li
Jinhui Tang
46
72
0
09 Jan 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
145
157
0
09 Jan 2019
Thinking Outside the Pool: Active Training Image Creation for Relative Attributes
Aron Yu
Kristen Grauman
51
23
0
08 Jan 2019
Dynamics are Important for the Recognition of Equine Pain in Video
Sofia Broomé
K. Gleerup
P. Andersen
Hedvig Kjellström
67
26
0
07 Jan 2019
Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions
Yifei Huang
Zhenqiang Li
Minjie Cai
Yoichi Sato
EgoV
82
64
0
07 Jan 2019
Action2Vec: A Crossmodal Embedding Approach to Action Learning
Meera Hahn
Andrew Silva
James M. Rehg
80
58
0
02 Jan 2019
Actor Conditioned Attention Maps for Video Action Detection
Oytun Ulutan
S. Rallapalli
Mudhakar Srivatsa
Carlos Torres
B. S. Manjunath
65
42
0
30 Dec 2018
A Multi-Stream Convolutional Neural Network Framework for Group Activity Recognition
Sina Mokhtarzadeh Azar
Mina Ghadimi Atigh
A. Nickabadi
40
22
0
26 Dec 2018
Coupled Recurrent Network (CRN)
Lin Sun
Kui Jia
Yuejia Shen
Silvio Savarese
Dit-Yan Yeung
Bertram E. Shi
40
4
0
25 Dec 2018
EgoReID Dataset: Person Re-identification in Videos Acquired by Mobile Devices with First-Person Point-of-View
Emrah Basaran
Yonatan Tariku Tesfaye
M. Shah
55
0
0
22 Dec 2018
Temporal Hockey Action Recognition via Pose and Optical Flows
Zixi Cai
H. Neher
Kanav Vats
David A Clausi
John S. Zelek
45
39
0
22 Dec 2018
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Guoyun Tu
Yanwei Fu
Boyang Albert Li
Jiarui Gao
Yu-Gang Jiang
Xiangyang Xue
36
29
0
21 Dec 2018
D3D: Distilled 3D Networks for Video Action Recognition
Jonathan C. Stroud
David A. Ross
Chen Sun
Jia Deng
Rahul Sukthankar
3DPC
58
160
0
19 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
69
33
0
17 Dec 2018
TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition
Xiyang Dai
Bharat Singh
Joe Yue-Hei Ng
L. Davis
ViT
81
25
0
14 Dec 2018
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training
Mahdi Abavisani
Hamid Reza Vaezi Joze
Vishal M. Patel
78
131
0
14 Dec 2018
Action Machine: Rethinking Action Recognition in Trimmed Videos
Jiagang Zhu
Wei Zou
Liang Xu
Yiming Hu
Zheng Zhu
Manyu Chang
Junjie Huang
Guan Huang
Dalong Du
97
37
0
14 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
59
3
0
13 Dec 2018
The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos
Hazel Doughty
W. Mayol-Cuevas
Dima Damen
82
140
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
236
481
0
12 Dec 2018
Learning Discriminative Motion Features Through Detection
Gedas Bertasius
Christoph Feichtenhofer
Du Tran
Jianbo Shi
Lorenzo Torresani
77
15
0
11 Dec 2018
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
201
3,300
0
10 Dec 2018
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
118
101
0
09 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
91
20
0
07 Dec 2018
Tri-axial Self-Attention for Concurrent Activity Recognition
Yanyi Zhang
Xinyu Li
Kaixiang Huang
Yehan Wang
Shuhong Chen
I. Marsic
HAI
27
0
0
06 Dec 2018
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
183
709
0
06 Dec 2018
An Empirical Study towards Understanding How Deep Convolutional Nets Recognize Falls
Yan Zhang
Heiko Neumann
76
5
0
05 Dec 2018
The Visual Centrifuge: Model-Free Layered Video Representations
Jean-Baptiste Alayrac
João Carreira
Andrew Zisserman
87
48
0
04 Dec 2018
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
147
215
0
04 Dec 2018
Spatio-Temporal Action Graph Networks
Roei Herzig
Elad Levi
Huijuan Xu
Hang Gao
Eli Brosh
Xiaolong Wang
Amir Globerson
Trevor Darrell
GNN
79
20
0
04 Dec 2018
MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language
Hamid Reza Vaezi Joze
Oscar Koller
SLR
87
254
0
03 Dec 2018
SUSiNet: See, Understand and Summarize it
Petros Koutras
Petros Maragos
52
26
0
03 Dec 2018
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
101
748
0
03 Dec 2018
Pedestrian Detection with Autoregressive Network Phases
Garrick Brazil
Xiaoming Liu
83
72
0
02 Dec 2018
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries
Bruce McIntosh
Kevin Duarte
Yogesh S Rawat
M. Shah
MedIm
64
17
0
02 Dec 2018
Previous
1
2
3
...
68
69
70
71
72
73
Next