Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.06950
Cited By
The Kinetics Human Action Video Dataset
19 May 2017
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Kinetics Human Action Video Dataset"
50 / 2,016 papers shown
Title
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
24
173
0
20 Dec 2019
Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition
Konstantinos Papadopoulos
Enjie Ghorbel
Djamila Aouada
Björn E. Ottersten
GNN
93
42
0
20 Dec 2019
Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation
Yang He
Shadi Rahimian
Bernt Schiele
Mario Fritz
MIACV
21
49
0
20 Dec 2019
Self-Attention Network for Skeleton-based Human Action Recognition
Sangwoo Cho
M. H. Maqbool
Fei Liu
H. Foroosh
3DH
22
71
0
18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
Philippe Weinzaepfel
Grégory Rogez
19
71
0
16 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs
Jingwei Ji
Ranjay Krishna
Li Fei-Fei
Juan Carlos Niebles
39
336
0
15 Dec 2019
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
33
420
0
15 Dec 2019
Action Modifiers: Learning from Adverbs in Instructional Videos
Hazel Doughty
Ivan Laptev
W. Mayol-Cuevas
Dima Damen
27
30
0
13 Dec 2019
VIBE: Video Inference for Human Body Pose and Shape Estimation
Muhammed Kocabas
Nikos Athanasiou
Michael J. Black
3DH
28
917
0
11 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
Jinwoo Choi
Chen Gao
Joseph C.E. Messou
Jia-Bin Huang
24
177
0
11 Dec 2019
PuckNet: Estimating hockey puck location from broadcast video
Kanav Vats
William J. McNally
Chris Dulhanty
Z. Q. Lin
David A Clausi
John S. Zelek
8
7
0
11 Dec 2019
Context-Dependent Models for Predicting and Characterizing Facial Expressiveness
Victoria Lin
J. Girard
Louis-Philippe Morency
16
8
0
10 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
29
251
0
10 Dec 2019
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
Paritosh Parmar
B. Morris
3DPC
18
9
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
24
9
0
09 Dec 2019
Synthetic Humans for Action Recognition from Unseen Viewpoints
Gül Varol
Ivan Laptev
Cordelia Schmid
Andrew Zisserman
33
96
0
09 Dec 2019
VideoDG: Generalizing Temporal Relations in Videos to Novel Domains
Zhiyu Yao
Yunbo Wang
Jianmin Wang
Philip S. Yu
Mingsheng Long
OOD
ViT
32
23
0
08 Dec 2019
ClusterFit: Improving Generalization of Visual Representations
Xueting Yan
Ishan Misra
Abhinav Gupta
Deepti Ghadiyaram
D. Mahajan
SSL
VLM
27
132
0
06 Dec 2019
A Multigrid Method for Efficiently Training Video Models
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
32
94
0
02 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
32
126
0
02 Dec 2019
Probing the State of the Art: A Critical Look at Visual Representation Evaluation
Cinjon Resnick
Zeping Zhan
Joan Bruna
AI4TS
20
12
0
30 Nov 2019
Multimodal Machine Translation through Visuals and Speech
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
49
73
0
28 Nov 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
42
428
0
28 Nov 2019
Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
22
27
0
28 Nov 2019
Non-Autoregressive Coarse-to-Fine Video Captioning
Bang-ju Yang
Yuexian Zou
Fenglin Liu
Can Zhang
27
11
0
27 Nov 2019
G-TAD: Sub-Graph Localization for Temporal Action Detection
Mengmeng Xu
Chen Zhao
D. Rojas
Ali K. Thabet
Guohao Li
39
435
0
26 Nov 2019
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
27
7
0
26 Nov 2019
Oops! Predicting Unintentional Action in Video
Dave Epstein
Boyuan Chen
Carl Vondrick
27
99
0
25 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
36
236
0
21 Nov 2019
MMTM: Multimodal Transfer Module for CNN Fusion
Hamid Reza Vaezi Joze
Amirreza Shaban
Michael L. Iuzzolino
K. Koishida
18
277
0
20 Nov 2019
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain
Barak Battash
H. Barad
Hanlin Tang
Amit Bleiweiss
16
30
0
19 Nov 2019
SMART: Skeletal Motion Action Recognition aTtack
He Wang
Feixiang He
Zexi Peng
Yong-Liang Yang
Tianjia Shao
Kun Zhou
David C. Hogg
AAML
31
5
0
16 Nov 2019
Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism
Prashant Pandey
P. PrathoshA.
Manu Kohli
Joshua K. Pritchard
24
33
0
11 Nov 2019
Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching
Wei Peng
Xiaopeng Hong
Haoyu Chen
Guoying Zhao
GNN
40
323
0
11 Nov 2019
Are we asking the right questions in MovieQA?
Bhavan A. Jasani
Rohit Girdhar
Deva Ramanan
11
15
0
08 Nov 2019
A Spectral Nonlocal Block for Neural Networks
Lei Zhu
Qi She
Lidan Zhang
Ping Guo
18
2
0
04 Nov 2019
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Mathew Monfort
Bowen Pan
K. Ramakrishnan
A. Andonian
Barry A. McNamara
A. Lascelles
Quanfu Fan
Dan Gutfreund
Rogerio Feris
A. Oliva
VLM
14
68
0
01 Nov 2019
Chirality Nets for Human Pose Regression
Raymond A. Yeh
Yuan-Ting Hu
Alex Schwing
3DH
22
48
0
31 Oct 2019
A Self Validation Network for Object-Level Human Attention Estimation
Zehua Zhang
Chen Yu
David J. Crandall
EgoV
30
10
0
31 Oct 2019
Comprehensive Video Understanding: Video summarization with content-based video recommender design
Yudong Jiang
Kaixu Cui
B. Peng
Changliang Xu
BDL
20
28
0
30 Oct 2019
Predictive Coding Networks Meet Action Recognition
Xia Huang
Hossein Mousavi
Gemma Roig
21
1
0
22 Oct 2019
Volterra Neural Networks (VNNs)
Siddharth Roheda
Hamid Krim
19
10
0
21 Oct 2019
Adaptive and Iteratively Improving Recurrent Lateral Connections
Barak Battash
Lior Wolf
25
2
0
16 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Rohit Girdhar
Deva Ramanan
22
176
0
10 Oct 2019
Human Action Sequence Classification
Yan Bin Ng
Basura Fernando
30
4
0
07 Oct 2019
Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction
G. Zucatelli
Seonghyeon Nam
R. Coelho
Seon Joo Kim
16
59
0
04 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
43
457
0
03 Oct 2019
Learning Temporal Action Proposals With Fewer Labels
Jingwei Ji
Kaidi Cao
Juan Carlos Niebles
6
36
0
03 Oct 2019
Training Kinetics in 15 Minutes: Large-scale Distributed Training on Videos
Ji Lin
Chuang Gan
Song Han
12
10
0
01 Oct 2019
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Alexandros Stergiou
R. Poppe
3DH
20
19
0
30 Sep 2019
Previous
1
2
3
...
34
35
36
...
39
40
41
Next