Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
Learning Class Regularized Features for Action Recognition
Alexandros Stergiou
R. Poppe
R. Veltkamp
21
3
0
07 Feb 2020
Solving Raven's Progressive Matrices with Neural Networks
Tao Zhuo
Mohan S. Kankanhalli
105
26
0
05 Feb 2020
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks
M. Rashid
Hedvig Kjellström
Yong Jae Lee
WSOL
GNN
138
46
0
04 Feb 2020
Neural Sign Language Translation by Learning Tokenization
Alptekin Orbay
L. Akarun
SLR
68
73
0
02 Feb 2020
Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks
Joonatan Mänttäri
Sofia Broomé
John Folkesson
Hedvig Kjellström
FAtt
67
28
0
02 Feb 2020
ERA: A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos
Lichao Mou
Yuansheng Hua
P. Jin
Xiaoxiang Zhu
AI4TS
119
45
0
30 Jan 2020
Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
Jonathan Munro
Dima Damen
EgoV
84
196
0
27 Jan 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
224
288
0
24 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
282
209
0
23 Jan 2020
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
Maja Pantic
238
241
0
23 Jan 2020
Detecting Deficient Coverage in Colonoscopies
Daniel Freedman
Yochai Blau
L. Katzir
Amit Aides
I. Shimshoni
...
Tomer Golany
A. Gordon
Greg S. Corrado
Yossi Matias
Ehud Rivlin
78
55
0
23 Jan 2020
Zero-Shot Activity Recognition with Videos
Evin Pınar Örnek
24
1
0
22 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Ashraful Islam
Richard J. Radke
69
46
0
21 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection
Wen Wang
Xiaojiang Peng
Yu Qiao
Jian Cheng
48
2
0
21 Jan 2020
The benefits of synthetic data for action categorization
Mohamad Ballout
Mohammad Tuqan
Daniel C. Asmar
Elie A. Shammas
George E. Sakr
39
6
0
20 Jan 2020
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion
Kaiyu Shan
Yongtao Wang
Zhuoying Wang
Tingting Liang
Zhi Tang
Ying-Cong Chen
Yangyan Li
AI4TS
40
4
0
19 Jan 2020
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video
Jie Wu
Guanbin Li
Si Liu
Liang Lin
OffRL
71
104
0
18 Jan 2020
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
60
94
0
17 Jan 2020
Sideways: Depth-Parallel Training of Video Models
Mateusz Malinowski
G. Swirszcz
João Carreira
Viorica Patraucean
MDE
94
15
0
17 Jan 2020
Multi-step Joint-Modality Attention Network for Scene-Aware Dialogue System
Yun-Wei Chu
Kuan-Yen Lin
Chao-Chun Hsu
Lun-Wei Ku
137
22
0
17 Jan 2020
Spatio-Temporal Ranked-Attention Networks for Video Captioning
A. Cherian
Jue Wang
Chiori Hori
Tim K. Marks
AI4TS
49
19
0
17 Jan 2020
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Tianhao Li
Limin Wang
VGen
68
57
0
16 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition
Li Tao
Xueting Wang
T. Yamasaki
3DPC
69
24
0
16 Jan 2020
EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video
Jennifer J. Sun
Ting Liu
Alan S. Cowen
Florian Schroff
Hartwig Adam
Gautam Prasad
38
7
0
15 Jan 2020
Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors
Lei Wang
Piotr Koniusz
93
51
0
14 Jan 2020
Actions as Moving Points
Yixuan Li
Zixu Wang
Limin Wang
Gangshan Wu
163
106
0
14 Jan 2020
A Novel Inspection System For Variable Data Printing Using Deep Learning
O. Haik
Oded Perry
Eli Chen
Peter J. Klammer
38
4
0
13 Jan 2020
Compressive sensing based privacy for fall detection
Ronak Gupta
Prashant Anand
S. Chaudhury
Brejesh Lall
Sanjay Singh
15
5
0
10 Jan 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network
A. Tsiami
Petros Koutras
Petros Maragos
99
73
0
09 Jan 2020
DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection
Liming Jiang
Ren Li
Wayne Wu
Chao Qian
Chen Change Loy
CVBM
PICV
137
447
0
09 Jan 2020
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos
Naina Dhingra
A. Kunz
3DPC
SLR
80
36
0
04 Jan 2020
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection
Ruben Tolosana
R. Vera-Rodríguez
Julian Fierrez
Aythami Morales
J. Ortega-Garcia
3DPC
CVBM
120
801
0
01 Jan 2020
Short-Term Temporal Convolutional Networks for Dynamic Hand Gesture Recognition
Yi Zhang
Chong Wang
Ye Zheng
Jieyu Zhao
Yuqi Li
Xijiong Xie
3DH
23
4
0
31 Dec 2019
DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition
Nuno C. Garcia
Sarah Adel Bargal
Vitaly Ablavsky
Pietro Morerio
Vittorio Murino
Stan Sclaroff
66
49
0
23 Dec 2019
Adversarial Cross-Domain Action Recognition with Co-Attention
Boxiao Pan
Zhangjie Cao
Ehsan Adeli
Juan Carlos Niebles
ViT
80
106
0
22 Dec 2019
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
64
176
0
20 Dec 2019
Analysis of Video Feature Learning in Two-Stream CNNs on the Example of Zebrafish Swim Bout Classification
Bennet Breier
A. Onken
36
4
0
20 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
Philippe Weinzaepfel
Grégory Rogez
84
72
0
16 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs
Jingwei Ji
Ranjay Krishna
Li Fei-Fei
Juan Carlos Niebles
94
346
0
15 Dec 2019
Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition
D. Luvizon
Hedi Tabia
David Picard
3DH
95
121
0
15 Dec 2019
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
86
432
0
15 Dec 2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong
S. Schwarcz
Peng Xu
Davide D‘Ambrosio
Juhana Kangaspunta
A. Angelova
Huong Phan
Navdeep Jaitly
65
7
0
13 Dec 2019
Action Modifiers: Learning from Adverbs in Instructional Videos
Hazel Doughty
Ivan Laptev
W. Mayol-Cuevas
Dima Damen
103
30
0
13 Dec 2019
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
Antoine Miech
Jean-Baptiste Alayrac
Lucas Smaira
Ivan Laptev
Josef Sivic
Andrew Zisserman
VGen
SSL
156
713
0
13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt
Jialing Lyu
Weichao Qiu
Xinyue Wei
Yi Zhang
Alan Yuille
Zhengjun Zha
VLM
51
3
0
13 Dec 2019
VIBE: Video Inference for Human Body Pose and Shape Estimation
Muhammed Kocabas
Nikos Athanasiou
Michael J. Black
3DH
145
932
0
11 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
Jinwoo Choi
Chen Gao
Joseph C.E. Messou
Jia-Bin Huang
145
182
0
11 Dec 2019
HyperCon: Image-To-Video Model Transfer for Video-To-Video Translation Tasks
Ryan Szeto
Mostafa El-Khamy
Jungwon Lee
Jason J. Corso
DiffM
63
7
0
10 Dec 2019
Forecasting future action sequences with attention: a new approach to weakly supervised action forecasting
Yan Bin Ng
Basura Fernando
AI4TS
43
34
0
10 Dec 2019
Appending Adversarial Frames for Universal Video Attack
Zhikai Chen
Lingxi Xie
Shanmin Pang
Yong He
Qi Tian
AAML
70
34
0
10 Dec 2019
Previous
1
2
3
...
61
62
63
...
71
72
73
Next