Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 1,387 papers shown
Title
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames
Brent A. Griffin
Jason J. Corso
VOS
30
42
0
28 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
27
103
0
25 Mar 2019
StartNet: Online Detection of Action Start in Untrimmed Videos
M. Gao
Mingze Xu
L. Davis
R. Socher
Caiming Xiong
38
51
0
23 Mar 2019
On the Importance of Video Action Recognition for Visual Lipreading
Xinshuo Weng
25
3
0
22 Mar 2019
Forecasting Time-to-Collision from Monocular Video: Feasibility, Dataset, and Challenges
A. Manglik
Xinshuo Weng
Eshed Ohn-Bar
Kris Kitani
27
15
0
21 Mar 2019
Cross-task weakly supervised learning from instructional videos
Dimitri Zhukov
Jean-Baptiste Alayrac
R. G. Cinbis
David Fouhey
Ivan Laptev
Josef Sivic
SSL
25
243
0
19 Mar 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Jia-Xing Zhong
Nannan Li
Weijie Kong
Shan Liu
Thomas H. Li
Ge Li
NoLa
SSL
24
397
0
18 Mar 2019
Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals
Patrick Schlosser
David Münch
Michael Arens
3DPC
18
3
0
11 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
18
305
0
07 Mar 2019
Video-based surgical skill assessment using 3D convolutional neural networks
Isabel Funke
S. T. Mees
Jürgen Weitz
Stefanie Speidel
16
173
0
06 Mar 2019
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
Yazan Abu Farha
Juergen Gall
28
651
0
05 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
27
82
0
04 Mar 2019
Less is More: Learning Highlight Detection from Video Duration
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
14
108
0
03 Mar 2019
Efficient Video Classification Using Fewer Frames
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
40
88
0
27 Feb 2019
IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition
Ke Yang
Peng Qiao
Dongsheng Li
Y. Dou
ViT
35
8
0
26 Feb 2019
Exploring Frame Segmentation Networks for Temporal Action Localization
Ke Yang
Xiaolong Shen
Peng Qiao
Shijie Li
Dongsheng Li
Y. Dou
41
10
0
14 Feb 2019
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
27
54
0
26 Jan 2019
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
25
189
0
25 Jan 2019
Skeleton-based Action Recognition of People Handling Objects
Sunoh Kim
Kimin Yun
Jongyoul Park
J. Choi
29
37
0
21 Jan 2019
Semantic Image Networks for Human Action Recognition
Sunder Ali Khowaja
Seok-Lyong Lee
21
32
0
21 Jan 2019
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition
Zheng Shou
Xudong Lin
Yannis Kalantidis
Laura Sevilla-Lara
Marcus Rohrbach
Shih-Fu Chang
Zhicheng Yan
VGen
37
120
0
11 Jan 2019
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals
Lu Jin
Zechao Li
Jinhui Tang
16
71
0
09 Jan 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
22
156
0
09 Jan 2019
Dynamics are Important for the Recognition of Equine Pain in Video
Sofia Broomé
K. Gleerup
P. Andersen
Hedvig Kjellström
35
26
0
07 Jan 2019
EgoReID Dataset: Person Re-identification in Videos Acquired by Mobile Devices with First-Person Point-of-View
Emrah Basaran
Yonatan Tariku Tesfaye
M. Shah
27
0
0
22 Dec 2018
Temporal Hockey Action Recognition via Pose and Optical Flows
Zixi Cai
H. Neher
Kanav Vats
David A Clausi
John S. Zelek
11
38
0
22 Dec 2018
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Guoyun Tu
Yanwei Fu
Boyang Albert Li
Jiarui Gao
Yu-Gang Jiang
Xiangyang Xue
11
29
0
21 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
15
33
0
17 Dec 2018
TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition
Xiyang Dai
Bharat Singh
Joe Yue-Hei Ng
L. Davis
ViT
32
25
0
14 Dec 2018
Action Machine: Rethinking Action Recognition in Trimmed Videos
Jiagang Zhu
Wei Zou
Liang Xu
Yiming Hu
Zheng Zhu
Manyu Chang
Junjie Huang
Guan Huang
Dalong Du
35
37
0
14 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
24
3
0
13 Dec 2018
The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos
Hazel Doughty
W. Mayol-Cuevas
Dima Damen
36
138
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
28
101
0
09 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
36
20
0
07 Dec 2018
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
An Empirical Study towards Understanding How Deep Convolutional Nets Recognize Falls
Yan Zhang
Heiko Neumann
26
5
0
05 Dec 2018
The Visual Centrifuge: Model-Free Layered Video Representations
Jean-Baptiste Alayrac
João Carreira
Andrew Zisserman
21
48
0
04 Dec 2018
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
21
212
0
04 Dec 2018
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
27
691
0
03 Dec 2018
Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision
M. Joneidi
Alireza Zaeemzadeh
Nazanin Rahnavard
M. Shah
30
13
0
29 Nov 2018
A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame Duplication Detection and Localization in Forged Videos
Chengjiang Long
Arslan Basharat
A. Hoogs
14
5
0
27 Nov 2018
Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles
Dahun Kim
Donghyeon Cho
In So Kweon
SSL
17
343
0
24 Nov 2018
Driver Behavior Recognition via Interwoven Deep Convolutional Neural Nets with Multi-stream Inputs
Chaoyun Zhang
Rui Li
Woojin Kim
Daesub Yoon
P. Patras
31
49
0
22 Nov 2018
Rethinking ImageNet Pre-training
Kaiming He
Ross B. Girshick
Piotr Dollár
VLM
SSeg
47
1,077
0
21 Nov 2018
Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection
Khoi-Nguyen C. Mac
D. Joshi
Raymond A. Yeh
Jinjun Xiong
Rogerio Feris
Minh Do
27
42
0
21 Nov 2018
Compressing Recurrent Neural Networks with Tensor Ring for Action Recognition
Yu Pan
Jing Xu
Maolin Wang
Jinmian Ye
Fei Wang
Kun Bai
Zenglin Xu
MQ
18
104
0
19 Nov 2018
Multi-scale 3D Convolution Network for Video Based Person Re-Identification
Jianing Li
Shiliang Zhang
Tiejun Huang
3DPC
25
163
0
19 Nov 2018
Temporal Recurrent Networks for Online Action Detection
Mingze Xu
M. Gao
Yi-Ting Chen
L. Davis
David J. Crandall
OffRL
31
162
0
18 Nov 2018
Relational Long Short-Term Memory for Video Action Recognition
Zexi Chen
B. Ramachandra
Tianfu Wu
Ranga Raju Vatsavai
24
5
0
16 Nov 2018
Previous
1
2
3
...
25
26
27
28
Next