Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,645 papers shown
Title
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
88
254
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
87
71
0
11 Apr 2019
Recurrent Space-time Graph Neural Networks
Andrei Liviu Nicolicioiu
Iulia Duta
Marius Leordeanu
GNN
128
45
0
11 Apr 2019
Attentive Action and Context Factorization
Yanjie Wang
Vinh Tran
Gedas Bertasius
Lorenzo Torresani
Minh Hoai
31
6
0
10 Apr 2019
Black-box Adversarial Attacks on Video Recognition Models
Linxi Jiang
Xingjun Ma
Shaoxiang Chen
James Bailey
Yu-Gang Jiang
AAML
MLAU
76
147
0
10 Apr 2019
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Yunpeng Chen
Haoqi Fan
Bing Xu
Zhicheng Yan
Yannis Kalantidis
Marcus Rohrbach
Shuicheng Yan
Jiashi Feng
109
565
0
10 Apr 2019
Dynamic Gesture Recognition by Using CNNs and Star RGB: a Temporal Information Condensation
Clebeson Canuto dos Santos
J. L. A. Samatelo
R. Vassallo
57
57
0
10 Apr 2019
Knowledge Distillation for Human Action Anticipation
Vinh Tran
Yang Wang
Minh Hoai
64
6
0
09 Apr 2019
Learning from Videos with Deep Convolutional LSTM Networks
Logan Courtney
R. Sreenivas
23
7
0
09 Apr 2019
Action Recognition from Single Timestamp Supervision in Untrimmed Videos
Davide Moltisanti
Sanja Fidler
Dima Damen
84
61
0
09 Apr 2019
SCSampler: Sampling Salient Clips from Video for Efficient Action Recognition
Bruno Korbar
Du Tran
Lorenzo Torresani
80
225
0
08 Apr 2019
Relational Action Forecasting
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Rahul Sukthankar
Kevin Patrick Murphy
Cordelia Schmid
87
81
0
08 Apr 2019
Unsupervised learning of action classes with continuous temporal embedding
Anna Kukleva
Hilde Kuehne
Fadime Sener
Juergen Gall
87
107
0
08 Apr 2019
Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Yunhui Liu
Wen Liu
SSL
59
206
0
07 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
125
558
0
06 Apr 2019
Convolutional Relational Machine for Group Activity Recognition
Sina Mokhtarzadeh Azar
Mina Ghadimi Atigh
A. Nickabadi
Alexandre Alahi
BDL
72
106
0
05 Apr 2019
Attention Distillation for Learning Video Representations
Miao Liu
Xin Chen
Yun C. Zhang
Yin Li
James M. Rehg
57
2
0
05 Apr 2019
Fast Weakly Supervised Action Segmentation Using Mutual Consistency
Yaser Souri
Mohsen Fayyaz
Luca Minciullo
Gianpiero Francesca
Juergen Gall
91
52
0
05 Apr 2019
Deep Predictive Video Compression with Bi-directional Prediction
Woonsung Park
Munchurl Kim
62
7
0
05 Apr 2019
Video Classification with Channel-Separated Convolutional Networks
Du Tran
Heng Wang
Lorenzo Torresani
Matt Feiszli
3DV
135
591
0
04 Apr 2019
ExCL: Extractive Clip Localization Using Natural Language Descriptions
Soham Ghosh
Anuva Agarwal
Zarana Parekh
Alexander G. Hauptmann
CLIP
61
153
0
04 Apr 2019
Resource Efficient 3D Convolutional Neural Networks
Okan Kopuklu
Neslihan Köse
Ahmet Gunduz
Gerhard Rigoll
69
190
0
04 Apr 2019
Activity Driven Weakly Supervised Object Detection
Zhenheng Yang
D. Mahajan
Deepti Ghadiyaram
Ram Nevatia
Vignesh Ramanathan
WSOD
68
32
0
02 Apr 2019
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization
Alejandro Pardo
Humam Alwassel
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
130
52
0
30 Mar 2019
Local Aggregation for Unsupervised Learning of Visual Embeddings
Chengxu Zhuang
Alex Zhai
Daniel L. K. Yamins
SSL
105
448
0
29 Mar 2019
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames
Brent A. Griffin
Jason J. Corso
VOS
113
43
0
28 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
89
103
0
25 Mar 2019
StartNet: Online Detection of Action Start in Untrimmed Videos
M. Gao
Mingze Xu
L. Davis
R. Socher
Caiming Xiong
67
52
0
23 Mar 2019
On the Importance of Video Action Recognition for Visual Lipreading
Xinshuo Weng
29
3
0
22 Mar 2019
Forecasting Time-to-Collision from Monocular Video: Feasibility, Dataset, and Challenges
A. Manglik
Xinshuo Weng
Eshed Ohn-Bar
Kris Kitani
105
15
0
21 Mar 2019
Value of Temporal Dynamics Information in Driving Scene Segmentation
Li Ding
Jack Terwilliger
Rini Sherony
B. Reimer
Alex Fridman
51
23
0
21 Mar 2019
Cross-task weakly supervised learning from instructional videos
Dimitri Zhukov
Jean-Baptiste Alayrac
R. G. Cinbis
David Fouhey
Ivan Laptev
Josef Sivic
SSL
183
250
0
19 Mar 2019
Human Activity Recognition for Edge Devices
Manjot Bilkhu
Hammababdullah Ayyubi
27
0
0
18 Mar 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Jia-Xing Zhong
Nannan Li
Weijie Kong
Shan Liu
Thomas H. Li
Ge Li
NoLa
SSL
116
410
0
18 Mar 2019
Learning Super-resolution 3D Segmentation of Plant Root MRI Images from Few Examples
Ali Oguz Uzman
Jannis Horn
Sven Behnke
24
4
0
16 Mar 2019
GolfDB: A Video Database for Golf Swing Sequencing
William J. McNally
Kanav Vats
T. Pinto
Chris Dulhanty
J. McPhee
A. Wong
59
55
0
15 Mar 2019
Two-Stream Action Recognition-Oriented Video Super-Resolution
Haochen Zhang
Dong Liu
Zhiwei Xiong
SupR
64
46
0
13 Mar 2019
Asymmetric Residual Neural Network for Accurate Human Activity Recognition
J. Long
Wuqing Sun
Zhan Yang
Osolo Ian Raymond
HAI
57
21
0
13 Mar 2019
Video Generation from Single Semantic Label Map
Junting Pan
Chengyu Wang
Xu Jia
Jing Shao
Lu Sheng
Junjie Yan
Xiaogang Wang
VGen
55
104
0
11 Mar 2019
Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals
Patrick Schlosser
David Münch
Michael Arens
3DPC
20
3
0
11 Mar 2019
SSN: Learning Sparse Switchable Normalization via SparsestMax
Wenqi Shao
Jiamin Ren
Jingyu Li
Ruimao Zhang
Yudian Li
Xiaogang Wang
Ping Luo
69
56
0
09 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
145
317
0
07 Mar 2019
Video-based surgical skill assessment using 3D convolutional neural networks
Isabel Funke
S. T. Mees
Jürgen Weitz
Stefanie Speidel
85
177
0
06 Mar 2019
Semantic Adversarial Network with Multi-scale Pyramid Attention for Video Classification
De Xie
Cheng Deng
Hao Wang
Chao Li
Dapeng Tao
41
16
0
06 Mar 2019
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
Yazan Abu Farha
Juergen Gall
90
670
0
05 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
83
82
0
04 Mar 2019
Less is More: Learning Highlight Detection from Video Duration
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
61
111
0
03 Mar 2019
Attention-Based Structural-Plasticity
Soheil Kolouri
Nicholas A. Ketz
Xinyun Zou
J. Krichmar
Praveen K. Pilly
CLL
30
12
0
02 Mar 2019
Unsupervised Traffic Accident Detection in First-Person Videos
Yu Yao
Mingze Xu
Yuchen Wang
David J. Crandall
E. Atkins
101
143
0
02 Mar 2019
Progress Regression RNN for Online Spatial-Temporal Action Localization in Unconstrained Videos
Bo Hu
Jianfei Cai
Tat-Jen Cham
Junsong Yuan
81
3
0
01 Mar 2019
Previous
1
2
3
...
67
68
69
...
71
72
73
Next