Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation
Yansong Tang
Jiwen Lu
Jie Zhou
80
33
0
20 Mar 2020
Fully Automated Hand Hygiene Monitoring\\in Operating Room using 3D Convolutional Neural Network
Minjee Kim
Joonmyeong Choi
Namkug Kim
13
4
0
20 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
203
192
0
19 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
74
13
0
18 Mar 2020
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
Xu Li
Jingwen Wang
Lin Ma
Kaihao Zhang
Fengzong Lian
Zhanhui Kang
Jinjun Wang
36
5
0
18 Mar 2020
Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification
Sanath Narayan
Akshita Gupta
Fahad Shahbaz Khan
Cees G. M. Snoek
Ling Shao
VLM
49
224
0
17 Mar 2020
Multi-modal Dense Video Captioning
Vladimir E. Iashin
Esa Rahtu
92
172
0
17 Mar 2020
Predictively Encoded Graph Convolutional Network for Noise-Robust Skeleton-based Action Recognition
Jongmin Yu
Yongsang Yoon
M. Jeon
194
47
0
17 Mar 2020
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
O. Kayhan
Jan van Gemert
341
237
0
16 Mar 2020
SF-Net: Single-Frame Supervision for Temporal Action Localization
Fan Ma
Linchao Zhu
Yi Yang
Shengxin Cindy Zha
Gourab Kundu
Matt Feiszli
Zheng Shou
141
142
0
15 Mar 2020
Interaction Graphs for Object Importance Estimation in On-road Driving Videos
Zehua Zhang
Ashish Tawari
Sujitha Martin
David J. Crandall
GNN
FAtt
138
23
0
12 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019
Manyuan Zhang
Hao Shao
Guanglu Song
Yu Liu
Junjie Yan
40
3
0
12 Mar 2020
Beyond the Camera: Neural Networks in World Coordinates
Gunnar Sigurdsson
Abhinav Gupta
Cordelia Schmid
Alahari Karteek
41
2
0
12 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation
Gunnar Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
105
50
0
11 Mar 2020
Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Jialin Gao
Zhixiang Shi
Jiani Li
Guanshuo Wang
Yufeng Yuan
Shiming Ge
Xiaoping Zhou
64
76
0
09 Mar 2020
Transformation-based Adversarial Video Prediction on Large-Scale Data
Pauline Luc
Aidan Clark
Sander Dieleman
Diego de Las Casas
Yotam Doron
Albin Cassirer
Karen Simonyan
VGen
336
89
0
09 Mar 2020
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
66
12
0
08 Mar 2020
Transferring Cross-domain Knowledge for Video Sign Language Recognition
Dongxu Li
Xin Yu
Chenchen Xu
L. Petersson
Hongdong Li
SLR
112
105
0
08 Mar 2020
TTPP: Temporal Transformer with Progressive Prediction for Efficient Action Anticipation
Wen Wang
Xiaojiang Peng
Yanzhou Su
Yu Qiao
Jian Cheng
AI4TS
82
18
0
07 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
134
126
0
06 Mar 2020
Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation
Min-Hung Chen
Baopu Li
Sid Ying-Ze Bao
G. Al-Regib
Z. Kira
TTA
158
122
0
05 Mar 2020
Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video
Hyeon Cho
Taehoon Kim
H. Chang
Wonjun Hwang
58
20
0
05 Mar 2020
Detecting Attended Visual Targets in Video
Eunji Chong
Yongxin Wang
Nataniel Ruiz
James M. Rehg
259
116
0
05 Mar 2020
ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize Daily Activities of the Elderly
Jinhyeok Jang
Dohyung Kim
Cheonshu Park
Minsu Jang
Jaeyeon Lee
Jaehong Kim
84
67
0
04 Mar 2020
MoVi: A Large Multipurpose Motion and Video Dataset
Saeed Ghorbani
Kimia Mahdaviani
A. Thaler
Konrad Paul Kording
D. Cook
Gunnar Blohm
N. Troje
92
73
0
04 Mar 2020
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
Biagio Brattoli
Joseph Tighe
Fedor Zhdanov
Pietro Perona
Krzysztof Chalupka
VLM
245
130
0
03 Mar 2020
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
Shizhe Chen
Yida Zhao
Qin Jin
Qi Wu
124
319
0
01 Mar 2020
Joint Wasserstein Distribution Matching
Jingyun Liang
Langyuan Mo
Qing Du
Yong Guo
P. Zhao
Junzhou Huang
Mingkui Tan
28
0
0
01 Mar 2020
VideoSSL: Semi-Supervised Learning for Video Classification
Longlong Jing
T. Parag
Zhe Wu
Yingli Tian
Hongcheng Wang
64
52
0
29 Feb 2020
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Alban Main De Boissiere
R. Noumeir
99
38
0
28 Feb 2020
Joint 2D-3D Breast Cancer Classification
G. Liang
Xiaoqin Wang
Yu Zhang
Xin Xing
Hunter Blanton
Tawfiq Salem
Nathan Jacobs
57
39
0
27 Feb 2020
Evolving Losses for Unsupervised Video Representation Learning
A. Piergiovanni
A. Angelova
Michael S. Ryoo
SSL
89
140
0
26 Feb 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Hung Le
Nancy F. Chen
57
9
0
25 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Peng Xu
Kun Liu
Tao Xiang
Timothy M. Hospedales
Zhanyu Ma
Jun Guo
Yi-Zhe Song
105
33
0
21 Feb 2020
Strength from Weakness: Fast Learning Using Weak Supervision
Joshua Robinson
Stefanie Jegelka
S. Sra
67
32
0
19 Feb 2020
SummaryNet: A Multi-Stage Deep Learning Model for Automatic Video Summarisation
Ziyad Jappie
David Torpey
Turgay Celik
23
3
0
19 Feb 2020
Human Action Recognition using Local Two-Stream Convolution Neural Network Features and Support Vector Machines
David Torpey
Turgay Celik
13
8
0
19 Feb 2020
Knowledge Integration Networks for Action Recognition
Shiwen Zhang
Sheng Guo
Limin Wang
Weilin Huang
Matthew R. Scott
123
18
0
18 Feb 2020
V4D:4D Convolutional Neural Networks for Video-level Representation Learning
Shiwen Zhang
Sheng Guo
Weilin Huang
Matthew R. Scott
Limin Wang
50
73
0
18 Feb 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
Peisen Zhao
Lingxi Xie
Chen Ju
Ya Zhang
Yanfeng Wang
Qi Tian
21
1
0
18 Feb 2020
Over-the-Air Adversarial Flickering Attacks against Video Recognition Networks
Roi Pony
I. Naeh
Shie Mannor
AAML
80
54
0
12 Feb 2020
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
69
100
0
12 Feb 2020
Learning spatio-temporal representations with temporal squeeze pooling
Guoxi Huang
A. Bors
ViT
53
12
0
11 Feb 2020
Dynamic Inference: A New Approach Toward Efficient Video Action Recognition
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Yi Yang
Shilei Wen
78
35
0
09 Feb 2020
FSD-10: A Dataset for Competitive Sports Content Analysis
Shenlan Liu
Xiang Liu
Gao Huang
Lin Feng
Lianyu Hu
Dong Jiang
Ai-Xuan Zhang
Yang Liu
Hong Qiao
AI4TS
57
19
0
09 Feb 2020
Weakly-Supervised Multi-Person Action Recognition in 360
∘
^{\circ}
∘
Videos
Junnan Li
Jianquan Liu
Yongkang Wong
Shoji Nishimura
Mohan S. Kankanhalli
120
13
0
09 Feb 2020
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
26
0
0
08 Feb 2020
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
74
63
0
08 Feb 2020
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan
Lan Zhang
Xiangyang Li
Hui Xiong
VLM
56
17
0
08 Feb 2020
iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention
Li-Yu Daisy Liu
Dongyang Cai
Jie Liu
Nan Ding
Tao Wang
20
0
0
07 Feb 2020
Previous
1
2
3
...
60
61
62
...
71
72
73
Next