Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy
Akinori F. Ebihara
Taiki Miyagawa
K. Sakurai
Hitoshi Imaoka
69
0
0
10 Jun 2020
Open-Narrow-Synechiae Anterior Chamber Angle Classification in AS-OCT Sequences
Huaying Hao
Huazhu Fu
Yanwu Xu
Jianlong Yang
Fei Li
Xiulan Zhang
Jiang-Dong Liu
Yitian Zhao
233
8
0
09 Jun 2020
PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition
Yuecong Xu
Haozhi Cao
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
56
5
0
09 Jun 2020
Action Recognition with Deep Multiple Aggregation Networks
A. Mazari
H. Sahbi
61
0
0
08 Jun 2020
ARID: A New Dataset for Recognizing Action in the Dark
Yuecong Xu
Jianfei Yang
Haozhi Cao
K. Mao
Jianxiong Yin
Simon See
77
73
0
06 Jun 2020
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
M. Gao
Yingbo Zhou
Ran Xu
R. Socher
Caiming Xiong
102
42
0
05 Jun 2020
Egocentric Object Manipulation Graphs
Eadom Dessalene
Michael Maynord
Chinmaya Devaraj
Cornelia Fermuller
Yiannis Aloimonos
EgoV
81
19
0
05 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
110
23
0
04 Jun 2020
Temporal Aggregate Representations for Long-Range Video Understanding
Fadime Sener
Dipika Singhania
Angela Yao
AI4TS
69
7
0
01 Jun 2020
In the Eye of the Beholder: Gaze and Actions in First Person Video
Yin Li
Miao Liu
James M. Rehg
EgoV
179
71
0
31 May 2020
Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts
Bo Pang
Kaiwen Zha
Hanwen Cao
Jiajun Tang
Minghui Yu
Cewu Lu
77
25
0
30 May 2020
Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided Framework: A Large-scale Study
Luyao Shi
Deepta Rajan
Shafiq Abedin
Srikar Yellapragada
David Beymer
E. Dehghan
65
18
0
29 May 2020
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
Pratik Mazumder
Pravendra Singh
Kranti K. Parida
Vinay P. Namboodiri
82
35
0
27 May 2020
A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
Edison Marrese-Taylor
Cristian Rodriguez-Opazo
Jorge A. Balazs
Stephen Gould
Y. Matsuo
63
3
0
27 May 2020
Unifying Few- and Zero-Shot Egocentric Action Recognition
Tyler R. Scott
Michael Shvartsman
Karl Ridgeway
EgoV
52
1
0
27 May 2020
SpotFast Networks with Memory Augmented Lateral Transformers for Lipreading
Peratham Wiriyathammabhum
62
8
0
21 May 2020
Intra- and Inter-Action Understanding via Temporal Action Parsing
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
54
71
0
20 May 2020
On Evaluating Weakly Supervised Action Segmentation Methods
Yaser Souri
Alexander Richard
Luca Minciullo
Juergen Gall
47
7
0
19 May 2020
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Vladimir E. Iashin
Esa Rahtu
104
130
0
17 May 2020
Pedestrian Action Anticipation using Contextual Feature Fusion in Stacked RNNs
Amir Rasouli
Iuliia Kotseruba
John K. Tsotsos
103
112
0
13 May 2020
Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks
Ning Zhang
Jingen Liu
Ke Wang
Dan Zeng
Tao Mei
51
7
0
13 May 2020
Project RISE: Recognizing Industrial Smoke Emissions
Yen-Chia Hsu
Ting-Hao 'Kenneth' Huang
Ting-Yao Hu
P. Dille
Sean Prendi
Ryan N. Hoffman
Anastasia Tsuhlares
Jessica Pachuta
Randy Sargent
I. Nourbakhsh
60
19
0
13 May 2020
Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events
Weiyao Lin
Huabin Liu
Shizhan Liu
Yuxi Li
Rui Qian
Tao Wang
Ning Xu
H. Xiong
Guojun Qi
N. Sebe
84
15
0
09 May 2020
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition
Miao Yin
Siyu Liao
Xiao-Yang Liu
Xiaodong Wang
Bo Yuan
82
25
0
09 May 2020
Condensed Movies: Story Based Retrieval with Contextual Embeddings
Max Bain
Arsha Nagrani
A. Brown
Andrew Zisserman
128
102
0
08 May 2020
Learning to Segment Actions from Observation and Narration
Daniel Fried
Jean-Baptiste Alayrac
Phil Blunsom
Chris Dyer
S. Clark
Aida Nematzadeh
124
32
0
07 May 2020
Exploiting Inter-Frame Regional Correlation for Efficient Action Recognition
Yuecong Xu
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
35
11
0
06 May 2020
Adaptive Interaction Modeling via Graph Operations Search
Haoxin Li
Weishi Zheng
Yu Tao
Haifeng Hu
Jianhuang Lai
68
5
0
05 May 2020
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Antonino Furnari
G. Farinella
EgoV
66
141
0
04 May 2020
Towards Visually Explaining Video Understanding Networks with Perturbation
Zhenqiang Li
Weimin Wang
Zuoyue Li
Yifei Huang
Yoichi Sato
FAtt
38
3
0
01 May 2020
Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos
Elahe Vahdani
Longlong Jing
Yingli Tian
Matt Huenerfauth
26
8
0
01 May 2020
Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images
Matthew Purri
Kristin J. Dana
36
16
0
29 Apr 2020
Skeleton Focused Human Activity Recognition in RGB Video
Bruce X. B. Yu
Yan Liu
Keith C. C. Chan
67
4
0
29 Apr 2020
Span-based Localizing Network for Natural Language Video Localization
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
113
316
0
29 Apr 2020
Inferring Temporal Compositions of Actions Using Probabilistic Automata
Rodrigo Santa Cruz
A. Cherian
Basura Fernando
Dylan Campbell
Stephen Gould
39
2
0
28 Apr 2020
AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement with Neural Searching
Zitong Yu
Xiaobai Li
Xuesong Niu
Jingang Shi
Guoying Zhao
49
132
0
26 Apr 2020
Low-latency hand gesture recognition with a low resolution thermal imager
Maarten Vandersteegen
Wouter Reusen
Kristof Van Beeck
38
17
0
24 Apr 2020
Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos
Mamshad Nayeem Rizve
Ugur Demir
Praveen Tirupattur
A. J. Rana
Kevin Duarte
Ishan R. Dave
Yogesh S Rawat
M. Shah
47
19
0
23 Apr 2020
Action recognition in real-world videos
Waqas Sultani
Qazi Ammar Arshad
Chen Chen
80
2
0
22 Apr 2020
Human and Machine Action Prediction Independent of Object Information
Fatemeh Ziaeetabar
Jennifer Pomp
Stefan Pfeiffer
Nadiya El-Sourani
R. Schubotz
M. Tamosiunaite
Florentin Wörgötter
6
0
0
22 Apr 2020
Group Activity Detection from Trajectory and Video Data in Soccer
Ryan Sanford
Siavash Gorji
L. G. Hafemann
B. Pourbabaee
Mehrsan Javan
61
34
0
21 Apr 2020
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition
Rami Ben-Ari
Mor Shpigel
Ophir Azulai
Udi Barzelay
Daniel Rotman
ViT
72
25
0
21 Apr 2020
CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition
Zhengwei Wang
Qi She
Tejo Chalasani
A. Smolic
3DPC
SLR
70
15
0
20 Apr 2020
Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging
V. Mehta
Abhinav Dhall
Sujata Pal
Shehroz S. Khan
59
25
0
17 Apr 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
Huy Manh Nguyen
Tomo Miyazaki
Yoshihiro Sugaya
S. Omachi
144
1
0
16 Apr 2020
Local-Global Video-Text Interactions for Temporal Grounding
Jonghwan Mun
Minsu Cho
Bohyung Han
103
270
0
16 Apr 2020
Asynchronous Interaction Aggregation for Action Detection
Jiajun Tang
Jinchao Xia
Xinzhi Mu
Bo Pang
Cewu Lu
89
121
0
16 Apr 2020
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos
Guillaume Vaudaux-Ruth
Adrien Chan-Hon-Tong
Catherine Achard
BDL
89
7
0
15 Apr 2020
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
78
331
0
14 Apr 2020
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning
Kangning Liu
Shuhang Gu
Andrés Romero
Radu Timofte
51
9
0
14 Apr 2020
Previous
1
2
3
...
58
59
60
...
71
72
73
Next