Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
Identity-Aware Multi-Sentence Video Description
J. S. Park
Trevor Darrell
Anna Rohrbach
71
17
0
22 Aug 2020
Biased Mixtures Of Experts: Enabling Computer Vision Inference Under Data Transfer Limitations
Alhabib Abbas
Y. Andreopoulos
MoE
108
18
0
21 Aug 2020
Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition
Zitong Yu
Benjia Zhou
Jun Wan
Pichao Wang
Haoyu Chen
Xin Liu
Stan Z. Li
Guoying Zhao
3DPC
91
91
0
21 Aug 2020
Learning to Abstract and Predict Human Actions
Romero Morais
Vuong Le
T. Tran
Svetha Venkatesh
AI4CE
44
6
0
20 Aug 2020
AWNet: Attentive Wavelet Network for Image ISP
Linhui Dai
Xiaohong Liu
Chengqi Li
Jun Chen
88
58
0
20 Aug 2020
Conditional Entropy Coding for Efficient Video Compression
Jerry Liu
Shenlong Wang
Wei-Chiu Ma
Meet Shah
Rui Hu
Pranaab Dhawan
R. Urtasun
64
67
0
20 Aug 2020
Accuracy and Performance Comparison of Video Action Recognition Approaches
Matthew Hutchinson
S. Samsi
William Arcand
David Bestor
Bill Bergeron
...
Andrew Prout
Antonio Rosa
Albert Reuther
Charles Yee
V. Gadepally
42
5
0
20 Aug 2020
Localizing Anomalies from Weakly-Labeled Videos
Hui Lv
Chuanwei Zhou
Chunyan Xu
Zhen Cui
Jian Yang
92
123
0
20 Aug 2020
Text-based Localization of Moments in a Video Corpus
Sudipta Paul
Niluthpol Chowdhury Mithun
Amit K. Roy-Chowdhury
46
15
0
20 Aug 2020
Learning Trailer Moments in Full-Length Movies
Lezi Wang
Dong Liu
R. Puri
Dimitris N. Metaxas
52
43
0
19 Aug 2020
SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras
Asif Sushmit
Partho Ghosh
Md. Abrar Istiak
Nayeeb Rashid
Ahsan Habib Akash
Taufiq Hasan
25
3
0
19 Aug 2020
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization
Yuxi Li
Weiyao Lin
John See
N. Xu
Shugong Xu
Ke Yan
Cong Yang
360
17
0
19 Aug 2020
AssembleNet++: Assembling Modality Representations via Attention Connections
Michael S. Ryoo
A. Piergiovanni
Juhana Kangaspunta
A. Angelova
65
45
0
18 Aug 2020
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization
Tao Zhao
Junwei Han
Le Yang
Dingwen Zhang
90
19
0
18 Aug 2020
Contact Area Detector using Cross View Projection Consistency for COVID-19 Projects
Peiying Zhang
W. T. Calderon
Bokyung Lee
Alex Tessier
Jacky Bibliowicz
Liviu-Mihai Calin
Michael Lee
27
1
0
18 Aug 2020
AID: Pushing the Performance Boundary of Human Pose Estimation with Information Dropping Augmentation
Junjie Huang
Zheng Zhu
Guan Huang
Dalong Du
3DH
79
1
0
17 Aug 2020
HRVGAN: High Resolution Video Generation using Spatio-Temporal GAN
Abhinav Sagar
GAN
55
1
0
17 Aug 2020
Deep Learning Predicts Cardiovascular Disease Risks from Lung Cancer Screening Low Dose Computed Tomography
Hanqing Chao
Hongming Shan
F. Homayounieh
Ramandeep Singh
R. Khera
Hengtao Guo
Timothy Su
Ge Wang
Mannudeep K. Kalra
Pingkun Yan
39
74
0
16 Aug 2020
Deep Domain Adaptation for Ordinal Regression of Pain Intensity Estimation Using Weakly-Labelled Videos
R Gnana Praveen
Eric Granger
P. Cardinal
88
21
0
13 Aug 2020
Self-supervised Video Representation Learning by Pace Prediction
Jiangliu Wang
Jianbo Jiao
Yunhui Liu
SSL
AI4TS
84
237
0
13 Aug 2020
Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning
Ying Cheng
Ruize Wang
Zhihao Pan
Rui Feng
Yuejie Zhang
SSL
150
110
0
13 Aug 2020
Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition
Taeoh Kim
Hyeongmin Lee
Myeongah Cho
Hankook Lee
Dong Heon Cho
Sangyoun Lee
88
26
0
13 Aug 2020
We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos
A. Andonian
Camilo Luciano Fosco
Mathew Monfort
Allen Lee
Rogerio Feris
Carl Vondrick
A. Oliva
46
9
0
12 Aug 2020
Dynamic Object Removal and Spatio-Temporal RGB-D Inpainting via Geometry-Aware Adversarial Learning
Borna Bevsić
Abhinav Valada
56
36
0
12 Aug 2020
VI-Net: View-Invariant Quality of Human Movement Assessment
Faegheh Sardari
A. Paiement
S. Hannuna
Majid Mirmehdi
3DH
93
27
0
11 Aug 2020
Adversarial Generative Grammars for Human Activity Prediction
A. Piergiovanni
A. Angelova
Alexander Toshev
Michael S. Ryoo
GAN
92
31
0
11 Aug 2020
Sharp Multiple Instance Learning for DeepFake Video Detection
Xiaodan Li
Yining Lang
YueFeng Chen
Xiaofeng Mao
Yuan He
Shuhui Wang
Hui Xue
Quan Lu
AAML
111
175
0
11 Aug 2020
Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation
Yiheng Liu
Wen-gang Zhou
Mao Xi
Sanjing Shen
Houqiang Li
96
9
0
10 Aug 2020
2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework
Haoyu Chen
Zitong Yu
Xin Liu
Wei Peng
Yoon Lee
Guoying Zhao
3DPC
53
5
0
10 Aug 2020
Spatiotemporal Contrastive Video Representation Learning
Rui Qian
Tianjian Meng
Boqing Gong
Ming-Hsuan Yang
Haoran Wang
Serge J. Belongie
Huayu Chen
SSL
AI4TS
157
502
0
09 Aug 2020
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
Anyi Rao
Jiaze Wang
Linning Xu
Xuekun Jiang
Qingqiu Huang
Bolei Zhou
Dahua Lin
96
63
0
08 Aug 2020
PAN: Towards Fast Action Recognition via Learning Persistence of Appearance
Can Zhang
Yuexian Zou
Guang Chen
Lei Gan
89
39
0
08 Aug 2020
Multi-Level Temporal Pyramid Network for Action Detection
Xiang Wang
Changxin Gao
Shiwei Zhang
Nong Sang
45
14
0
07 Aug 2020
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment
Behnoosh Parsa
A. Banerjee
97
2
0
07 Aug 2020
Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework
Li Tao
Xueting Wang
T. Yamasaki
SSL
85
106
0
06 Aug 2020
Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos
Xiaoye Qu
Peng Tang
Zhikang Zhou
Yu Cheng
Jianfeng Dong
Pan Zhou
90
92
0
06 Aug 2020
Self-supervised Temporal Discriminative Learning for Video Representation Learning
Jinpeng Wang
Yiqi Lin
A. J. Ma
Pong C. Yuen
TTA
73
11
0
05 Aug 2020
Self-supervised learning using consistency regularization of spatio-temporal data augmentation for action recognition
Jinpeng Wang
Yiqi Lin
A. J. Ma
SSL
55
1
0
05 Aug 2020
Boundary Content Graph Neural Network for Temporal Action Proposal Generation
Y. Bai
Yingying Wang
Yunhai Tong
Yang Yang
Qiyue Liu
Junhui Liu
83
162
0
04 Aug 2020
Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
Daizong Liu
Xiaoye Qu
Xiao-Yang Liu
Jianfeng Dong
Pan Zhou
Zichuan Xu
92
129
0
04 Aug 2020
Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition
M. E. Kalfaoglu
Sinan Kalkan
A. Aydin Alatan
3DPC
93
143
0
03 Aug 2020
Memory-augmented Dense Predictive Coding for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
124
242
0
03 Aug 2020
Residual Frames with Efficient Pseudo-3D CNN for Human Action Recognition
Jiawei Chen
Jenson Hsiao
C. Ho
58
5
0
03 Aug 2020
AUTSL: A Large Scale Multi-modal Turkish Sign Language Dataset and Baseline Methods
Ozge Mercanoglu Sincan
H. Keles
SLR
75
173
0
03 Aug 2020
The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)
Samuel Albanie
Yang Liu
Arsha Nagrani
Antoine Miech
Ernesto Coto
...
Kaixu Cui
Hui Liu
Chen Wang
Yudong Jiang
Xiaoshuai Hao
87
9
0
03 Aug 2020
A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises
S. Kevin Zhou
H. Greenspan
Christos Davatzikos
James S. Duncan
Bram van Ginneken
A. Madabhushi
Jerry L. Prince
Daniel Rueckert
Ronald M. Summers
220
650
0
02 Aug 2020
Estimating Motion Codes from Demonstration Videos
Maxat Alibayev
D. Paulius
Yu Sun
47
4
0
31 Jul 2020
AR-Net: Adaptive Frame Resolution for Efficient Action Recognition
Yue Meng
Chung-Ching Lin
Yikang Shen
P. Sattigeri
Leonid Karlinsky
A. Oliva
Kate Saenko
Rogerio Feris
97
146
0
31 Jul 2020
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
Baoxiong Jia
Yixin Chen
Siyuan Huang
Yixin Zhu
Song-Chun Zhu
42
54
0
31 Jul 2020
Hierarchical Action Classification with Network Pruning
Mahdi Davoodikakhki
KangKang Yin
78
20
0
30 Jul 2020
Previous
1
2
3
...
55
56
57
...
71
72
73
Next