Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,289 papers shown
Title
Object Referring in Videos with Language and Human Gaze
A. Vasudevan
Dengxin Dai
Luc Van Gool
VOS
104
76
0
04 Jan 2018
What have we learned from deep representations for action recognition?
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
Andrew Zisserman
SSL
88
47
0
04 Jan 2018
A Unified Method for First and Third Person Action Recognition
Ali Javidani
Ahmad Mahmoudi-Aznaveh
50
6
0
30 Dec 2017
Learning Deep and Compact Models for Gesture Recognition
K. Mullick
A. Namboodiri
3DH
19
8
0
29 Dec 2017
Future Frame Prediction for Anomaly Detection -- A New Baseline
Wen Liu
Weixin Luo
Dongze Lian
Shenghua Gao
3DH
181
1,082
0
28 Dec 2017
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization
Hang Zhao
Antonio Torralba
Lorenzo Torresani
Zhicheng Yan
VLM
AI4TS
87
29
0
26 Dec 2017
Detect-and-Track: Efficient Pose Estimation in Videos
Rohit Girdhar
Georgia Gkioxari
Lorenzo Torresani
Manohar Paluri
Du Tran
3DH
120
230
0
26 Dec 2017
Towards Structured Analysis of Broadcast Badminton Videos
Anurag Ghosh
Suriya Singh
C. V. Jawahar
48
46
0
23 Dec 2017
On the Integration of Optical Flow and Action Recognition
Laura Sevilla-Lara
Yiyi Liao
Fatma Guney
Varun Jampani
Andreas Geiger
Michael J. Black
149
197
0
22 Dec 2017
An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data
Oytun Ulutan
B. Riggan
Nasser M. Nasrabadi
B. S. Manjunath
55
3
0
20 Dec 2017
Objects that Sound
Relja Arandjelović
Andrew Zisserman
ObjD
VOS
120
530
0
18 Dec 2017
Video Object Detection with an Aligned Spatial-Temporal Memory
Fanyi Xiao
Yong Jae Lee
92
189
0
18 Dec 2017
Probabilistic Semantic Retrieval for Surveillance Videos with Activity Graphs
Yuting Chen
Joseph Wang
Yannan Bai
Greg Castañón
Venkatesh Saligrama
82
15
0
17 Dec 2017
Weakly Supervised Action Localization by Sparse Temporal Pooling Network
P. Nguyen
Ting Liu
Gautam Prasad
Bohyung Han
WSOL
205
351
0
14 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
210
1,336
0
13 Dec 2017
Im2Flow: Motion Hallucination from Static Images for Action Recognition
Ruohan Gao
Bo Xiong
Kristen Grauman
83
93
0
12 Dec 2017
Learning Latent Super-Events to Detect Multiple Activities in Videos
A. Piergiovanni
Michael S. Ryoo
79
90
0
05 Dec 2017
Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition
Pichao Wang
Wanqing Li
Jun Wan
P. Ogunbona
Xinwang Liu
100
72
0
05 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou
Zhaowen Wang
Chen Fang
Trung Bui
Tamara L. Berg
VGen
98
209
0
04 Dec 2017
Object Classification using Ensemble of Local and Deep Features
Siddharth Srivastava
Prerana Mukherjee
Brejesh Lall
Kamlesh Jaiswal
52
7
0
04 Dec 2017
Compressed Video Action Recognition
Chao-Yuan Wu
Manzil Zaheer
Hexiang Hu
R. Manmatha
Alex Smola
Philipp Krahenbuhl
184
325
0
02 Dec 2017
Learning to Segment Moving Objects
P. Tokmakov
Cordelia Schmid
Alahari Karteek
VOS
95
97
0
01 Dec 2017
Label Efficient Learning of Transferable Representations across Domains and Tasks
Zelun Luo
Yuliang Zou
Judy Hoffman
Li Fei-Fei
80
277
0
30 Nov 2017
Graph Distillation for Action Detection with Privileged Modalities
Zelun Luo
Jun-Ting Hsieh
Lu Jiang
Juan Carlos Niebles
Li Fei-Fei
107
104
0
30 Nov 2017
Budget-Aware Activity Detection with A Recurrent Policy Network
Behrooz Mahasseni
Xiaodong Yang
Pavlo Molchanov
Jan Kautz
72
6
0
30 Nov 2017
An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos
Rui Hou
Chen Chen
M. Shah
MedIm
67
61
0
30 Nov 2017
Relation Networks for Object Detection
Han Hu
Jiayuan Gu
Zheng Zhang
Jifeng Dai
Yichen Wei
ObjD
150
1,230
0
30 Nov 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
258
3,042
0
30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun
Zhanghui Kuang
Wanli Ouyang
Lu Sheng
Wayne Zhang
94
297
0
29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
104
1,667
0
28 Nov 2017
Revisiting hand-crafted feature for action recognition: a set of improved dense trajectories
K. Matsui
Toru Tamaki
Gwladys Auffret
B. Raytchev
K. Kaneda
28
0
0
28 Nov 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture
Katsunori Ohnishi
Shohei Yamamoto
Yoshitaka Ushiku
Tatsuya Harada
VGen
GAN
81
60
0
27 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
135
1,937
0
27 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao-Chang Liu
Shilei Wen
104
209
0
27 Nov 2017
Predictive Learning: Using Future Representation Learning Variantial Autoencoder for Human Action Prediction
Runsheng Yu
Zhenyu Shi
Qiongxiong Ma
Laiyun Qing
3DH
DRL
49
4
0
25 Nov 2017
Appearance-and-Relation Networks for Video Classification
Limin Wang
Wei Li
Wen Li
Luc Van Gool
100
352
0
24 Nov 2017
Summarizing First-Person Videos from Third Persons' Points of Views
Hsuan-I Ho
Wei-Chen Chiu
Y. Wang
EgoV
3DH
53
30
0
24 Nov 2017
Deep Video Generation, Prediction and Completion of Human Action Sequences
Haoye Cai
Chunyan Bai
Yu-Wing Tai
Chi-Keung Tang
VGen
96
147
0
23 Nov 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
151
1,043
0
22 Nov 2017
Three-Stream Convolutional Networks for Video-based Person Re-Identification
Zeng Yu
Tianrui Li
Ning Yu
Xun Gong
Ke Chen
Yi Pan
26
6
0
22 Nov 2017
Multi-Level Recurrent Residual Networks for Action Recognition
Zhenxing Zheng
Gaoyun An
Q. Ruan
41
12
0
22 Nov 2017
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
A. Karami
M. M. Arzani
Rahman Yousefzadeh
Luc Van Gool
86
242
0
22 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
366
8,940
0
21 Nov 2017
Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion
Weiyao Lin
Yang Mi
Jianxin Wu
K. Lu
H. Xiong
66
37
0
20 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
83
145
0
16 Nov 2017
Occlusion Aware Unsupervised Learning of Optical Flow
Yang Wang
Yezhou Yang
Zhenheng Yang
Liang Zhao
Peng Wang
Wei Xu
SSL
79
311
0
16 Nov 2017
A Correlation Based Feature Representation for First-Person Activity Recognition
R. Kahani
Alireza Talebpour
Ahmad Mahmoudi-Aznaveh
51
12
0
15 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
97
89
0
11 Nov 2017
Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification
Yuxin Peng
Yunzhen Zhao
Junchao Zhang
84
116
0
09 Nov 2017
Attentional Pooling for Action Recognition
Rohit Girdhar
Deva Ramanan
138
321
0
04 Nov 2017
Previous
1
2
3
...
37
38
39
...
44
45
46
Next