v1v2 (latest)

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,289 papers shown

Title
Object Referring in Videos with Language and Human Gaze A. Vasudevan Dengxin Dai Luc Van Gool VOS 104 76 0 04 Jan 2018
What have we learned from deep representations for action recognition? Christoph Feichtenhofer A. Pinz Richard P. Wildes Andrew Zisserman SSL 88 47 0 04 Jan 2018
A Unified Method for First and Third Person Action Recognition Ali Javidani Ahmad Mahmoudi-Aznaveh 55 6 0 30 Dec 2017
Learning Deep and Compact Models for Gesture Recognition K. Mullick A. Namboodiri 3DH 19 8 0 29 Dec 2017
Future Frame Prediction for Anomaly Detection -- A New Baseline Wen Liu Weixin Luo Dongze Lian Shenghua Gao 3DH 181 1,082 0 28 Dec 2017
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization Hang Zhao Antonio Torralba Lorenzo Torresani Zhicheng Yan VLM AI4TS 87 29 0 26 Dec 2017
Detect-and-Track: Efficient Pose Estimation in Videos Rohit Girdhar Georgia Gkioxari Lorenzo Torresani Manohar Paluri Du Tran 3DH 120 231 0 26 Dec 2017
Towards Structured Analysis of Broadcast Badminton Videos Anurag Ghosh Suriya Singh C. V. Jawahar 48 46 0 23 Dec 2017
On the Integration of Optical Flow and Action Recognition Laura Sevilla-Lara Yiyi Liao Fatma Guney Varun Jampani Andreas Geiger Michael J. Black 149 197 0 22 Dec 2017
An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data Oytun Ulutan B. Riggan Nasser M. Nasrabadi B. S. Manjunath 55 3 0 20 Dec 2017
Objects that Sound Relja Arandjelović Andrew Zisserman ObjD VOS 120 530 0 18 Dec 2017
Video Object Detection with an Aligned Spatial-Temporal Memory Fanyi Xiao Yong Jae Lee 92 189 0 18 Dec 2017
Probabilistic Semantic Retrieval for Surveillance Videos with Activity Graphs Yuting Chen Joseph Wang Yannan Bai Greg Castañón Venkatesh Saligrama 82 15 0 17 Dec 2017
Weakly Supervised Action Localization by Sparse Temporal Pooling Network P. Nguyen Ting Liu Gautam Prasad Bohyung Han WSOL 205 352 0 14 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification Saining Xie Chen Sun Jonathan Huang Zhuowen Tu Kevin Patrick Murphy 3DH 210 1,336 0 13 Dec 2017
Im2Flow: Motion Hallucination from Static Images for Action Recognition Ruohan Gao Bo Xiong Kristen Grauman 92 93 0 12 Dec 2017
Learning Latent Super-Events to Detect Multiple Activities in Videos A. Piergiovanni Michael S. Ryoo 79 90 0 05 Dec 2017
Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition Pichao Wang Wanqing Li Jun Wan P. Ogunbona Xinwang Liu 100 72 0 05 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild Yipin Zhou Zhaowen Wang Chen Fang Trung Bui Tamara L. Berg VGen 98 209 0 04 Dec 2017
Object Classification using Ensemble of Local and Deep Features Siddharth Srivastava Prerana Mukherjee Brejesh Lall Kamlesh Jaiswal 52 7 0 04 Dec 2017
Compressed Video Action Recognition Chao-Yuan Wu Manzil Zaheer Hexiang Hu R. Manmatha Alex Smola Philipp Krahenbuhl 184 325 0 02 Dec 2017
Learning to Segment Moving Objects P. Tokmakov Cordelia Schmid Alahari Karteek VOS 95 97 0 01 Dec 2017
Label Efficient Learning of Transferable Representations across Domains and Tasks Zelun Luo Yuliang Zou Judy Hoffman Li Fei-Fei 80 277 0 30 Nov 2017
Graph Distillation for Action Detection with Privileged Modalities Zelun Luo Jun-Ting Hsieh Lu Jiang Juan Carlos Niebles Li Fei-Fei 110 104 0 30 Nov 2017
Budget-Aware Activity Detection with A Recurrent Policy Network Behrooz Mahasseni Xiaodong Yang Pavlo Molchanov Jan Kautz 72 6 0 30 Nov 2017
An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos Rui Hou Chen Chen M. Shah MedIm 67 61 0 30 Nov 2017
Relation Networks for Object Detection Han Hu Jiayuan Gu Zheng Zhang Jifeng Dai Yichen Wei ObjD 150 1,230 0 30 Nov 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun Manohar Paluri 258 3,045 0 30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition Shuyang Sun Zhanghui Kuang Wanli Ouyang Lu Sheng Wayne Zhang 94 297 0 29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Zhaofan Qiu Ting Yao Tao Mei 104 1,668 0 28 Nov 2017
Revisiting hand-crafted feature for action recognition: a set of improved dense trajectories K. Matsui Toru Tamaki Gwladys Auffret B. Raytchev K. Kaneda 28 0 0 28 Nov 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture Katsunori Ohnishi Shohei Yamamoto Yoshitaka Ushiku Tatsuya Harada VGen GAN 81 60 0 27 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? Kensho Hara Hirokatsu Kataoka Y. Satoh 3DPC 135 1,937 0 27 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification Xiang Long Chuang Gan Gerard de Melo Jiajun Wu Xiao-Chang Liu Shilei Wen 104 209 0 27 Nov 2017
Predictive Learning: Using Future Representation Learning Variantial Autoencoder for Human Action Prediction Runsheng Yu Zhenyu Shi Qiongxiong Ma Laiyun Qing 3DH DRL 49 4 0 25 Nov 2017
Appearance-and-Relation Networks for Video Classification Limin Wang Wei Li Wen Li Luc Van Gool 100 352 0 24 Nov 2017
Summarizing First-Person Videos from Third Persons' Points of Views Hsuan-I Ho Wei-Chen Chiu Y. Wang EgoV 3DH 53 30 0 24 Nov 2017
Deep Video Generation, Prediction and Completion of Human Action Sequences Haoye Cai Chunyan Bai Yu-Wing Tai Chi-Keung Tang VGen 96 147 0 23 Nov 2017
Temporal Relational Reasoning in Videos Bolei Zhou A. Andonian Aude Oliva Antonio Torralba NAI 151 1,043 0 22 Nov 2017
Three-Stream Convolutional Networks for Video-based Person Re-Identification Zeng Yu Tianrui Li Ning Yu Xun Gong Ke Chen Yi Pan 26 6 0 22 Nov 2017
Multi-Level Recurrent Residual Networks for Action Recognition Zhenxing Zheng Gaoyun An Q. Ruan 41 12 0 22 Nov 2017
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification Ali Diba Mohsen Fayyaz Vivek Sharma A. Karami M. M. Arzani Rahman Yousefzadeh Luc Van Gool 86 242 0 22 Nov 2017
Non-local Neural Networks Xinyu Wang Ross B. Girshick Abhinav Gupta Kaiming He OffRL 366 8,948 0 21 Nov 2017
Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion Weiyao Lin Yang Mi Jianxin Wu K. Lu H. Xiong 66 37 0 20 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding Chih-Yao Ma Asim Kadav I. Melvin Z. Kira G. Al-Regib H. Graf 83 145 0 16 Nov 2017
Occlusion Aware Unsupervised Learning of Optical Flow Yang Wang Yezhou Yang Zhenheng Yang Liang Zhao Peng Wang Wei Xu SSL 79 311 0 16 Nov 2017
A Correlation Based Feature Representation for First-Person Activity Recognition R. Kahani Alireza Talebpour Ahmad Mahmoudi-Aznaveh 51 12 0 15 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition Jiagang Zhu Wei Zou Zheng Zhu 97 89 0 11 Nov 2017
Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification Yuxin Peng Yunzhen Zhao Junchao Zhang 84 116 0 09 Nov 2017
Attentional Pooling for Action Recognition Rohit Girdhar Deva Ramanan 138 321 0 04 Nov 2017