Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014

Jeff Donahue

Lisa Anne Hendricks

Marcus Rohrbach

Subhashini Venugopalan

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 642 papers shown

Title
Dense-Captioning Events in Videos Ranjay Krishna Kenji Hata F. Ren Li Fei-Fei Juan Carlos Niebles 65 1,214 0 02 May 2017
Query-adaptive Video Summarization via Quality-aware Relevance Estimation A. Vasudevan Michael Gygli Anna Volokitin Luc Van Gool 30 93 0 01 May 2017
Inception Recurrent Convolutional Neural Network for Object Recognition Md. Zahangir Alom Mahmudul Hasan C. Yakopcic T. Taha 39 86 0 25 Apr 2017
Second-order Temporal Pooling for Action Recognition A. Cherian Stephen Gould EgoV 11 29 0 23 Apr 2017
Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation Ngan Le Kha Gia Quach Khoa Luu Marios Savvides Chenchen Zhu 19 71 0 12 Apr 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks Liwei Wang Yin Li Jing-ling Huang Svetlana Lazebnik VLM 27 494 0 11 Apr 2017
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition Chih-Yao Ma Min-Hung Chen Z. Kira G. Al-Regib AI4TS 32 241 0 30 Mar 2017
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images Rakshith Shetty Bernt Schiele Mario Fritz 35 223 0 30 Mar 2017
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation Albert Gatt E. Krahmer LM&MA ELM 27 810 0 29 Mar 2017
Learning and Refining of Privileged Information-based RNNs for Action Recognition from Depth Sequences Zhiyuan Shi Tae-Kyun Kim 14 80 0 28 Mar 2017
Where to put the Image in an Image Caption Generator Marc Tanti Albert Gatt K. Camilleri 47 96 0 27 Mar 2017
Visually grounded learning of keyword prediction from untranscribed speech Herman Kamper Shane Settle Gregory Shakhnarovich Karen Livescu 19 63 0 23 Mar 2017
Weakly Supervised Action Learning with RNN based Fine-to-coarse Modeling Alexander Richard Hilde Kuehne Juergen Gall 23 195 0 23 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation Chenxi Liu Zhe-nan Lin Xiaohui Shen Jimei Yang Xin Lu Alan Yuille EgoV 36 234 0 23 Mar 2017
An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning Fan Wu Zhongwen Xu Yi Yang ObjD 31 11 0 22 Mar 2017
Encouraging LSTMs to Anticipate Actions Very Early Mohammad Sadegh Ali Akbarian F. Saleh Mathieu Salzmann Basura Fernando L. Petersson Lars Andersson 34 169 0 21 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning Abhishek Das Satwik Kottur J. M. F. Moura Stefan Lee Dhruv Batra OffRL 31 423 0 20 Mar 2017
Multilevel Context Representation for Improving Object Recognition Andreas Kölsch Muhammad Zeshan Afzal Marcus Liwicki 27 3 0 19 Mar 2017
Recurrent Models for Situation Recognition Arun Mallya Svetlana Lazebnik 20 30 0 18 Mar 2017
UntrimmedNets for Weakly Supervised Action Recognition and Detection Limin Wang Yuanjun Xiong Dahua Lin Luc Van Gool 30 490 0 09 Mar 2017
A Pursuit of Temporal Accuracy in General Activity Detection Yuanjun Xiong Yue Zhao Limin Wang Dahua Lin Xiaoou Tang 14 132 0 08 Mar 2017
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos Zheng Shou Jonathan Chan Alireza Zareian K. Miyazawa Shih-Fu Chang 20 560 0 04 Mar 2017
The Statistical Recurrent Unit Junier B. Oliva Barnabás Póczós J. Schneider 16 50 0 01 Mar 2017
Scene Flow to Action Map: A New Representation for RGB-D based Action Recognition with Convolutional Neural Networks Pichao Wang W. Li Zhimin Gao Yuyao Zhang Chang-Fu Tang P. Ogunbona 3DPC 172 131 0 28 Feb 2017
MAT: A Multimodal Attentive Translator for Image Captioning Chang Liu F. Sun Changhu Wang Feng Wang Alan Yuille 17 58 0 18 Feb 2017
Deep Reinforcement Learning for Visual Object Tracking in Videos Da Zhang H. Maei Xin Eric Wang Yuan-fang Wang 20 115 0 31 Jan 2017
Incorporating Global Visual Features into Attention-Based Neural Machine Translation Iacer Calixto Qun Liu Nick Campbell 24 154 0 23 Jan 2017
Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN Teik Koon Cheang Yong Shean Chong Yong Haur Tay 16 56 0 23 Jan 2017
Person Re-Identification via Recurrent Feature Aggregation Yichao Yan Bingbing Ni Zhichao Song Chao Ma Yan Yan Xiaokang Yang 16 243 0 23 Jan 2017
Action Recognition: From Static Datasets to Moving Robots Fahimeh Rezazadegan S. Shirazi B. Upcroft Michael Milford 11 45 0 18 Jan 2017
Ordered Pooling of Optical Flow Sequences for Action Recognition Jue Wang A. Cherian Fatih Porikli 17 45 0 12 Jan 2017
Transforming Sensor Data to the Image Domain for Deep Learning - an Application to Footstep Detection Monit Shah Singh Vinaychandran Pondenkandath Bo Zhou P. Lukowicz Marcus Liwicki 22 75 0 04 Jan 2017
Learning Visual N-Grams from Web Data Ang Li Allan Jabri Armand Joulin L. V. D. van der Maaten VLM 20 136 0 29 Dec 2016
Structured Sequence Modeling with Graph Convolutional Recurrent Networks Youngjoo Seo M. Defferrard P. Vandergheynst Xavier Bresson GNN 36 757 0 22 Dec 2016
An Empirical Study of Language CNN for Image Captioning Jiuxiang Gu G. Wang Jianfei Cai Tsuhan Chen 25 132 0 21 Dec 2016
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale F. Iandola 3DV 26 18 0 20 Dec 2016
Asynchronous Temporal Fields for Action Recognition Gunnar A. Sigurdsson S. Divvala Ali Farhadi Abhinav Gupta BDL 16 170 0 19 Dec 2016
Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs Li Jing Yichen Shen T. Dubček J. Peurifoy S. Skirlo Yann LeCun Max Tegmark Marin Soljacic 15 176 0 15 Dec 2016
Attentive Explanations: Justifying Decisions and Pointing to the Evidence Dong Huk Park Lisa Anne Hendricks Zeynep Akata Bernt Schiele Trevor Darrell Marcus Rohrbach AAML 21 79 0 14 Dec 2016
End-to-end Learning of Driving Models from Large-scale Video Datasets Huazhe Xu Yang Gao F. I. F. Richard Yu Trevor Darrell 44 821 0 04 Dec 2016
Areas of Attention for Image Captioning M. Pedersoli Thomas Lucas Cordelia Schmid Jakob Verbeek 27 205 0 03 Dec 2016
Short-term traffic flow forecasting with spatial-temporal correlation in a hybrid deep learning framework Yuankai Wu Huachun Tan AI4TS 31 248 0 03 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Yash Goyal Tejas Khot D. Summers-Stay Dhruv Batra Devi Parikh CoGe 104 3,120 0 02 Dec 2016
Action Recognition with Dynamic Image Networks Hakan Bilen Basura Fernando Efstratios Gavves Andrea Vedaldi FAtt 21 221 0 02 Dec 2016
Guided Open Vocabulary Image Captioning with Constrained Beam Search Peter Anderson Basura Fernando Mark Johnson Stephen Gould 21 232 0 02 Dec 2016
Video Captioning with Multi-Faceted Attention Xiang Long Chuang Gan Gerard de Melo 22 88 0 01 Dec 2016
Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures Gaurav Mittal Tanya Marwah V. Balasubramanian VGen DiffM 38 67 0 30 Nov 2016
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model Marcella Cornia Lorenzo Baraldi G. Serra Rita Cucchiara 31 548 0 29 Nov 2016
Social Behavior Prediction from First Person Videos Shan Su J. Hong Jianbo Shi H. Park EgoV 34 12 0 29 Nov 2016
Visual Dialog Abhishek Das Satwik Kottur Khushi Gupta Avi Singh Deshraj Yadav José M. F. Moura Devi Parikh Dhruv Batra 54 990 0 26 Nov 2016