Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014

Jeff Donahue

Lisa Anne Hendricks

Marcus Rohrbach

Subhashini Venugopalan

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 642 papers shown

Title
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos Amlan Kar Nishant Rai Karan Sikka Gaurav Sharma 24 152 0 24 Nov 2016
Image-based localization using LSTMs for structured feature correlation F. Walch C. Hazirbas Laura Leal-Taixé Torsten Sattler S. Hilsenbeck Daniel Cremers 27 494 0 23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text Yunchen Pu Martin Renqiang Min Zhe Gan Lawrence Carin 39 14 0 23 Nov 2016
Dense Captioning with Joint Inference and Visual Context L. Yang K. Tang Jianchao Yang Li-Jia Li VLM 21 169 0 21 Nov 2016
Deep Temporal Linear Encoding Networks Ali Diba Vivek Sharma Luc Van Gool 19 227 0 21 Nov 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs J. Krause Justin Johnson Ranjay Krishna Li Fei-Fei VLM 25 373 0 20 Nov 2016
Fast Video Classification via Adaptive Cascading of Deep Models Haichen Shen Seungyeop Han Matthai Philipose Arvind Krishnamurthy 26 78 0 20 Nov 2016
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM Yan Huang Wei Wang Liang Wang 26 222 0 17 Nov 2016
Convolutional Gated Recurrent Networks for Video Segmentation Mennatullah Siam Sepehr Valipour Martin Jägersand Nilanjan Ray VOS 19 98 0 16 Nov 2016
Learning long-term dependencies for action recognition with a biologically-inspired deep network Yemin Shi Yonghong Tian Yaowei Wang Tiejun Huang 26 63 0 16 Nov 2016
DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing Shuochao Yao Shaohan Hu Yiran Zhao Aston Zhang Tarek F. Abdelzaher HAI AI4TS 22 616 0 07 Nov 2016
Boosting Image Captioning with Attributes Ting Yao Yingwei Pan Yehao Li Zhaofan Qiu Tao Mei VLM 36 620 0 05 Nov 2016
Clinical Text Prediction with Numerically Grounded Conditional Language Models Georgios P. Spithourakis S. Petersen Sebastian Riedel 22 7 0 20 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering Youngjae Yu Hyungjin Ko Jongwook Choi Gunhee Kim 14 229 0 10 Oct 2016
Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition Zecheng Xie Zenghui Sun Lianwen Jin Hao Ni Terry Lyons 35 122 0 09 Oct 2016
A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection Nian Liu Junwei Han 27 189 0 06 Oct 2016
Prediction of Manipulation Actions Cornelia Fermuller Fang Wang Yezhou Yang Konstantinos Zampogiannis Yi Zhang Francisco Barranco Michael Pfeiffer 16 51 0 03 Oct 2016
A Survey of Multi-View Representation Learning Yingming Li Ming Yang Zhongfei Zhang AI4TS 3DV 30 509 0 03 Oct 2016
Learning Language-Visual Embedding for Movie Understanding with Natural-Language Atousa Torabi Niket Tandon Leonid Sigal 14 97 0 26 Sep 2016
Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN Yemin Shi Yonghong Tian Yaowei Wang Tiejun Huang 11 191 0 10 Sep 2016
DAiSEE: Towards User Engagement Recognition in the Wild Abhay Gupta Arjun DĆunha Kamal N. Awasthi V. Balasubramanian 13 141 0 07 Sep 2016
Making a Case for Learning Motion Representations with Phase S. Pintea J. C. V. Gemert 26 11 0 06 Sep 2016
Autonomous driving challenge: To Infer the property of a dynamic object based on its motion pattern using recurrent neural network Mona Fathollahi R. Kasturi 16 15 0 01 Sep 2016
Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification Ali Diba A. Pazandeh Luc Van Gool 24 75 0 31 Aug 2016
What makes ImageNet good for transfer learning? Minyoung Huh Pulkit Agrawal Alexei A. Efros OOD SSeg VLM SSL 36 670 0 30 Aug 2016
Learning to generalize to new compositions in image understanding Y. Atzmon Jonathan Berant Vahid Kezami Amir Globerson Gal Chechik 18 67 0 27 Aug 2016
Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action Recognition César Roberto de Souza Adrien Gaidon E. Vig A. Peña 19 44 0 25 Aug 2016
Title Generation for User Generated Videos Kuo-Hao Zeng Tseng-Hung Chen Juan Carlos Niebles Min Sun 32 69 0 25 Aug 2016
Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks Pichao Wang W. Li Song Liu Yuyao Zhang Zhimin Gao P. Ogunbona SLR 37 50 0 22 Aug 2016
STFCN: Spatio-Temporal FCN for Semantic Video Segmentation Mohsen Fayyaz M. H. Saffar Mohammad Sabokrou M. Fathy Reinhard Klette Fay Huang 11 56 0 21 Aug 2016
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning Y. Tan Chee Seng Chan VLM 22 29 0 20 Aug 2016
Leveraging Structural Context Models and Ranking Score Fusion for Human Interaction Prediction Qiuhong Ke Bennamoun Senjian An F. Boussaïd Ferdous Sohel 14 38 0 18 Aug 2016
Seeing with Humans: Gaze-Assisted Neural Image Captioning Yusuke Sugano Andreas Bulling 21 68 0 18 Aug 2016
Clockwork Convnets for Video Semantic Segmentation Evan Shelhamer Kate Rakelly Judy Hoffman Trevor Darrell 29 199 0 11 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task Ashkan Mokarian Mateusz Malinowski Mario Fritz 27 5 0 09 Aug 2016
Discriminatively Trained Latent Ordinal Model for Video Classification Karan Sikka Gaurav Sharma 19 11 0 08 Aug 2016
Modeling Context in Referring Expressions Licheng Yu Patrick Poirson Shan Yang Alexander C. Berg Tamara L. Berg 28 1,227 0 31 Jul 2016
SPICE: Semantic Propositional Image Caption Evaluation Peter Anderson Basura Fernando Mark Johnson Stephen Gould EGVM 34 1,884 0 29 Jul 2016
Connectionist Temporal Modeling for Weakly Supervised Action Labeling De-An Huang Li Fei-Fei Juan Carlos Niebles 16 237 0 28 Jul 2016
An Actor-Critic Algorithm for Sequence Prediction Dzmitry Bahdanau Philemon Brakel Kelvin Xu Anirudh Goyal Ryan J. Lowe Joelle Pineau Aaron Courville Yoshua Bengio 31 635 0 24 Jul 2016
Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition Jun Liu Amir Shahroudy Dong Xu Gang Wang 35 1,099 0 24 Jul 2016
Hierarchical Attention Network for Action Recognition in Videos Yilin Wang Suhang Wang Jiliang Tang Neil O'Hare Yi-Ju Chang Baoxin Li BDL 22 82 0 21 Jul 2016
Hierarchical Deep Temporal Models for Group Activity Recognition Mostafa S. Ibrahim S. Muralidharan Zhiwei Deng Arash Vahdat Greg Mori 77 445 0 09 Jul 2016
VideoLSTM Convolves, Attends and Flows for Action Recognition Zhenyang Li E. Gavves Mihir Jain Cees G. M. Snoek 33 463 0 06 Jul 2016
Captioning Images with Diverse Objects Subhashini Venugopalan Lisa Anne Hendricks Marcus Rohrbach Raymond J. Mooney Trevor Darrell Kate Saenko VLM 22 178 0 24 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions Arijit Ray Gordon A. Christie Joey Tianyi Zhou Dhruv Batra Devi Parikh 19 56 0 21 Jun 2016
Convolutional Residual Memory Networks Joel Ruben Antony Moniz C. Pal 23 23 0 16 Jun 2016
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 36 60 0 15 Jun 2016
Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach Lin Wu Chunhua Shen Anton Van Den Hengel 29 66 0 06 Jun 2016
Recurrent Fully Convolutional Networks for Video Segmentation Sepehr Valipour Mennatullah Siam Martin Jägersand Nilanjan Ray VOS 19 89 0 01 Jun 2016