Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4389
Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Long-term Recurrent Convolutional Networks for Visual Recognition and Description"
50 / 642 papers shown
Title
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
Amlan Kar
Nishant Rai
Karan Sikka
Gaurav Sharma
24
152
0
24 Nov 2016
Image-based localization using LSTMs for structured feature correlation
F. Walch
C. Hazirbas
Laura Leal-Taixé
Torsten Sattler
S. Hilsenbeck
Daniel Cremers
27
494
0
23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text
Yunchen Pu
Martin Renqiang Min
Zhe Gan
Lawrence Carin
39
14
0
23 Nov 2016
Dense Captioning with Joint Inference and Visual Context
L. Yang
K. Tang
Jianchao Yang
Li-Jia Li
VLM
21
169
0
21 Nov 2016
Deep Temporal Linear Encoding Networks
Ali Diba
Vivek Sharma
Luc Van Gool
19
227
0
21 Nov 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs
J. Krause
Justin Johnson
Ranjay Krishna
Li Fei-Fei
VLM
25
373
0
20 Nov 2016
Fast Video Classification via Adaptive Cascading of Deep Models
Haichen Shen
Seungyeop Han
Matthai Philipose
Arvind Krishnamurthy
26
78
0
20 Nov 2016
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
Yan Huang
Wei Wang
Liang Wang
26
222
0
17 Nov 2016
Convolutional Gated Recurrent Networks for Video Segmentation
Mennatullah Siam
Sepehr Valipour
Martin Jägersand
Nilanjan Ray
VOS
19
98
0
16 Nov 2016
Learning long-term dependencies for action recognition with a biologically-inspired deep network
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
26
63
0
16 Nov 2016
DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing
Shuochao Yao
Shaohan Hu
Yiran Zhao
Aston Zhang
Tarek F. Abdelzaher
HAI
AI4TS
22
616
0
07 Nov 2016
Boosting Image Captioning with Attributes
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
VLM
36
620
0
05 Nov 2016
Clinical Text Prediction with Numerically Grounded Conditional Language Models
Georgios P. Spithourakis
S. Petersen
Sebastian Riedel
22
7
0
20 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering
Youngjae Yu
Hyungjin Ko
Jongwook Choi
Gunhee Kim
14
229
0
10 Oct 2016
Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition
Zecheng Xie
Zenghui Sun
Lianwen Jin
Hao Ni
Terry Lyons
35
122
0
09 Oct 2016
A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection
Nian Liu
Junwei Han
27
189
0
06 Oct 2016
Prediction of Manipulation Actions
Cornelia Fermuller
Fang Wang
Yezhou Yang
Konstantinos Zampogiannis
Yi Zhang
Francisco Barranco
Michael Pfeiffer
16
51
0
03 Oct 2016
A Survey of Multi-View Representation Learning
Yingming Li
Ming Yang
Zhongfei Zhang
AI4TS
3DV
30
509
0
03 Oct 2016
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
14
97
0
26 Sep 2016
Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
11
191
0
10 Sep 2016
DAiSEE: Towards User Engagement Recognition in the Wild
Abhay Gupta
Arjun DĆunha
Kamal N. Awasthi
V. Balasubramanian
13
141
0
07 Sep 2016
Making a Case for Learning Motion Representations with Phase
S. Pintea
J. C. V. Gemert
26
11
0
06 Sep 2016
Autonomous driving challenge: To Infer the property of a dynamic object based on its motion pattern using recurrent neural network
Mona Fathollahi
R. Kasturi
16
15
0
01 Sep 2016
Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification
Ali Diba
A. Pazandeh
Luc Van Gool
24
75
0
31 Aug 2016
What makes ImageNet good for transfer learning?
Minyoung Huh
Pulkit Agrawal
Alexei A. Efros
OOD
SSeg
VLM
SSL
36
670
0
30 Aug 2016
Learning to generalize to new compositions in image understanding
Y. Atzmon
Jonathan Berant
Vahid Kezami
Amir Globerson
Gal Chechik
18
67
0
27 Aug 2016
Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action Recognition
César Roberto de Souza
Adrien Gaidon
E. Vig
A. Peña
19
44
0
25 Aug 2016
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
32
69
0
25 Aug 2016
Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks
Pichao Wang
W. Li
Song Liu
Yuyao Zhang
Zhimin Gao
P. Ogunbona
SLR
37
50
0
22 Aug 2016
STFCN: Spatio-Temporal FCN for Semantic Video Segmentation
Mohsen Fayyaz
M. H. Saffar
Mohammad Sabokrou
M. Fathy
Reinhard Klette
Fay Huang
11
56
0
21 Aug 2016
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning
Y. Tan
Chee Seng Chan
VLM
22
29
0
20 Aug 2016
Leveraging Structural Context Models and Ranking Score Fusion for Human Interaction Prediction
Qiuhong Ke
Bennamoun
Senjian An
F. Boussaïd
Ferdous Sohel
14
38
0
18 Aug 2016
Seeing with Humans: Gaze-Assisted Neural Image Captioning
Yusuke Sugano
Andreas Bulling
21
68
0
18 Aug 2016
Clockwork Convnets for Video Semantic Segmentation
Evan Shelhamer
Kate Rakelly
Judy Hoffman
Trevor Darrell
29
199
0
11 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task
Ashkan Mokarian
Mateusz Malinowski
Mario Fritz
27
5
0
09 Aug 2016
Discriminatively Trained Latent Ordinal Model for Video Classification
Karan Sikka
Gaurav Sharma
19
11
0
08 Aug 2016
Modeling Context in Referring Expressions
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
28
1,227
0
31 Jul 2016
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
34
1,884
0
29 Jul 2016
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
16
237
0
28 Jul 2016
An Actor-Critic Algorithm for Sequence Prediction
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
31
635
0
24 Jul 2016
Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition
Jun Liu
Amir Shahroudy
Dong Xu
Gang Wang
35
1,099
0
24 Jul 2016
Hierarchical Attention Network for Action Recognition in Videos
Yilin Wang
Suhang Wang
Jiliang Tang
Neil O'Hare
Yi-Ju Chang
Baoxin Li
BDL
22
82
0
21 Jul 2016
Hierarchical Deep Temporal Models for Group Activity Recognition
Mostafa S. Ibrahim
S. Muralidharan
Zhiwei Deng
Arash Vahdat
Greg Mori
77
445
0
09 Jul 2016
VideoLSTM Convolves, Attends and Flows for Action Recognition
Zhenyang Li
E. Gavves
Mihir Jain
Cees G. M. Snoek
33
463
0
06 Jul 2016
Captioning Images with Diverse Objects
Subhashini Venugopalan
Lisa Anne Hendricks
Marcus Rohrbach
Raymond J. Mooney
Trevor Darrell
Kate Saenko
VLM
22
178
0
24 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Joey Tianyi Zhou
Dhruv Batra
Devi Parikh
19
56
0
21 Jun 2016
Convolutional Residual Memory Networks
Joel Ruben Antony Moniz
C. Pal
23
23
0
16 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
36
60
0
15 Jun 2016
Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach
Lin Wu
Chunhua Shen
Anton Van Den Hengel
29
66
0
06 Jun 2016
Recurrent Fully Convolutional Networks for Video Segmentation
Sepehr Valipour
Mennatullah Siam
Martin Jägersand
Nilanjan Ray
VOS
19
89
0
01 Jun 2016
Previous
1
2
3
...
10
11
12
13
Next