ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 642 papers shown
Title
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for
  Human Action Recognition in Videos
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
Amlan Kar
Nishant Rai
Karan Sikka
Gaurav Sharma
24
152
0
24 Nov 2016
Image-based localization using LSTMs for structured feature correlation
Image-based localization using LSTMs for structured feature correlation
F. Walch
C. Hazirbas
Laura Leal-Taixé
Torsten Sattler
S. Hilsenbeck
Daniel Cremers
27
494
0
23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text
Adaptive Feature Abstraction for Translating Video to Text
Yunchen Pu
Martin Renqiang Min
Zhe Gan
Lawrence Carin
39
14
0
23 Nov 2016
Dense Captioning with Joint Inference and Visual Context
Dense Captioning with Joint Inference and Visual Context
L. Yang
K. Tang
Jianchao Yang
Li-Jia Li
VLM
21
169
0
21 Nov 2016
Deep Temporal Linear Encoding Networks
Deep Temporal Linear Encoding Networks
Ali Diba
Vivek Sharma
Luc Van Gool
19
227
0
21 Nov 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs
A Hierarchical Approach for Generating Descriptive Image Paragraphs
J. Krause
Justin Johnson
Ranjay Krishna
Li Fei-Fei
VLM
25
373
0
20 Nov 2016
Fast Video Classification via Adaptive Cascading of Deep Models
Fast Video Classification via Adaptive Cascading of Deep Models
Haichen Shen
Seungyeop Han
Matthai Philipose
Arvind Krishnamurthy
26
78
0
20 Nov 2016
Instance-aware Image and Sentence Matching with Selective Multimodal
  LSTM
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM
Yan Huang
Wei Wang
Liang Wang
26
222
0
17 Nov 2016
Convolutional Gated Recurrent Networks for Video Segmentation
Convolutional Gated Recurrent Networks for Video Segmentation
Mennatullah Siam
Sepehr Valipour
Martin Jägersand
Nilanjan Ray
VOS
19
98
0
16 Nov 2016
Learning long-term dependencies for action recognition with a
  biologically-inspired deep network
Learning long-term dependencies for action recognition with a biologically-inspired deep network
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
26
63
0
16 Nov 2016
DeepSense: A Unified Deep Learning Framework for Time-Series Mobile
  Sensing Data Processing
DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing
Shuochao Yao
Shaohan Hu
Yiran Zhao
Aston Zhang
Tarek F. Abdelzaher
HAI
AI4TS
22
616
0
07 Nov 2016
Boosting Image Captioning with Attributes
Boosting Image Captioning with Attributes
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
VLM
36
620
0
05 Nov 2016
Clinical Text Prediction with Numerically Grounded Conditional Language
  Models
Clinical Text Prediction with Numerically Grounded Conditional Language Models
Georgios P. Spithourakis
S. Petersen
Sebastian Riedel
22
7
0
20 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and
  Question Answering
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering
Youngjae Yu
Hyungjin Ko
Jongwook Choi
Gunhee Kim
14
229
0
10 Oct 2016
Learning Spatial-Semantic Context with Fully Convolutional Recurrent
  Network for Online Handwritten Chinese Text Recognition
Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition
Zecheng Xie
Zenghui Sun
Lianwen Jin
Hao Ni
Terry Lyons
35
122
0
09 Oct 2016
A Deep Spatial Contextual Long-term Recurrent Convolutional Network for
  Saliency Detection
A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection
Nian Liu
Junwei Han
27
189
0
06 Oct 2016
Prediction of Manipulation Actions
Prediction of Manipulation Actions
Cornelia Fermuller
Fang Wang
Yezhou Yang
Konstantinos Zampogiannis
Yi Zhang
Francisco Barranco
Michael Pfeiffer
16
51
0
03 Oct 2016
A Survey of Multi-View Representation Learning
A Survey of Multi-View Representation Learning
Yingming Li
Ming Yang
Zhongfei Zhang
AI4TS
3DV
30
509
0
03 Oct 2016
Learning Language-Visual Embedding for Movie Understanding with
  Natural-Language
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
14
97
0
26 Sep 2016
Sequential Deep Trajectory Descriptor for Action Recognition with
  Three-stream CNN
Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
11
191
0
10 Sep 2016
DAiSEE: Towards User Engagement Recognition in the Wild
DAiSEE: Towards User Engagement Recognition in the Wild
Abhay Gupta
Arjun DĆunha
Kamal N. Awasthi
V. Balasubramanian
13
141
0
07 Sep 2016
Making a Case for Learning Motion Representations with Phase
Making a Case for Learning Motion Representations with Phase
S. Pintea
J. C. V. Gemert
26
11
0
06 Sep 2016
Autonomous driving challenge: To Infer the property of a dynamic object
  based on its motion pattern using recurrent neural network
Autonomous driving challenge: To Infer the property of a dynamic object based on its motion pattern using recurrent neural network
Mona Fathollahi
R. Kasturi
16
15
0
01 Sep 2016
Efficient Two-Stream Motion and Appearance 3D CNNs for Video
  Classification
Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification
Ali Diba
A. Pazandeh
Luc Van Gool
24
75
0
31 Aug 2016
What makes ImageNet good for transfer learning?
What makes ImageNet good for transfer learning?
Minyoung Huh
Pulkit Agrawal
Alexei A. Efros
OOD
SSeg
VLM
SSL
36
670
0
30 Aug 2016
Learning to generalize to new compositions in image understanding
Learning to generalize to new compositions in image understanding
Y. Atzmon
Jonathan Berant
Vahid Kezami
Amir Globerson
Gal Chechik
18
67
0
27 Aug 2016
Sympathy for the Details: Dense Trajectories and Hybrid Classification
  Architectures for Action Recognition
Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action Recognition
César Roberto de Souza
Adrien Gaidon
E. Vig
A. Peña
19
44
0
25 Aug 2016
Title Generation for User Generated Videos
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
32
69
0
25 Aug 2016
Large-scale Continuous Gesture Recognition Using Convolutional Neural
  Networks
Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks
Pichao Wang
W. Li
Song Liu
Yuyao Zhang
Zhimin Gao
P. Ogunbona
SLR
37
50
0
22 Aug 2016
STFCN: Spatio-Temporal FCN for Semantic Video Segmentation
STFCN: Spatio-Temporal FCN for Semantic Video Segmentation
Mohsen Fayyaz
M. H. Saffar
Mohammad Sabokrou
M. Fathy
Reinhard Klette
Fay Huang
11
56
0
21 Aug 2016
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning
Y. Tan
Chee Seng Chan
VLM
22
29
0
20 Aug 2016
Leveraging Structural Context Models and Ranking Score Fusion for Human
  Interaction Prediction
Leveraging Structural Context Models and Ranking Score Fusion for Human Interaction Prediction
Qiuhong Ke
Bennamoun
Senjian An
F. Boussaïd
Ferdous Sohel
14
38
0
18 Aug 2016
Seeing with Humans: Gaze-Assisted Neural Image Captioning
Seeing with Humans: Gaze-Assisted Neural Image Captioning
Yusuke Sugano
Andreas Bulling
21
68
0
18 Aug 2016
Clockwork Convnets for Video Semantic Segmentation
Clockwork Convnets for Video Semantic Segmentation
Evan Shelhamer
Kate Rakelly
Judy Hoffman
Trevor Darrell
29
199
0
11 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for
  the Visual Madlibs Task
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task
Ashkan Mokarian
Mateusz Malinowski
Mario Fritz
27
5
0
09 Aug 2016
Discriminatively Trained Latent Ordinal Model for Video Classification
Discriminatively Trained Latent Ordinal Model for Video Classification
Karan Sikka
Gaurav Sharma
19
11
0
08 Aug 2016
Modeling Context in Referring Expressions
Modeling Context in Referring Expressions
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
28
1,227
0
31 Jul 2016
SPICE: Semantic Propositional Image Caption Evaluation
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
34
1,884
0
29 Jul 2016
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
16
237
0
28 Jul 2016
An Actor-Critic Algorithm for Sequence Prediction
An Actor-Critic Algorithm for Sequence Prediction
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
31
635
0
24 Jul 2016
Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition
Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition
Jun Liu
Amir Shahroudy
Dong Xu
Gang Wang
35
1,099
0
24 Jul 2016
Hierarchical Attention Network for Action Recognition in Videos
Hierarchical Attention Network for Action Recognition in Videos
Yilin Wang
Suhang Wang
Jiliang Tang
Neil O'Hare
Yi-Ju Chang
Baoxin Li
BDL
22
82
0
21 Jul 2016
Hierarchical Deep Temporal Models for Group Activity Recognition
Hierarchical Deep Temporal Models for Group Activity Recognition
Mostafa S. Ibrahim
S. Muralidharan
Zhiwei Deng
Arash Vahdat
Greg Mori
77
445
0
09 Jul 2016
VideoLSTM Convolves, Attends and Flows for Action Recognition
VideoLSTM Convolves, Attends and Flows for Action Recognition
Zhenyang Li
E. Gavves
Mihir Jain
Cees G. M. Snoek
33
463
0
06 Jul 2016
Captioning Images with Diverse Objects
Captioning Images with Diverse Objects
Subhashini Venugopalan
Lisa Anne Hendricks
Marcus Rohrbach
Raymond J. Mooney
Trevor Darrell
Kate Saenko
VLM
22
178
0
24 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise
  Questions
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Joey Tianyi Zhou
Dhruv Batra
Devi Parikh
19
56
0
21 Jun 2016
Convolutional Residual Memory Networks
Convolutional Residual Memory Networks
Joel Ruben Antony Moniz
C. Pal
23
23
0
16 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
36
60
0
15 Jun 2016
Deep Recurrent Convolutional Networks for Video-based Person
  Re-identification: An End-to-End Approach
Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach
Lin Wu
Chunhua Shen
Anton Van Den Hengel
29
66
0
06 Jun 2016
Recurrent Fully Convolutional Networks for Video Segmentation
Recurrent Fully Convolutional Networks for Video Segmentation
Sepehr Valipour
Mennatullah Siam
Martin Jägersand
Nilanjan Ray
VOS
19
89
0
01 Jun 2016
Previous
123...10111213
Next