ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

42 / 642 papers shown
Title
Guiding Long-Short Term Memory for Image Caption Generation
Guiding Long-Short Term Memory for Image Caption Generation
Xu Jia
E. Gavves
Basura Fernando
Tinne Tuytelaars
VLM
22
101
0
16 Sep 2015
Learning Contextual Dependencies with Convolutional Hierarchical
  Recurrent Neural Networks
Learning Contextual Dependencies with Convolutional Hierarchical Recurrent Neural Networks
Zhen Zuo
Bing Shuai
G. Wang
Xiao Liu
Xingxing Wang
B. Wang
11
93
0
13 Sep 2015
Recurrent Network Models for Human Dynamics
Recurrent Network Models for Human Dynamics
Katerina Fragkiadaki
Sergey Levine
Panna Felsen
Jitendra Malik
28
30
0
02 Aug 2015
Every Moment Counts: Dense Detailed Labeling of Actions in Complex
  Videos
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos
Serena Yeung
Olga Russakovsky
Ning Jin
Mykhaylo Andriluka
Greg Mori
Li Fei-Fei
VLM
36
436
0
21 Jul 2015
Describing Multimedia Content using Attention-based Encoder--Decoder
  Networks
Describing Multimedia Content using Attention-based Encoder--Decoder Networks
Kyunghyun Cho
Aaron Courville
Yoshua Bengio
32
411
0
04 Jul 2015
Aligning where to see and what to tell: image caption with region-based
  attention and scene factorization
Aligning where to see and what to tell: image caption with region-based attention and scene factorization
Junqi Jin
Kun Fu
Runpeng Cui
Fei Sha
Changshui Zhang
26
117
0
20 Jun 2015
Reading Scene Text in Deep Convolutional Sequences
Reading Scene Text in Deep Convolutional Sequences
Pan He
Weilin Huang
Yu Qiao
Chen Change Loy
Xiaoou Tang
21
307
0
14 Jun 2015
Convolutional LSTM Network: A Machine Learning Approach for
  Precipitation Nowcasting
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
233
7,904
0
13 Jun 2015
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to
  Action Sequences
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences
Hongyuan Mei
Joey Tianyi Zhou
Matthew R. Walter
LM&Ro
21
242
0
12 Jun 2015
Learning language through pictures
Learning language through pictures
Grzegorz Chrupała
Ákos Kádár
A. Alishahi
VLM
SSL
29
65
0
11 Jun 2015
P-CNN: Pose-based CNN Features for Action Recognition
P-CNN: Pose-based CNN Features for Action Recognition
Guilhem Chéron
Ivan Laptev
Cordelia Schmid
20
607
0
11 Jun 2015
Pointer Networks
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
48
3,015
0
09 Jun 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural
  Networks
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
51
2,017
0
09 Jun 2015
Visualizing and Understanding Recurrent Networks
Visualizing and Understanding Recurrent Networks
A. Karpathy
Justin Johnson
Li Fei-Fei
HAI
21
1,096
0
05 Jun 2015
Learning to track for spatio-temporal action localization
Learning to track for spatio-temporal action localization
Philippe Weinzaepfel
Zaïd Harchaoui
Cordelia Schmid
34
338
0
05 Jun 2015
Beyond Temporal Pooling: Recurrence and Temporal Convolutions for
  Gesture Recognition in Video
Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video
Lionel Pigou
Aaron van den Oord
Sander Dieleman
Mieke Van Herreweghe
J. Dambre
25
254
0
05 Jun 2015
Learning to Answer Questions From Image Using Convolutional Neural
  Network
Learning to Answer Questions From Image Using Convolutional Neural Network
Lin Ma
Zhengdong Lu
Hang Li
27
261
0
01 Jun 2015
Visual Madlibs: Fill in the blank Image Generation and Question
  Answering
Visual Madlibs: Fill in the blank Image Generation and Question Answering
Licheng Yu
Eunbyung Park
Alexander C. Berg
Tamara L. Berg
VLM
MLLM
32
97
0
31 May 2015
Weakly-Supervised Alignment of Video With Text
Weakly-Supervised Alignment of Video With Text
Piotr Bojanowski
Rémi Lajugie
Edouard Grave
Francis R. Bach
Ivan Laptev
Jean Ponce
Cordelia Schmid
41
134
0
22 May 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
32
496
0
21 May 2015
Jointly Modeling Embedding and Translation to Bridge Video and Language
Jointly Modeling Embedding and Translation to Bridge Video and Language
Yingwei Pan
Tao Mei
Ting Yao
Houqiang Li
Y. Rui
41
535
0
07 May 2015
Language Models for Image Captioning: The Quirks and What Works
Language Models for Image Captioning: The Quirks and What Works
Jacob Devlin
Hao Cheng
Hao Fang
Saurabh Gupta
Li Deng
Xiaodong He
Geoffrey Zweig
Margaret Mitchell
32
281
0
07 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about
  Images
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
35
595
0
05 May 2015
VQA: Visual Question Answering
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
64
5,365
0
03 May 2015
Dense Optical Flow Prediction from a Static Image
Dense Optical Flow Prediction from a Static Image
Jacob Walker
Abhinav Gupta
M. Hebert
34
210
0
02 May 2015
Anticipating Visual Representations from Unlabeled Video
Anticipating Visual Representations from Unlabeled Video
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
24
145
0
29 Apr 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence
  Descriptions of Images
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao
Xu Wei
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
25
154
0
25 Apr 2015
Differential Recurrent Neural Networks for Action Recognition
Differential Recurrent Neural Networks for Action Recognition
Vivek Veeriah
Naifan Zhuang
Guo-Jun Qi
30
462
0
25 Apr 2015
Multimodal Convolutional Neural Networks for Matching Image and Sentence
Multimodal Convolutional Neural Networks for Matching Image and Sentence
Lin Ma
Zhengdong Lu
Lifeng Shang
Hang Li
38
337
0
23 Apr 2015
Temporal Localization of Fine-Grained Actions in Videos by Domain
  Transfer from Web Images
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images
Chen Sun
Sanketh Shetty
Rahul Sukthankar
Ram Nevatia
18
136
0
04 Apr 2015
Microsoft COCO Captions: Data Collection and Evaluation Server
Microsoft COCO Captions: Data Collection and Evaluation Server
Xinlei Chen
Hao Fang
Nayeon Lee
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollar
C. L. Zitnick
62
2,428
0
01 Apr 2015
Fully Connected Deep Structured Networks
Fully Connected Deep Structured Networks
A. Schwing
R. Urtasun
SSeg
57
308
0
09 Mar 2015
Using Descriptive Video Services to Create a Large Data Source for Video
  Annotation Research
Using Descriptive Video Services to Create a Large Data Source for Video Annotation Research
Atousa Torabi
C. Pal
Hugo Larochelle
Aaron Courville
VGen
31
204
0
03 Mar 2015
Image Specificity
Image Specificity
M. Jas
Devi Parikh
24
40
0
16 Feb 2015
Phrase-based Image Captioning
Phrase-based Image Captioning
R. Lebret
Pedro H. O. Pinheiro
R. Collobert
VLM
28
120
0
12 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
100
10,006
0
10 Feb 2015
A Dataset for Movie Description
A Dataset for Movie Description
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
VGen
31
498
0
12 Jan 2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
Junhua Mao
Wenyuan Xu
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
VLM
62
1,235
0
20 Dec 2014
Translating Videos to Natural Language Using Deep Recurrent Neural
  Networks
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
Subhashini Venugopalan
Huijuan Xu
Jeff Donahue
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
47
950
0
15 Dec 2014
Deep Visual-Semantic Alignments for Generating Image Descriptions
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
21
5,556
0
07 Dec 2014
CIDEr: Consensus-based Image Description Evaluation
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
55
4,401
0
20 Nov 2014
Transfer Learning for Video Recognition with Scarce Training Data for
  Deep Convolutional Neural Network
Transfer Learning for Video Recognition with Scarce Training Data for Deep Convolutional Neural Network
Yu-Chuan Su
Tzu-Hsuan Chiu
Chun-Yen Yeh
Hsinfu Huang
Winston H. Hsu
22
27
0
15 Sep 2014
Previous
123...111213