ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 642 papers shown
Title
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
65
1,214
0
02 May 2017
Query-adaptive Video Summarization via Quality-aware Relevance
  Estimation
Query-adaptive Video Summarization via Quality-aware Relevance Estimation
A. Vasudevan
Michael Gygli
Anna Volokitin
Luc Van Gool
30
93
0
01 May 2017
Inception Recurrent Convolutional Neural Network for Object Recognition
Inception Recurrent Convolutional Neural Network for Object Recognition
Md. Zahangir Alom
Mahmudul Hasan
C. Yakopcic
T. Taha
39
86
0
25 Apr 2017
Second-order Temporal Pooling for Action Recognition
Second-order Temporal Pooling for Action Recognition
A. Cherian
Stephen Gould
EgoV
11
29
0
23 Apr 2017
Reformulating Level Sets as Deep Recurrent Neural Network Approach to
  Semantic Segmentation
Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation
Ngan Le
Kha Gia Quach
Khoa Luu
Marios Savvides
Chenchen Zhu
19
71
0
12 Apr 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
27
494
0
11 Apr 2017
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for
  Activity Recognition
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition
Chih-Yao Ma
Min-Hung Chen
Z. Kira
G. Al-Regib
AI4TS
32
241
0
30 Mar 2017
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy
  Risks in Images
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images
Rakshith Shetty
Bernt Schiele
Mario Fritz
35
223
0
30 Mar 2017
Survey of the State of the Art in Natural Language Generation: Core
  tasks, applications and evaluation
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation
Albert Gatt
E. Krahmer
LM&MA
ELM
27
810
0
29 Mar 2017
Learning and Refining of Privileged Information-based RNNs for Action
  Recognition from Depth Sequences
Learning and Refining of Privileged Information-based RNNs for Action Recognition from Depth Sequences
Zhiyuan Shi
Tae-Kyun Kim
14
80
0
28 Mar 2017
Where to put the Image in an Image Caption Generator
Where to put the Image in an Image Caption Generator
Marc Tanti
Albert Gatt
K. Camilleri
47
96
0
27 Mar 2017
Visually grounded learning of keyword prediction from untranscribed
  speech
Visually grounded learning of keyword prediction from untranscribed speech
Herman Kamper
Shane Settle
Gregory Shakhnarovich
Karen Livescu
19
63
0
23 Mar 2017
Weakly Supervised Action Learning with RNN based Fine-to-coarse Modeling
Weakly Supervised Action Learning with RNN based Fine-to-coarse Modeling
Alexander Richard
Hilde Kuehne
Juergen Gall
23
195
0
23 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe-nan Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
36
234
0
23 Mar 2017
An End-to-End Approach to Natural Language Object Retrieval via
  Context-Aware Deep Reinforcement Learning
An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning
Fan Wu
Zhongwen Xu
Yi Yang
ObjD
31
11
0
22 Mar 2017
Encouraging LSTMs to Anticipate Actions Very Early
Encouraging LSTMs to Anticipate Actions Very Early
Mohammad Sadegh Ali Akbarian
F. Saleh
Mathieu Salzmann
Basura Fernando
L. Petersson
Lars Andersson
34
169
0
21 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
31
423
0
20 Mar 2017
Multilevel Context Representation for Improving Object Recognition
Multilevel Context Representation for Improving Object Recognition
Andreas Kölsch
Muhammad Zeshan Afzal
Marcus Liwicki
27
3
0
19 Mar 2017
Recurrent Models for Situation Recognition
Recurrent Models for Situation Recognition
Arun Mallya
Svetlana Lazebnik
20
30
0
18 Mar 2017
UntrimmedNets for Weakly Supervised Action Recognition and Detection
UntrimmedNets for Weakly Supervised Action Recognition and Detection
Limin Wang
Yuanjun Xiong
Dahua Lin
Luc Van Gool
30
490
0
09 Mar 2017
A Pursuit of Temporal Accuracy in General Activity Detection
A Pursuit of Temporal Accuracy in General Activity Detection
Yuanjun Xiong
Yue Zhao
Limin Wang
Dahua Lin
Xiaoou Tang
14
132
0
08 Mar 2017
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action
  Localization in Untrimmed Videos
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
Zheng Shou
Jonathan Chan
Alireza Zareian
K. Miyazawa
Shih-Fu Chang
20
560
0
04 Mar 2017
The Statistical Recurrent Unit
The Statistical Recurrent Unit
Junier B. Oliva
Barnabás Póczós
J. Schneider
16
50
0
01 Mar 2017
Scene Flow to Action Map: A New Representation for RGB-D based Action
  Recognition with Convolutional Neural Networks
Scene Flow to Action Map: A New Representation for RGB-D based Action Recognition with Convolutional Neural Networks
Pichao Wang
W. Li
Zhimin Gao
Yuyao Zhang
Chang-Fu Tang
P. Ogunbona
3DPC
172
131
0
28 Feb 2017
MAT: A Multimodal Attentive Translator for Image Captioning
MAT: A Multimodal Attentive Translator for Image Captioning
Chang Liu
F. Sun
Changhu Wang
Feng Wang
Alan Yuille
17
58
0
18 Feb 2017
Deep Reinforcement Learning for Visual Object Tracking in Videos
Deep Reinforcement Learning for Visual Object Tracking in Videos
Da Zhang
H. Maei
Xin Eric Wang
Yuan-fang Wang
20
115
0
31 Jan 2017
Incorporating Global Visual Features into Attention-Based Neural Machine
  Translation
Incorporating Global Visual Features into Attention-Based Neural Machine Translation
Iacer Calixto
Qun Liu
Nick Campbell
24
154
0
23 Jan 2017
Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN
Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN
Teik Koon Cheang
Yong Shean Chong
Yong Haur Tay
16
56
0
23 Jan 2017
Person Re-Identification via Recurrent Feature Aggregation
Person Re-Identification via Recurrent Feature Aggregation
Yichao Yan
Bingbing Ni
Zhichao Song
Chao Ma
Yan Yan
Xiaokang Yang
16
243
0
23 Jan 2017
Action Recognition: From Static Datasets to Moving Robots
Action Recognition: From Static Datasets to Moving Robots
Fahimeh Rezazadegan
S. Shirazi
B. Upcroft
Michael Milford
11
45
0
18 Jan 2017
Ordered Pooling of Optical Flow Sequences for Action Recognition
Ordered Pooling of Optical Flow Sequences for Action Recognition
Jue Wang
A. Cherian
Fatih Porikli
17
45
0
12 Jan 2017
Transforming Sensor Data to the Image Domain for Deep Learning - an
  Application to Footstep Detection
Transforming Sensor Data to the Image Domain for Deep Learning - an Application to Footstep Detection
Monit Shah Singh
Vinaychandran Pondenkandath
Bo Zhou
P. Lukowicz
Marcus Liwicki
22
75
0
04 Jan 2017
Learning Visual N-Grams from Web Data
Learning Visual N-Grams from Web Data
Ang Li
Allan Jabri
Armand Joulin
L. V. D. van der Maaten
VLM
20
136
0
29 Dec 2016
Structured Sequence Modeling with Graph Convolutional Recurrent Networks
Structured Sequence Modeling with Graph Convolutional Recurrent Networks
Youngjoo Seo
M. Defferrard
P. Vandergheynst
Xavier Bresson
GNN
36
757
0
22 Dec 2016
An Empirical Study of Language CNN for Image Captioning
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu
G. Wang
Jianfei Cai
Tsuhan Chen
25
132
0
21 Dec 2016
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
26
18
0
20 Dec 2016
Asynchronous Temporal Fields for Action Recognition
Asynchronous Temporal Fields for Action Recognition
Gunnar A. Sigurdsson
S. Divvala
Ali Farhadi
Abhinav Gupta
BDL
16
170
0
19 Dec 2016
Tunable Efficient Unitary Neural Networks (EUNN) and their application
  to RNNs
Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs
Li Jing
Yichen Shen
T. Dubček
J. Peurifoy
S. Skirlo
Yann LeCun
Max Tegmark
Marin Soljacic
15
176
0
15 Dec 2016
Attentive Explanations: Justifying Decisions and Pointing to the
  Evidence
Attentive Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
AAML
21
79
0
14 Dec 2016
End-to-end Learning of Driving Models from Large-scale Video Datasets
End-to-end Learning of Driving Models from Large-scale Video Datasets
Huazhe Xu
Yang Gao
F. I. F. Richard Yu
Trevor Darrell
44
821
0
04 Dec 2016
Areas of Attention for Image Captioning
Areas of Attention for Image Captioning
M. Pedersoli
Thomas Lucas
Cordelia Schmid
Jakob Verbeek
27
205
0
03 Dec 2016
Short-term traffic flow forecasting with spatial-temporal correlation in
  a hybrid deep learning framework
Short-term traffic flow forecasting with spatial-temporal correlation in a hybrid deep learning framework
Yuankai Wu
Huachun Tan
AI4TS
31
248
0
03 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
104
3,120
0
02 Dec 2016
Action Recognition with Dynamic Image Networks
Action Recognition with Dynamic Image Networks
Hakan Bilen
Basura Fernando
Efstratios Gavves
Andrea Vedaldi
FAtt
21
221
0
02 Dec 2016
Guided Open Vocabulary Image Captioning with Constrained Beam Search
Guided Open Vocabulary Image Captioning with Constrained Beam Search
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
21
232
0
02 Dec 2016
Video Captioning with Multi-Faceted Attention
Video Captioning with Multi-Faceted Attention
Xiang Long
Chuang Gan
Gerard de Melo
22
88
0
01 Dec 2016
Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive
  Architectures
Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures
Gaurav Mittal
Tanya Marwah
V. Balasubramanian
VGen
DiffM
38
67
0
30 Nov 2016
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive
  Model
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model
Marcella Cornia
Lorenzo Baraldi
G. Serra
Rita Cucchiara
31
548
0
29 Nov 2016
Social Behavior Prediction from First Person Videos
Social Behavior Prediction from First Person Videos
Shan Su
J. Hong
Jianbo Shi
H. Park
EgoV
34
12
0
29 Nov 2016
Visual Dialog
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
54
990
0
26 Nov 2016
Previous
123...101112139
Next