ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 688 papers shown
Title
Unsupervised Learning of View-invariant Action Representations
Unsupervised Learning of View-invariant Action Representations
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
SSL
26
99
0
06 Sep 2018
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark
N. Xu
L. Yang
Yuchen Fan
Dingcheng Yue
Yuchen Liang
Jianchao Yang
Thomas Huang
VOS
31
522
0
06 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
LUCSS: Language-based User-customized Colourization of Scene Sketches
LUCSS: Language-based User-customized Colourization of Scene Sketches
C. Zou
Haoran Mo
Ruofei Du
Xing Wu
Chengying Gao
Hongbo Fu
30
8
0
30 Aug 2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
42
65
0
11 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action
  Classification
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
81
79
0
03 Aug 2018
Learning Actionable Representations from Visual Observations
Learning Actionable Representations from Visual Observations
Debidatta Dwibedi
Jonathan Tompson
Corey Lynch
P. Sermanet
SSL
22
80
0
02 Aug 2018
Actor-Centric Relation Network
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
41
220
0
28 Jul 2018
Distinctive-attribute Extraction for Image Captioning
Distinctive-attribute Extraction for Image Captioning
Boeun Kim
Young Han Lee
Hyedong Jung
C. Cho
17
6
0
25 Jul 2018
Equal But Not The Same: Understanding the Implicit Relationship Between
  Persuasive Images and Text
Equal But Not The Same: Understanding the Implicit Relationship Between Persuasive Images and Text
Ruotong Wang
R. Hwa
Adriana Kovashka
18
54
0
21 Jul 2018
"Factual" or "Emotional": Stylized Image Captioning with Adaptive
  Learning and Attention
"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention
Tianlang Chen
Zhongping Zhang
Quanzeng You
Chen Fang
Zhaowen Wang
Hailin Jin
Jiebo Luo
24
86
0
10 Jul 2018
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised
  Temporal Action Detector
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector
Jia-Xing Zhong
Nannan Li
Weijie Kong
Zhang Tao
Thomas H. Li
Ge Li
14
93
0
09 Jul 2018
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting
  Text with Arbitrary Shapes
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Pengyuan Lyu
Minghui Liao
Cong Yao
Wenhao Wu
X. Bai
39
592
0
06 Jul 2018
Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
Siddhartha Chandra
Camille Couprie
Iasonas Kokkinos
18
60
0
03 Jul 2018
Long Activity Video Understanding using Functional Object-Oriented
  Network
Long Activity Video Understanding using Functional Object-Oriented Network
Ahmad Babaeian Jelodar
D. Paulius
Yu Sun
23
35
0
03 Jul 2018
Women also Snowboard: Overcoming Bias in Captioning Models (Extended
  Abstract)
Women also Snowboard: Overcoming Bias in Captioning Models (Extended Abstract)
Lisa Anne Hendricks
Kaylee Burns
Kate Saenko
Trevor Darrell
Anna Rohrbach
39
479
0
02 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
212
0
20 Jun 2018
RISE: Randomized Input Sampling for Explanation of Black-box Models
RISE: Randomized Input Sampling for Explanation of Black-box Models
Vitali Petsiuk
Abir Das
Kate Saenko
FAtt
35
1,151
0
19 Jun 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
23
181
0
19 Jun 2018
Multimodal feature fusion for CNN-based gait recognition: an empirical
  comparison
Multimodal feature fusion for CNN-based gait recognition: an empirical comparison
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
21
60
0
19 Jun 2018
Object Level Visual Reasoning in Videos
Object Level Visual Reasoning in Videos
Fabien Baradel
Natalia Neverova
Christian Wolf
J. Mille
Greg Mori
24
163
0
16 Jun 2018
From Trailers to Storylines: An Efficient Way to Learn from Movies
From Trailers to Storylines: An Efficient Way to Learn from Movies
Qingqiu Huang
Yuanjun Xiong
Yu Xiong
Yuqi Zhang
Dahua Lin
28
26
0
14 Jun 2018
Understanding Patch-Based Learning by Explaining Predictions
Understanding Patch-Based Learning by Explaining Predictions
Christopher J. Anders
G. Montavon
Wojciech Samek
K. Müller
UQCV
FAtt
33
6
0
11 Jun 2018
In Ictu Oculi: Exposing AI Generated Fake Face Videos by Detecting Eye
  Blinking
In Ictu Oculi: Exposing AI Generated Fake Face Videos by Detecting Eye Blinking
Yuezun Li
Ming-Ching Chang
Siwei Lyu
CVBM
13
225
0
07 Jun 2018
Mining for meaning: from vision to language through multiple networks
  consensus
Mining for meaning: from vision to language through multiple networks consensus
Iulia Duta
Andrei Liviu Nicolicioiu
Simion-Vlad Bogolin
Marius Leordeanu
18
3
0
05 Jun 2018
Context-aware Cascade Attention-based RNN for Video Emotion Recognition
Context-aware Cascade Attention-based RNN for Video Emotion Recognition
Man-Chin Sun
Shih-Huan Hsu
Min Yang
Jen-Hsien Chien
28
18
0
30 May 2018
Needle Tip Force Estimation using an OCT Fiber and a Fused convGRU-CNN
  Architecture
Needle Tip Force Estimation using an OCT Fiber and a Fused convGRU-CNN Architecture
N. Gessert
Torben Priegnitz
T. Saathoff
Sven-Thomas Antoni
David Meyer
M. Hamann
K. Jünemann
Christoph Otte
Alexander Schlaefer
24
10
0
30 May 2018
Pointly-Supervised Action Localization
Pointly-Supervised Action Localization
Pascal Mettes
Cees G. M. Snoek
3DPC
19
26
0
29 May 2018
Less is More: Surgical Phase Recognition with Less Annotations through
  Self-Supervised Pre-training of CNN-LSTM Networks
Less is More: Surgical Phase Recognition with Less Annotations through Self-Supervised Pre-training of CNN-LSTM Networks
Gaurav Yengera
Didier Mutter
J. Marescaux
N. Padoy
40
71
0
22 May 2018
Enriched Long-term Recurrent Convolutional Network for Facial
  Micro-Expression Recognition
Enriched Long-term Recurrent Convolutional Network for Facial Micro-Expression Recognition
Huai-Qian Khor
John See
Raphaël C.-W. Phan
Weiyao Lin
17
165
0
22 May 2018
Joint Image Captioning and Question Answering
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
24
12
0
22 May 2018
An Evaluation of Trajectory Prediction Approaches and Notes on the
  TrajNet Benchmark
An Evaluation of Trajectory Prediction Approaches and Notes on the TrajNet Benchmark
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
21
70
0
19 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned
  Text
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
24
115
0
18 May 2018
Identifying Object States in Cooking-Related Images
Identifying Object States in Cooking-Related Images
Ahmad Babaeian Jelodar
Md Sirajus Salekin
Yu Sun
22
37
0
17 May 2018
Omega: An Architecture for AI Unification
Omega: An Architecture for AI Unification
Eray Özkural
AI4CE
16
1
0
16 May 2018
Graph Edge Convolutional Neural Networks for Skeleton Based Action
  Recognition
Graph Edge Convolutional Neural Networks for Skeleton Based Action Recognition
Xikun Zhang
Chang Xu
Xinmei Tian
Dacheng Tao
3DH
GNN
25
157
0
16 May 2018
I Have Seen Enough: A Teacher Student Network for Video Classification
  Using Fewer Frames
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames
S. Bhardwaj
Mitesh M. Khapra
23
3
0
12 May 2018
Rethinking Diversified and Discriminative Proposal Generation for Visual
  Grounding
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
Zhou Yu
Jun-chen Yu
Chenchao Xiang
Zhou Zhao
Q. Tian
Dacheng Tao
ObjD
18
138
0
09 May 2018
Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks
Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks
M. Bastan
Kim-Hui Yap
Lap-Pui Chau
37
6
0
28 Apr 2018
ECO: Efficient Convolutional Network for Online Video Understanding
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
142
496
0
24 Apr 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning
Object Counts! Bringing Explicit Detections Back into Image Captioning
Josiah Wang
Pranava Madhyastha
Lucia Specia
ObjD
19
37
0
23 Apr 2018
Deep Facial Expression Recognition: A Survey
Deep Facial Expression Recognition: A Survey
Shan Li
Weihong Deng
151
1,280
0
23 Apr 2018
Modelling customer online behaviours with neural networks: applications
  to conversion prediction and advertising retargeting
Modelling customer online behaviours with neural networks: applications to conversion prediction and advertising retargeting
Yanwei Cui
Rogatien Tobossi
Olivia Vigouroux
MU
36
11
0
20 Apr 2018
Automatic Stance Detection Using End-to-End Memory Networks
Automatic Stance Detection Using End-to-End Memory Networks
Mitra Mohtarami
R. Baly
James R. Glass
Preslav Nakov
Lluís Màrquez i Villodre
Alessandro Moschitti
19
122
0
20 Apr 2018
Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture
  Recognition
Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition
Okan Kopuklu
Neslihan Köse
Gerhard Rigoll
17
111
0
19 Apr 2018
Deep Multimodal Subspace Clustering Networks
Deep Multimodal Subspace Clustering Networks
Mahdi Abavisani
Vishal M. Patel
28
163
0
17 Apr 2018
Particle-based pedestrian path prediction using LSTM-MDL models
Particle-based pedestrian path prediction using LSTM-MDL models
Ronny Hug
S. Becker
Wolfgang Hubner
Michael Arens
41
32
0
16 Apr 2018
AFA-PredNet: The action modulation within predictive coding
AFA-PredNet: The action modulation within predictive coding
Junpei Zhong
Angelo Cangelosi
Xinzheng Zhang
T. Ogata
27
10
0
11 Apr 2018
Fine-grained Activity Recognition in Baseball Videos
Fine-grained Activity Recognition in Baseball Videos
A. Piergiovanni
Michael S. Ryoo
24
74
0
09 Apr 2018
Deep Spatiotemporal Models for Robust Proprioceptive Terrain
  Classification
Deep Spatiotemporal Models for Robust Proprioceptive Terrain Classification
Abhinav Valada
Wolfram Burgard
27
61
0
02 Apr 2018
Previous
123...789...121314
Next