ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 665 papers shown
Title
Cascaded Revision Network for Novel Object Captioning
Cascaded Revision Network for Novel Object Captioning
Qianyu Feng
Yu Wu
Hehe Fan
C. Yan
Yezhou Yang
29
35
0
06 Aug 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph
  Generation
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Jing Wang
Yingwei Pan
Ting Yao
Jinhui Tang
Tao Mei
VLM
BDL
DiffM
19
36
0
01 Aug 2019
Dynamic Facial Expression Generation on Hilbert Hypersphere with
  Conditional Wasserstein Generative Adversarial Nets
Dynamic Facial Expression Generation on Hilbert Hypersphere with Conditional Wasserstein Generative Adversarial Nets
N. Otberdout
Mohamed Daoudi
Anis Kacem
Lahoucine Ballihi
Stefano Berretti
GAN
32
52
0
23 Jul 2019
Compact Global Descriptor for Neural Networks
Xiangyu He
Ke Cheng
Qiang Chen
Qinghao Hu
Peisong Wang
Jian Cheng
31
8
0
23 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
20
132
0
22 Jul 2019
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot
  Action Recognition
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
M. Bishay
Georgios Zoumpourlis
Ioannis Patras
ViT
27
155
0
21 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
35
75
0
19 Jul 2019
Multi-Task Recurrent Convolutional Network with Correlation Loss for
  Surgical Video Analysis
Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis
Yueming Jin
Huaxia Li
Qi Dou
Hao Chen
J. Qin
Chi-Wing Fu
Pheng-Ann Heng
24
173
0
13 Jul 2019
Aerial Animal Biometrics: Individual Friesian Cattle Recovery and Visual
  Identification via an Autonomous UAV with Onboard Deep Inference
Aerial Animal Biometrics: Individual Friesian Cattle Recovery and Visual Identification via an Autonomous UAV with Onboard Deep Inference
William Andrew
C. Greatwood
T. Burghardt
14
52
0
11 Jul 2019
Aesthetic Attributes Assessment of Images
Aesthetic Attributes Assessment of Images
Xin Jin
Le Wu
Geng Zhao
Xiaodong Li
Xiaokun Zhang
Shiming Ge
Dongqing Zou
Bin Zhou
Xinghui Zhou
22
36
0
11 Jul 2019
Simple vs complex temporal recurrences for video saliency prediction
Simple vs complex temporal recurrences for video saliency prediction
Panagiotis Linardos
Eva Mohedano
J. Nieto
Noel E. O'Connor
Xavier Giró-i-Nieto
Kevin McGuinness
FAtt
23
86
0
03 Jul 2019
Deformable Tube Network for Action Detection in Videos
Deformable Tube Network for Action Detection in Videos
Wei Li
Zehuan Yuan
Dashan Guo
Lei Huang
Xiangzhong Fang
Changhu Wang
ViT
MedIm
28
5
0
03 Jul 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
36
796
0
25 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
28
462
0
14 Jun 2019
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
X. Li
Rui Cao
D. Zhu
13
20
0
12 Jun 2019
Multi-modal Active Learning From Human Data: A Deep Reinforcement
  Learning Approach
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach
Ognjen Rudovic
Meiru Zhang
Bjorn Schuller
Rosalind W. Picard
OffRL
36
44
0
07 Jun 2019
Early detection of sepsis utilizing deep learning on electronic health
  record event sequences
Early detection of sepsis utilizing deep learning on electronic health record event sequences
S. Lauritsen
M. E. Kalør
Emil Lund Kongsgaard
K. M. Lauritsen
Marianne Johansson Jørgensen
Jeppe Lange
B. Thiesson
19
135
0
07 Jun 2019
Natural Vocabulary Emerges from Free-Form Annotations
Natural Vocabulary Emerges from Free-Form Annotations
Jordi Pont-Tuset
Michael Gygli
V. Ferrari
VLM
26
3
0
04 Jun 2019
Relational Reasoning using Prior Knowledge for Visual Captioning
Relational Reasoning using Prior Knowledge for Visual Captioning
Jingyi Hou
Xinxiao Wu
Yayun Qi
Wentian Zhao
Jiebo Luo
Yunde Jia
17
14
0
04 Jun 2019
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action
  Segmentation
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation
Hilde Kuehne
Alexander Richard
Juergen Gall
24
82
0
03 Jun 2019
Reconstruct and Represent Video Contents for Captioning via
  Reinforcement Learning
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
20
67
0
03 Jun 2019
Hallucinating Optical Flow Features for Video Classification
Hallucinating Optical Flow Features for Video Classification
Yongyi Tang
Lin Ma
Lianqiang Zhou
19
19
0
28 May 2019
Lightweight Network Architecture for Real-Time Action Recognition
Lightweight Network Architecture for Real-Time Action Recognition
Alexander Kozlov
Vadim Andronov
Y. Gritsenko
ViT
25
33
0
21 May 2019
Multitask Learning of Temporal Connectionism in Convolutional Networks
  using a Joint Distribution Loss Function to Simultaneously Identify Tools and
  Phase in Surgical Videos
Multitask Learning of Temporal Connectionism in Convolutional Networks using a Joint Distribution Loss Function to Simultaneously Identify Tools and Phase in Surgical Videos
S. S. Mondal
R. Sathish
Debdoot Sheet
35
17
0
20 May 2019
Learning Video Representations from Correspondence Proposals
Learning Video Representations from Correspondence Proposals
Xingyu Liu
Joon-Young Lee
Hailin Jin
27
63
0
20 May 2019
Talking With Your Hands: Scaling Hand Gestures and Recognition With CNNs
Talking With Your Hands: Scaling Hand Gestures and Recognition With CNNs
Okan Kopuklu
Yao Rong
Gerhard Rigoll
SLR
11
5
0
10 May 2019
Machine Learning Cryptanalysis of a Quantum Random Number Generator
Machine Learning Cryptanalysis of a Quantum Random Number Generator
N. D. Truong
J. Y. Haw
S. Assad
P. Lam
O. Kavehei
AAML
31
39
0
07 May 2019
Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for
  Lipreading
Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading
Xinshuo Weng
Kris M. Kitani
16
71
0
04 May 2019
Recurrent Convolutional Strategies for Face Manipulation Detection in
  Videos
Recurrent Convolutional Strategies for Face Manipulation Detection in Videos
Ekraam Sabir
Jiaxin Cheng
Ayush Jaiswal
Wael AbdAlmageed
I. Masi
Premkumar Natarajan
AAML
CVBM
28
451
0
02 May 2019
Large Scale Holistic Video Understanding
Large Scale Holistic Video Understanding
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Manohar Paluri
Jurgen Gall
Rainer Stiefelhagen
Luc Van Gool
26
35
0
25 Apr 2019
DynamoNet: Dynamic Action and Motion Network
DynamoNet: Dynamic Action and Motion Network
Ali Diba
Vivek Sharma
Luc Van Gool
Rainer Stiefelhagen
30
110
0
25 Apr 2019
Pointing Novel Objects in Image Captioning
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
33
69
0
25 Apr 2019
Detecting inter-sectional accuracy differences in driver drowsiness
  detection algorithms
Detecting inter-sectional accuracy differences in driver drowsiness detection algorithms
Mkhuseli Ngxande
J. Tapamo
Michael G. Burke
22
12
0
23 Apr 2019
3G structure for image caption generation
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
13
34
0
21 Apr 2019
Multi-modal gated recurrent units for image description
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
21
26
0
20 Apr 2019
EV-Action: Electromyography-Vision Multi-Modal Action Dataset
EV-Action: Electromyography-Vision Multi-Modal Action Dataset
Lichen Wang
Bin Sun
Joseph P. Robinson
Taotao Jing
Y. Fu
27
28
0
20 Apr 2019
EmbraceNet: A robust deep learning architecture for multimodal
  classification
EmbraceNet: A robust deep learning architecture for multimodal classification
Jun-Ho Choi
Jong-Seok Lee
23
123
0
19 Apr 2019
Understanding Neural Networks via Feature Visualization: A survey
Understanding Neural Networks via Feature Visualization: A survey
Anh Nguyen
J. Yosinski
Jeff Clune
FAtt
11
160
0
18 Apr 2019
DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition
DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition
Toby Perrett
Dima Damen
22
24
0
18 Apr 2019
AirPen: A Touchless Fingertip Based Gestural Interface for Smartphones
  and Head-Mounted Devices
AirPen: A Touchless Fingertip Based Gestural Interface for Smartphones and Head-Mounted Devices
Varun Jain
R. Hebbalaguppe
21
4
0
12 Apr 2019
Multi-View Region Adaptive Multi-temporal DMM and RGB Action Recognition
Multi-View Region Adaptive Multi-temporal DMM and RGB Action Recognition
Mahmoud Al-Faris
J. Chiverton
Yanyan Yang
D. Ndzi
22
13
0
12 Apr 2019
The Sound of Motions
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
17
251
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
24
69
0
11 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
32
540
0
06 Apr 2019
Recurrent Back-Projection Network for Video Super-Resolution
Recurrent Back-Projection Network for Video Super-Resolution
Muhammad Haris
Gregory Shakhnarovich
Norimichi Ukita
SupR
28
431
0
25 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video
  Analysis
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
16
304
0
07 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action
  Recognition
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
19
82
0
04 Mar 2019
Spatiotemporal Pyramid Network for Video Action Recognition
Spatiotemporal Pyramid Network for Video Action Recognition
Yunbo Wang
Mingsheng Long
Jianmin Wang
Philip S. Yu
24
227
0
04 Mar 2019
Learning To Follow Directions in Street View
Learning To Follow Directions in Street View
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
16
66
0
01 Mar 2019
Extended Gaze Following: Detecting Objects in Videos Beyond the Camera
  Field of View
Extended Gaze Following: Detecting Objects in Videos Beyond the Camera Field of View
Benoit Massé
Stéphane Lathuilière
Pablo Mesejo
Radu Horaud
11
14
0
28 Feb 2019
Previous
123...567...121314
Next