Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.08029
Cited By
Describing Videos by Exploiting Temporal Structure
27 February 2015
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Describing Videos by Exploiting Temporal Structure"
50 / 372 papers shown
Title
A Hierarchical Approach for Generating Descriptive Image Paragraphs
J. Krause
Justin Johnson
Ranjay Krishna
Li Fei-Fei
VLM
36
373
0
20 Nov 2016
Recurrent Memory Addressing for describing videos
A. Jain
Abhinav Agarwalla
Kumar Krishna Agrawal
Pabitra Mitra
38
10
0
20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
21
1,650
0
17 Nov 2016
Multimodal Memory Modelling for Video Captioning
Junbo Wang
Wei Wang
Yan Huang
Liang Wang
Tieniu Tan
32
142
0
17 Nov 2016
Learning long-term dependencies for action recognition with a biologically-inspired deep network
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
29
63
0
16 Nov 2016
Diversity encouraged learning of unsupervised LSTM ensemble for neural activity video prediction
Yilin Song
J. Viventi
Yao Wang
AI4TS
30
2
0
15 Nov 2016
Leveraging Video Descriptions to Learn Video Question Answering
Kuo-Hao Zeng
Tseng-Hung Chen
Ching-Yao Chuang
Yuan-Hong Liao
Juan Carlos Niebles
Min Sun
32
175
0
12 Nov 2016
Memory-augmented Attention Modelling for Videos
Rasool Fakoor
Abdel-rahman Mohamed
Margaret Mitchell
S. B. Kang
Pushmeet Kohli
43
20
0
07 Nov 2016
Clinical Text Prediction with Numerically Grounded Conditional Language Models
Georgios P. Spithourakis
S. Petersen
Sebastian Riedel
30
7
0
20 Oct 2016
Spatio-Temporal Attention Models for Grounded Video Captioning
M. Zanfir
Elisabeta Marinoiu
C. Sminchisescu
29
50
0
17 Oct 2016
Video Fill in the Blank with Merging LSTMs
Amir Mazaheri
Dong-Ming Zhang
M. Shah
24
18
0
13 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering
Youngjae Yu
Hyungjin Ko
Jongwook Choi
Gunhee Kim
14
230
0
10 Oct 2016
Can Ground Truth Label Propagation from Video help Semantic Segmentation?
Siva Karthik Mustikovela
M. Yang
Carsten Rother
19
33
0
03 Oct 2016
Video Summarization using Deep Semantic Features
Mayu Otani
Yuta Nakashima
Esa Rahtu
J. Heikkilä
N. Yokoya
9
112
0
28 Sep 2016
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
22
97
0
26 Sep 2016
Deep Learning for Video Classification and Captioning
Zuxuan Wu
Ting Yao
Yanwei Fu
Yu-Gang Jiang
3DV
VLM
24
123
0
22 Sep 2016
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for Multimodal Information Fusion
Ankit Gandhi
Arjun Sharma
Arijit Biswas
Om Deshmukh
AI4TS
19
12
0
17 Sep 2016
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks
Alberto Montes
Amaia Salvador
Santiago Pascual
Xavier Giró-i-Nieto
25
108
0
29 Aug 2016
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
35
69
0
25 Aug 2016
A Recurrent Encoder-Decoder Network for Sequential Face Alignment
Xi Peng
Rogerio Feris
Xiaoyu Wang
Dimitris N. Metaxas
CVBM
180
140
0
19 Aug 2016
Learning Joint Representations of Videos and Sentences with Web Image Search
Mayu Otani
Yuta Nakashima
Esa Rahtu
J. Heikkilä
N. Yokoya
18
94
0
08 Aug 2016
Visual Question Answering: A Survey of Methods and Datasets
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
32
413
0
20 Jul 2016
Action Recognition with Joint Attention on Multi-Level Deep Features
Jialin Wu
Gu Wang
Wukui Yang
Xiangyang Ji
23
15
0
09 Jul 2016
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Çağlar Gülçehre
A. Chandar
Kyunghyun Cho
Yoshua Bengio
14
64
0
30 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
39
60
0
15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
16
29
0
11 Jun 2016
Deep Learning Convolutional Networks for Multiphoton Microscopy Vasculature Segmentation
Petteri Teikari
Marc A. Santos
Charissa Poon
K. Hynynen
3DV
16
48
0
08 Jun 2016
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations
David M. Krueger
Tegan Maharaj
János Kramár
Mohammad Pezeshki
Nicolas Ballas
Nan Rosemary Ke
Anirudh Goyal
Yoshua Bengio
Aaron Courville
C. Pal
10
317
0
03 Jun 2016
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network
Yu Liu
Jianlong Fu
Tao Mei
C. Chen
13
4
0
02 Jun 2016
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
38
682
0
26 May 2016
With Whom Do I Interact? Detecting Social Interactions in Egocentric Photo-streams
Maedeh Aghaei
Mariella Dimiccoli
Petia Radeva
EgoV
22
35
0
13 May 2016
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
32
353
0
12 May 2016
Theano: A Python framework for fast computation of mathematical expressions
The Theano Development Team
Rami Al-Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermüller
...
Kelvin Xu
Lijun Xue
Li Yao
Saizheng Zhang
Ying Zhang
22
2,335
0
09 May 2016
Fast Object Localization Using a CNN Feature Map Based Multi-Scale Search
Hyungtae Lee
H. Kwon
Archith J. Bency
W. Nothwang
ObjD
17
4
0
12 Apr 2016
Video Description using Bidirectional Recurrent Neural Networks
Álvaro Peris
Marc Bolaños
Petia Radeva
F. Casacuberta
20
33
0
12 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
16
3
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
25
270
0
10 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
28
117
0
06 Apr 2016
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
19
410
0
30 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
18
347
0
28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Loris Bazzani
Hugo Larochelle
Lorenzo Torresani
26
133
0
27 Mar 2016
Attentive Contexts for Object Detection
Jianan Li
Yunchao Wei
Xiaodan Liang
Jian Dong
Tingfa Xu
Jiashi Feng
Shuicheng Yan
ObjD
17
221
0
24 Mar 2016
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE
3DV
36
1,351
0
21 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
Anton Van Den Hengel
Peng Wang
A. Dick
27
360
0
09 Mar 2016
Noisy Activation Functions
Çağlar Gülçehre
Marcin Moczulski
Misha Denil
Yoshua Bengio
9
283
0
01 Mar 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
Raffaella Bernardi
Ruken Cakici
Desmond Elliott
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
Frank Keller
A. Muscat
Barbara Plank
EGVM
VLM
27
363
0
15 Jan 2016
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources
Qi Wu
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
16
370
0
22 Nov 2015
Stories in the Eye: Contextual Visual Interactions for Efficient Video to Language Translation
Anirudh Goyal
Marius Leordeanu
21
1
0
20 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video Representations
Nicolas Ballas
L. Yao
C. Pal
Aaron Courville
MDE
37
692
0
19 Nov 2015
First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks
Quan Gan
Qipeng Guo
Zheng-Wei Zhang
Kyunghyun Cho
VOT
34
51
0
19 Nov 2015
Previous
1
2
3
4
5
6
7
8
Next