Describing Videos by Exploiting Temporal Structure

27 February 2015

Aaron Courville

Papers citing "Describing Videos by Exploiting Temporal Structure"

50 / 372 papers shown

Title
A Hierarchical Approach for Generating Descriptive Image Paragraphs J. Krause Justin Johnson Ranjay Krishna Li Fei-Fei VLM 36 373 0 20 Nov 2016
Recurrent Memory Addressing for describing videos A. Jain Abhinav Agarwalla Kumar Krishna Agrawal Pabitra Mitra 38 10 0 20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Long Chen Hanwang Zhang Jun Xiao Liqiang Nie Jian Shao Wei Liu Tat-Seng Chua 21 1,650 0 17 Nov 2016
Multimodal Memory Modelling for Video Captioning Junbo Wang Wei Wang Yan Huang Liang Wang Tieniu Tan 32 142 0 17 Nov 2016
Learning long-term dependencies for action recognition with a biologically-inspired deep network Yemin Shi Yonghong Tian Yaowei Wang Tiejun Huang 29 63 0 16 Nov 2016
Diversity encouraged learning of unsupervised LSTM ensemble for neural activity video prediction Yilin Song J. Viventi Yao Wang AI4TS 30 2 0 15 Nov 2016
Leveraging Video Descriptions to Learn Video Question Answering Kuo-Hao Zeng Tseng-Hung Chen Ching-Yao Chuang Yuan-Hong Liao Juan Carlos Niebles Min Sun 32 175 0 12 Nov 2016
Memory-augmented Attention Modelling for Videos Rasool Fakoor Abdel-rahman Mohamed Margaret Mitchell S. B. Kang Pushmeet Kohli 43 20 0 07 Nov 2016
Clinical Text Prediction with Numerically Grounded Conditional Language Models Georgios P. Spithourakis S. Petersen Sebastian Riedel 30 7 0 20 Oct 2016
Spatio-Temporal Attention Models for Grounded Video Captioning M. Zanfir Elisabeta Marinoiu C. Sminchisescu 29 50 0 17 Oct 2016
Video Fill in the Blank with Merging LSTMs Amir Mazaheri Dong-Ming Zhang M. Shah 24 18 0 13 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering Youngjae Yu Hyungjin Ko Jongwook Choi Gunhee Kim 14 230 0 10 Oct 2016
Can Ground Truth Label Propagation from Video help Semantic Segmentation? Siva Karthik Mustikovela M. Yang Carsten Rother 19 33 0 03 Oct 2016
Video Summarization using Deep Semantic Features Mayu Otani Yuta Nakashima Esa Rahtu J. Heikkilä N. Yokoya 9 112 0 28 Sep 2016
Learning Language-Visual Embedding for Movie Understanding with Natural-Language Atousa Torabi Niket Tandon Leonid Sigal 22 97 0 26 Sep 2016
Deep Learning for Video Classification and Captioning Zuxuan Wu Ting Yao Yanwei Fu Yu-Gang Jiang 3DV VLM 24 123 0 22 Sep 2016
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for Multimodal Information Fusion Ankit Gandhi Arjun Sharma Arijit Biswas Om Deshmukh AI4TS 19 12 0 17 Sep 2016
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks Alberto Montes Amaia Salvador Santiago Pascual Xavier Giró-i-Nieto 25 108 0 29 Aug 2016
Title Generation for User Generated Videos Kuo-Hao Zeng Tseng-Hung Chen Juan Carlos Niebles Min Sun 35 69 0 25 Aug 2016
A Recurrent Encoder-Decoder Network for Sequential Face Alignment Xi Peng Rogerio Feris Xiaoyu Wang Dimitris N. Metaxas CVBM 180 140 0 19 Aug 2016
Learning Joint Representations of Videos and Sentences with Web Image Search Mayu Otani Yuta Nakashima Esa Rahtu J. Heikkilä N. Yokoya 18 94 0 08 Aug 2016
Visual Question Answering: A Survey of Methods and Datasets Qi Wu Damien Teney Peng Wang Chunhua Shen A. Dick Anton Van Den Hengel 32 413 0 20 Jul 2016
Action Recognition with Joint Attention on Multi-Level Deep Features Jialin Wu Gu Wang Wukui Yang Xiangyang Ji 23 15 0 09 Jul 2016
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes Çağlar Gülçehre A. Chandar Kyunghyun Cho Yoshua Bengio 14 64 0 30 Jun 2016
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 39 60 0 15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data Shikhar Sharma Jing He Kaheer Suleman Hannes Schulz Philip Bachman 16 29 0 11 Jun 2016
Deep Learning Convolutional Networks for Multiphoton Microscopy Vasculature Segmentation Petteri Teikari Marc A. Santos Charissa Poon K. Hynynen 3DV 16 48 0 08 Jun 2016
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations David M. Krueger Tegan Maharaj János Kramár Mohammad Pezeshki Nicolas Ballas Nan Rosemary Ke Anirudh Goyal Yoshua Bengio Aaron Courville C. Pal 10 317 0 03 Jun 2016
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network Yu Liu Jianlong Fu Tao Mei C. Chen 13 4 0 02 Jun 2016
Video Summarization with Long Short-term Memory Ke Zhang Wei-Lun Chao Fei Sha Kristen Grauman 38 682 0 26 May 2016
With Whom Do I Interact? Detecting Social Interactions in Egocentric Photo-streams Maedeh Aghaei Mariella Dimiccoli Petia Radeva EgoV 22 35 0 13 May 2016
Movie Description Anna Rohrbach Atousa Torabi Marcus Rohrbach Niket Tandon C. Pal Hugo Larochelle Aaron Courville Bernt Schiele 3DV VGen 32 353 0 12 May 2016
Theano: A Python framework for fast computation of mathematical expressions The Theano Development Team Rami Al-Rfou Guillaume Alain Amjad Almahairi Christof Angermüller ... Kelvin Xu Lijun Xue Li Yao Saizheng Zhang Ying Zhang 22 2,335 0 09 May 2016
Fast Object Localization Using a CNN Feature Map Based Multi-Scale Search Hyungtae Lee H. Kwon Archith J. Bency W. Nothwang ObjD 17 4 0 12 Apr 2016
Video Description using Bidirectional Recurrent Neural Networks Álvaro Peris Marc Bolaños Petia Radeva F. Casacuberta 20 33 0 12 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition Marcus Rohrbach VLM 16 3 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 25 270 0 10 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Subhashini Venugopalan Lisa Anne Hendricks Raymond J. Mooney Kate Saenko VLM 28 117 0 06 Apr 2016
Recurrent Batch Normalization Tim Cooijmans Nicolas Ballas César Laurent Çağlar Gülçehre Aaron Courville ODL 19 410 0 30 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation Hoo-Chang Shin Kirk Roberts Le Lu Dina Demner-Fushman Jianhua Yao Ronald M. Summers 18 347 0 28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention Loris Bazzani Hugo Larochelle Lorenzo Torresani 26 133 0 27 Mar 2016
Attentive Contexts for Object Detection Jianan Li Yunchao Wei Xiaodan Liang Jian Dong Tingfa Xu Jiashi Feng Shuicheng Yan ObjD 17 221 0 24 Mar 2016
Deep Learning in Bioinformatics Seonwoo Min Byunghan Lee Sungroh Yoon AI4CE 3DV 36 1,351 0 21 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 27 360 0 09 Mar 2016
Noisy Activation Functions Çağlar Gülçehre Marcin Moczulski Misha Denil Yoshua Bengio 9 283 0 01 Mar 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures Raffaella Bernardi Ruken Cakici Desmond Elliott Aykut Erdem Erkut Erdem Nazli Ikizler-Cinbis Frank Keller A. Muscat Barbara Plank EGVM VLM 27 363 0 15 Jan 2016
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources Qi Wu Peng Wang Chunhua Shen A. Dick Anton Van Den Hengel 16 370 0 22 Nov 2015
Stories in the Eye: Contextual Visual Interactions for Efficient Video to Language Translation Anirudh Goyal Marius Leordeanu 21 1 0 20 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video Representations Nicolas Ballas L. Yao C. Pal Aaron Courville MDE 37 692 0 19 Nov 2015
First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks Quan Gan Qipeng Guo Zheng-Wei Zhang Kyunghyun Cho VOT 34 51 0 19 Nov 2015