Multimodal Memory Modelling for Video Captioning

17 November 2016

Liang Wang

Papers citing "Multimodal Memory Modelling for Video Captioning"

32 / 32 papers shown

Title
Global2Local: A Joint-Hierarchical Attention for Video Captioning Chengpeng Dai Fuhai Chen Xiaoshuai Sun Rongrong Ji QiXiang Ye Yongjian Wu 71 1 0 13 Mar 2022
Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi C. Grana Rita Cucchiara 68 192 0 28 Nov 2016
Recurrent Memory Addressing for describing videos A. Jain Abhinav Agarwalla Kumar Krishna Agrawal Pabitra Mitra 60 10 0 20 Nov 2016
Memory-augmented Attention Modelling for Videos Rasool Fakoor Abdel-rahman Mohamed Margaret Mitchell S. B. Kang Pushmeet Kohli 113 20 0 07 Nov 2016
Using Fast Weights to Attend to the Recent Past Jimmy Ba Geoffrey E. Hinton Volodymyr Mnih Joel Z Leibo Catalin Ionescu 83 273 0 20 Oct 2016
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 117 854 0 21 Sep 2016
Dynamic Memory Networks for Visual and Textual Question Answering Caiming Xiong Stephen Merity R. Socher 86 756 0 04 Mar 2016
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 2.3K 194,641 0 10 Dec 2015
Rethinking the Inception Architecture for Computer Vision Christian Szegedy Vincent Vanhoucke Sergey Ioffe Jonathon Shlens Z. Wojna 3DV BDL 886 27,444 0 02 Dec 2015
Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems Jesse Dodge Andreea Gane Xiang Zhang Antoine Bordes S. Chopra Alexander H. Miller Arthur Szlam Jason Weston ELM 102 198 0 21 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video Representations Nicolas Ballas L. Yao C. Pal Aaron Courville MDE 95 701 0 19 Nov 2015
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning Pingbo Pan Zhongwen Xu Yi Yang Leilei Gan Yueting Zhuang 59 385 0 11 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks Haonan Yu Jiang Wang Zhiheng Huang Yi Yang Wenyuan Xu 95 560 0 26 Oct 2015
Teaching Machines to Read and Comprehend Karl Moritz Hermann Tomás Kociský Edward Grefenstette L. Espeholt W. Kay Mustafa Suleyman Phil Blunsom 355 3,555 0 10 Jun 2015
Learning to Transduce with Unbounded Memory Edward Grefenstette Karl Moritz Hermann Mustafa Suleyman Phil Blunsom 98 297 0 08 Jun 2015
Large-scale Simple Question Answering with Memory Networks Antoine Bordes Nicolas Usunier S. Chopra Jason Weston 119 701 0 05 Jun 2015
Jointly Modeling Embedding and Translation to Bridge Video and Language Yingwei Pan Tao Mei Ting Yao Houqiang Li Y. Rui 83 534 0 07 May 2015
Sequence to Sequence -- Video to Text Subhashini Venugopalan Marcus Rohrbach Jeff Donahue Raymond J. Mooney Trevor Darrell Kate Saenko 146 1,419 0 03 May 2015
Microsoft COCO Captions: Data Collection and Evaluation Server Xinlei Chen Hao Fang Nayeon Lee Ramakrishna Vedantam Saurabh Gupta Piotr Dollar C. L. Zitnick 230 2,497 0 01 Apr 2015
Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets Armand Joulin Tomas Mikolov TPM 144 412 0 03 Mar 2015
Describing Videos by Exploiting Temporal Structure L. Yao Atousa Torabi Kyunghyun Cho Nicolas Ballas C. Pal Hugo Larochelle Aaron Courville 151 1,064 0 27 Feb 2015
Translating Videos to Natural Language Using Deep Recurrent Neural Networks Subhashini Venugopalan Huijuan Xu Jeff Donahue Marcus Rohrbach Raymond J. Mooney Kate Saenko 165 952 0 15 Dec 2014
Show and Tell: A Neural Image Caption Generator Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 3DV 265 6,042 0 17 Nov 2014
Neural Turing Machines Alex Graves Greg Wayne Ivo Danihelka 115 2,333 0 20 Oct 2014
Memory Networks Jason Weston S. Chopra Antoine Bordes GNN KELM 162 1,709 0 15 Oct 2014
Going Deeper with Convolutions Christian Szegedy Wei Liu Yangqing Jia P. Sermanet Scott E. Reed Dragomir Anguelov D. Erhan Vincent Vanhoucke Andrew Rabinovich 496 43,717 0 17 Sep 2014
Recurrent Neural Network Regularization Wojciech Zaremba Ilya Sutskever Oriol Vinyals ODL 160 2,778 0 08 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition Karen Simonyan Andrew Zisserman FAtt MDE 1.7K 100,575 0 04 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate Dzmitry Bahdanau Kyunghyun Cho Yoshua Bengio AIMat 584 27,345 0 01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation Kyunghyun Cho B. V. Merrienboer Çağlar Gülçehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk Yoshua Bengio AIMat 1.1K 23,414 0 03 Jun 2014
Rich feature hierarchies for accurate object detection and semantic segmentation Ross B. Girshick Jeff Donahue Trevor Darrell Jitendra Malik ObjD 301 26,223 0 11 Nov 2013
ADADELTA: An Adaptive Learning Rate Method Matthew D. Zeiler ODL 165 6,635 0 22 Dec 2012