v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Progressively Diffused Networks for Semantic Image Segmentation Ruimao Zhang Wei Yang Zhanglin Peng Xiaogang Wang Liang Lin SSeg 23 3 0 20 Feb 2017
Person Search with Natural Language Description Shuang Li Tong Xiao Hongsheng Li Bolei Zhou Dayu Yue Xiaogang Wang 109 397 0 19 Feb 2017
MAT: A Multimodal Attentive Translator for Image Captioning Chang Liu F. Sun Changhu Wang Feng Wang Alan Yuille 93 59 0 18 Feb 2017
Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection Tharindu Fernando Simon Denman Sridha Sridharan Clinton Fookes HAI 84 336 0 18 Feb 2017
Experiment Segmentation in Scientific Discourse as Clause-level Structured Prediction using Recurrent Neural Networks Pradeep Dasigi Gully A. Burns Eduard H. Hovy A. Waard 35 27 0 17 Feb 2017
Frustratingly Short Attention Spans in Neural Language Modeling Michal Daniluk Tim Rocktaschel Johannes Welbl Sebastian Riedel 113 112 0 15 Feb 2017
Gated Multimodal Units for Information Fusion John Arevalo Thamar Solorio Manuel Montes-y-Gómez Fabio Gonzalez 108 382 0 07 Feb 2017
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation Iacer Calixto Qun Liu N. Campbell 174 183 0 04 Feb 2017
Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush 152 463 0 03 Feb 2017
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey L. Ferrone Fabio Massimo Zanzotto 49 38 0 02 Feb 2017
Deep Reinforcement Learning for Visual Object Tracking in Videos Da Zhang H. Maei Xin Eric Wang Yuan-fang Wang 151 117 0 31 Jan 2017
Memory Augmented Neural Networks with Wormhole Connections Çağlar Gülçehre A. Chandar Yoshua Bengio 102 63 0 30 Jan 2017
Supervised Deep Sparse Coding Networks Xiaoxia Sun Nasser M. Nasrabadi T. Tran BDL 94 15 0 29 Jan 2017
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation N. Mostafazadeh Chris Brockett W. Dolan Michel Galley Jianfeng Gao Georgios P. Spithourakis Lucy Vanderwende 111 183 0 28 Jan 2017
Deep Reinforcement Learning: An Overview Yuxi Li OffRL VLM 346 1,550 0 25 Jan 2017
Incorporating Global Visual Features into Attention-Based Neural Machine Translation Iacer Calixto Qun Liu Nick Campbell 136 156 0 23 Jan 2017
Understanding the Effective Receptive Field in Deep Convolutional Neural Networks Wenjie Luo Yujia Li R. Urtasun R. Zemel HAI 106 1,813 0 15 Jan 2017
Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks Yuzhen Lu F. Salem 39 39 0 12 Jan 2017
Comprehension-guided referring expressions Ruotian Luo Gregory Shakhnarovich ObjD 107 171 0 12 Jan 2017
Attention-Based Multimodal Fusion for Video Description Chiori Hori Takaaki Hori Teng-Yok Lee Kazuhiro Sumi J. Hershey Tim K. Marks 95 361 0 11 Jan 2017
Context-aware Captions from Context-agnostic Supervision Ramakrishna Vedantam Samy Bengio Kevin Patrick Murphy Devi Parikh Gal Chechik 96 152 0 11 Jan 2017
Towards Decoding as Continuous Optimization in Neural Machine Translation Cong Duy Vu Hoang Gholamreza Haffari Trevor Cohn AI4CE 89 42 0 11 Jan 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation Guillaume Klein Yoon Kim Yuntian Deng Jean Senellart Alexander M. Rush 356 1,900 0 10 Jan 2017
Textual Entailment with Structured Attentions and Composition Kai Zhao Liang Huang Mingbo Ma 87 28 0 04 Jan 2017
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs by Selective Execution Lanlan Liu Jia Deng 119 206 0 02 Jan 2017
Aspect-augmented Adversarial Networks for Domain Adaptation Yuan Zhang Regina Barzilay Tommi Jaakkola 119 96 0 01 Jan 2017
Feedback Networks Amir Zamir Te-Lin Wu Lin Sun Bokui (William) Shen Jitendra Malik Silvio Savarese 95 211 0 30 Dec 2016
FastMask: Segment Multi-scale Object Candidates in One Shot Hexiang Hu Shiyi Lan Yuning Jiang Zhimin Cao Fei Sha SSeg 3DPC 86 28 0 28 Dec 2016
Robust LSTM-Autoencoders for Face De-Occlusion in the Wild F. Zhao Jiashi Feng Jian-jun Zhao Wenhan Yang Shuicheng Yan CVBM 71 140 0 27 Dec 2016
Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge A. Setio A. Traverso Thomas de Bel Moira S. N. Berens C. V. D. Bogaard ... Jef Vandemeulebroucke N. Walasek G. Zuidhof Bram van Ginneken Colin Jacobs 142 1,093 0 23 Dec 2016
Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task Nan Ding Sebastian Goodman Fei Sha Radu Soricut VLM 85 9 0 22 Dec 2016
Re-evaluating Automatic Metrics for Image Captioning Mert Kilickaya Aykut Erdem Nazli Ikizler-Cinbis Erkut Erdem 66 181 0 22 Dec 2016
A Context-aware Attention Network for Interactive Question Answering Huayu Li Martin Renqiang Min Yong Ge Asim Kadav 65 69 0 22 Dec 2016
Top-down Visual Saliency Guided by Captions Vasili Ramanishka Abir Das Jianming Zhang Kate Saenko 87 143 0 21 Dec 2016
Multi-Agent Cooperation and the Emergence of (Natural) Language Angeliki Lazaridou A. Peysakhovich Marco Baroni LLMAG 167 434 0 21 Dec 2016
An Empirical Study of Language CNN for Image Captioning Jiuxiang Gu G. Wang Jianfei Cai Tsuhan Chen 95 134 0 21 Dec 2016
Action-Driven Object Detection with Top-Down Visual Attentions Donggeun Yoo Sunggyun Park K. Paeng Joon-Young Lee In So Kweon ObjD 48 6 0 20 Dec 2016
Automatic Generation of Grounded Visual Questions Shijie Zhang Zhuang Li Shaodi You Zhenglu Yang Jiawan Zhang OOD 79 79 0 20 Dec 2016
Large-Scale Image Retrieval with Attentive Deep Local Features Hyeonwoo Noh A. Araújo Jack Sim Tobias Weyand Bohyung Han 3DV 145 777 0 19 Dec 2016
Few-Shot Object Recognition from Machine-Labeled Web Images Zhongwen Xu Linchao Zhu Yi Yang VLM 86 66 0 19 Dec 2016
Learning to predict where to look in interactive environments using deep recurrent q-learning Seyed Sajad Mousavi Michael Schukat Enda Howley Ali Borji N. Mozayani 59 31 0 17 Dec 2016
Delta Networks for Optimized Recurrent Network Computation Daniel Neil Junhaeng Lee T. Delbruck Shih-Chii Liu 106 66 0 16 Dec 2016
CSVideoNet: A Real-time End-to-end Learning Framework for High-frame-rate Video Compressive Sensing Kai Xu Fengbo Ren 64 8 0 15 Dec 2016
Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering Hao Liu Yang Yang Fumin Shen Lixin Duan Heng Tao Shen 65 9 0 15 Dec 2016
Single Image Action Recognition using Semantic Body Part Actions Zhichen Zhao Huimin Ma Shaodi You 75 74 0 14 Dec 2016
End-to-End Deep Reinforcement Learning for Lane Keeping Assist Ahmad El-Sallab Mohammed Abdou E. Perot S. Yogamani 77 176 0 13 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer Sergey Zagoruyko N. Komodakis 152 2,598 0 12 Dec 2016
Empirical Evaluation of A New Approach to Simplifying Long Short-term Memory (LSTM) Yuzhen Lu 24 2 0 12 Dec 2016
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering Marc Bolaños Álvaro Peris F. Casacuberta Petia Radeva 68 6 0 12 Dec 2016
Text-guided Attention Model for Image Captioning Jonghwan Mun Minsu Cho Bohyung Han VLM 59 93 0 12 Dec 2016