v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Lessons learned in multilingual grounded language learning Ákos Kádár Desmond Elliott Marc-Alexandre Côté Grzegorz Chrupała Afra Alishahi VLM 119 24 0 20 Sep 2018
C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis K. J. Joseph Arghya Pal Sailaja Rajanala V. Balasubramanian DiffM 87 26 0 20 Sep 2018
Exploring Visual Relationship for Image Captioning Ting Yao Yingwei Pan Yehao Li Tao Mei 147 837 0 19 Sep 2018
Quantum Statistics-Inspired Neural Attention Aristotelis Charalampous S. Chatzis 35 0 0 17 Sep 2018
Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing Zhuo Chen Weisi Lin Shiqi Wang Ling-yu Duan Alex C. Kot 88 17 0 17 Sep 2018
Integrative Analysis of Patient Health Records and Neuroimages via Memory-based Graph Convolutional Network Xi Sheryl Zhang Jingyuan Chou Fei Wang 69 15 0 17 Sep 2018
CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis Ankit Parag Shah Jean-Baptiste Lamare Tuan Nguyen-Anh Alexander G. Hauptmann 79 106 0 16 Sep 2018
Attention as a Perspective for Learning Tempo-invariant Audio Queries Matthias Dorfer Jan Hajic Gerhard Widmer 25 2 0 15 Sep 2018
A Deep Learning and Gamification Approach to Energy Conservation at Nanyang Technological University Ioannis C. Konstantakopoulos Andrew R. Barkan Shiying He Tanya Veeravalli Huihan Liu C. Spanos AI4CE 46 7 0 13 Sep 2018
Improving Reinforcement Learning Based Image Captioning with Natural Language Prior Tszhang Guo Shiyu Chang Mo Yu Kun Bai 70 15 0 13 Sep 2018
IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles Tianze Shi Kedar Tatwawadi K. Chakrabarti Yi Mao Oleksandr Polozov Weizhu Chen 78 64 0 13 Sep 2018
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts Shuming Ma Lei Cui Damai Dai Furu Wei Xu Sun VGen 81 63 0 13 Sep 2018
Image Captioning based on Deep Reinforcement Learning Haichao Shi Peng Li Bo Wang Zhenyu Wang 31 25 0 13 Sep 2018
Higher-order Graph Convolutional Networks J. B. Lee Ryan A. Rossi Xiangnan Kong Sungchul Kim Eunyee Koh Anup B. Rao GNN 74 36 0 12 Sep 2018
End-to-end Image Captioning Exploits Multimodal Distributional Similarity Pranava Madhyastha Josiah Wang Lucia Specia CoGe 63 7 0 11 Sep 2018
SPASS: Scientific Prominence Active Search System with Deep Image Captioning Network D. Qiu 26 2 0 10 Sep 2018
Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers Zhen He Jian Li Daxue Liu Hangen He David Barber VOT 77 54 0 10 Sep 2018
Dual Attention Network for Scene Segmentation J. Fu Qingbin Liu Haijie Tian Yong Li Yongjun Bao Zhiwei Fang Hanqing Lu SSeg 333 5,134 0 09 Sep 2018
Exploration on Grounded Word Embedding: Matching Words and Images with Image-Enhanced Skip-Gram Model Ruixuan Luo 36 0 0 08 Sep 2018
Object Hallucination in Image Captioning Anna Rohrbach Lisa Anne Hendricks Kaylee Burns Trevor Darrell Kate Saenko 236 445 0 06 Sep 2018
Bimodal network architectures for automatic generation of image annotation from text Mehdi Moradi Ali Madani Yaniv Gur Yufan Guo Tanveer Syeda-Mahmood 50 20 0 05 Sep 2018
Dynamically Context-Sensitive Time-Decay Attention for Dialogue Modeling Shang-Yu Su Pei-Chieh Yuan Yun-Nung Chen 51 7 0 05 Sep 2018
Text2Scene: Generating Compositional Scenes from Textual Descriptions Fuwen Tan Song Feng Vicente Ordonez 135 18 0 04 Sep 2018
Diverse and Coherent Paragraph Generation from Images Moitreya Chatterjee Alex Schwing 78 67 0 03 Sep 2018
VoxSegNet: Volumetric CNNs for Semantic Part Segmentation of 3D Shapes Zongji Wang Feng Lu 3DPC 68 113 0 01 Sep 2018
Improving Visual Relationship Detection using Semantic Modeling of Scene Descriptions S. Baier Yunpu Ma Volker Tresp 102 59 0 01 Sep 2018
LIUM-CVC Submissions for WMT18 Multimodal Translation Task Ozan Caglayan Adrien Bardet Fethi Bougares Loïc Barrault M. García-Martínez Marc Masana Luis Herranz Joost van de Weijer 68 41 0 01 Sep 2018
When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size) Liang Huang Kai Zhao Mingbo Ma 96 54 0 31 Aug 2018
A Deep Neural Network Sentence Level Classification Method with Context Information Xingyi Song Johann Petrak A. Roberts 49 21 0 31 Aug 2018
An Adaptive Locally Connected Neuron Model: Focusing Neuron F. Boray Tek 38 6 0 31 Aug 2018
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report Renjie Zheng Yilin Yang Mingbo Ma Liang Huang 86 8 0 31 Aug 2018
Learning to Describe Differences Between Pairs of Similar Images Harsh Jhamtani Taylor Berg-Kirkpatrick 92 155 0 31 Aug 2018
LUCSS: Language-based User-customized Colourization of Scene Sketches C. Zou Haoran Mo Ruofei Du Xing Wu Chengying Gao Hongbo Fu 47 8 0 30 Aug 2018
End-to-end Speech Recognition with Adaptive Computation Steps Mohan Li Min Liu Masanori Hattori 44 34 0 30 Aug 2018
Hard Non-Monotonic Attention for Character-Level Transduction Shijie Wu Pamela Shapiro Ryan Cotterell 90 42 0 29 Aug 2018
Attention-based Neural Text Segmentation Pinkesh Badjatiya Litton J. Kurisinkel Manish Gupta Vasudeva Varma 53 68 0 29 Aug 2018
Top-down Attention Recurrent VLAD Encoding for Action Recognition in Videos Swathikiran Sudhakaran Oswald Lanz 48 6 0 29 Aug 2018
Interact as You Intend: Intention-Driven Human-Object Interaction Detection Bingjie Xu Junnan Li Yongkang Wong Mohan Kankanhalli Qi Zhao 103 100 0 29 Aug 2018
Notes on Deep Learning for NLP A. Tixier VLM 66 15 0 29 Aug 2018
Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation Renjie Zheng Mingbo Ma Liang Huang 80 35 0 28 Aug 2018
Natural Language Generation with Neural Variational Models Hareesh Bahuleyan DRL 49 6 0 27 Aug 2018
A neural attention model for speech command recognition Douglas Coimbra de Andrade Sabato Leo M. Viana Christoph Bernkopf 57 145 0 27 Aug 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions Fenglin Liu Xuancheng Ren Yuanxin Liu Houfeng Wang Xu Sun 132 66 0 27 Aug 2018
Learning End-to-End Goal-Oriented Dialog with Multiple Answers Janarthanan Rajendran Jatin Ganhotra Satinder Singh L. Polymenakos 61 37 0 24 Aug 2018
Approximate Distribution Matching for Sequence-to-Sequence Learning Wenhu Chen Guanlin Li Shujie Liu Zhirui Zhang Mu Li M. Zhou OOD BDL 38 0 0 24 Aug 2018
Sarcasm Analysis using Conversation Context Debanjan Ghosh Alexander R. Fabbri Smaranda Muresan 72 85 0 22 Aug 2018
Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images Jo Schlemper Ozan Oktay M. Schaap M. Heinrich Bernhard Kainz Ben Glocker Daniel Rueckert MedIm 142 1,488 0 22 Aug 2018
Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents Ganbin Zhou Rongyu Cao Xiang Ao Ping Luo Fen Lin Leyu Lin Qing He 48 0 0 22 Aug 2018
Exploring a Unified Attention-Based Pooling Framework for Speaker Verification Yi Y. Liu Liang He Weiwei Liu Jia-Wei Liu 43 8 0 21 Aug 2018
VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification Song-Le Chen Lintao Zheng Yan Zhang Zhixin Sun Kai Xu 3DPC 3DV 79 75 0 20 Aug 2018