v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM Feng Wang Huichao Gong Gaochao liu Meijing Li Chuangye Yan Tian Xia Xueming Li Jianyang Zeng 53 172 0 06 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking Xiaoyu Lin Devi Parikh CoGe 112 84 0 04 May 2016
Multi30K: Multilingual English-German Image Descriptions Desmond Elliott Stella Frank K. Simaán Lucia Specia VLM 140 590 0 02 May 2016
Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion Dinesh Jayaraman Kristen Grauman 80 91 0 30 Apr 2016
Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition Théodore Bluche AI4TS 167 189 0 28 Apr 2016
Dialog-based Language Learning Jason Weston LLMAG 141 109 0 20 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging Jiren Jin Hideki Nakayama 3DV VLM 109 69 0 18 Apr 2016
Parallelizing Word2Vec in Shared and Distributed Memory Shihao Ji N. Satish Sheng Li Pradeep Dubey VLM MoE 64 72 0 15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks Gunnar Sigurdsson Xinlei Chen Abhinav Gupta 80 39 0 14 Apr 2016
Filling in the details: Perceiving from low fidelity images F. Wick Michael L. Wick M. Pomplun 3DH 21 1 0 14 Apr 2016
Visual Storytelling Ting-Hao 'Kenneth' Huang Huang Francis Ferraro N. Mostafazadeh Ishan Misra ... C. L. Zitnick Devi Parikh Lucy Vanderwende Michel Galley Margaret Mitchell VGen 99 480 0 13 Apr 2016
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention Théodore Bluche J. Louradour Ronaldo O. Messina VLM 100 170 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 83 274 0 10 Apr 2016
Optimizing Performance of Recurrent Neural Networks on GPUs J. Appleyard Tomás Kociský Phil Blunsom 93 93 0 07 Apr 2016
Advances in Very Deep Convolutional Neural Networks for LVCSR Tom Sercu Vaibhava Goel 74 44 0 06 Apr 2016
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition Ziyan Wang Jiwen Lu Ruogu Lin Jianjiang Feng Jie zhou 100 29 0 06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project Guntis Barzdins Steve Renals D. Gosko 31 6 0 05 Apr 2016
Image Captioning with Deep Bidirectional LSTMs Cheng Wang Haojin Yang Christian Bartz Christoph Meinel VLM 92 280 0 04 Apr 2016
Character-Level Question Answering with Attention David Golub Xiaodong He 92 185 0 04 Apr 2016
Reasoning About Pragmatics with Neural Listeners and Speakers Jacob Andreas Dan Klein ReLM LRM 111 175 0 02 Apr 2016
Automatic Annotation of Structured Facts in Images Mohamed Elhoseiny Scott D. Cohen W. Chang Brian L. Price Ahmed Elgammal 61 9 0 02 Apr 2016
AttSum: Joint Learning of Focusing and Summarization with Neural Attention Ziqiang Cao Wenjie Li Sujian Li Furu Wei Yanran Li 93 117 0 01 Apr 2016
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection Sheng-syun Shen Hung-yi Lee 89 66 0 31 Mar 2016
Minimal Gated Unit for Recurrent Neural Networks Guoxiang Zhou Jianxin Wu Chen-Da Liu-Zhang Zhi Zhou 83 333 0 31 Mar 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning Andrew Shin Masataka Yamaguchi Katsunori Ohnishi Tatsuya Harada 86 8 0 30 Mar 2016
Recurrent Batch Normalization Tim Cooijmans Nicolas Ballas César Laurent Çağlar Gülçehre Aaron Courville ODL 114 411 0 30 Mar 2016
Rich Image Captioning in the Wild Kenneth Tran Xiaodong He Lei Zhang Jian Sun Cornelia Carapcea Chris Thrasher Chris Buehler Chris Sienkiewicz VLM 60 124 0 30 Mar 2016
Generating Visual Explanations Lisa Anne Hendricks Zeynep Akata Marcus Rohrbach Jeff Donahue Bernt Schiele Trevor Darrell VLM FAtt 110 622 0 28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation Hoo-Chang Shin Kirk Roberts Le Lu Dina Demner-Fushman Jianhua Yao Ronald M. Summers 77 351 0 28 Mar 2016
Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention Linlin Chao J. Tao Minghao Yang Ya Li Zhengqi Wen 51 30 0 28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention Loris Bazzani Hugo Larochelle Lorenzo Torresani 103 135 0 27 Mar 2016
Neural Text Generation from Structured Data with Application to the Biography Domain R. Lebret David Grangier Michael Auli 77 46 0 24 Mar 2016
Attentive Contexts for Object Detection Jianan Li Yunchao Wei Xiaodan Liang Jian Dong Tingfa Xu Jiashi Feng Shuicheng Yan ObjD 79 222 0 24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing Arnau Ramisa F. Yan Francesc Moreno-Noguer K. Mikolajczyk 72 106 0 23 Mar 2016
Semantic Object Parsing with Graph LSTM Xiaodan Liang Xiaohui Shen Jiashi Feng Liang Lin Shuicheng Yan 204 356 0 23 Mar 2016
Deep Learning in Bioinformatics Seonwoo Min Byunghan Lee Sungroh Yoon AI4CE 3DV 112 1,365 0 21 Mar 2016
Segmentation from Natural Language Expressions Ronghang Hu Marcus Rohrbach Trevor Darrell VLM EgoV 86 439 0 20 Mar 2016
One-Shot Generalization in Deep Generative Models Danilo Jimenez Rezende S. Mohamed Ivo Danihelka Karol Gregor Daan Wierstra BDL VLM DRL LRM 133 254 0 16 Mar 2016
Image Captioning with Semantic Attention Quanzeng You Hailin Jin Zhaowen Wang Chen Fang Jiebo Luo VLM 232 1,666 0 12 Mar 2016
Neural Discourse Relation Recognition with Semantic Memory Biao Zhang Deyi Xiong Jinsong Su 37 16 0 12 Mar 2016
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild Chen-Yu Lee Simon Osindero VLM 95 460 0 09 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 91 362 0 09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering Caiming Xiong Stephen Merity R. Socher 92 756 0 04 Mar 2016
Noisy Activation Functions Çağlar Gülçehre Marcin Moczulski Misha Denil Yoshua Bengio 57 284 0 01 Mar 2016
Recurrent Neural Network Grammars Chris Dyer A. Kuncoro Miguel Ballesteros Noah A. Smith GNN 121 527 0 25 Feb 2016
Learning to Generate with Memory Chongxuan Li Jun Zhu Bo Zhang BDL 130 42 0 24 Feb 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations Ranjay Krishna Yuke Zhu Oliver Groth Justin Johnson Kenji Hata ... Yannis Kalantidis Li Li David A. Shamma Michael S. Bernstein Fei-Fei Li 487 5,779 0 23 Feb 2016
Contextual LSTM (CLSTM) models for Large scale NLP tasks Shalini Ghosh Oriol Vinyals B. Strope Scott Roy Tom Dean Larry Heck 75 213 0 19 Feb 2016
"Why Should I Trust You?": Explaining the Predictions of Any Classifier Marco Tulio Ribeiro Sameer Singh Carlos Guestrin FAtt FaML 1.3K 17,237 0 16 Feb 2016
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification Jimmy S. J. Ren Yongtao Hu Yu-Wing Tai Chuan Wang Li Xu Wenxiu Sun Qiong Yan 84 108 0 13 Feb 2016