v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Saliency-Guided Attention Network for Image-Sentence Matching Zhong Ji Haoran Wang Jiawei Han Yanwei Pang 71 89 0 20 Apr 2019
Salient Object Detection in the Deep Learning Era: An In-Depth Survey Wenguan Wang Qiuxia Lai Huazhu Fu Jianbing Shen Haibin Ling Ruigang Yang 108 617 0 19 Apr 2019
Emergence of Compositional Language with Deep Generational Transmission Michael Cogswell Jiasen Lu Stefan Lee Devi Parikh Dhruv Batra 117 49 0 19 Apr 2019
Attentive Single-Tasking of Multiple Tasks Kevis-Kokitsi Maninis Ilija Radosavovic Iasonas Kokkinos 204 251 0 18 Apr 2019
Learning to Collocate Neural Modules for Image Captioning Xu Yang Hanwang Zhang Jianfei Cai 71 78 0 18 Apr 2019
DeepNovoV2: Better de novo peptide sequencing with deep learning Rui Qiao Ngoc Hieu Tran L. Xin B. Shan Ming Li A. Ghodsi 37 17 0 17 Apr 2019
Aggregation Cross-Entropy for Sequence Recognition Zecheng Xie Yaoxiong Huang Yuanzhi Zhu Lianwen Jin Yuliang Liu Lele Xie 95 92 0 17 Apr 2019
BS-Nets: An End-to-End Framework For Band Selection of Hyperspectral Image Yaoming Cai Xiaobo Liu Z. Cai 49 192 0 17 Apr 2019
CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing Xin Jin Cuiling Lan Wenjun Zeng Zhizheng Zhang Zhibo Chen 74 7 0 17 Apr 2019
Explainability in Human-Agent Systems A. Rosenfeld A. Richardson XAI 86 207 0 17 Apr 2019
Neural Message Passing for Multi-Label Classification Jack Lanchantin Arshdeep Sekhon Yanjun Qi 66 38 0 17 Apr 2019
Real Image Denoising with Feature Attention Saeed Anwar Nick Barnes 110 513 0 16 Apr 2019
Latent Code and Text-based Generative Adversarial Networks for Soft-text Generation Md. Akmal Haidar Mehdi Rezagholizadeh Alan Do-Omri Ahmad Rashid GAN 68 15 0 15 Apr 2019
Self-critical n-step Training for Image Captioning Junlong Gao Shiqi Wang Shanshe Wang Siwei Ma Wen Gao 97 55 0 15 Apr 2019
An Empirical Investigation of Global and Local Normalization for Recurrent Neural Sequence Models Using a Continuous Relaxation to Beam Search Kartik Goyal Chris Dyer Taylor Berg-Kirkpatrick 65 16 0 15 Apr 2019
IIT (BHU) Varanasi at MSR-SRST 2018: A Language Model Based Approach for Natural Language Generation Shreyansh Singh Avi Chawla Ayush Sharma Anil Kumar Singh 21 3 0 12 Apr 2019
Factor Graph Attention Idan Schwartz Seunghak Yu Tamir Hazan Alex Schwing 132 110 0 11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog Idan Schwartz Alex Schwing Tamir Hazan 89 71 0 11 Apr 2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks Xizhou Zhu Dazhi Cheng Zheng Zhang Stephen Lin Jifeng Dai 94 420 0 11 Apr 2019
FTGAN: A Fully-trained Generative Adversarial Networks for Text to Face Generation Xiang Chen Lingbo Qing Xiaohai He Xiaodong Luo Yining Xu GAN CVBM 65 34 0 11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations Zilong Zheng Wenguan Wang Siyuan Qi Song-Chun Zhu 128 117 0 11 Apr 2019
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations Hao Wu Jiayuan Mao Yufeng Zhang Yuning Jiang Lei Li Weiwei Sun Wei-Ying Ma 33 8 0 11 Apr 2019
Knowledge Squeezed Adversarial Network Compression Changyong Shu Li Peng Xie Yuan Yanyun Qu Longquan Dai Lizhuang Ma GAN 75 11 0 10 Apr 2019
Identifying Sub-Phenotypes of Acute Kidney Injury using Structured and Unstructured Electronic Health Record Data with Memory Networks Zhenxing Xu Jingyuan Chou Xi Sheryl Zhang Yuan Luo T. Isakova ... Richard C. Kiefer J. Pacheco Luke Rasmussen Jyotishman Pathak Fei Wang 74 54 0 10 Apr 2019
Context-Aware Embeddings for Automatic Art Analysis Noa Garcia B. Renoust Yuta Nakashima 47 52 0 10 Apr 2019
Cross-Modal Self-Attention Network for Referring Image Segmentation Linwei Ye Mrigank Rochan Zhi Liu Yang Wang EgoV 87 478 0 09 Apr 2019
Attention-based Multi-instance Neural Network for Medical Diagnosis from Incomplete and Low Quality Data Zeyuan Wang Josiah Poon Shiding Sun S. Poon 93 26 0 09 Apr 2019
Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection Vicky Zayats Mari Ostendorf 57 30 0 08 Apr 2019
L2AE-D: Learning to Aggregate Embeddings for Few-shot Learning with Meta-level Dropout Heda Song M. Torres Ender Ozcan I. Triguero 57 8 0 08 Apr 2019
Streamlined Dense Video Captioning Jonghwan Mun L. Yang Zhou Ren N. Xu Bohyung Han 94 144 0 08 Apr 2019
SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression Christos Baziotis Ion Androutsopoulos Ioannis Konstas Alexandros Potamianos 76 83 0 07 Apr 2019
Learning to Learn Relation for Important People Detection in Still Images Wei-Hong Li Fa-Ting Hong Weishi Zheng 3DPC 3DH 57 27 0 07 Apr 2019
Doodle to Search: Practical Zero-Shot Sketch-based Image Retrieval S. Dey Pau Riba Anjan Dutta Josep Llados Yi-Zhe Song 94 181 0 06 Apr 2019
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling Jiancheng Yang Qiang Zhang Bingbing Ni Linguo Li Jinxian Liu Mengdie Zhou Qi Tian 3DPC 95 382 0 06 Apr 2019
Attention Distillation for Learning Video Representations Miao Liu Xin Chen Yun C. Zhang Yin Li James M. Rehg 66 2 0 05 Apr 2019
Information Aggregation for Multi-Head Attention with Routing-by-Agreement Jian Li Baosong Yang Zi-Yi Dou Xing Wang Michael R. Lyu Zhaopeng Tu 82 46 0 05 Apr 2019
Relation-Aware Global Attention for Person Re-identification Zhizheng Zhang Cuiling Lan Wenjun Zeng Xin Jin Zhibo Chen 3DPC 118 485 0 05 Apr 2019
Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval Yadan Luo Ziwei Wang Zi Huang Yang Yang Huimin Lu 44 7 0 05 Apr 2019
An Attentive Survey of Attention Models S. Chaudhari Varun Mithal Gungor Polatkan R. Ramanath 200 666 0 05 Apr 2019
Clinically Accurate Chest X-Ray Report Generation Guanxiong Liu T. Hsu Matthew B. A. McDermott Willie Boag W. Weng Peter Szolovits Marzyeh Ghassemi MedIm 134 279 0 04 Apr 2019
End-to-End Video Captioning Silvio Olivastri Gurkirt Singh Fabio Cuzzolin 70 18 0 04 Apr 2019
A Simple Joint Model for Improved Contextual Neural Lemmatization Chaitanya Malaviya Shijie Wu Ryan Cotterell 99 28 0 04 Apr 2019
Revisiting Visual Grounding E. Conser Kennedy Hahn Chandler M. Watson Melanie Mitchell 49 5 0 03 Apr 2019
Medical device surveillance with electronic health records A. Callahan Jason Alan Fries Christopher Ré J. Huddleston N. Giori Scott L. Delp N. Shah 79 54 0 03 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news images Ali Furkan Biten Lluís Gómez Marçal Rusiñol Dimosthenis Karatzas 89 141 0 02 Apr 2019
Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition Omer Arshad I. Gallo Shah Nawaz Alessandro Calefati 44 43 0 02 Apr 2019
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents Christian Rupprecht Cyril Ibrahim C. Pal 96 32 0 02 Apr 2019
Learning Good Representation via Continuous Attention Liang Zhao Wenyuan Xu 27 0 0 29 Mar 2019
Counting with Focus for Free Zenglin Shi Pascal Mettes Cees G. M. Snoek 3DV 3DPC 84 109 0 28 Mar 2019
Describing like humans: on diversity in image captioning Qingzhong Wang Antoni B. Chan 104 99 0 28 Mar 2019