v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase Generation Iftitahu Ni'mah Vlado Menkovski Mykola Pechenizkiy 44 2 0 17 Sep 2019
Learning to Deceive with Attention-Based Explanations Danish Pruthi Mansi Gupta Bhuwan Dhingra Graham Neubig Zachary Chase Lipton 118 194 0 17 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions Yaser Alwatter Yuhong Guo BDL 39 1 0 17 Sep 2019
Controllable Text-to-Image Generation Bowen Li Xiaojuan Qi Thomas Lukasiewicz Philip Torr GAN 152 357 0 16 Sep 2019
Motion Guided Attention for Video Salient Object Detection Haofeng Li Guanqi Chen Guanbin Li Yizhou Yu 128 167 0 16 Sep 2019
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer Wentao Jiang Si Liu Chen Gao Jie Cao Ran He Jiashi Feng Shuicheng Yan CVBM 76 130 0 16 Sep 2019
Deep Collaborative Filtering with Multi-Aspect Information in Heterogeneous Networks C. Shi Xiaotian Han Li Song Tianlin Li Senzhang Wang Junping Du Philip S. Yu 144 101 0 14 Sep 2019
SANVis: Visual Analytics for Understanding Self-Attention Networks Cheonbok Park Inyoup Na Yongjang Jo Sungbok Shin J. Yoo Bum Chul Kwon Jian Zhao Hyungjong Noh Yeonsoo Lee Jaegul Choo HAI 80 40 0 13 Sep 2019
Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent Neural Networks R. C. Staudemeyer Eric Rothstein Morris 67 498 0 12 Sep 2019
Speculative Beam Search for Simultaneous Translation Renjie Zheng Mingbo Ma Baigong Zheng Liang Huang 98 24 0 12 Sep 2019
Human Visual Attention Prediction Boosts Learning & Performance of Autonomous Driving Agents Alexander Makrigiorgos A. Shafti Alex Harston Julien Gérard A. Faisal 55 14 0 11 Sep 2019
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression Sicheng Zhao Zizhou Jia Hui Chen Leida Li Guiguang Ding Kurt Keutzer 94 62 0 11 Sep 2019
Dual-attention Focused Module for Weakly Supervised Object Localization Yukun Zhou Zailiang Chen Hai-lan Shen Qing Liu Rongchang Zhao Yixiong Liang WSOL 57 4 0 11 Sep 2019
Select and Attend: Towards Controllable Content Selection in Text Generation Xiaoyu Shen Jun Suzuki Kentaro Inui Hui Su Dietrich Klakow Satoshi Sekine 76 29 0 10 Sep 2019
Compositional Generalization in Image Captioning Mitja Nikolaus Mostafa Abdou Matthew Lamm Rahul Aralikatte Desmond Elliott CoGe 98 49 0 10 Sep 2019
FDA: Feature Disruptive Attack Aditya Ganeshan S. VivekB. R. Venkatesh Babu AAML 124 105 0 10 Sep 2019
Multimodal Attention Branch Network for Perspective-Free Sentence Generation A. Magassouba K. Sugiura Hisashi Kawai 45 17 0 10 Sep 2019
Neural Naturalist: Generating Fine-Grained Image Comparisons Maxwell Forbes Christine Kaeser-Chen Piyush Sharma Serge J. Belongie VLM 141 58 0 09 Sep 2019
Hierarchy Parsing for Image Captioning Ting Yao Yingwei Pan Yehao Li Tao Mei VLM 96 166 0 09 Sep 2019
Picture What you Read I. Gallo Shah Nawaz Alessandro Calefati Riccardo La Grassa Nicola Landro DiffM 66 0 0 09 Sep 2019
Improving Neural Question Generation using World Knowledge D. Gupta Kaheer Suleman Mahmoud Adada Andrew McNamara Justin Harris MedIm 82 7 0 09 Sep 2019
Transfer Reward Learning for Policy Gradient-Based Text Generation James OÑeill Danushka Bollegala 25 1 0 09 Sep 2019
AtLoc: Attention Guided Camera Localization Bing Wang Changhao Chen Chris Xiaoxuan Lu Peijun Zhao A. Trigoni Andrew Markham 102 158 0 08 Sep 2019
Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks Chen Zhang Qiuchi Li D. Song GNN 66 445 0 08 Sep 2019
Conditional Text Generation for Harmonious Human-Machine Interaction Bin Guo Hao Wang Yasan Ding Wei Wu Shaoyang Hao Yueqi Sun Zhiwen Yu 103 4 0 08 Sep 2019
Look and Modify: Modification Networks for Image Captioning Fawaz Sammani Mahmoud Elsayed 52 22 0 07 Sep 2019
What can computational models learn from human selective attention? A review from an audiovisual crossmodal perspective Di Fu C. Weber Guochun Yang Matthias Kerzel Weizhi Nan Pablo V. A. Barros Haiyan Wu Xun Liu S. Wermter 35 0 0 05 Sep 2019
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation Wei Wei Ling Cheng Xian-Ling Mao Guangyou Zhou Feida Zhu DiffM 79 19 0 05 Sep 2019
Semantic-Aware Scene Recognition Alejandro López-Cifuentes Marcos Escudero-Viñolo Jesús Bescós Álvaro García-Martín 86 106 0 05 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question Answering Hongyang Xue Wenqing Chu Zhou Zhao Deng Cai 62 33 0 05 Sep 2019
Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach Dong-Jin Kim Jinsoo Choi Tae-Hyun Oh In So Kweon SSL VLM 89 56 0 05 Sep 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering Soravit Changpinyo Bo Pang Piyush Sharma Radu Soricut ObjD 65 20 0 04 Sep 2019
Do Cross Modal Systems Leverage Semantic Relationships? Shah Nawaz Muhammad Kamran Janjua I. Gallo Arif Mahmood Alessandro Calefati Faisal Shafait 56 8 0 03 Sep 2019
Encode, Tag, Realize: High-Precision Text Editing Eric Malmi Sebastian Krause S. Rothe Daniil Mirylenka Aliaksei Severyn 3DV 112 171 0 03 Sep 2019
A Geometry-Sensitive Approach for Photographic Style Classification Koustav Ghosal Mukta Prasad A. Smolic GAN 61 6 0 03 Sep 2019
EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks Pengfei Zhang Jianru Xue Cuiling Lan Wenjun Zeng Zhanning Gao Nanning Zheng 72 85 0 03 Sep 2019
Story-oriented Image Selection and Placement Sreyasi Nag Chowdhury Simon Razniewski Gerhard Weikum 27 1 0 02 Sep 2019
SumQE: a BERT-based Summary Quality Estimation Model Stratos Xenouleas Prodromos Malakasiotis Marianna Apidianaki Ion Androutsopoulos 71 37 0 02 Sep 2019
What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues Xintong Yu Hongming Zhang Yangqiu Song Yan Song Changshui Zhang 42 28 0 01 Sep 2019
Phrase Grounding by Soft-Label Chain Conditional Random Field Jiacheng Liu Julia Hockenmaier 50 10 0 01 Sep 2019
Humor Detection: A Transformer Gets the Last Laugh Orion Weller Kevin Seppi 138 123 0 31 Aug 2019
A Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Haoran Chen Ke Lin A. Maye Jianmin Li Xiaoling Hu 64 48 0 31 Aug 2019
Rethinking Irregular Scene Text Recognition Shangbang Long Yushuo Guan Bingxuan Wang Kaigui Bian Cong Yao 67 8 0 30 Aug 2019
Reflective Decoding Network for Image Captioning Lei Ke Wenjie Pei Ruiyu Li Xiaoyong Shen Yu-Wing Tai ObjD 75 94 0 30 Aug 2019
Translating Math Formula Images to LaTeX Sequences Using Deep Neural Networks with Sequence-level Training Zelun Wang Jyh-Charn S. Liu 29 7 0 29 Aug 2019
Aesthetic Image Captioning From Weakly-Labelled Photographs Koustav Ghosal A. Rana A. Smolic 67 25 0 29 Aug 2019
DFPENet-geology: A Deep Learning Framework for High Precision Recognition and Segmentation of Co-seismic Landslides Qingsong Xu Chaojun Ouyang Tianhai Jiang Xuanmei Fan Duoxiang Cheng AI4CE 46 13 0 28 Aug 2019
Image Captioning with Sparse Recurrent Neural Network J. Tan Chee Seng Chan Joon Huang Chuah VLM 56 6 0 28 Aug 2019
Fingerspelling recognition in the wild with iterative visual attention Bowen Shi Aurora Martinez Del Rio J. Keane D. Brentari G. Shakhnarovich Karen Livescu 68 63 0 28 Aug 2019
Attention-based Dropout Layer for Weakly Supervised Object Localization Junsuk Choe Hyunjung Shim WSOL 155 368 0 27 Aug 2019