v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Mimic and Fool: A Task Agnostic Adversarial Attack Akshay Chaturvedi Utpal Garain AAML 57 27 0 11 Jun 2019
Relationship-Embedded Representation Learning for Grounding Referring Expressions Sibei Yang Guanbin Li Yizhou Yu ObjD 97 55 0 11 Jun 2019
Bag of Color Features For Color Constancy Firas Laakom Nikolaos Passalis Jenni Raitoharju Jarno Nikkanen Anastasios Tefas Alexandros Iosifidis Moncef Gabbouj 46 33 0 11 Jun 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval Yale Song M. Soleymani 84 247 0 11 Jun 2019
Improving Neural Language Modeling via Adversarial Training Dilin Wang Chengyue Gong Qiang Liu AAML 124 119 0 10 Jun 2019
An Attention-based Recurrent Convolutional Network for Vehicle Taillight Recognition Kuan-Hui Lee Takaaki Tagawa Jia Pan Adrien Gaidon B. Douillard ViT 37 15 0 09 Jun 2019
Attention-based Conditioning Methods for External Knowledge Integration Katerina Margatina Christos Baziotis Alexandros Potamianos 51 30 0 09 Jun 2019
Attending to Discriminative Certainty for Domain Adaptation V. Kurmi Shanu Kumar Vinay P. Namboodiri OOD 98 108 0 08 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training Charles C. Chen Ruiyi Zhang Eunyee Koh Sungchul Kim Scott D. Cohen Tong Yu Ryan Rossi Razvan Bunescu AIMat 69 39 0 07 Jun 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video Zhenfang Chen Lin Ma Wenhan Luo Kwan-Yee K. Wong 105 103 0 06 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents Alex Mott Daniel Zoran Mike Chrzanowski Daan Wierstra Danilo Jimenez Rezende 74 192 0 06 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering Zhou Yu D. Xu Jun-chen Yu Ting Yu Zhou Zhao Yueting Zhuang Dacheng Tao 157 478 0 06 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning Zhengjun Zha Daqing Liu Hanwang Zhang Yongdong Zhang Feng Wu 66 122 0 06 Jun 2019
Neural Legal Judgment Prediction in English Ilias Chalkidis Ion Androutsopoulos Nikolaos Aletras AILaw ELM 190 342 0 05 Jun 2019
Large-Scale Multi-Label Text Classification on EU Legislation Ilias Chalkidis Manos Fergadiotis Prodromos Malakasiotis Ion Androutsopoulos AILaw 66 217 0 05 Jun 2019
Machine Learning and System Identification for Estimation in Physical Systems Fredrik Bagge Carlson OOD 56 5 0 05 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences William Chan Nikita Kitaev Kelvin Guu Mitchell Stern Jakob Uszkoreit VLM 96 65 0 04 Jun 2019
Natural Vocabulary Emerges from Free-Form Annotations Jordi Pont-Tuset Michael Gygli V. Ferrari VLM 90 3 0 04 Jun 2019
Masked Non-Autoregressive Image Captioning Junlong Gao Xi Meng Shiqi Wang Xia Li Shanshe Wang Siwei Ma Wen Gao 80 39 0 03 Jun 2019
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS Mutian He Yan Deng Lei He 97 81 0 03 Jun 2019
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Johanes Effendi Andros Tjandra S. Sakti Satoshi Nakamura 68 3 0 03 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions Sashank Santhanam Samira Shaikh 3DV 84 52 0 02 Jun 2019
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal Data Shizhe Chen Qin Jin Alexander G. Hauptmann SSL 46 9 0 02 Jun 2019
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video Jian Liu Naveed Akhtar Ajmal Mian 3DH 66 10 0 01 Jun 2019
Do Human Rationales Improve Machine Explanations? Julia Strout Ye Zhang Raymond J. Mooney 89 58 0 31 May 2019
Audio Caption in a Car Setting with a Sentence-Level Loss Xuenan Xu Heinrich Dinkel Mengyue Wu Kai Yu 31 2 0 31 May 2019
Interactive-predictive neural multimodal systems Álvaro Peris F. Casacuberta KELM HAI 47 2 0 30 May 2019
Meta Dropout: Learning to Perturb Features for Generalization Haebeom Lee Taewook Nam Eunho Yang Sung Ju Hwang OOD 68 3 0 30 May 2019
Adversarial Sub-sequence for Text Generation Xingyuan Chen Yanzhe Li Peng Jin Jiuhua Zhang Xinyu Dai Jiajun Chen Gang Song GAN 55 5 0 30 May 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback Hui Wu Yupeng Gao Xiaoxiao Guo Ziad Al-Halah Steven J. Rennie Kristen Grauman Rogerio Feris EgoV 167 68 0 30 May 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism Xuelong Li Aihong Yuan Xiaoqiang Lu 79 37 0 29 May 2019
Recurrent Existence Determination Through Policy Optimization Baoxiang Wang 47 1 0 29 May 2019
Semantic Fisher Scores for Task Transfer: Using Objects to Classify Scenes Mandar Dixit Yunsheng Li Nuno Vasconcelos 86 14 0 27 May 2019
Audio2Face: Generating Speech/Face Animation from Single Audio with Attention-Based Bidirectional LSTM Networks Guanzhong Tian Yi Yuan Yang Liu CVBM 86 45 0 27 May 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models Linfeng Zhang Zhanhong Tan Jiebo Song Jingwei Chen Chenglong Bao Kaisheng Ma 55 71 0 27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence Jeff Clune 148 122 0 27 May 2019
Transcribing Content from Structural Images with Spotlight Mechanism Yu Yin Zhenya Huang Enhong Chen Qi Liu Fuzheng Zhang Xing Xie Guoping Hu 48 22 0 27 May 2019
Extreme Multi-Label Legal Text Classification: A case study in EU Legislation Ilias Chalkidis Manos Fergadiotis Prodromos Malakasiotis Nikolaos Aletras Ion Androutsopoulos AILaw 86 75 0 26 May 2019
Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives Yi Tay Shuohang Wang Anh Tuan Luu Jie Fu Minh C. Phan Xingdi Yuan J. Rao S. Hui Aston Zhang 118 110 0 26 May 2019
A Survey on Biomedical Image Captioning Vasiliki Kougia John Pavlopoulos Ion Androutsopoulos MedIm 94 83 0 26 May 2019
Path Ranking with Attention to Type Hierarchies Weiyu Liu A. Daruna Z. Kira Sonia Chernova AIMat 70 13 0 26 May 2019
DIANet: Dense-and-Implicit Attention Network Zhongzhan Huang Senwei Liang Mingfu Liang Haizhao Yang CVBM 82 57 0 25 May 2019
Bivariate Beta-LSTM Kyungwoo Song Joonho Jang Seung-Jae Shin Il-Chul Moon 51 6 0 25 May 2019
Pose-adaptive Hierarchical Attention Network for Facial Expression Recognition Yuanyuan Liu Jiyao Peng Jiabei Zeng Shiguang Shan CVBM 69 16 0 24 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment Jivitesh Sharma Per-Arne Andersen Ole-Christoffer Granmo M. G. Olsen AI4CE 78 70 0 23 May 2019
AttentionRNN: A Structured Spatial Attention Mechanism Siddhesh Khandelwal Leonid Sigal 71 3 0 22 May 2019
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention Antonino Furnari G. Farinella EgoV 141 175 0 22 May 2019
A Neural, Interactive-predictive System for Multimodal Sequence to Sequence Tasks Álvaro Peris F. Casacuberta 48 4 0 20 May 2019
Image Captioning based on Deep Learning Methods: A Survey Yiyu Wang Jungang Xu Yingfei Sun Xianpei Han VLM 44 7 0 20 May 2019
Less Memory, Faster Speed: Refining Self-Attention Module for Image Reconstruction Zheng Wang Jianwu Li Ge Song Tieling Li 28 2 0 20 May 2019