v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning Austin R. Clyde Thomas Brettin A. Partin Maulik Shaulik H. Yoo Yvonne A. Evrard Yitan Zhu Fangfang Xia Rick L. Stevens 124 7 0 30 Apr 2020
Towards Embodied Scene Description Sinan Tan Huaping Liu Di Guo Xinyu Zhang F. Sun LM&Ro 54 9 0 30 Apr 2020
memeBot: Towards Automatic Image Meme Generation Aadhavan Sadasivam K. Gunasekar H. Davulcu Yezhou Yang 42 10 0 30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions Sharan Narang Colin Raffel Katherine Lee Adam Roberts Noah Fiedel Karishma Malkan 102 201 0 30 Apr 2020
Explainable Deep Learning: A Field Guide for the Uninitiated Gabrielle Ras Ning Xie Marcel van Gerven Derek Doran AAML XAI 122 382 0 30 Apr 2020
Image Captioning through Image Transformer Sen He Wentong Liao Hamed R. Tavakoli M. Yang Bodo Rosenhahn N. Pugeault ViT 97 94 0 29 Apr 2020
Valid Explanations for Learning to Rank Models Jaspreet Singh Zhenye Wang Megha Khosla Avishek Anand LRM FAtt 43 8 0 29 Apr 2020
The Explanation Game: Towards Prediction Explainability through Sparse Communication Marcos Vinícius Treviso André F. T. Martins FAtt 72 3 0 28 Apr 2020
Exploring Self-attention for Image Recognition Hengshuang Zhao Jiaya Jia V. Koltun SSL 105 792 0 28 Apr 2020
Local Lipschitz Bounds of Deep Neural Networks Calypso Herrera Florian Krach Josef Teichmann 40 3 0 27 Apr 2020
Self-Supervised Attention Learning for Depth and Ego-motion Estimation Assem Sadek Boris Chidlovskii MDE 77 6 0 27 Apr 2020
Sequential Interpretability: Methods, Applications, and Future Direction for Understanding Deep Learning Models in the Context of Sequential Data B. Shickel Parisa Rashidi AI4TS 72 18 0 27 Apr 2020
Attention Based Real Image Restoration Saeed Anwar Nick Barnes L. Petersson 63 0 0 26 Apr 2020
Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports Baoyu Jing Zeya Wang Eric Xing 107 142 0 26 Apr 2020
Quantifying the Contextualization of Word Representations with Semantic Class Probing Mengjie Zhao Philipp Dufter Yadollah Yaghoobzadeh Hinrich Schütze 83 27 0 25 Apr 2020
Detective: An Attentive Recurrent Model for Sparse Object Detection A. Kechaou Manuel Martínez Monica Haurilet Rainer Stiefelhagen ObjD 39 3 0 25 Apr 2020
Deep Multimodal Neural Architecture Search Zhou Yu Yuhao Cui Jun-chen Yu Meng Wang Dacheng Tao Qi Tian 77 100 0 25 Apr 2020
The Variational Bandwidth Bottleneck: Stochastic Evaluation on an Information Budget Anirudh Goyal Yoshua Bengio M. Botvinick Sergey Levine 70 24 0 24 Apr 2020
Survey on Visual Sentiment Analysis A. Ortis G. Farinella Sebastiano Battiato 45 77 0 24 Apr 2020
Why an Android App is Classified as Malware? Towards Malware Classification Interpretation Bozhi Wu Sen Chen Cuiyun Gao Lingling Fan Yang Liu W. Wen Michael R. Lyu 107 59 0 24 Apr 2020
Efficient Neural Architecture for Text-to-Image Synthesis Douglas M. Souza Jonatas Wehrmann D. Ruiz 53 24 0 23 Apr 2020
Visual Question Answering Using Semantic Information from Image Descriptions Tasmia Tasrin Md Sultan al Nahian Brent Harrison 32 0 0 23 Apr 2020
Textual Visual Semantic Dataset for Text Spotting Ahmed Sabir Francesc Moreno-Noguer Lluís Padró 43 3 0 21 Apr 2020
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs Shiyang Yan Yang Hua N. Robertson 81 7 0 21 Apr 2020
Transform and Tell: Entity-Aware News Image Captioning Alasdair Tran A. Mathews Lexing Xie VLM 60 97 0 17 Apr 2020
Knowledge-Based Visual Question Answering in Videos Noa Garcia Mayu Otani Chenhui Chu Yuta Nakashima 23 0 0 17 Apr 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence Huy Manh Nguyen Tomo Miyazaki Yoshihiro Sugaya S. Omachi 149 1 0 16 Apr 2020
Top-Down Networks: A coarse-to-fine reimagination of CNNs Ioannis Lelekas Nergis Tomen S. Pintea Jan van Gemert 29 6 0 16 Apr 2020
Destination Prediction Based on Partial Trajectory Data Patrick Ebel Ibrahim Emre Göl Christoph Lingenfelder Andreas Vogelsang 41 33 0 16 Apr 2020
Hybrid Attention Networks for Flow and Pressure Forecasting in Water Distribution Systems Ziqing Ma Shuming Liu Guancheng Guo Xipeng Yu AI4TS 15 4 0 13 Apr 2020
Sequential Weakly Labeled Multi-Activity Localization and Recognition on Wearable Sensors using Recurrent Attention Networks Kun Wang Jun He Lefei Zhang HAI 50 39 0 13 Apr 2020
Attend and Decode: 4D fMRI Task State Decoding Using Attention Models Sam Nguyen Brenda Ng Alan Kaplan Priyadip Ray 63 25 0 10 Apr 2020
S2A: Wasserstein GAN with Spatio-Spectral Laplacian Attention for Multi-Spectral Band Synthesis Litu Rout Indranil Misra Manthira Moorthi Subbiah D. Dhar 53 7 0 08 Apr 2020
Survey for Trust-aware Recommender Systems: A Deep Learning Perspective Manqing Dong Feng Yuan Lina Yao Xianzhi Wang Xiwei Xu Liming Zhu 73 8 0 08 Apr 2020
e-SNLI-VE: Corrected Visual-Textual Entailment with Natural Language Explanations Virginie Do Oana-Maria Camburu Zeynep Akata Thomas Lukasiewicz LRM 99 30 0 07 Apr 2020
Context-Aware Group Captioning via Self-Attention and Contrastive Features Zhuowan Li Quan Hung Tran Long Mai Zhe Lin Alan Yuille VLM 81 44 0 07 Apr 2020
Hierarchical Opacity Propagation for Image Matting Yaoyi Li Qin Xu Hongtao Lu 71 13 0 07 Apr 2020
Deep Attentive Generative Adversarial Network for Photo-Realistic Image De-Quantization Yang Zhang Changhui Hu Xiaobo Lu GAN 52 1 0 07 Apr 2020
Scenario-Transferable Semantic Graph Reasoning for Interaction-Aware Probabilistic Prediction Yeping Hu Wei Zhan Masayoshi Tomizuka 141 38 0 07 Apr 2020
Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis Kenya Sakka Kotaro Nakayama Nisei Kimura Taiki Inoue Yusuke Iwasawa Ryohei Yamaguchi Yosimasa Kawazoe K. Ohe Y. Matsuo 16 2 0 06 Apr 2020
Guiding Monocular Depth Estimation Using Depth-Attention Volume Lam Huynh Phong Nguyen-Ha Jirí Matas Esa Rahtu J. Heikkilä MDE 97 156 0 06 Apr 2020
Sub-Instruction Aware Vision-and-Language Navigation Yicong Hong Cristian Rodriguez-Opazo Qi Wu Stephen Gould 136 72 0 06 Apr 2020
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning Shashank Bujimalla Mahesh Subedar Omesh Tickoo BDL UQCV 37 10 0 06 Apr 2020
Iterative Context-Aware Graph Inference for Visual Dialog Dan Guo Haibo Wang Hanwang Zhang Zhengjun Zha Meng Wang 89 49 0 05 Apr 2020
Towards Relevance and Sequence Modeling in Language Recognition Bharat Padi Anand Mohan Sriram Ganapathy 27 15 0 02 Apr 2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers Zhicheng Huang Zhaoyang Zeng Bei Liu Dongmei Fu Jianlong Fu ViT 214 440 0 02 Apr 2020
More Grounded Image Captioning by Distilling Image-Text Matching Model Yuanen Zhou Meng Wang Daqing Liu Zhenzhen Hu Hanwang Zhang 101 126 0 01 Apr 2020
X-Linear Attention Networks for Image Captioning Yingwei Pan Ting Yao Yehao Li Tao Mei 146 519 0 31 Mar 2020
Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters .Ilker Kesen Ozan Arkan Can Erkut Erdem Aykut Erdem Deniz Yuret VLM 64 1 0 28 Mar 2020
Actor-Transformers for Group Activity Recognition Kirill Gavrilyuk Ryan Sanford Mehrsan Javan Cees G. M. Snoek ViT 73 182 0 28 Mar 2020