v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Position Focused Attention Network for Image-Text Matching Yaxiong Wang Hao-Hsiang Yang Xueming Qian Lin Ma Jing Lu Biao Li Xin Fan 58 172 0 23 Jul 2019
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models Boyuan Pan Yazheng Yang Hao Li Zhou Zhao Yueting Zhuang Deng Cai Xiaofei He 64 18 0 23 Jul 2019
Compact Global Descriptor for Neural Networks Xiangyu He Ke Cheng Qiang Chen Qinghao Hu Peisong Wang Jian Cheng 96 8 0 23 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 143 136 0 22 Jul 2019
Deep Learning for Time Series Forecasting: The Electric Load Case Alberto Gasparin S. Lukovic Cesare Alippi AI4TS 80 231 0 22 Jul 2019
Automatic Radiology Report Generation based on Multi-view Image Fusion and Medical Concept Enrichment Jianbo Yuan Haofu Liao R. Luo Jiebo Luo MedIm 88 196 0 22 Jul 2019
A study on the Interpretability of Neural Retrieval Models using DeepSHAP Zeon Trevor Fernando Jaspreet Singh Avishek Anand FAtt AAML 65 68 0 15 Jul 2019
Extracting Interpretable Physical Parameters from Spatiotemporal Systems using Unsupervised Learning Peter Y. Lu Samuel Kim Marin Soljacic AI4CE 67 60 0 13 Jul 2019
A Survey of Deep Learning-based Object Detection L. Jiao Fan Zhang Fang Liu Shuyuan Yang Lingling Li Zhixi Feng Rong Qu ObjD 131 974 0 11 Jul 2019
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions Yulei Niu Hanwang Zhang Zhiwu Lu Shih-Fu Chang ObjD BDL 101 26 0 08 Jul 2019
Informative Visual Storytelling with Cross-modal Rules Jiacheng Li Haizhou Shi Siliang Tang Leilei Gan Yueting Zhuang 52 24 0 07 Jul 2019
EPNAS: Efficient Progressive Neural Architecture Search Yanqi Zhou Peng Wang Sercan O. Arik Haonan Yu Syed Zawad Feng Yan G. Diamos 47 5 0 07 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention Networks Hongyang Gao Shuiwang Ji GNN 70 57 0 05 Jul 2019
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters Federico Landi Lorenzo Baraldi M. Corsini Rita Cucchiara LM&Ro 98 27 0 05 Jul 2019
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks V. Kosaraju Amir Sadeghian Roberto Martín-Martín Ian Reid S. Hamid Rezatofighi Silvio Savarese 95 613 0 04 Jul 2019
ACNe: Attentive Context Normalization for Robust Permutation-Equivariant Learning Weiwei Sun Wei Jiang Eduard Trulls Andrea Tagliasacchi K. M. Yi 3DPC 90 20 0 04 Jul 2019
Learning Blended, Precise Semantic Program Embeddings Ke Wang Z. Su NAI 62 27 0 03 Jul 2019
Neural Image Captioning E. Tan Lakshay Sharma VLM 55 3 0 02 Jul 2019
Generative Models for Automatic Chemical Design Daniel Schwalbe-Koda Rafael Gómez-Bombarelli MedIm AI4CE 89 81 0 02 Jul 2019
Augmenting Self-attention with Persistent Memory Sainbayar Sukhbaatar Edouard Grave Guillaume Lample Hervé Jégou Armand Joulin RALM KELM 77 139 0 02 Jul 2019
Kite: Automatic speech recognition for unmanned aerial vehicles Dan Oneaţă H. Cucu 40 13 0 02 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems Hung Le Doyen Sahoo Nancy F. Chen Guosheng Lin 70 112 0 02 Jul 2019
Inter and Intra Document Attention for Depression Risk Assessment Diego Maupomé Marc Queudot Marie-Jean Meurs 14 7 0 30 Jun 2019
Machine Reading Comprehension: a Literature Review Xin Zhang An Yang Sujian Li Yizhong Wang 91 33 0 30 Jun 2019
A Neural Attention Model for Adaptive Learning of Social Friends' Preferences Dimitrios Rafailidis Gerhard Weiss GNN FedML 50 2 0 29 Jun 2019
A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning A. Asadi Reza Safabakhsh 26 3 0 26 Jun 2019
Creating A Neural Pedagogical Agent by Jointly Learning to Review and Assess Youngnam Lee Youngduck Choi Junghyun Cho Alexander R. Fabbri Hyunbin Loh Chanyou Hwang Yongku Lee Sang-Wook Kim Dragomir R. Radev 35 18 0 26 Jun 2019
Deep Modular Co-Attention Networks for Visual Question Answering Zhou Yu Jun Yu Yuhao Cui Dacheng Tao Q. Tian 150 811 0 25 Jun 2019
Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation Maximilian Spliethover Jonas Klaff Hendrik Heuer 54 10 0 24 Jun 2019
CORAL8: Concurrent Object Regression for Area Localization in Medical Image Panels Sam Maksoud Arnold Wiliem Kun-li Zhao Teng Zhang Lin Wu Brian C. Lovell MedIm 51 11 0 24 Jun 2019
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments K. Niu Y. Huang Wanli Ouyang Liang Wang 58 144 0 23 Jun 2019
Sequence Generation: From Both Sides to the Middle Long Zhou Jiajun Zhang Chengqing Zong Heng Yu 76 22 0 23 Jun 2019
Informative Image Captioning with External Sources of Information Sanqiang Zhao Piyush Sharma Tomer Levinboim Radu Soricut 65 46 0 20 Jun 2019
Understanding More about Human and Machine Attention in Deep Neural Networks Qiuxia Lai Salman Khan Wenguan Wang Jianbing Shen Hanqiu Sun Ling Shao HAI XAI 52 7 0 20 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors G. Lambard Ekaterina Gracheva 71 21 0 20 Jun 2019
A simple and effective postprocessing method for image classification Yan Liu Yun Li Yunhao Yuan Jipeng Qiang 21 1 0 19 Jun 2019
VizADS-B: Analyzing Sequences of ADS-B Images Using Explainable Convolutional LSTM Encoder-Decoder to Detect Cyber Attacks Sefi Akerman Edan Habler A. Shabtai 82 18 0 19 Jun 2019
Distilling Translations with Visual Awareness Julia Ive Pranava Madhyastha Lucia Specia VLM 154 76 0 18 Jun 2019
Expressing Visual Relationships via Language Hao Tan Franck Dernoncourt Zhe Lin Trung Bui Joey Tianyi Zhou 93 68 0 18 Jun 2019
Attention Guided Graph Convolutional Networks for Relation Extraction Zhijiang Guo Yan Zhang Wei Lu GNN 97 413 0 18 Jun 2019
ASAC: Active Sensing using Actor-Critic models Chang Jo Kim James Jordon M. Schaar CML 63 16 0 16 Jun 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding Jian Zheng S. Krishnamurthy Ruxin Chen Min-Hung Chen Zhenhao Ge Xiaohua Li 85 4 0 16 Jun 2019
Generating Diverse and Informative Natural Language Fashion Feedback Gil Sadeh L. Fritz Gabi Shalev Eduard Oks 58 5 0 15 Jun 2019
Connecting Touch and Vision via Cross-Modal Prediction Yunzhu Li Jun-Yan Zhu Russ Tedrake Antonio Torralba 80 139 0 14 Jun 2019
Image Captioning: Transforming Objects into Words Simão Herdade Armin Kappeler K. Boakye Joao Soares ViT 175 476 0 14 Jun 2019
Multigrid Neural Memory T. Huynh Michael Maire Matthew R. Walter 64 10 0 13 Jun 2019
Stand-Alone Self-Attention in Vision Models Prajit Ramachandran Niki Parmar Ashish Vaswani Irwan Bello Anselm Levskaya Jonathon Shlens VLM SLR ViT 193 1,218 0 13 Jun 2019
Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training William Harvey Michael Teng Frank Wood 50 4 0 13 Jun 2019
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays X. Li Rui Cao D. Zhu 79 20 0 12 Jun 2019
Pay Attention to Convolution Filters: Towards Fast and Accurate Fine-Grained Transfer Learning Xiangxi Mo Ruizhe Cheng Tianyi Fang 35 3 0 12 Jun 2019