v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI Alejandro Barredo Arrieta Natalia Díaz Rodríguez Javier Del Ser Adrien Bennetot Siham Tabik ... S. Gil-Lopez Daniel Molina Richard Benjamins Raja Chatila Francisco Herrera XAI 351 6,391 0 22 Oct 2019
Weakly-Supervised Completion Moment Detection using Temporal Attention Farnoosh Heidarivincheh Majid Mirmehdi Dima Damen 40 9 0 22 Oct 2019
Fixed Pattern Noise Reduction for Infrared Images Based on Cascade Residual Attention CNN Juntao Guan R. Lai Ai Xiong Zesheng Liu Lin Gu 53 75 0 22 Oct 2019
Drivers Drowsiness Detection using Condition-Adaptive Representation Learning Framework Jongmin Yu Sangwook Park Sangwook Lee M. Jeon MedIm 83 86 0 22 Oct 2019
Multi-Resolution Weak Supervision for Sequential Data Frederic Sala P. Varma Jason Alan Fries Daniel Y. Fu Shiori Sagawa ... A. Ramamoorthy K. Xiao Kayvon Fatahalian J. Priest Christopher Ré NoLa 166 30 0 21 Oct 2019
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image Synthesis Jorge Agnese Jonathan Herrera Haicheng Tao Xingquan Zhu EGVM 95 103 0 21 Oct 2019
Attention Enriched Deep Learning Model for Breast Tumor Segmentation in Ultrasound Images Aleksandar Vakanski Min Xian Phoebe E. Freer 90 144 0 20 Oct 2019
Endowing Deep 3D Models with Rotation Invariance Based on Principal Component Analysis Zelin Xiao Hongxin Lin Renjie Li Hongyang Chao Shengyong Ding 46 31 0 20 Oct 2019
Learning to Answer Subjective, Specific Product-Related Queries using Customer Reviews by Adversarial Domain Adaptation Manirupa Das Zhen Wang Evan Jaffe Madhuja Chattopadhyay Eric Fosler-Lussier R. Ramnath AAML 80 2 0 18 Oct 2019
Cross Attention Network for Few-shot Classification Rui Hou Hong Chang Bingpeng Ma Shiguang Shan Xilin Chen 282 647 0 17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style Hongwei Ge Zehang Yan Kai Zhang Mingde Zhao Liang Sun 59 25 0 15 Oct 2019
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee Shuangjie Xu Feng Xu Yu Cheng Pan Zhou 35 2 0 14 Oct 2019
Dynamic Attention Networks for Task Oriented Grounding S. Dasgupta Badri N. Patro Vinay P. Namboodiri 86 1 0 14 Oct 2019
Snow avalanche segmentation in SAR images with Fully Convolutional Neural Networks F. Bianchi J. Grahn M. Eckerstorfer E. Malnes H. Vickers 40 48 0 11 Oct 2019
Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases Maximilian Idahl Megha Khosla Avishek Anand 33 10 0 11 Oct 2019
Multi-modal Deep Analysis for Multimedia Wenwu Zhu Xin Eric Wang Hongzhi Li 76 43 0 11 Oct 2019
Referring Expression Object Segmentation with Caption-Aware Consistency Yi-Wen Chen Yi-Hsuan Tsai Tiantian Wang Yen-Yu Lin Ming-Hsuan Yang EgoV 71 87 0 10 Oct 2019
Semantic-aware Image Deblurring Fuhai Chen Rongrong Ji Chengpeng Dai Xiaoshuai Sun Chia-Wen Lin Jiayi Ji Baochang Zhang Feiyue Huang Liujuan Cao BDL VLM 113 6 0 09 Oct 2019
Improved Res2Net model for Person re-identification Zongjing Cao H. Lee 116 2 0 08 Oct 2019
Modulated Self-attention Convolutional Network for VQA Jean-Benoit Delbrouck Antoine Maiorca Nathan Hubens Stéphane Dupont 29 1 0 08 Oct 2019
Graph Few-shot Learning via Knowledge Transfer Huaxiu Yao Chuxu Zhang Ying Wei Meng Jiang Suhang Wang Junzhou Huang Nitesh Chawla Z. Li 135 168 0 07 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability Marcella Cornia Lorenzo Baraldi Rita Cucchiara 164 29 0 07 Oct 2019
Adversarial reconstruction for Multi-modal Machine Translation Jean-Benoit Delbrouck Stéphane Dupont GAN 158 2 0 07 Oct 2019
On Leveraging the Visual Modality for Neural Machine Translation Vikas Raunak Sang Keun Choe Quanyang Lu Yi Xu Florian Metze 38 11 0 07 Oct 2019
Compositional Generalization for Primitive Substitutions Yuanpeng Li Liang Zhao Jianyu Wang Joel Hestness 77 87 0 07 Oct 2019
MASTER: Multi-Aspect Non-local Network for Scene Text Recognition Ning Lu Wenwen Yu Xianbiao Qi Yihao Chen Ping Gong Rong Xiao Xiang Bai 70 158 0 07 Oct 2019
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory A. Vasudevan Ahmed K. Farahat Chetan Gupta LM&Ro 81 2 0 04 Oct 2019
Graph Analysis and Graph Pooling in the Spatial Domain M. Rahmani Maria Liakata GNN 61 3 0 03 Oct 2019
Residual Attention Graph Convolutional Network for Geometric 3D Scene Classification Albert Mosella-Montoro Javier Ruiz-Hidalgo 3DPC 128 8 0 30 Sep 2019
DeepUSPS: Deep Robust Unsupervised Saliency Prediction With Self-Supervision D. Nguyen Maximilian Dax Chaithanya Kumar Mummadi Thi Phuong Nhung Ngo T. Nguyen Zhongyu Lou Thomas Brox 114 70 0 28 Sep 2019
The Detection of Distributional Discrepancy for Text Generation Xingyuan Chen Ping Cai Peng Jin Haokun Du Hongjun Wang Xingyu Dai Jiajun Chen 43 0 0 28 Sep 2019
Imitation Learning Based on Bilateral Control for Human-Robot Cooperation Ayumu Sasagawa K. Fujimoto S. Sakaino T. Tsuji 53 2 0 28 Sep 2019
Learning Category Correlations for Multi-label Image Recognition with Graph Networks Qing Li Xiaojiang Peng Yu Qiao Qiang Peng 51 22 0 28 Sep 2019
Interpreting Undesirable Pixels for Image Classification on Black-Box Models Sin-Han Kang Hong G Jung Seong-Whan Lee FAtt 60 3 0 27 Sep 2019
Video-Based Convolutional Attention for Person Re-Identification Marco Zamprogno Marco Passon N. Martinel G. Serra G. Lancioni C. Micheloni C. Tasso G. Foresti 130 1 0 26 Sep 2019
Multi-grained Attention Networks for Single Image Super-Resolution Huapeng Wu Zhengxia Zou Jie Gui W. Zeng Jieping Ye Jun Zhang Hongyi Liu Zhihui Wei SupR 56 60 0 26 Sep 2019
Gated Channel Transformation for Visual Recognition Zongxin Yang Linchao Zhu Yu Wu Yezhou Yang ViT 67 212 0 25 Sep 2019
Attention Interpretability Across NLP Tasks Shikhar Vashishth Shyam Upadhyay Gaurav Singh Tomar Manaal Faruqui XAI MILM 97 176 0 24 Sep 2019
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model Yanpei Shi Qiang Huang Thomas Hain 105 1 0 24 Sep 2019
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep Visuomotor Policies for Manipulation in Clutter P. Abolghasemi Ladislau Bölöni OffRL 85 10 0 24 Sep 2019
Paying Attention to Function Words Shane Steinert-Threlkeld 31 3 0 24 Sep 2019
Where to Look Next: Unsupervised Active Visual Exploration on 360° Input Soroush Seifi Tinne Tuytelaars 70 10 0 23 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators Kuang-Huei Lee Hamid Palangi Xi Chen Houdong Hu Jianfeng Gao VLM 67 37 0 22 Sep 2019
NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement Learning Ameer Haj-Ali Nesreen Ahmed Theodore L. Willke Sophia Shao Krste Asanović Ion Stoica 92 101 0 20 Sep 2019
Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation Yi-An Lai Arshit Gupta Yi Zhang 49 1 0 19 Sep 2019
Adaptively Aligned Image Captioning via Adaptive Attention Time Lun Huang Wenmin Wang Yaxian Xia Jie Chen 83 63 0 19 Sep 2019
RUN through the Streets: A New Dataset and Baseline Models for Realistic Urban Navigation Tzuf Paz-Argaman Reut Tsarfaty 70 20 0 19 Sep 2019
Large-scale representation learning from visually grounded untranscribed speech Gabriel Ilharco Yuan Zhang Jason Baldridge SSL 87 61 0 19 Sep 2019
ContCap: A scalable framework for continual image captioning Giang Nguyen Tae Joon Jun T. Tran Tolcha Yalew Daeyoung Kim VLM CLL 73 10 0 19 Sep 2019
Pose-aware Multi-level Feature Network for Human Object Interaction Detection Bo Wan Desen Zhou Yongfei Liu Rongjie Li Xuming He 76 200 0 18 Sep 2019