v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Memory-augmented Dense Predictive Coding for Video Representation Learning Tengda Han Weidi Xie Andrew Zisserman SSL 126 242 0 03 Aug 2020
AUTSL: A Large Scale Multi-modal Turkish Sign Language Dataset and Baseline Methods Ozge Mercanoglu Sincan H. Keles SLR 77 173 0 03 Aug 2020
Efficient Urdu Caption Generation using Attention based LSTM Inaam Ilahi Hafiz Muhammad Abdullah Zia Ahtazaz Ehsan Rauf Tabassam Armaghan Ahmed VLM 67 3 0 02 Aug 2020
A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises S. Kevin Zhou H. Greenspan Christos Davatzikos James S. Duncan Bram van Ginneken A. Madabhushi Jerry L. Prince Daniel Rueckert Ronald M. Summers 220 650 0 02 Aug 2020
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space Liu Yang VLM 60 5 0 02 Aug 2020
Improving Skeleton-based Action Recognitionwith Robust Spatial and Temporal Features Zeshi Yang KangKang Yin 3DPC 65 3 0 01 Aug 2020
Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report Jing Shi Zhiheng Li Haitian Zheng Yihang Xu Tianyou Xiao ... R. Magnotti A. Sexton Jeet Thaker Oscar Su Chenliang Xu 47 1 0 01 Aug 2020
Learning to Rank for Active Learning: A Listwise Approach Minghan Li Xialei Liu Joost van de Weijer Bogdan Raducanu 91 22 0 31 Jul 2020
Neural Language Generation: Formulation, Methods, and Evaluation Cristina Garbacea Qiaozhu Mei 165 30 0 31 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images Chen Jin Ryutaro Tanno Moucheng Xu T. Mertzanidou Daniel C. Alexander AI4TS 53 4 0 29 Jul 2020
Enriching Video Captions With Contextual Text Philipp Rimle Pelin Dogan Markus Gross 59 3 0 29 Jul 2020
Improving Recurrent Neural Network Responsiveness to Acute Clinical Events D. Ledbetter Eugene Laksana M. Aczon R. Wetzel OOD 32 3 0 28 Jul 2020
AiR: Attention with Reasoning Capability Shi Chen Ming Jiang Jinhui Yang Qi Zhao LRM 56 36 0 28 Jul 2020
Chest X-ray Report Generation through Fine-Grained Label Learning Tanveer Syeda-Mahmood Ken C. L. Wong Yaniv Gur Joy T. Wu A. Jadhav ... A. Pillai Arjun Sharma A. Syed Orest Boyko Mehdi Moradi 95 47 0 27 Jul 2020
RANDOM MASK: Towards Robust Convolutional Neural Networks Tiange Luo Tianle Cai Mengxiao Zhang Siyu Chen Liwei Wang AAML OOD 100 17 0 27 Jul 2020
Contrastive Visual-Linguistic Pretraining Lei Shi Kai Shuang Shijie Geng Peng Su Zhengkai Jiang Peng Gao Zuohui Fu Gerard de Melo Sen Su VLM SSL CLIP 105 29 0 26 Jul 2020
Dynamically Extracting Outcome-Specific Problem Lists from Clinical Notes with Guided Multi-Headed Attention Justin Lovelace N. Hurley A. Haimovich B. Mortazavi 69 4 0 25 Jul 2020
Deep Inverse Reinforcement Learning for Structural Evolution of Small Molecules Brighter Agyemang Wei-Ping Wu Daniel Addo Michael Y. Kpiebaareh Ebenezer Nanor C. R. Haruna 38 7 0 24 Jul 2020
Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection Xianyu Chen Ming Jiang Qi Zhao ObjD 42 14 0 23 Jul 2020
HCMS at SemEval-2020 Task 9: A Neural Approach to Sentiment Analysis for Code-Mixed Texts Aditya Srivastava V. H. Vardhan 77 5 0 23 Jul 2020
Comprehensive Image Captioning via Scene Graph Decomposition Yiwu Zhong Liwei Wang Jianshu Chen Dong Yu Yin Li 137 128 0 23 Jul 2020
Integrating Image Captioning with Rule-based Entity Masking Aditya Mogadala Xiaoyu Shen Dietrich Klakow 34 7 0 22 Jul 2020
Attend and Segment: Attention Guided Active Semantic Segmentation Soroush Seifi Tinne Tuytelaars 71 13 0 22 Jul 2020
BAKSA at SemEval-2020 Task 9: Bolstering CNN with Self-Attention for Sentiment Analysis of Code Mixed Text Ayush Kumar Harsh Agarwal Keshav Bansal Ashutosh Modi 35 12 0 21 Jul 2020
Fine-Grained Image Captioning with Global-Local Discriminative Objective Jie Wu Tianshui Chen Hefeng Wu Zhi Yang Guangchun Luo Liang Lin 70 59 0 21 Jul 2020
A Generic Visualization Approach for Convolutional Neural Networks Ahmed Taha Xitong Yang Abhinav Shrivastava L. Davis 49 8 0 19 Jul 2020
Length-Controllable Image Captioning Chaorui Deng Ning Ding Mingkui Tan Qi Wu VLM 81 57 0 19 Jul 2020
Understanding Spatial Relations through Multiple Modalities Soham Dan Hangfeng He Dan Roth 36 6 0 19 Jul 2020
Deep Learning Based Brain Tumor Segmentation: A Survey Zhihua Liu Lei Tong Zheheng Jiang Long Chen Feixiang Zhou Qianni Zhang Xiangrong Zhang Ling Li Huiyu Zhou 3DV 110 238 0 18 Jul 2020
Volumetric Transformer Networks Seungryong Kim Sabine Süsstrunk Mathieu Salzmann ViT 107 5 0 18 Jul 2020
Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification Mang Ye Jianbing Shen David J. Crandall Ling Shao Jiebo Luo 93 324 0 18 Jul 2020
Kronecker Attention Networks Hongyang Gao Zhengyang Wang Shuiwang Ji 55 33 0 16 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation Hanqing Wang Wenguan Wang Tianmin Shu Wei Liang Jianbing Shen 145 73 0 15 Jul 2020
RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition Xiaoyu Yue Zhanghui Kuang Chenhao Lin Hongbin Sun Wayne Zhang 94 162 0 15 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting Roberto Bigazzi Federico Landi Marcella Cornia S. Cascianelli Lorenzo Baraldi Rita Cucchiara EgoV LM&Ro 78 17 0 14 Jul 2020
Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 70 45 0 14 Jul 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning Riccardo Del Chiaro Bartlomiej Twardowski Andrew D. Bagdanov Joost van de Weijer CLL VLM 79 41 0 13 Jul 2020
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation Aditya Mogadala Marius Mosbach Dietrich Klakow VLM 370 0 0 12 Jul 2020
Applying recent advances in Visual Question Answering to Record Linkage Marko Smilevski 22 0 0 12 Jul 2020
Image Captioning with Compositional Neural Module Networks Junjiao Tian Jean Oh 44 11 0 10 Jul 2020
Attention or memory? Neurointerpretable agents in space and time Lennart Bramlage A. Cortese 53 1 0 09 Jul 2020
Fast Transformers with Clustered Attention Apoorv Vyas Angelos Katharopoulos Franccois Fleuret 96 156 0 09 Jul 2020
Graph-Based Continual Learning Binh Tang David S. Matteson BDL CLL 76 37 0 09 Jul 2020
Learning to Reweight with Deep Interactions Yang Fan Yingce Xia Lijun Wu Shufang Xie Weiqing Liu Jiang Bian Tao Qin Xiang-Yang Li 81 9 0 09 Jul 2020
PathGAN: Local Path Planning with Attentive Generative Adversarial Networks Dooseop Choi Seung-Jun Han Kyoung‐Wook Min Jeongdan Choi GAN 59 5 0 08 Jul 2020
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers Shijie Geng Peng Gao Moitreya Chatterjee Chiori Hori Jonathan Le Roux Yongfeng Zhang Hongsheng Li A. Cherian 101 11 0 08 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts Marzi Heidari M. Ghatee A. Nickabadi Arash Pourhasan Nezhad DiffM MoE 84 1 0 07 Jul 2020
RGBT Salient Object Detection: A Large-scale Dataset and Benchmark Zhengzheng Tu Yan Ma Zhun Li Chenglong Li Jieming Xu Yongtao Liu 3DV 84 165 0 07 Jul 2020
EDSL: An Encoder-Decoder Architecture with Symbol-Level Features for Printed Mathematical Expression Recognition Yingnan Fu Tingting Liu Ming Gao Aoying Zhou 100 7 0 06 Jul 2020
Automatically Generating Codes from Graphical Screenshots Based on Deep Autocoder Xiaoling Huang Feng Liao 144 0 0 05 Jul 2020