v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Modality Shifting Attention Network for Multi-modal Video Question Answering Junyeong Kim Minuk Ma T. Pham Kyungsu Kim Chang D. Yoo 91 72 0 04 Jul 2020
A Few-Shot Sequential Approach for Object Counting Negin Sokhandan Pegah Kamousi Alejandro Posada Eniola Alese Negar Rostamzadeh 63 3 0 03 Jul 2020
Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition Bin-Bin Gao Hong-Yu Zhou 71 115 0 03 Jul 2020
Synergistic saliency and depth prediction for RGB-D saliency detection Yue Wang Yuke Li J. Elder Huchuan Lu Runmin Wu Lu Zhang MDE 106 8 0 03 Jul 2020
Balanced Symmetric Cross Entropy for Large Scale Imbalanced and Noisy Data Feifei Huang Jie Li Xuelin Zhu 25 10 0 03 Jul 2020
Modality-Agnostic Attention Fusion for visual search with text feedback Eric Dodds Jack Culpepper Simão Herdade Yang Zhang K. Boakye EgoV 109 74 0 30 Jun 2020
AdaSGD: Bridging the gap between SGD and Adam Jiaxuan Wang Jenna Wiens 77 10 0 30 Jun 2020
Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification Xingyang Ni H. Huttunen CVBM 43 20 0 29 Jun 2020
Self-Attention Networks for Intent Detection Sevinj Yolchuyeva Géza Németh Bálint Gyires-Tóth 27 13 0 28 Jun 2020
Listen carefully and tell: an audio captioning system based on residual learning and gammatone audio representation Sergi Perez-Castanos Javier Naranjo-Alcazar P. Zuccarello M. Cobos 70 11 0 27 Jun 2020
Modeling Long-Term and Short-Term Interests with Parallel Attentions for Session-based Recommendation Jing Zhu Yanan Xu Yanmin Zhu HAI 20 11 0 27 Jun 2020
ULSAM: Ultra-Lightweight Subspace Attention Module for Compact Convolutional Neural Networks Rajat Saini N. Jha B. K. Das Sparsh Mittal C.Krishna Mohan 71 83 0 26 Jun 2020
Graph Optimal Transport for Cross-Domain Alignment Liqun Chen Zhe Gan Yu Cheng Linjie Li Lawrence Carin Jingjing Liu OT 129 152 0 26 Jun 2020
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering C. Sur 30 9 0 25 Jun 2020
AReLU: Attention-based Rectified Linear Unit Dengsheng Chen Jun Li Kai Xu 87 20 0 24 Jun 2020
Differentiable Window for Dynamic Local Attention Thanh-Tung Nguyen Xuan-Phi Nguyen Shafiq Joty Xiaoli Li 56 13 0 24 Jun 2020
Robot Object Retrieval with Contextual Natural Language Queries Thao Nguyen N. Gopalan Roma Patel Matt Corsaro Ellie Pavlick Stefanie Tellex LM&Ro 81 53 0 23 Jun 2020
Neural Cellular Automata Manifold Alejandro Hernandez Ruiz Armand Vilalta Francesc Moreno-Noguer 57 9 0 22 Jun 2020
Improving Image Captioning with Better Use of Captions Zhan Shi Xu Zhou Xipeng Qiu Xiao-Dan Zhu 68 128 0 21 Jun 2020
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation Shiyang Yan Yang Hua N. Robertson OffRL 52 0 0 21 Jun 2020
A3T-GCN: Attention Temporal Graph Convolutional Network for Traffic Forecasting Jiawei Zhu Yujiao Song Ling Zhao Haifeng Li AI4TS 72 278 0 20 Jun 2020
Predicting Temporal Sets with Deep Neural Networks Le Yu Leilei Sun Bowen Du Chuanren Liu Hui Xiong Weifeng Lv 89 45 0 20 Jun 2020
Concatenated Attention Neural Network for Image Restoration Ying-jie Tian Yiqi Wang LinRui Yang Zhiquan Qi 57 11 0 19 Jun 2020
Adversarial Attacks for Multi-view Deep Models Xuli Sun Shiliang Sun AAML 39 0 0 19 Jun 2020
Hyperparameter Analysis for Image Captioning Amish Patel Aravind Varier 73 2 0 19 Jun 2020
Automated Radiological Report Generation For Chest X-Rays With Weakly-Supervised End-to-End Deep Learning Shuai Zhang Xiaoyan Xin Yang Wang Yachong Guo Q. Hao Xianfeng Yang Jun Wang Jian Zhang Bing Zhang Wei Wang MedIm 43 1 0 18 Jun 2020
Category-Specific CNN for Visual-aware CTR Prediction at JD.com Hu Liu Jing Lu Hao Yang Xiwei Zhao Sulong Xu ... Zehua Zhang Wenjie Niu Xiaokun Zhu Yongjun Bao Weipeng P. Yan 71 32 0 18 Jun 2020
XRayGAN: Consistency-preserving Generation of X-ray Images from Radiology Reports Xingyi Yang Nandiraju Gireesh Eric Xing P. Xie MedIm 51 3 0 17 Jun 2020
Visual Attention for Musical Instrument Recognition Karn N. Watcharasupat Siddharth Gururani Alexander Lerch 49 3 0 17 Jun 2020
Cross-Correlated Attention Networks for Person Re-Identification Jieming Zhou S. Roy Pengfei Fang Mehrtash Harandi L. Petersson 56 16 0 17 Jun 2020
A generalizable saliency map-based interpretation of model outcome Shailja Thakur S. Fischmeister AAML FAtt MILM 41 2 0 16 Jun 2020
Visualization for Histopathology Images using Graph Convolutional Neural Networks M. Sureka Abhijeet Patil Deepak Anand A. Sethi FAtt GNN MedIm 68 36 0 16 Jun 2020
Unsupervised Pansharpening Based on Self-Attention Mechanism Ying Qu Razieh Kaviani Baghbaderani Hairong Qi C. Kwan 81 69 0 16 Jun 2020
Global Feature Aggregation for Accident Anticipation Mishal Fatima Muhammad Umar Karim Khan C. Kyung 83 19 0 16 Jun 2020
SD-RSIC: Summarization Driven Deep Remote Sensing Image Captioning Gencer Sumbul Sonali Nayak Begüm Demir 63 77 0 15 Jun 2020
ORD: Object Relationship Discovery for Visual Dialogue Generation Ziwei Wang Zi Huang Yadan Luo Huimin Lu 57 4 0 15 Jun 2020
Mitigating Gender Bias in Captioning Systems Ruixiang Tang Mengnan Du Yuening Li Zirui Liu Na Zou Helen Zhou FaML 142 66 0 15 Jun 2020
AMENet: Attentive Maps Encoder Network for Trajectory Prediction Hao Cheng Wentong Liao M. Yang Bodo Rosenhahn Monika Sester 90 46 0 15 Jun 2020
Towards Robust Pattern Recognition: A Review Xu-Yao Zhang Cheng-Lin Liu C. Suen OOD HAI 73 110 0 12 Jun 2020
Incorporating User Micro-behaviors and Item Knowledge into Multi-task Learning for Session-based Recommendation Wenjing Meng Deqing Yang Yanghua Xiao 70 110 0 12 Jun 2020
RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams Vasiliki Kougia John Pavlopoulos P. Papapetrou Max Gordon 50 0 0 11 Jun 2020
Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning Ruozi Huang Huang Hu Wei Wu Kei Sawada Mi Zhang Daxin Jiang 125 122 0 11 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges M. Eskénazi Tiancheng Zhao LLMAG AI4TS AI4CE 93 9 0 10 Jun 2020
MultiResolution Attention Extractor for Small Object Detection Fan Zhang L. Jiao Lingling Li Fang Liu Xu Liu ObjD 49 11 0 10 Jun 2020
Toward Building Safer Smart Homes for the People with Disabilities Shahinur Alam M. Mahmud M. Yeasin 30 4 0 10 Jun 2020
Why Attentions May Not Be Interpretable? Bing Bai Jian Liang Guanhua Zhang Hao Li Kun Bai Fei Wang FAtt 100 61 0 10 Jun 2020
Cost-effective Interactive Attention Learning with Neural Attention Processes Jay Heo Junhyeong Park Hyewon Jeong Kwang Joon Kim Juho Lee Eunho Yang Sung Ju Hwang 50 8 0 09 Jun 2020
Physically constrained short-term vehicle trajectory forecasting with naive semantic maps Albert Dulian J. Murray 33 0 0 09 Jun 2020
Text Detection and Recognition in the Wild: A Review Z. Raisi Mohamed A. Naiel Paul Fieguth Steven Wardell John S. Zelek 90 35 0 08 Jun 2020
FMA-ETA: Estimating Travel Time Entirely Based on FFN With Attention Yiwen Sun Yulu Wang Kun Fu Zheng Wang Ziang Yan Changshui Zhang Jieping Ye AI4TS 46 16 0 07 Jun 2020