v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
Multimodal Transformer with Multi-View Visual Representation for Image Captioning Jun-chen Yu Jing Li Zhou Yu Qingming Huang ViT 70 387 0 20 May 2019
Conversion Prediction Using Multi-task Conditional Attention Networks to Support the Creation of Effective Ad Creative Shunsuke Kitada Hitoshi Iyatomi Yoshifumi Seki 26 8 0 17 May 2019
Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks Karan Sikka Lucas Van Bramer Ajay Divakaran 94 2 0 17 May 2019
Inductive Guided Filter: Real-time Deep Image Matting with Weakly Annotated Masks on Mobile Devices Yaoyi Li Jianfu Zhang Weijie Zhao Hongtao Lu 46 5 0 16 May 2019
Incorporating Sememes into Chinese Definition Modeling Liner Yang Cunliang Kong Yun Chen Yang Liu Qinan Fan Erhong Yang 61 31 0 16 May 2019
Exact Hard Monotonic Attention for Character-Level Transduction Shijie Wu Ryan Cotterell 71 60 0 15 May 2019
Embeddings and Representation Learning for Structured Data Benjamin Paassen Claudio Gallicchio Alessio Micheli A. Sperduti 53 7 0 15 May 2019
Sparse Sequence-to-Sequence Models Ben Peters Vlad Niculae André F. T. Martins TPM 219 215 0 14 May 2019
A human-inspired recognition system for premodern Japanese historical documents A. D. Le Tarin Clanuwat A. Kitamoto AI4TS 106 14 0 14 May 2019
Hierarchically Structured Meta-learning Huaxiu Yao Ying Wei Junzhou Huang Z. Li 80 205 0 13 May 2019
Federated Multi-task Hierarchical Attention Model for Sensor Analytics Yujing Chen Yue Ning Zheng Chai Huzefa Rangwala 54 6 0 13 May 2019
What Clinicians Want: Contextualizing Explainable Machine Learning for Clinical End Use S. Tonekaboni Shalmali Joshi M. Mccradden Anna Goldenberg 106 403 0 13 May 2019
Object Detection in 20 Years: A Survey Zhengxia Zou Keyan Chen Zhenwei Shi Yuhong Guo Jieping Ye VLM ObjD AI4TS 169 2,418 0 13 May 2019
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Yuhang Song Jianyi Wang Thomas Lukasiewicz Zhenghua Xu Shangtong Zhang Andrzej Wojcicki Mai Xu LRM 87 15 0 12 May 2019
Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection M. M. K. Moghaddam Ehsan Abbasnejad Javen Qinfeng Shi 51 2 0 11 May 2019
Few-Shot Learning with Embedded Class Models and Shot-Free Meta Training Avinash Ravichandran Rahul Bhotika Stefano Soatto 81 170 0 10 May 2019
Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables Yan Xu Baoyuan Wu Fumin Shen Yanbo Fan Yong Zhang Heng Tao Shen Wei Liu AAML 80 56 0 10 May 2019
Memory-Attended Recurrent Network for Video Captioning Wenjie Pei Jiyuan Zhang Xiangrong Wang Lei Ke Xiaoyong Shen Yu-Wing Tai 111 204 0 10 May 2019
Embedding Human Knowledge into Deep Neural Network via Attention Map Masahiro Mitsuhara Hiroshi Fukui Yusuke Sakashita Takanori Ogata Tsubasa Hirakawa Takayoshi Yamashita H. Fujiyoshi 102 73 0 09 May 2019
Multimodal Semantic Attention Network for Video Captioning Liang Sun Bing Li Chunfen Yuan Zhengjun Zha Weiming Hu 62 11 0 08 May 2019
ShapeGlot: Learning Language for Shape Differentiation Panos Achlioptas Judy Fan Robert D. Hawkins Noah D. Goodman Leonidas Guibas 132 83 0 08 May 2019
Frame-Recurrent Video Inpainting by Robust Optical Flow Inference Yifan Ding Chuan Wang Haibin Huang Jiaming Liu Jue Wang Liqiang Wang 58 12 0 08 May 2019
Object Exchangeability in Reinforcement Learning: Extended Abstract John Mern Dorsa Sadigh Mykel Kochenderfer OCL 51 1 0 07 May 2019
Conditional Generative Neural System for Probabilistic Trajectory Prediction Jiachen Li Hengbo Ma Masayoshi Tomizuka 102 176 0 05 May 2019
Face Hallucination by Attentive Sequence Optimization with Reinforcement Learning Yukai Shi Guanbin Li Qingxing Cao Keze Wang Liang Lin CVBM SupR 68 32 0 04 May 2019
DeepSignals: Predicting Intent of Drivers Through Visual Signals Davi Frossard Eric Kee R. Urtasun ViT 38 17 0 03 May 2019
Processing Megapixel Images with Deep Attention-Sampling Models Angelos Katharopoulos Franccois Fleuret 87 65 0 03 May 2019
Weight Map Layer for Noise and Adversarial Attack Robustness Mohammed Amer Tomás Maul 99 4 0 02 May 2019
Signed Distance-based Deep Memory Recommender Thanh-Binh Tran Xinyue Liu Kyumin Lee Xiangnan Kong FedML HAI 62 20 0 01 May 2019
PR Product: A Substitute for Inner Product in Neural Networks Zhennan Wang Wenbin Zou Chen Xu 50 6 0 30 Apr 2019
A scalable saliency-based Feature selection method with instance level information Brais Cancela V. Bolón-Canedo Amparo Alonso-Betanzos João Gama FAtt 62 13 0 30 Apr 2019
A self-attention based deep learning method for lesion attribute detection from CT reports Yifan Peng Ke Yan V. Sandfort Ronald M. Summers Zhiyong Lu MedIm 42 18 0 30 Apr 2019
Relational Collaborative Filtering:Modeling Multiple Item Relations for Recommendation Xin Xin Xiangnan He Yongfeng Zhang Yongdong Zhang J. Jose 89 167 0 29 Apr 2019
Human-Centered Emotion Recognition in Animated GIFs Zhengyuan Yang Yixuan Zhang Jiebo Luo 54 22 0 27 Apr 2019
Using Context Information to Enhance Simple Question Answering Lin Li Mengjing Zhang Zhaohui Chao Jianwen Xiang 33 11 0 27 Apr 2019
Knowing When to Stop: Evaluation and Verification of Conformity to Output-size Specifications Chenglong Wang Rudy Bunel Krishnamurthy Dvijotham Po-Sen Huang Edward Grefenstette Pushmeet Kohli 60 5 0 26 Apr 2019
Evaluating Recurrent Neural Network Explanations L. Arras Ahmed Osman K. Müller Wojciech Samek XAI FAtt 117 88 0 26 Apr 2019
Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation Chunfeng Song Yan Huang Wanli Ouyang Liang Wang 118 218 0 26 Apr 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering Jie Lei Licheng Yu Tamara L. Berg Joey Tianyi Zhou 83 230 0 25 Apr 2019
Pointing Novel Objects in Image Captioning Yehao Li Ting Yao Yingwei Pan Hongyang Chao Tao Mei 93 70 0 25 Apr 2019
Attention-based Transfer Learning for Brain-computer Interface Chuanqi Tan F. Sun Tao Kong Bin Fang Wenchang Zhang OOD 45 9 0 25 Apr 2019
HAR-Net: Joint Learning of Hybrid Attention for Single-stage Object Detection Yali Li Shengjin Wang 74 35 0 25 Apr 2019
A Self-Attentive Emotion Recognition Network Harris Partaourides Kostantinos Papadamou N. Kourtellis Ilias Leontiadis S. Chatzis 31 7 0 24 Apr 2019
Generating Token-Level Explanations for Natural Language Inference James Thorne Andreas Vlachos Christos Christodoulopoulos Arpit Mittal LRM 95 57 0 24 Apr 2019
Latent Variable Algorithms for Multimodal Learning and Sensor Fusion Lijiang Guo DRL 31 1 0 23 Apr 2019
Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting Tianran Ouyang Ling Shao OOD 51 8 0 23 Apr 2019
End-to-End Spoken Language Translation Michelle Guo Albert Haque Prateek Verma 58 8 0 23 Apr 2019
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels Rami Al-Rfou Dustin Zelle Bryan Perozzi 57 57 0 21 Apr 2019
3G structure for image caption generation Aihong Yuan Xuelong Li Xiaoqiang Lu 38 34 0 21 Apr 2019
Compression and Localization in Reinforcement Learning for ATARI Games Joel Ruben Antony Moniz Barun Patra Sarthak Garg AI4CE 51 2 0 20 Apr 2019