v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown

Title
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos Sanchita Ghose John J. Prevost GAN 67 26 0 20 Jul 2021
Class dependency based learning using Bi-LSTM coupled with the transfer learning of VGG16 for the diagnosis of Tuberculosis from chest x-rays G. Jignesh Chowdary G. Suganya M. Premalatha K. Karunamurthy 61 6 0 19 Jul 2021
Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences Ikko Yamane Junya Honda Florian Yger Masashi Sugiyama SSL FedML OOD 59 1 0 16 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning Paul Pu Liang Yiwei Lyu Xiang Fan Zetian Wu Yun Cheng ... Peter Wu Michelle A. Lee Yuke Zhu Ruslan Salakhutdinov Louis-Philippe Morency VLM 111 172 0 15 Jul 2021
Variational Topic Inference for Chest X-Ray Report Generation Ivona Najdenkoska Xiantong Zhen M. Worring Ling Shao MedIm 88 29 0 15 Jul 2021
An Overview and Experimental Study of Learning-based Optimization Algorithms for Vehicle Routing Problem Bingjie Li Guohua Wu Yongming He Mingfeng Fan Witold Pedrycz 114 70 0 15 Jul 2021
Passive Attention in Artificial Neural Networks Predicts Human Visual Selectivity Thomas A. Langlois H. C. Zhao Erin Grant Ishita Dasgupta Thomas Griffiths Nori Jacoby 89 16 0 14 Jul 2021
Surgical Instruction Generation with Transformers Jinglu Zhang Y. Nie Jian Chang Jiangning Zhang MedIm 94 13 0 14 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 162 270 0 14 Jul 2021
Multi-Scale Label Relation Learning for Multi-Label Classification Using 1-Dimensional Convolutional Neural Networks Junhyung Lyle Kim Byungyoon Park Charmgil Hong 31 0 0 13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization Jiajie Zou Yuran Zhang Jialu Li Xing Tian Nai Ding AIMat 92 2 0 13 Jul 2021
Split, embed and merge: An accurate table structure recognizer Zhenrong Zhang Jianshu Zhang Jun Du LMTD 187 62 0 12 Jul 2021
Legal Judgment Prediction with Multi-Stage CaseRepresentation Learning in the Real Court Setting Luyao Ma Yating Zhang Tianyi Wang Xiaozhong Liu Wei Ye Changlong Sun Shikun Zhang ELM AILaw 99 59 0 12 Jul 2021
Levels of explainable artificial intelligence for human-aligned conversational explanations Richard Dazeley Peter Vamplew Cameron Foale Charlotte Young Sunil Aryal F. Cruz 65 93 0 07 Jul 2021
Controlled Caption Generation for Images Through Adversarial Attacks Nayyer Aafaq Naveed Akhtar Wei Liu M. Shah Ajmal Mian AAML 59 10 0 07 Jul 2021
Self-Adversarial Training incorporating Forgery Attention for Image Forgery Localization Longhao Zhuo Shunquan Tan Bin Li Jiwu Huang AAML 57 74 0 06 Jul 2021
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling Qingyong Hu Bo Yang Linhai Xie Stefano Rosa Yulan Guo Zhihua Wang Niki Trigoni Andrew Markham 3DPC 88 184 0 06 Jul 2021
RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting Benjamin Hou Georgios Kaissis Ronald M. Summers Bernhard Kainz ViT LM&MA MedIm 93 53 0 05 Jul 2021
Gradient Importance Learning for Incomplete Observations Qitong Gao Dong Wang Joshua D. Amason Siyang Yuan Chenyang Tao Ricardo Henao M. Hadziahmetovic Lawrence Carin Miroslav Pajic 50 10 0 05 Jul 2021
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model Zhiqi Huang Fenglin Liu Xian Wu Shen Ge Helin Wang Wei Fan Yuexian Zou AuLLM 57 2 0 04 Jul 2021
Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions Motonari Kambara K. Sugiura ViT 62 6 0 02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python Yiheng Wang Yao Zhang Yanzhang Wang Yan Wan Jiao Wang Zhongyuan Wu Yuhao Yang Bowen She 169 101 0 01 Jul 2021
VideoLightFormer: Lightweight Action Recognition using Transformers Raivo Koot Haiping Lu ViT 135 6 0 01 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring Jianing Qiu Frank P.-W. Lo Xiao Gu M. Jobarteh Wenyan Jia ... M. McCrory Edward Sazonov Mingui Sun Gary Frost Benny Lo EgoV 66 19 0 01 Jul 2021
MissFormer: (In-)attention-based handling of missing observations for trajectory filtering and prediction S. Becker Ronny Hug Wolfgang Hubner Michael Arens B. Morris 69 4 0 30 Jun 2021
Attention Aware Wavelet-based Detection of Morphed Face Images Poorya Aghdaie Baaria Chaudhary Sobhan Soleymani J. Dawson Nasser M. Nasrabadi CVBM 76 30 0 29 Jun 2021
Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder Chao Zeng Tiesong Zhao Sam Kwong 94 2 0 29 Jun 2021
SALYPATH: A Deep-Based Architecture for visual attention prediction M. A. Kerkouri Marouane Tliba A. Chetouani R. Harba FAtt MDE 59 9 0 29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog Agents Ye Zhu Yu Wu Yi Yang Yan Yan 73 6 0 26 Jun 2021
Neural Fashion Image Captioning : Accounting for Data Diversity Gilles Hacheme Nouréini Sayouti 69 13 0 23 Jun 2021
Probabilistic Attention for Interactive Segmentation Prasad Gabbur Manjot Bilkhu J. Movellan 103 13 0 23 Jun 2021
Interventional Video Grounding with Dual Contrastive Learning Guoshun Nan Rui Qiao Yao Xiao Jun Liu Sicong Leng H. Zhang Wei Lu 98 145 0 21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation Yixin Wang Zihao Lin Zhe Xu Haoyu Dong Jiang Tian Jie Luo Zhongchao Shi Yang Zhang Jianping Fan Zhiqiang He UQCV MedIm 122 12 0 21 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning Fenglin Liu Meng Gao Tianhao Zhang Yuexian Zou 142 7 0 20 Jun 2021
Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ? Prasanna Parthasarathi J. Pineau Sarath Chandar 64 2 0 20 Jun 2021
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering Ahjeong Seo Gi-Cheon Kang J. Park Byoung-Tak Zhang 82 54 0 19 Jun 2021
Learning to Predict Visual Attributes in the Wild Khoi Pham Kushal Kafle Zhe Lin Zhi Ding Scott D. Cohen Q. Tran Abhinav Shrivastava 54 114 0 17 Jun 2021
Semi-Autoregressive Transformer for Image Captioning Yuanen Zhou Yong Zhang Zhenzhen Hu Meng Wang VLM 78 25 0 17 Jun 2021
Invertible Attention Jiajun Zha Yiran Zhong Jing Zhang Leonid Sigal Liang Zheng 82 7 0 16 Jun 2021
Soft Attention: Does it Actually Help to Learn Social Interactions in Pedestrian Trajectory Prediction? L. Boucaud Daniel Aloise Nicolas Saunier HAI 41 0 0 16 Jun 2021
Kernel Identification Through Transformers F. Simpson Ian Davies V. Lalchand A. Vullo N. Durrande C. Rasmussen 63 11 0 15 Jun 2021
Contrastive Attention for Automatic Chest X-ray Report Generation Fenglin Liu Changchang Yin Xian Wu Shen Ge Yuexian Zou Ping Zhang Yuexian Zou Xu Sun MedIm 137 153 0 13 Jun 2021
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation Fenglin Liu Xian Wu Shen Ge Wei Fan Yuexian Zou MedIm 120 262 0 13 Jun 2021
Bayesian Attention Belief Networks Shujian Zhang Xinjie Fan Bo Chen Mingyuan Zhou BDL 114 32 0 09 Jun 2021
Salient Object Ranking with Position-Preserved Attention Haoyang Fang Daoxin Zhang Yi Zhang Minghao Chen Jiawei Li Yao Hu Deng Cai Xiaofei He 71 21 0 09 Jun 2021
Object Based Attention Through Internal Gating Jordan Lei Ari S. Benjamin Konrad Paul Kording OCL 40 4 0 08 Jun 2021
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models Chenfeng Xu Shijia Yang Tomer Galanti Bichen Wu Xiangyu Yue Bohan Zhai Wei Zhan Peter Vajda Kurt Keutzer Masayoshi Tomizuka 3DPC 62 55 0 08 Jun 2021
Lessons learned developing and using a machine learning model to automatically transcribe 2.3 million handwritten occupation codes Bjorn-Richard Pedersen Einar J. Holsbø Trygve Andersen N. Shvetsov Johan Ravn H. Sommerseth L. A. Bongo AI4TS 39 6 0 07 Jun 2021
Relative Importance in Sentence Processing Nora Hollenstein Lisa Beinborn FAtt 82 32 0 07 Jun 2021
Adversarially Regularized Graph Attention Networks for Inductive Learning on Partially Labeled Graphs Jiaren Xiao Quanyu Dai Xiaochen Xie J. Lam Ka-Wai Kwok GNN 78 7 0 07 Jun 2021