Image Captioning with Semantic Attention

12 March 2016

Papers citing "Image Captioning with Semantic Attention"

50 / 562 papers shown

Title
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching Tianlang Chen Jiebo Luo 11 69 0 20 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework C. Sur 27 7 0 16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC) C. Sur 25 16 0 15 Feb 2020
CBAG: Conditional Biomedical Abstract Generation Justin Sybrandt Ilya Safro MedIm AI4CE 19 8 0 13 Feb 2020
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos Sicheng Zhao Yunsheng Ma Yang Gu Jufeng Yang Tengfei Xing Pengfei Xu Runbo Hu Hua Chai Kurt Keutzer 11 98 0 12 Feb 2020
Vision-based Fight Detection from Surveillance Cameras Seymanur Akti G. A. Tataroglu H. K. Ekenel 19 77 0 11 Feb 2020
The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings Binny Mathew Sandipan Sikdar Florian Lemmerich M. Strohmaier 6 35 0 27 Jan 2020
aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption C. Sur 18 8 0 27 Jan 2020
Show, Recall, and Tell: Image Captioning with Recall Mechanism Li Wang Zechen Bai Yonghua Zhang Hongtao Lu 27 67 0 15 Jan 2020
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features Andrés Mafla S. Dey Ali Furkan Biten Lluís Gómez Dimosthenis Karatzas 13 26 0 14 Jan 2020
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning Models Jiamei Sun Sebastian Lapuschkin Wojciech Samek Alexander Binder FAtt 42 29 0 04 Jan 2020
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation Xinjie Fan Yizhe Zhang Zhendong Wang Mingyuan Zhou BDL 9 4 0 31 Dec 2019
Vision and Language: from Visual Perception to Content Creation Tao Mei Wei Zhang Ting Yao VLM 17 8 0 26 Dec 2019
Meshed-Memory Transformer for Image Captioning Marcella Cornia Matteo Stefanini Lorenzo Baraldi Rita Cucchiara 14 868 0 17 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey Shervin Minaee AmirAli Abdolrashidi Hang Su Bennamoun David C. Zhang 21 84 0 30 Nov 2019
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTM C. Sur BDL 9 6 0 22 Nov 2019
Improving Non-Intrusive Load Disaggregation through an Attention-Based Deep Neural Network V. Piccialli A. M. Sudoso 14 10 0 15 Nov 2019
Conditionally Learn to Pay Attention for Sequential Visual Task Jun He Quan-Jie Cao Lei Zhang 21 0 0 11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications Chao Zhang Zichao Yang Xiaodong He Li Deng HAI AI4TS 35 322 0 10 Nov 2019
On Architectures for Including Visual Information in Neural Language Models for Image Description Marc Tanti Albert Gatt K. Camilleri VLM 30 2 0 09 Nov 2019
Assisting human experts in the interpretation of their visual process: A case study on assessing copper surface adhesive potency T. Hascoet Xuejiao Deng Daniela Mihai Mari Sugiyama Yuji Adachi Sachiko Nakamura Jonathon S. Hare Tomoko Hayashi T. Takiguchi 9 1 0 24 Oct 2019
Imperial College London Submission to VATEX Video Captioning Task Ozan Caglayan Zixiu "Alex" Wu Pranava Madhyastha Josiah Wang Lucia Specia 12 0 0 16 Oct 2019
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style Hongwei Ge Zehang Yan Kai Zhang Mingde Zhao Liang Sun 30 24 0 15 Oct 2019
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee Shuangjie Xu Feng Xu Yu Cheng Pan Zhou 21 2 0 14 Oct 2019
Semantic-aware Image Deblurring Fuhai Chen Rongrong Ji Chengpeng Dai Xiaoshuai Sun Chia-Wen Lin Jiayi Ji Baochang Zhang Feiyue Huang Liujuan Cao BDL VLM 25 6 0 09 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability Marcella Cornia Lorenzo Baraldi Rita Cucchiara 14 27 0 07 Oct 2019
Controlled Text Generation for Data Augmentation in Intelligent Artificial Agents Nikolaos Malandrakis Minmin Shen Anuj Kumar Goyal Shuyang Gao Abhishek Sethi A. Metallinou 26 54 0 04 Oct 2019
ALCNN: Attention-based Model for Fine-grained Demand Inference of Dock-less Shared Bike in New Cities Chang-rui Liu Yanan Xu Yanmin Zhu 13 0 0 25 Sep 2019
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep Visuomotor Policies for Manipulation in Clutter P. Abolghasemi Ladislau Bölöni OffRL 17 10 0 24 Sep 2019
Pose-aware Multi-level Feature Network for Human Object Interaction Detection Bo Wan Desen Zhou Yongfei Liu Rongjie Li Xuming He 26 197 0 18 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions Yaser Alwatter Yuhong Guo BDL 21 1 0 17 Sep 2019
Automatically Extracting Challenge Sets for Non local Phenomena in Neural Machine Translation Leshem Choshen Omri Abend 19 18 0 15 Sep 2019
Deep Collaborative Filtering with Multi-Aspect Information in Heterogeneous Networks C. Shi Xiaotian Han Li Song Tianlin Li Senzhang Wang Junping Du Philip S. Yu 99 98 0 14 Sep 2019
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling Junjie Hu Yu Cheng Zhe Gan Jingjing Liu Jianfeng Gao Graham Neubig 8 67 0 11 Sep 2019
Human Visual Attention Prediction Boosts Learning & Performance of Autonomous Driving Agents Alexander Makrigiorgos A. Shafti Alex Harston Julien Gérard A. Faisal 14 14 0 11 Sep 2019
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression Sicheng Zhao Zizhou Jia Hui Chen Leida Li Guiguang Ding Kurt Keutzer 33 62 0 11 Sep 2019
Compositional Generalization in Image Captioning Mitja Nikolaus Mostafa Abdou Matthew Lamm Rahul Aralikatte Desmond Elliott CoGe 27 49 0 10 Sep 2019
Hierarchy Parsing for Image Captioning Ting Yao Yingwei Pan Yehao Li Tao Mei VLM 22 164 0 09 Sep 2019
Look and Modify: Modification Networks for Image Captioning Fawaz Sammani Mahmoud Elsayed 22 22 0 07 Sep 2019
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation Wei Wei Ling Cheng Xian-Ling Mao Guangyou Zhou Feida Zhu DiffM 22 19 0 05 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question Answering Hongyang Xue Wenqing Chu Zhou Zhao Deng Cai 25 33 0 05 Sep 2019
Large-scale Tag-based Font Retrieval with Generative Feature Learning Tianlang Chen Zhaowen Wang N. Xu Hailin Jin Jiebo Luo 3DV VLM 12 28 0 04 Sep 2019
A Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Haoran Chen Ke Lin A. Maye Jianmin Li Xiaoling Hu 25 47 0 31 Aug 2019
Reflective Decoding Network for Image Captioning Lei Ke Wenjie Pei Ruiyu Li Xiaoyong Shen Yu-Wing Tai ObjD 8 91 0 30 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings Iro Laina Christian Rupprecht Nassir Navab SSL 21 103 0 25 Aug 2019
Saccader: Improving Accuracy of Hard Attention Models for Vision Gamaleldin F. Elsayed Simon Kornblith Quoc V. Le VLM 29 71 0 20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck Shuang Ma Daniel J. McDuff Yale Song 25 22 0 19 Aug 2019
A Fast and Accurate One-Stage Approach to Visual Grounding Zhengyuan Yang Boqing Gong Liwei Wang Wenbing Huang Dong Yu Jiebo Luo ObjD 14 360 0 18 Aug 2019
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards Yuqing Song Shizhe Chen Yida Zhao Qin Jin SSL 23 40 0 15 Aug 2019
Efficient Inference of CNNs via Channel Pruning Boyu Zhang A. Davoodi Y. Hu CVBM 16 6 0 08 Aug 2019