v1v2 (latest)

Areas of Attention for Image Captioning

3 December 2016

Papers citing "Areas of Attention for Image Captioning"

46 / 46 papers shown

Title
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation Israa A. Albadarneh Bassam Hammo Omar Al-Kadi VLM 29 0 0 03 Jun 2025
An Ensemble Model with Attention Based Mechanism for Image Captioning Israa Al Badarneh Bassam Hammo Omar Al-Kadi 198 6 0 28 Jan 2025
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning Mozhgan Pourkeshavarz Shahabedin Nabavi Mohsen Moghaddam M. Shamsfard 84 4 0 08 Feb 2023
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation Jie Ruan Yue Wu Xiaojun Wan Yuesheng Zhu 64 1 0 20 Nov 2022
M^4I: Multi-modal Models Membership Inference Pingyi Hu Zihan Wang Ruoxi Sun Hu Wang Minhui Xue 97 27 0 15 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM THANH VAN NGUYEN Long H. Nguyen Nhat Truong Pham Liu Tai Nguyen Van Huong Do Hai Nguyen Ngoc Duy Nguyen VLM ViT 43 1 0 03 Sep 2022
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE Solvers Namgyu Kang Byeonghyeon Lee Youngjoon Hong S. Yun Eunbyung Park PINN AI4CE 63 16 0 26 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics Othón González-Chávez Guillermo Ruiz Daniela Moctezuma Tania A. Ramirez-delreal 73 9 0 04 Jul 2022
Image Captioning based on Feature Refinement and Reflective Decoding G. Alabduljabbar Hafida Benhidour Said Kerrache 3DV 26 3 0 16 Jun 2022
Neural Attention for Image Captioning: Review of Outstanding Methods Zanyar Zohourianshahzadi Jugal Kalita VLM 86 47 0 29 Nov 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation G. O. D. Santos Esther Luna Colombini Sandra Avila 81 30 0 28 Sep 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 153 270 0 14 Jul 2021
Attention, please! A survey of Neural Attention Models in Deep Learning Alana de Santana Correia Esther Luna Colombini HAI 128 197 0 31 Mar 2021
Generalizing Face Forgery Detection with High-frequency Features Yucheng Luo Yong Zhang Junchi Yan Wei Liu CVBM 78 347 0 23 Mar 2021
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation Jungjun Kim Dong-Gyu Lee Jialin Wu Hong G Jung Seong-Whan Lee ObjD 91 22 0 22 Jan 2021
Boost Image Captioning with Knowledge Reasoning Feicheng Huang Zhixin Li Haiyang Wei Canlong Zhang Huifang Ma 38 25 0 02 Nov 2020
Image Captioning with Attention for Smart Local Tourism using EfficientNet D. H. Fudholi Yurio Windiatmoko Nurdi Afrianto Prastyo Eko Susanto Magfirah Suyuti A. Hidayatullah R. Rahmadi 3DH 18 11 0 18 Sep 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning Riccardo Del Chiaro Bartlomiej Twardowski Andrew D. Bagdanov Joost van de Weijer CLL VLM 77 41 0 13 Jul 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching Tianlang Chen Jiajun Deng Jiebo Luo 232 70 0 07 Mar 2020
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching Tianlang Chen Jiebo Luo 67 69 0 20 Feb 2020
Meshed-Memory Transformer for Image Captioning Marcella Cornia Matteo Stefanini Lorenzo Baraldi Rita Cucchiara 110 888 0 17 Dec 2019
Predicting the Politics of an Image Using Webly Supervised Data Christopher Thomas Adriana Kovashka SSL 84 21 0 31 Oct 2019
Cross Attention Network for Few-shot Classification Rui Hou Hong Chang Bingpeng Ma Shiguang Shan Xilin Chen 280 647 0 17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style Hongwei Ge Zehang Yan Kai Zhang Mingde Zhao Liang Sun 54 25 0 15 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability Marcella Cornia Lorenzo Baraldi Rita Cucchiara 162 29 0 07 Oct 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering Soravit Changpinyo Bo Pang Piyush Sharma Radu Soricut ObjD 58 20 0 04 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 141 136 0 22 Jul 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding Jian Zheng S. Krishnamurthy Ruxin Chen Min-Hung Chen Zhenhao Ge Xiaohua Li 77 4 0 16 Jun 2019
Multi-scale self-guided attention for medical image segmentation Ashish Sinha Jose Dolz SSeg 80 420 0 07 Jun 2019
Generating Question Relevant Captions to Aid Visual Question Answering Jialin Wu Zeyuan Hu Raymond J. Mooney 112 43 0 03 Jun 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding Ning Xie Farley Lai Derek Doran Asim Kadav CoGe 127 327 0 20 Jan 2019
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions Marcella Cornia Lorenzo Baraldi Rita Cucchiara DiffM 109 176 0 26 Nov 2018
Gated Hierarchical Attention for Image Captioning Qingzhong Wang Antoni B. Chan 80 18 0 30 Oct 2018
Area Attention Yang Li Lukasz Kaiser Samy Bengio Si Si 156 19 0 23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning Md Zakir Hossain Ferdous Sohel M. Shiratuddin Hamid Laga VLM 3DV 179 780 0 06 Oct 2018
Facial Action Unit Detection Using Attention and Relation Learning Zhiwen Shao Zhilei Liu Jianfei Cai Yunsheng Wu Lizhuang Ma ViT 69 118 0 10 Aug 2018
Joint Image Captioning and Question Answering Jialin Wu Zeyuan Hu Raymond J. Mooney 52 13 0 22 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning YunLong Yu Zhong Ji Yanwei Fu Jichang Guo Yanwei Pang Zhongfei Zhang VLM 81 27 0 21 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text A. Mathews Lexing Xie Xuming He VLM 75 115 0 18 May 2018
Token-level and sequence-level loss smoothing for RNN language models Maha Elbayad Laurent Besacier Jakob Verbeek 67 19 0 14 May 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data Xihui Liu Hongsheng Li Jing Shao Dapeng Chen Xiaogang Wang 93 133 0 22 Mar 2018
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-rays Xiaosong Wang Yifan Peng Le Lu Zhiyong Lu Ronald M. Summers MedIm 76 469 0 12 Jan 2018
ADVISE: Symbolism and External Knowledge for Decoding Advertisements Keren Ye Adriana Kovashka 79 51 0 17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries Bohan Zhuang Qi Wu Chunhua Shen Ian Reid Anton Van Den Hengel ObjD 82 135 0 17 Nov 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould Lei Zhang AIMat 218 4,231 0 25 Jul 2017
Paying Attention to Descriptions Generated by Image Captioning Models Hamed R. Tavakoli Rakshith Shetty Ali Borji Jorma T. Laaksonen 80 79 0 24 Apr 2017