ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.01033
  4. Cited By
Areas of Attention for Image Captioning
v1v2 (latest)

Areas of Attention for Image Captioning

3 December 2016
M. Pedersoli
Thomas Lucas
Cordelia Schmid
Jakob Verbeek
ArXiv (abs)PDFHTML

Papers citing "Areas of Attention for Image Captioning"

46 / 46 papers shown
Title
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation
Israa A. Albadarneh
Bassam Hammo
Omar Al-Kadi
VLM
29
0
0
03 Jun 2025
An Ensemble Model with Attention Based Mechanism for Image Captioning
Israa Al Badarneh
Bassam Hammo
Omar Al-Kadi
198
6
0
28 Jan 2025
Stacked Cross-modal Feature Consolidation Attention Networks for Image
  Captioning
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
84
4
0
08 Feb 2023
How to Describe Images in a More Funny Way? Towards a Modular Approach
  to Cross-Modal Sarcasm Generation
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation
Jie Ruan
Yue Wu
Xiaojun Wan
Yuesheng Zhu
64
1
0
20 Nov 2022
M^4I: Multi-modal Models Membership Inference
M^4I: Multi-modal Models Membership Inference
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
97
27
0
15 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain
  using Swin Transformer and Attention-based LSTM
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM
THANH VAN NGUYEN
Long H. Nguyen
Nhat Truong Pham
Liu Tai Nguyen
Van Huong Do
Hai Nguyen
Ngoc Duy Nguyen
VLMViT
43
1
0
03 Sep 2022
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE
  Solvers
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE Solvers
Namgyu Kang
Byeonghyeon Lee
Youngjoon Hong
S. Yun
Eunbyung Park
PINNAI4CE
63
16
0
26 Jul 2022
Are metrics measuring what they should? An evaluation of image
  captioning task metrics
Are metrics measuring what they should? An evaluation of image captioning task metrics
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
73
9
0
04 Jul 2022
Image Captioning based on Feature Refinement and Reflective Decoding
Image Captioning based on Feature Refinement and Reflective Decoding
G. Alabduljabbar
Hafida Benhidour
Said Kerrache
3DV
26
3
0
16 Jun 2022
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
86
47
0
29 Nov 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
81
30
0
28 Sep 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
153
270
0
14 Jul 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
128
197
0
31 Mar 2021
Generalizing Face Forgery Detection with High-frequency Features
Generalizing Face Forgery Detection with High-frequency Features
Yucheng Luo
Yong Zhang
Junchi Yan
Wei Liu
CVBM
78
347
0
23 Mar 2021
Visual Question Answering based on Local-Scene-Aware Referring
  Expression Generation
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Jungjun Kim
Dong-Gyu Lee
Jialin Wu
Hong G Jung
Seong-Whan Lee
ObjD
91
22
0
22 Jan 2021
Boost Image Captioning with Knowledge Reasoning
Boost Image Captioning with Knowledge Reasoning
Feicheng Huang
Zhixin Li
Haiyang Wei
Canlong Zhang
Huifang Ma
38
25
0
02 Nov 2020
Image Captioning with Attention for Smart Local Tourism using
  EfficientNet
Image Captioning with Attention for Smart Local Tourism using EfficientNet
D. H. Fudholi
Yurio Windiatmoko
Nurdi Afrianto
Prastyo Eko Susanto
Magfirah Suyuti
A. Hidayatullah
R. Rahmadi
3DH
18
11
0
18 Sep 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image
  Captioning
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
Riccardo Del Chiaro
Bartlomiej Twardowski
Andrew D. Bagdanov
Joost van de Weijer
CLLVLM
77
41
0
13 Jul 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching
Adaptive Offline Quintuplet Loss for Image-Text Matching
Tianlang Chen
Jiajun Deng
Jiebo Luo
232
70
0
07 Mar 2020
Expressing Objects just like Words: Recurrent Visual Embedding for
  Image-Text Matching
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen
Jiebo Luo
67
69
0
20 Feb 2020
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
110
888
0
17 Dec 2019
Predicting the Politics of an Image Using Webly Supervised Data
Predicting the Politics of an Image Using Webly Supervised Data
Christopher Thomas
Adriana Kovashka
SSL
84
21
0
31 Oct 2019
Cross Attention Network for Few-shot Classification
Cross Attention Network for Few-shot Classification
Rui Hou
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
280
647
0
17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in
  Human-Like Cognitive Style
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
54
25
0
15 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic
  Explainability
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
162
29
0
07 Oct 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic
  Labels Improve Image Captioning and Visual Question Answering
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Soravit Changpinyo
Bo Pang
Piyush Sharma
Radu Soricut
ObjD
58
20
0
04 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
141
136
0
22 Jul 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual
  Top-Down Attention for Game Scene Understanding
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding
Jian Zheng
S. Krishnamurthy
Ruxin Chen
Min-Hung Chen
Zhenhao Ge
Xiaohua Li
77
4
0
16 Jun 2019
Multi-scale self-guided attention for medical image segmentation
Multi-scale self-guided attention for medical image segmentation
Ashish Sinha
Jose Dolz
SSeg
80
420
0
07 Jun 2019
Generating Question Relevant Captions to Aid Visual Question Answering
Generating Question Relevant Captions to Aid Visual Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
112
43
0
03 Jun 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
127
327
0
20 Jan 2019
Show, Control and Tell: A Framework for Generating Controllable and
  Grounded Captions
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
109
176
0
26 Nov 2018
Gated Hierarchical Attention for Image Captioning
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
80
18
0
30 Oct 2018
Area Attention
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
156
19
0
23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM3DV
179
780
0
06 Oct 2018
Facial Action Unit Detection Using Attention and Relation Learning
Facial Action Unit Detection Using Attention and Relation Learning
Zhiwen Shao
Zhilei Liu
Jianfei Cai
Yunsheng Wu
Lizhuang Ma
ViT
69
118
0
10 Aug 2018
Joint Image Captioning and Question Answering
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
52
13
0
22 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot
  Learning
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
YunLong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
81
27
0
21 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned
  Text
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
75
115
0
18 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
67
19
0
14 May 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with
  Partially Labeled Data
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
93
133
0
22 Mar 2018
TieNet: Text-Image Embedding Network for Common Thorax Disease
  Classification and Reporting in Chest X-rays
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-rays
Xiaosong Wang
Yifan Peng
Le Lu
Zhiyong Lu
Ronald M. Summers
MedIm
76
469
0
12 Jan 2018
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
79
51
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery
  through Dialogs and Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
82
135
0
17 Nov 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual
  Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
218
4,231
0
25 Jul 2017
Paying Attention to Descriptions Generated by Image Captioning Models
Paying Attention to Descriptions Generated by Image Captioning Models
Hamed R. Tavakoli
Rakshith Shetty
Ali Borji
Jorma T. Laaksonen
80
79
0
24 Apr 2017
1