ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.03925
  4. Cited By
Image Captioning with Semantic Attention

Image Captioning with Semantic Attention

12 March 2016
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
    VLM
ArXivPDFHTML

Papers citing "Image Captioning with Semantic Attention"

50 / 562 papers shown
Title
Injecting Semantic Concepts into End-to-End Image Captioning
Injecting Semantic Concepts into End-to-End Image Captioning
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lin Liang
Zhe Gan
Lijuan Wang
Yezhou Yang
Zicheng Liu
ViT
VLM
24
86
0
09 Dec 2021
Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal
  Characterization
Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal Characterization
Mesut Erhan Unal
Adriana Kovashka
Wen-Ting Chung
Yu-Ru Lin
15
4
0
05 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
32
45
0
29 Nov 2021
Ubi-SleepNet: Advanced Multimodal Fusion Techniques for Three-stage
  Sleep Classification Using Ubiquitous Sensing
Ubi-SleepNet: Advanced Multimodal Fusion Techniques for Three-stage Sleep Classification Using Ubiquitous Sensing
B. Zhai
Yu Guan
M. Catt
Thomas Ploetz
21
6
0
19 Nov 2021
Grounded Situation Recognition with Transformers
Grounded Situation Recognition with Transformers
Junhyeong Cho
Youngseok Yoon
Hyeonjun Lee
Suha Kwak
ViT
20
17
0
19 Nov 2021
CoLLIE: Continual Learning of Language Grounding from Language-Image
  Embeddings
CoLLIE: Continual Learning of Language Grounding from Language-Image Embeddings
Gabriel Skantze
Bram Willemsen
VLM
16
13
0
15 Nov 2021
Real-time Instance Segmentation of Surgical Instruments using Attention
  and Multi-scale Feature Fusion
Real-time Instance Segmentation of Surgical Instruments using Attention and Multi-scale Feature Fusion
Juan Carlos Angeles Ceron
Gilberto Ochoa-Ruiz
Leonardo Chang
Sharib Ali
29
36
0
09 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language
  Modeling
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
194
385
0
06 Nov 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal
  Information for Video Recognition
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
27
1
0
29 Oct 2021
CoVA: Context-aware Visual Attention for Webpage Information Extraction
CoVA: Context-aware Visual Attention for Webpage Information Extraction
Anurendra Kumar
Keval Morabia
Jingjing Wang
A. Niekler
Martin Potthast
28
11
0
24 Oct 2021
Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer
  Learning with Visual Concepts
Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Minchul Shin
Jonghwan Mun
Kyoung-Woon On
Woo-Young Kang
Gunsoo Han
Eun-Sol Kim
34
4
0
13 Oct 2021
Natural Language for Human-Robot Collaboration: Problems Beyond Language
  Grounding
Natural Language for Human-Robot Collaboration: Problems Beyond Language Grounding
Seth Pate
Wei-ping Xu
Ziyi Yang
Maxwell Love
Siddarth Ganguri
Lawson L. S. Wong
19
7
0
09 Oct 2021
Geometry Attention Transformer with Position-aware LSTMs for Image
  Captioning
Geometry Attention Transformer with Position-aware LSTMs for Image Captioning
Chi-Yin Wang
Yulin Shen
Luping Ji
ViT
39
49
0
01 Oct 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong-jin Liu
Chunyan Miao
ViT
21
3
0
29 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for
  Image Captioning
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
31
37
0
16 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning
  approach
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
23
17
0
11 Sep 2021
RefineCap: Concept-Aware Refinement for Image Captioning
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
12
0
0
08 Sep 2021
Attentive Neural Controlled Differential Equations for Time-series
  Classification and Forecasting
Attentive Neural Controlled Differential Equations for Time-series Classification and Forecasting
Sheo Yon Jhin
H. Shin
Seoyoung Hong
Solhee Park
Noseong Park
AI4TS
27
22
0
04 Sep 2021
IMG2SMI: Translating Molecular Structure Images to Simplified
  Molecular-input Line-entry System
IMG2SMI: Translating Molecular Structure Images to Simplified Molecular-input Line-entry System
Daniel Fernando Campos
Heng Ji
31
12
0
03 Sep 2021
Automated Generation of Accurate \& Fluent Medical X-ray Reports
Automated Generation of Accurate \& Fluent Medical X-ray Reports
Hoang T.N. Nguyen
Dong Nie
Taivanbat Badamdorj
Yujie Liu
Yingying Zhu
J. Truong
Li Cheng
MedIm
LM&MA
19
40
0
27 Aug 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for
  Stylized Image Captioning
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning
Guodun Li
Yuchen Zhai
Zehao Lin
Yin Zhang
56
21
0
26 Aug 2021
Group-based Distinctive Image Captioning with Memory Attention
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
16
18
0
20 Aug 2021
Who's Waldo? Linking People Across Text and Images
Who's Waldo? Linking People Across Text and Images
Claire Yuqing Cui
Apoorv Khandelwal
Yoav Artzi
Noah Snavely
Hadar Averbuch-Elor
31
21
0
16 Aug 2021
A Single Example Can Improve Zero-Shot Data Generation
A Single Example Can Improve Zero-Shot Data Generation
Pavel Burnyshev
Valentin Malykh
A. Bout
Ekaterina Artemova
Irina Piontkovskaya
18
3
0
16 Aug 2021
Cross-Modal Graph with Meta Concepts for Video Captioning
Cross-Modal Graph with Meta Concepts for Video Captioning
Hao Wang
Guosheng Lin
Guosheng Lin
Chunyan Miao
28
6
0
14 Aug 2021
Caption Generation on Scenes with Seen and Unseen Object Categories
Caption Generation on Scenes with Seen and Unseen Object Categories
B. Demirel
R. G. Cinbis
VLM
17
1
0
13 Aug 2021
Towers of Babel: Combining Images, Language, and 3D Geometry for
  Learning Multimodal Vision
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision
Xiaoshi Wu
Hadar Averbuch-Elor
J. Sun
Noah Snavely
23
19
0
12 Aug 2021
Sentence Semantic Regression for Text Generation
Sentence Semantic Regression for Text Generation
Wei Wang
Pijian Li
Haitao Zheng
LRM
15
1
0
06 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum
  Learning for Image Captioning
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
79
66
0
05 Aug 2021
Distributed Attention for Grounded Image Captioning
Distributed Attention for Grounded Image Captioning
Nenglun Chen
Xingjia Pan
Runnan Chen
Lei Yang
Zhiwen Lin
Yuqiang Ren
Haolei Yuan
Xiaowei Guo
Feiyue Huang
Wenping Wang
27
21
0
02 Aug 2021
Knowing When to Quit: Selective Cascaded Regression with Patch Attention
  for Real-Time Face Alignment
Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment
Gil Shapira
Noga Levy
Ishay Goldin
R. Jevnisek
CVBM
20
3
0
01 Aug 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLM
VLM
GNN
20
567
0
30 Jul 2021
The Who in XAI: How AI Background Shapes Perceptions of AI Explanations
The Who in XAI: How AI Background Shapes Perceptions of AI Explanations
Upol Ehsan
Samir Passi
Q. V. Liao
Larry Chan
I-Hsiang Lee
Michael J. Muller
Mark O. Riedl
32
85
0
28 Jul 2021
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Wentian Zhao
Yao Hu
Heda Wang
Xinxiao Wu
Jiebo Luo
23
47
0
26 Jul 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language
  Navigation
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
39
16
0
23 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
67
254
0
14 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake
  Monitoring
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring
Jianing Qiu
Frank P.-W. Lo
Xiao Gu
M. Jobarteh
Wenyan Jia
...
M. McCrory
Edward Sazonov
Mingui Sun
Gary Frost
Benny Lo
EgoV
30
18
0
01 Jul 2021
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and
  Generation
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Jing Liu
Xinxin Zhu
Fei Liu
Longteng Guo
Zijia Zhao
...
Weining Wang
Hanqing Lu
Shiyu Zhou
Jiajun Zhang
Jinqiao Wang
31
37
0
01 Jul 2021
Contrastive Semantic Similarity Learning for Image Captioning Evaluation
  with Intrinsic Auto-encoder
Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder
Chao Zeng
Tiesong Zhao
Sam Kwong
27
2
0
29 Jun 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
22
6
0
26 Jun 2021
A Picture May Be Worth a Hundred Words for Visual Question Answering
A Picture May Be Worth a Hundred Words for Visual Question Answering
Yusuke Hirota
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Ittetsu Taniguchi
Takao Onoye
ViT
8
5
0
25 Jun 2021
TCIC: Theme Concepts Learning Cross Language and Vision for Image
  Captioning
TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Zhihao Fan
Zhongyu Wei
Siyuan Wang
Ruize Wang
Zejun Li
Haijun Shan
Xuanjing Huang
24
26
0
21 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
21
7
0
20 Jun 2021
Understanding and Evaluating Racial Biases in Image Captioning
Understanding and Evaluating Racial Biases in Image Captioning
Dora Zhao
Angelina Wang
Olga Russakovsky
24
134
0
16 Jun 2021
Contrastive Attention for Automatic Chest X-ray Report Generation
Contrastive Attention for Automatic Chest X-ray Report Generation
Fenglin Liu
Changchang Yin
Xian Wu
Shen Ge
Yuexian Zou
Ping Zhang
Yuexian Zou
Xu Sun
MedIm
16
146
0
13 Jun 2021
Conversational Fashion Image Retrieval via Multiturn Natural Language
  Feedback
Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback
Yifei Yuan
W. Lam
13
43
0
08 Jun 2021
Counterfactual Maximum Likelihood Estimation for Training Deep Networks
Counterfactual Maximum Likelihood Estimation for Training Deep Networks
Xinyi Wang
Wenhu Chen
Michael Stephen Saxon
Luu Anh Tuan
OOD
CML
BDL
17
8
0
07 Jun 2021
ACE-NODE: Attentive Co-Evolving Neural Ordinary Differential Equations
ACE-NODE: Attentive Co-Evolving Neural Ordinary Differential Equations
Sheo Yon Jhin
Minju Jo
Taeyong Kong
Jinsung Jeon
Noseong Park
BDL
24
13
0
31 May 2021
Towards Diverse Paragraph Captioning for Untrimmed Videos
Towards Diverse Paragraph Captioning for Untrimmed Videos
Yuqing Song
Shizhe Chen
Qin Jin
21
37
0
30 May 2021
Recent advances and clinical applications of deep learning in medical
  image analysis
Recent advances and clinical applications of deep learning in medical image analysis
Xuxin Chen
Ximing Wang
Kecheng Zhang
K. Fung
T. Thai
K. Moore
Robert S. Mannel
Hong Liu
B. Zheng
Y. Qiu
OOD
18
572
0
27 May 2021
Previous
12345...101112
Next