ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.03925
  4. Cited By
Image Captioning with Semantic Attention

Image Captioning with Semantic Attention

12 March 2016
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
    VLM
ArXivPDFHTML

Papers citing "Image Captioning with Semantic Attention"

50 / 562 papers shown
Title
"Factual" or "Emotional": Stylized Image Captioning with Adaptive
  Learning and Attention
"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention
Tianlang Chen
Zhongping Zhang
Quanzeng You
Chen Fang
Zhaowen Wang
Hailin Jin
Jiebo Luo
24
86
0
10 Jul 2018
Topic-Guided Attention for Image Captioning
Topic-Guided Attention for Image Captioning
Zhihao Zhu
Zhan Xue
Zejian Yuan
30
23
0
10 Jul 2018
MTBI Identification From Diffusion MR Images Using Bag of Adversarial
  Visual Features
MTBI Identification From Diffusion MR Images Using Bag of Adversarial Visual Features
Shervin Minaee
Yao Wang
Alp Aygar
Sohae Chung
X. Wang
Yvonne W. Lui
E. Fieremans
S. Flanagan
J. Rath
MedIm
27
34
0
27 Jun 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual
  Question Answering
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
25
79
0
24 May 2018
CNN+CNN: Convolutional Decoders for Image Captioning
CNN+CNN: Convolutional Decoders for Image Captioning
Qingzhong Wang
Antoni B. Chan
VLM
16
85
0
23 May 2018
Learning what and where to attend
Learning what and where to attend
Drew Linsley
Dan Scheibler
S. Eberhardt
Thomas Serre
14
32
0
22 May 2018
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report
  Generation
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric P. Xing
MedIm
10
327
0
21 May 2018
Turbo Learning for Captionbot and Drawingbot
Turbo Learning for Captionbot and Drawingbot
Qiuyuan Huang
Pengchuan Zhang
D. Wu
Lei Zhang
16
25
0
21 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot
  Learning
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
YunLong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
24
27
0
21 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
22
19
0
14 May 2018
Deep Ordinal Hashing with Spatial Attention
Deep Ordinal Hashing with Spatial Attention
Lu Jin
Xiangbo Shu
Kai Li
Zechao Li
Guo-Jun Qi
Jinhui Tang
43
78
0
07 May 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning
Object Counts! Bringing Explicit Detections Back into Image Captioning
Josiah Wang
Pranava Madhyastha
Lucia Specia
ObjD
16
37
0
23 Apr 2018
Beyond Narrative Description: Generating Poetry from Images by
  Multi-Adversarial Training
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
Bei Liu
Jianlong Fu
Makoto P. Kato
Masatoshi Yoshikawa
GAN
24
4
0
23 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
33
18
0
11 Apr 2018
Decoupled Novel Object Captioner
Decoupled Novel Object Captioner
Yuehua Wu
Linchao Zhu
Lu Jiang
Yi Yang
10
62
0
11 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
22
233
0
07 Apr 2018
Learn To Pay Attention
Learn To Pay Attention
Saumya Jetley
Nicholas A. Lord
Namhoon Lee
Philip Torr
67
437
0
06 Apr 2018
Learning to Guide Decoding for Image Captioning
Learning to Guide Decoding for Image Captioning
Wenhao Jiang
Lin Ma
Xinpeng Chen
Hanwang Zhang
Wei Liu
16
69
0
03 Apr 2018
End-to-End Dense Video Captioning with Masked Transformer
End-to-End Dense Video Captioning with Masked Transformer
Luowei Zhou
Yingbo Zhou
Jason J. Corso
R. Socher
Caiming Xiong
20
524
0
03 Apr 2018
Regularizing RNNs for Caption Generation by Reconstructing The Past with
  The Present
Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Xinpeng Chen
Lin Ma
Wenhao Jiang
Jian Yao
Wei Liu
17
92
0
30 Mar 2018
Neural Baby Talk
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
200
434
0
27 Mar 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with
  Partially Labeled Data
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
20
133
0
22 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
22
56
0
21 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual
  Questions
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Chenyu You
Jianfei Cai
Jiebo Luo
31
106
0
20 Mar 2018
Unpaired Image Captioning by Language Pivoting
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Chenyu You
Jianfei Cai
G. Wang
24
82
0
14 Mar 2018
Decoupled Spatial Neural Attention for Weakly Supervised Semantic
  Segmentation
Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation
Tianyi Zhang
Guosheng Lin
Jianfei Cai
T. Shen
Chunhua Shen
Alex C. Kot
32
76
0
07 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Feiyu Xiong
Qingming Huang
12
200
0
05 Mar 2018
Neural Aesthetic Image Reviewer
Neural Aesthetic Image Reviewer
Wenshan Wang
Su Yang
Weishan Zhang
Jiulong Zhang
22
38
0
28 Feb 2018
Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks
Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
Youngjin Yoon
In So Kweon
24
27
0
14 Feb 2018
Self-Supervised Video Hashing with Hierarchical Binary Auto-encoder
Self-Supervised Video Hashing with Hierarchical Binary Auto-encoder
Jingkuan Song
Hanwang Zhang
Xiangpeng Li
Lianli Gao
Hao Wu
Richang Hong
19
245
0
07 Feb 2018
Multimodal Sentiment Analysis with Word-Level Fusion and Reinforcement
  Learning
Multimodal Sentiment Analysis with Word-Level Fusion and Reinforcement Learning
Minghai Chen
Sen Wang
Paul Pu Liang
T. Baltrušaitis
Amir Zadeh
Louis-Philippe Morency
27
278
0
03 Feb 2018
Action Recognition with Spatio-Temporal Visual Attention on Skeleton
  Image Sequences
Action Recognition with Spatio-Temporal Visual Attention on Skeleton Image Sequences
Zhengyuan Yang
Y. Li
Jianchao Yang
Jiebo Luo
3DPC
27
12
0
31 Jan 2018
Image Captioning at Will: A Versatile Scheme for Effectively Injecting
  Sentiments into Image Descriptions
Image Captioning at Will: A Versatile Scheme for Effectively Injecting Sentiments into Image Descriptions
Quanzeng You
Hailin Jin
Jiebo Luo
VLM
32
52
0
30 Jan 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using
  Attributes and Captions
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Qing Li
Jianlong Fu
D. Yu
Tao Mei
Jiebo Luo
FAtt
XAI
CoGe
51
60
0
27 Jan 2018
MAttNet: Modular Attention Network for Referring Expression
  Comprehension
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe-nan Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
53
815
0
24 Jan 2018
Script Identification in Natural Scene Image and Video Frame using
  Attention based Convolutional-LSTM Network
Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network
A. Bhunia
Aishik Konwer
A. Bhunia
A. Bhowmick
P. Roy
Umapada Pal
19
124
0
01 Jan 2018
Attacking Visual Language Grounding with Adversarial Examples: A Case
  Study on Neural Image Captioning
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
Hongge Chen
Huan Zhang
Pin-Yu Chen
Jinfeng Yi
Cho-Jui Hsieh
GAN
AAML
32
49
0
06 Dec 2017
CAR-Net: Clairvoyant Attentive Recurrent Network
CAR-Net: Clairvoyant Attentive Recurrent Network
Amir Sadeghian
Ferdinand Legros
Maxime Voisin
Ricky Vesel
Alexandre Alahi
Silvio Savarese
3DPC
32
140
0
28 Nov 2017
On the Automatic Generation of Medical Imaging Reports
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric P. Xing
MedIm
35
503
0
22 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder
  with an Additive Gaussian Encoding Space
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
A. Schwing
Svetlana Lazebnik
CoGe
37
175
0
19 Nov 2017
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
27
50
0
17 Nov 2017
AI Challenger : A Large-scale Dataset for Going Deeper in Image
  Understanding
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding
Jiahong Wu
He Zheng
Bo Zhao
Yixin Li
Baoming Yan
...
Shipei Zhou
G. Lin
Yanwei Fu
Yizhou Wang
Yonggang Wang
VLM
38
149
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery
  through Dialogs and Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
21
134
0
17 Nov 2017
Phrase-based Image Captioning with Hierarchical LSTM Model
Phrase-based Image Captioning with Hierarchical LSTM Model
Y. Tan
Chee Seng Chan
VLM
26
4
0
11 Nov 2017
Common Representation Learning Using Step-based Correlation Multi-Modal
  CNN
Common Representation Learning Using Step-based Correlation Multi-Modal CNN
Gaurav Bhatt
Piyush Jha
Balasubramanian Raman
SSL
8
4
0
31 Oct 2017
FigureQA: An Annotated Figure Dataset for Visual Reasoning
FigureQA: An Annotated Figure Dataset for Visual Reasoning
Samira Ebrahimi Kahou
Vincent Michalski
Adam Atkinson
Ákos Kádár
Adam Trischler
Yoshua Bengio
ReLM
AIMat
25
307
0
19 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided
  Assitance
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance
Aditya Mogadala
Umanga Bista
Lexing Xie
Achim Rettinger
25
7
0
17 Oct 2017
Contrastive Learning for Image Captioning
Contrastive Learning for Image Captioning
Bo Dai
Dahua Lin
SSL
VLM
13
190
0
06 Oct 2017
What Does Explainable AI Really Mean? A New Conceptualization of
  Perspectives
What Does Explainable AI Really Mean? A New Conceptualization of Perspectives
Derek Doran
Sarah Schulz
Tarek R. Besold
XAI
32
437
0
02 Oct 2017
Self-Guiding Multimodal LSTM - when we do not have a perfect training
  dataset for image captioning
Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning
Yang Xian
Yingli Tian
VLM
25
22
0
15 Sep 2017
Previous
123...1011129
Next