ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.03925
  4. Cited By
Image Captioning with Semantic Attention

Image Captioning with Semantic Attention

12 March 2016
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
    VLM
ArXivPDFHTML

Papers citing "Image Captioning with Semantic Attention"

50 / 562 papers shown
Title
Dense Relational Captioning: Triple-Stream Networks for
  Relationship-Based Captioning
Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
14
84
0
14 Mar 2019
COMIC: Towards A Compact Image Captioning Model with Attention
COMIC: Towards A Compact Image Captioning Model with Attention
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
25
40
0
04 Mar 2019
Generative Visual Dialogue System via Adaptive Reasoning and Weighted
  Likelihood Estimation
Generative Visual Dialogue System via Adaptive Reasoning and Weighted Likelihood Estimation
Heming Zhang
Shalini Ghosh
Larry Heck
Stephen Walsh
Junting Zhang
Jie Zhang
C.-C. Jay Kuo
15
7
0
26 Feb 2019
Taking a HINT: Leveraging Explanations to Make Vision and Language
  Models More Grounded
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
FAtt
VLM
16
252
0
11 Feb 2019
Improving Image Captioning by Leveraging Knowledge Graphs
Improving Image Captioning by Leveraging Knowledge Graphs
Yimin Zhou
Yiwei Sun
Vasant Honavar
VLM
16
54
0
25 Jan 2019
Improving Sequence-to-Sequence Learning via Optimal Transport
Improving Sequence-to-Sequence Learning via Optimal Transport
Liqun Chen
Yizhe Zhang
Ruiyi Zhang
Chenyang Tao
Zhe Gan
Haichao Zhang
Bai Li
Dinghan Shen
Changyou Chen
Lawrence Carin
OT
11
92
0
18 Jan 2019
Attention-aware Multi-stroke Style Transfer
Attention-aware Multi-stroke Style Transfer
Yuan Yao
Jianqiang Ren
Xuansong Xie
Weidong Liu
Yong-jin Liu
Jun Wang
26
162
0
16 Jan 2019
Image Based Review Text Generation with Emotional Guidance
Image Based Review Text Generation with Emotional Guidance
Xuehui Sun
Zihan Zhou
Yuda Fan
18
1
0
14 Jan 2019
Automated Rationale Generation: A Technique for Explainable AI and its
  Effects on Human Perceptions
Automated Rationale Generation: A Technique for Explainable AI and its Effects on Human Perceptions
Upol Ehsan
Pradyumna Tambwekar
Larry Chan
Brent Harrison
Mark O. Riedl
19
237
0
11 Jan 2019
Action2Vec: A Crossmodal Embedding Approach to Action Learning
Action2Vec: A Crossmodal Embedding Approach to Action Learning
Meera Hahn
Andrew Silva
James M. Rehg
20
58
0
02 Jan 2019
Hierarchical LSTMs with Adaptive Attention for Visual Captioning
Hierarchical LSTMs with Adaptive Attention for Visual Captioning
Jingkuan Song
Xiangpeng Li
Lianli Gao
Heng Tao Shen
23
221
0
26 Dec 2018
Attention Branch Network: Learning of Attention Mechanism for Visual
  Explanation
Attention Branch Network: Learning of Attention Mechanism for Visual Explanation
Hiroshi Fukui
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
XAI
FAtt
11
399
0
25 Dec 2018
Attending Category Disentangled Global Context for Image Classification
Keke Tang
Guodong Wei
Runnan Chen
Jie Zhu
Zhaoquan Gu
Wenping Wang
17
0
0
17 Dec 2018
Grounded Video Description
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
27
190
0
17 Dec 2018
Visual Social Relationship Recognition
Visual Social Relationship Recognition
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
33
27
0
13 Dec 2018
Real-Time Referring Expression Comprehension by Single-Stage Grounding
  Network
Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Xinpeng Chen
Lin Ma
Jingyuan Chen
Zequn Jie
Wei Liu
Jiebo Luo
ObjD
18
110
0
09 Dec 2018
Unsupervised Image Captioning
Unsupervised Image Captioning
Yang Feng
Lin Ma
Wei Liu
Jiebo Luo
VLM
SSL
19
201
0
27 Nov 2018
Show, Control and Tell: A Framework for Generating Controllable and
  Grounded Captions
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
28
175
0
26 Nov 2018
Senti-Attend: Image Captioning using Sentiment and Attention
Senti-Attend: Image Captioning using Sentiment and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
VLM
24
15
0
24 Nov 2018
AttentionMask: Attentive, Efficient Object Proposal Generation Focusing
  on Small Objects
AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects
Christian Wilms
Simone Frintrop
19
17
0
21 Nov 2018
Attention-Based Deep Neural Networks for Detection of Cancerous and
  Precancerous Esophagus Tissue on Histopathological Slides
Attention-Based Deep Neural Networks for Detection of Cancerous and Precancerous Esophagus Tissue on Histopathological Slides
Naofumi Tomita
B. Abdollahi
Jason W. Wei
Bing Ren
A. Suriawinata
Saeed Hassanpour
MedIm
26
166
0
20 Nov 2018
Image Captioning Based on a Hierarchical Attention Mechanism and Policy
  Gradient Optimization
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization
Shiyang Yan
Yuan Xie
F. Wu
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
14
5
0
13 Nov 2018
Improved Dynamic Memory Network for Dialogue Act Classification with
  Adversarial Training
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
Yao Wan
Wenqiang Yan
Jianwei Gao
Zhou Zhao
Jian Wu
Philip S. Yu
24
10
0
12 Nov 2018
Holistic Multi-modal Memory Network for Movie Question Answering
Holistic Multi-modal Memory Network for Movie Question Answering
Anran Wang
Anh Tuan Luu
Chuan-Sheng Foo
Erik Cambria
Yi Tay
V. Chandrasekhar
36
20
0
12 Nov 2018
Sequence Generation with Guider Network
Sequence Generation with Guider Network
Ruiyi Zhang
Changyou Chen
Zhe Gan
Wenlin Wang
Liqun Chen
Dinghan Shen
Guoyin Wang
Lawrence Carin
3DV
14
4
0
02 Nov 2018
Dial2Desc: End-to-end Dialogue Description Generation
Dial2Desc: End-to-end Dialogue Description Generation
Haojie Pan
Junpei Zhou
Zhou Zhao
Yan Liu
Deng Cai
Min Yang
VLM
15
14
0
01 Nov 2018
Evaluating Text GANs as Language Models
Evaluating Text GANs as Language Models
Guy Tevet
Gavriel Habib
Vered Shwartz
Jonathan Berant
EGVM
4
31
0
30 Oct 2018
Gated Hierarchical Attention for Image Captioning
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
24
18
0
30 Oct 2018
Area Attention
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
31
20
0
23 Oct 2018
Bringing back simplicity and lightliness into neural image captioning
Bringing back simplicity and lightliness into neural image captioning
Jean-Benoit Delbrouck
Stéphane Dupont
15
5
0
15 Oct 2018
Attention Driven Person Re-identification
Attention Driven Person Re-identification
Fan Yang
Ke Yan
Shijian Lu
Huizhu Jia
Xiaodong Xie
Wen Gao
31
152
0
13 Oct 2018
Image Captioning as Neural Machine Translation Task in SOCKEYE
Image Captioning as Neural Machine Translation Task in SOCKEYE
Loris Bazzani
Tobias Domhan
Felix Hieber
VLM
19
2
0
09 Oct 2018
h-detach: Modifying the LSTM Gradient Towards Better Optimization
h-detach: Modifying the LSTM Gradient Towards Better Optimization
Devansh Arpit
Bhargav Kanuparthi
Giancarlo Kerg
Nan Rosemary Ke
Ioannis Mitliagkas
Yoshua Bengio
25
32
0
06 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
760
0
06 Oct 2018
Pay attention! - Robustifying a Deep Visuomotor Policy through
  Task-Focused Attention
Pay attention! - Robustifying a Deep Visuomotor Policy through Task-Focused Attention
P. Abolghasemi
Amir Mazaheri
M. Shah
Ladislau Bölöni
AAML
11
33
0
26 Sep 2018
Global Weighted Average Pooling Bridges Pixel-level Localization and
  Image-level Classification
Global Weighted Average Pooling Bridges Pixel-level Localization and Image-level Classification
Suo Qiu
11
28
0
21 Sep 2018
MTLE: A Multitask Learning Encoder of Visual Feature Representations for
  Video and Movie Description
MTLE: A Multitask Learning Encoder of Visual Feature Representations for Video and Movie Description
Oliver A. Nina
Washington Garcia
Scott Clouse
Alper Yilmaz
18
4
0
19 Sep 2018
Exploring Visual Relationship for Image Captioning
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
41
827
0
19 Sep 2018
Image Captioning based on Deep Reinforcement Learning
Image Captioning based on Deep Reinforcement Learning
Haichao Shi
Peng Li
Bo Wang
Zhenyu Wang
20
25
0
13 Sep 2018
Response Characterization for Auditing Cell Dynamics in Long Short-term
  Memory Networks
Response Characterization for Auditing Cell Dynamics in Long Short-term Memory Networks
Ramin M. Hasani
Alexander Amini
Mathias Lechner
Felix Naser
Radu Grosu
Daniela Rus
28
25
0
11 Sep 2018
PhaseLink: A Deep Learning Approach to Seismic Phase Association
PhaseLink: A Deep Learning Approach to Seismic Phase Association
Zachary E. Ross
Yisong Yue
Men‐Andrin Meier
E. Hauksson
T. Heaton
11
149
0
08 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
Adapting Visual Question Answering Models for Enhancing Multimodal
  Community Q&A Platforms
Adapting Visual Question Answering Models for Enhancing Multimodal Community Q&A Platforms
Avikalp Srivastava
Hsin Wen Liu
Sumio Fujita
25
3
0
29 Aug 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and
  Comprehensive Image Captions
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
98
65
0
27 Aug 2018
Neural Task Planning with And-Or Graph Representations
Neural Task Planning with And-Or Graph Representations
Tianshui Chen
Riquan Chen
Lin Nie
Xiaonan Luo
Xiaobai Liu
Liang Lin
8
20
0
25 Aug 2018
Facial Action Unit Detection Using Attention and Relation Learning
Facial Action Unit Detection Using Attention and Relation Learning
Zhiwen Shao
Zhilei Liu
Jianfei Cai
Yunsheng Wu
Lizhuang Ma
ViT
17
115
0
10 Aug 2018
Pairwise Body-Part Attention for Recognizing Human-Object Interactions
Pairwise Body-Part Attention for Recognizing Human-Object Interactions
Haoshu Fang
Jinkun Cao
Yu-Wing Tai
Cewu Lu
19
134
0
28 Jul 2018
A Survey of the Usages of Deep Learning in Natural Language Processing
A Survey of the Usages of Deep Learning in Natural Language Processing
Dan Otter
Julian R. Medina
Jugal Kalita
VLM
27
11
0
27 Jul 2018
Recurrent Fusion Network for Image Captioning
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
Wei Liu
Tong Zhang
ObjD
33
233
0
26 Jul 2018
Distinctive-attribute Extraction for Image Captioning
Distinctive-attribute Extraction for Image Captioning
Boeun Kim
Young Han Lee
Hyedong Jung
Choongsang Cho
22
6
0
25 Jul 2018
Previous
123...10111289
Next