ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.03925
  4. Cited By
Image Captioning with Semantic Attention

Image Captioning with Semantic Attention

12 March 2016
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
    VLM
ArXivPDFHTML

Papers citing "Image Captioning with Semantic Attention"

50 / 562 papers shown
Title
Expressing Objects just like Words: Recurrent Visual Embedding for
  Image-Text Matching
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen
Jiebo Luo
11
69
0
20 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic
  Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO
  Framework
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
27
7
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
25
16
0
15 Feb 2020
CBAG: Conditional Biomedical Abstract Generation
CBAG: Conditional Biomedical Abstract Generation
Justin Sybrandt
Ilya Safro
MedIm
AI4CE
19
8
0
13 Feb 2020
An End-to-End Visual-Audio Attention Network for Emotion Recognition in
  User-Generated Videos
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
11
98
0
12 Feb 2020
Vision-based Fight Detection from Surveillance Cameras
Vision-based Fight Detection from Surveillance Cameras
Seymanur Akti
G. A. Tataroglu
H. K. Ekenel
19
77
0
11 Feb 2020
The POLAR Framework: Polar Opposites Enable Interpretability of
  Pre-Trained Word Embeddings
The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings
Binny Mathew
Sandipan Sikdar
Florian Lemmerich
M. Strohmaier
6
35
0
27 Jan 2020
aiTPR: Attribute Interaction-Tensor Product Representation for Image
  Caption
aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption
C. Sur
18
8
0
27 Jan 2020
Show, Recall, and Tell: Image Captioning with Recall Mechanism
Show, Recall, and Tell: Image Captioning with Recall Mechanism
Li Wang
Zechen Bai
Yonghua Zhang
Hongtao Lu
27
67
0
15 Jan 2020
Fine-grained Image Classification and Retrieval by Combining Visual and
  Locally Pooled Textual Features
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
13
26
0
14 Jan 2020
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning
  Models
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning Models
Jiamei Sun
Sebastian Lapuschkin
Wojciech Samek
Alexander Binder
FAtt
42
29
0
04 Jan 2020
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence
  Generation
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation
Xinjie Fan
Yizhe Zhang
Zhendong Wang
Mingyuan Zhou
BDL
9
4
0
31 Dec 2019
Vision and Language: from Visual Perception to Content Creation
Vision and Language: from Visual Perception to Content Creation
Tao Mei
Wei Zhang
Ting Yao
VLM
17
8
0
26 Dec 2019
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
14
868
0
17 Dec 2019
Biometrics Recognition Using Deep Learning: A Survey
Biometrics Recognition Using Deep Learning: A Survey
Shervin Minaee
AmirAli Abdolrashidi
Hang Su
Bennamoun
David C. Zhang
21
84
0
30 Nov 2019
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and
  Context Capture for Language Representation -- A Generalization of Bi
  Directional LSTM
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTM
C. Sur
BDL
9
6
0
22 Nov 2019
Improving Non-Intrusive Load Disaggregation through an Attention-Based
  Deep Neural Network
Improving Non-Intrusive Load Disaggregation through an Attention-Based Deep Neural Network
V. Piccialli
A. M. Sudoso
14
10
0
15 Nov 2019
Conditionally Learn to Pay Attention for Sequential Visual Task
Conditionally Learn to Pay Attention for Sequential Visual Task
Jun He
Quan-Jie Cao
Lei Zhang
21
0
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
35
322
0
10 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
30
2
0
09 Nov 2019
Assisting human experts in the interpretation of their visual process: A
  case study on assessing copper surface adhesive potency
Assisting human experts in the interpretation of their visual process: A case study on assessing copper surface adhesive potency
T. Hascoet
Xuejiao Deng
Daniela Mihai
Mari Sugiyama
Yuji Adachi
Sachiko Nakamura
Jonathon S. Hare
Tomoko Hayashi
T. Takiguchi
9
1
0
24 Oct 2019
Imperial College London Submission to VATEX Video Captioning Task
Imperial College London Submission to VATEX Video Captioning Task
Ozan Caglayan
Zixiu "Alex" Wu
Pranava Madhyastha
Josiah Wang
Lucia Specia
12
0
0
16 Oct 2019
Exploring Overall Contextual Information for Image Captioning in
  Human-Like Cognitive Style
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
30
24
0
15 Oct 2019
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating
  Referee
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee
Shuangjie Xu
Feng Xu
Yu Cheng
Pan Zhou
21
2
0
14 Oct 2019
Semantic-aware Image Deblurring
Semantic-aware Image Deblurring
Fuhai Chen
Rongrong Ji
Chengpeng Dai
Xiaoshuai Sun
Chia-Wen Lin
Jiayi Ji
Baochang Zhang
Feiyue Huang
Liujuan Cao
BDL
VLM
25
6
0
09 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic
  Explainability
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
14
27
0
07 Oct 2019
Controlled Text Generation for Data Augmentation in Intelligent
  Artificial Agents
Controlled Text Generation for Data Augmentation in Intelligent Artificial Agents
Nikolaos Malandrakis
Minmin Shen
Anuj Kumar Goyal
Shuyang Gao
Abhishek Sethi
A. Metallinou
26
54
0
04 Oct 2019
ALCNN: Attention-based Model for Fine-grained Demand Inference of
  Dock-less Shared Bike in New Cities
ALCNN: Attention-based Model for Fine-grained Demand Inference of Dock-less Shared Bike in New Cities
Chang-rui Liu
Yanan Xu
Yanmin Zhu
13
0
0
25 Sep 2019
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep
  Visuomotor Policies for Manipulation in Clutter
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep Visuomotor Policies for Manipulation in Clutter
P. Abolghasemi
Ladislau Bölöni
OffRL
17
10
0
24 Sep 2019
Pose-aware Multi-level Feature Network for Human Object Interaction
  Detection
Pose-aware Multi-level Feature Network for Human Object Interaction Detection
Bo Wan
Desen Zhou
Yongfei Liu
Rongjie Li
Xuming He
26
197
0
18 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions
Inverse Visual Question Answering with Multi-Level Attentions
Yaser Alwatter
Yuhong Guo
BDL
21
1
0
17 Sep 2019
Automatically Extracting Challenge Sets for Non local Phenomena in
  Neural Machine Translation
Automatically Extracting Challenge Sets for Non local Phenomena in Neural Machine Translation
Leshem Choshen
Omri Abend
19
18
0
15 Sep 2019
Deep Collaborative Filtering with Multi-Aspect Information in
  Heterogeneous Networks
Deep Collaborative Filtering with Multi-Aspect Information in Heterogeneous Networks
C. Shi
Xiaotian Han
Li Song
Tianlin Li
Senzhang Wang
Junping Du
Philip S. Yu
99
98
0
14 Sep 2019
What Makes A Good Story? Designing Composite Rewards for Visual
  Storytelling
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling
Junjie Hu
Yu Cheng
Zhe Gan
Jingjing Liu
Jianfeng Gao
Graham Neubig
8
67
0
11 Sep 2019
Human Visual Attention Prediction Boosts Learning & Performance of
  Autonomous Driving Agents
Human Visual Attention Prediction Boosts Learning & Performance of Autonomous Driving Agents
Alexander Makrigiorgos
A. Shafti
Alex Harston
Julien Gérard
A. Faisal
14
14
0
11 Sep 2019
PDANet: Polarity-consistent Deep Attention Network for Fine-grained
  Visual Emotion Regression
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression
Sicheng Zhao
Zizhou Jia
Hui Chen
Leida Li
Guiguang Ding
Kurt Keutzer
33
62
0
11 Sep 2019
Compositional Generalization in Image Captioning
Compositional Generalization in Image Captioning
Mitja Nikolaus
Mostafa Abdou
Matthew Lamm
Rahul Aralikatte
Desmond Elliott
CoGe
27
49
0
10 Sep 2019
Hierarchy Parsing for Image Captioning
Hierarchy Parsing for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
VLM
22
164
0
09 Sep 2019
Look and Modify: Modification Networks for Image Captioning
Look and Modify: Modification Networks for Image Captioning
Fawaz Sammani
Mahmoud Elsayed
22
22
0
07 Sep 2019
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Wei Wei
Ling Cheng
Xian-Ling Mao
Guangyou Zhou
Feida Zhu
DiffM
22
19
0
05 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question
  Answering
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
25
33
0
05 Sep 2019
Large-scale Tag-based Font Retrieval with Generative Feature Learning
Large-scale Tag-based Font Retrieval with Generative Feature Learning
Tianlang Chen
Zhaowen Wang
N. Xu
Hailin Jin
Jiebo Luo
3DV
VLM
12
28
0
04 Sep 2019
A Semantics-Assisted Video Captioning Model Trained with Scheduled
  Sampling
A Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling
Haoran Chen
Ke Lin
A. Maye
Jianmin Li
Xiaoling Hu
25
47
0
31 Aug 2019
Reflective Decoding Network for Image Captioning
Reflective Decoding Network for Image Captioning
Lei Ke
Wenjie Pei
Ruiyu Li
Xiaoyong Shen
Yu-Wing Tai
ObjD
8
91
0
30 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Iro Laina
Christian Rupprecht
Nassir Navab
SSL
21
103
0
25 Aug 2019
Saccader: Improving Accuracy of Hard Attention Models for Vision
Saccader: Improving Accuracy of Hard Attention Models for Vision
Gamaleldin F. Elsayed
Simon Kornblith
Quoc V. Le
VLM
29
71
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information
  Bottleneck
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
25
22
0
19 Aug 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
A Fast and Accurate One-Stage Approach to Visual Grounding
Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
ObjD
14
360
0
18 Aug 2019
Unpaired Cross-lingual Image Caption Generation with Self-Supervised
  Rewards
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards
Yuqing Song
Shizhe Chen
Yida Zhao
Qin Jin
SSL
23
40
0
15 Aug 2019
Efficient Inference of CNNs via Channel Pruning
Efficient Inference of CNNs via Channel Pruning
Boyu Zhang
A. Davoodi
Y. Hu
CVBM
16
6
0
08 Aug 2019
Previous
123...678...101112
Next