ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.03925
  4. Cited By
Image Captioning with Semantic Attention

Image Captioning with Semantic Attention

12 March 2016
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
    VLM
ArXivPDFHTML

Papers citing "Image Captioning with Semantic Attention"

50 / 562 papers shown
Title
A Comprehensive Survey on Community Detection with Deep Learning
A Comprehensive Survey on Community Detection with Deep Learning
Xing Su
Shan Xue
Fanzhen Liu
Jia Wu
Jian Yang
...
Cécile Paris
Surya Nepal
Di Jin
Quan Z. Sheng
Philip S. Yu
GNN
22
321
0
26 May 2021
Writing by Memorizing: Hierarchical Retrieval-based Medical Report
  Generation
Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation
Xingyi Yang
Muchao Ye
Quanzeng You
Fenglong Ma
MedIm
16
38
0
25 May 2021
SGNet: A Super-class Guided Network for Image Classification and Object
  Detection
SGNet: A Super-class Guided Network for Image Classification and Object Detection
Kaidong Li
Ningning Wang
Yiju Yang
Guanghui Wang
92
22
0
26 Apr 2021
Skeletor: Skeletal Transformers for Robust Body-Pose Estimation
Skeletor: Skeletal Transformers for Robust Body-Pose Estimation
Tao Jiang
Necati Cihan Camgöz
Richard Bowden
ViT
17
38
0
23 Apr 2021
Detector-Free Weakly Supervised Grounding by Separation
Detector-Free Weakly Supervised Grounding by Separation
Assaf Arbelle
Sivan Doveh
Amit Alfassy
J. Shtok
Guy Lev
...
Kate Saenko
S. Ullman
Raja Giryes
Rogerio Feris
Leonid Karlinsky
35
23
0
20 Apr 2021
Visual Navigation with Spatial Attention
Visual Navigation with Spatial Attention
Bar Mayo
Tamir Hazan
A. Tal
EgoV
27
73
0
20 Apr 2021
BM-NAS: Bilevel Multimodal Neural Architecture Search
BM-NAS: Bilevel Multimodal Neural Architecture Search
Yihang Yin
Siyu Huang
Xiang Zhang
32
27
0
19 Apr 2021
Compressing Visual-linguistic Model via Knowledge Distillation
Compressing Visual-linguistic Model via Knowledge Distillation
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lijuan Wang
Yezhou Yang
Zicheng Liu
VLM
39
97
0
05 Apr 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
VGen
39
1,128
0
01 Apr 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
23
175
0
31 Mar 2021
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network
  for Video Reasoning over Traffic Events
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Li Xu
He Huang
Jun Liu
ViT
LRM
15
83
0
29 Mar 2021
SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability
  for Domain-specific Small Corpus
SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability for Domain-specific Small Corpus
Rishabh Gupta
Rajesh N. Rao
13
0
0
21 Mar 2021
An Unsupervised Sampling Approach for Image-Sentence Matching Using
  Document-Level Structural Information
An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Zejun Li
Zhongyu Wei
Zhihao Fan
Haijun Shan
Xuanjing Huang
25
5
0
21 Mar 2021
ClawCraneNet: Leveraging Object-level Relation for Text-based Video
  Segmentation
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation
Chen Liang
Yu Wu
Yawei Luo
Yi Yang
VOS
28
30
0
19 Mar 2021
Dual Attention-in-Attention Model for Joint Rain Streak and Raindrop
  Removal
Dual Attention-in-Attention Model for Joint Rain Streak and Raindrop Removal
Kaihao Zhang
Dongxu Li
Wenhan Luo
Wenqi Ren
22
73
0
12 Mar 2021
Perspectives and Prospects on Transformer Architecture for Cross-Modal
  Tasks with Language and Vision
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Andrew Shin
Masato Ishii
T. Narihira
35
37
0
06 Mar 2021
Causal Attention for Vision-Language Tasks
Causal Attention for Vision-Language Tasks
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
CML
28
148
0
05 Mar 2021
Efficient Palm-Line Segmentation with U-Net Context Fusion Module
Efficient Palm-Line Segmentation with U-Net Context Fusion Module
Toan Pham Van
S. T. Nguyen
Linh Doan Bao
Ngoc N. Tran
Ta Minh Thanh
32
6
0
24 Feb 2021
Characterization and recognition of handwritten digits using Julia
Characterization and recognition of handwritten digits using Julia
Md Asifuzzaman Jishan
M. Alam
A. Islam
I. R. Mazumder
K. Mahmud
A. K. Azad
19
0
0
24 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings
  and Data Augmentation
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation
Sulabh Katiyar
S. Borgohain
VLM
27
14
0
22 Feb 2021
Progressive Transformer-Based Generation of Radiology Reports
Progressive Transformer-Based Generation of Radiology Reports
Farhad Nooralahzadeh
Nicolas Andres Perez Gonzalez
T. Frauenfelder
Koji Fujimoto
Michael Krauthammer
ViT
MedIm
17
84
0
19 Feb 2021
HDMI: High-order Deep Multiplex Infomax
HDMI: High-order Deep Multiplex Infomax
Baoyu Jing
Chanyoung Park
Hanghang Tong
98
164
0
15 Feb 2021
Improved Bengali Image Captioning via deep convolutional neural network
  based encoder-decoder model
Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
VLM
30
18
0
14 Feb 2021
Referring Segmentation in Images and Videos with Cross-Modal
  Self-Attention Network
Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network
Linwei Ye
Mrigank Rochan
Zhi Liu
Xiaoqin Zhang
Yang Wang
VOS
EgoV
25
55
0
09 Feb 2021
Semantic Grouping Network for Video Captioning
Semantic Grouping Network for Video Captioning
Hobin Ryu
Sunghun Kang
Haeyong Kang
Chang D. Yoo
35
135
0
01 Feb 2021
DOC2PPT: Automatic Presentation Slides Generation from Scientific
  Documents
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
Tsu-jui Fu
Luu Anh Tuan
Daniel J. McDuff
Yale Song
14
49
0
28 Jan 2021
Probability Trajectory: One New Movement Description for Trajectory
  Prediction
Probability Trajectory: One New Movement Description for Trajectory Prediction
Pei Lv
Hui Wei
Tianxin Gu
Yuzhen Zhang
Xiaoheng Jiang
Bing Zhou
Mingliang Xu
33
0
0
26 Jan 2021
Sequence-based Dynamic Handwriting Analysis for Parkinson's Disease
  Detection with One-dimensional Convolutions and BiGRUs
Sequence-based Dynamic Handwriting Analysis for Parkinson's Disease Detection with One-dimensional Convolutions and BiGRUs
Moisés Díaz
Momina Moetesum
Imran Siddiqi
G. Vessio
23
78
0
23 Jan 2021
Macroscopic Control of Text Generation for Image Captioning
Macroscopic Control of Text Generation for Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
29
4
0
20 Jan 2021
Diagnostic Captioning: A Survey
Diagnostic Captioning: A Survey
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
91
26
0
18 Jan 2021
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense
  Generation
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation
Yiran Xing
Z. Shi
Zhao Meng
Gerhard Lakemeyer
Yunpu Ma
Roger Wattenhofer
VLM
72
40
0
02 Jan 2021
Coarse to Fine: Multi-label Image Classification with Global/Local
  Attention
Coarse to Fine: Multi-label Image Classification with Global/Local Attention
Fan Lyu
Fuyuan Hu
Victor S. Sheng
Zhengtian Wu
Qiming Fu
Baochuan Fu
11
6
0
26 Dec 2020
LCEval: Learned Composite Metric for Caption Evaluation
LCEval: Learned Composite Metric for Caption Evaluation
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
26
8
0
24 Dec 2020
Towards Recognizing New Semantic Concepts in New Visual Domains
Towards Recognizing New Semantic Concepts in New Visual Domains
Massimiliano Mancini
OOD
31
0
0
16 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
Image Captioning with Context-Aware Auxiliary Guidance
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
36
31
0
10 Dec 2020
Understanding Guided Image Captioning Performance across Domains
Understanding Guided Image Captioning Performance across Domains
Edwin G. Ng
Bo Pang
P. Sharma
Radu Soricut
27
24
0
04 Dec 2020
Creativity of Deep Learning: Conceptualization and Assessment
Creativity of Deep Learning: Conceptualization and Assessment
Marcus Basalla
Johannes Schneider
Jan vom Brocke
31
14
0
03 Dec 2020
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
Jing Su
Qingyun Dai
Frank Guerin
Mian Zhou
30
24
0
03 Dec 2020
Generating Descriptions for Sequential Images with Local-Object
  Attention and Global Semantic Context Modelling
Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling
Jing Su
Chenghua Lin
Mian Zhou
Qingyun Dai
Haoyu Lv
16
2
0
02 Dec 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
33
123
0
23 Nov 2020
SuperOCR: A Conversion from Optical Character Recognition to Image
  Captioning
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning
Baohua Sun
Michael Lin
Hao Sha
Lin Yang
19
5
0
21 Nov 2020
MAGNeto: An Efficient Deep Learning Method for the Extractive Tags
  Summarization Problem
MAGNeto: An Efficient Deep Learning Method for the Extractive Tags Summarization Problem
H. Phung
A. Vu
Tung D. Nguyen
Lam Thanh Do
Giang Nam Ngo
Trung Thanh Tran
Hà Nội
ViT
17
0
0
09 Nov 2020
Channel Pruning Guided by Spatial and Channel Attention for DNNs in
  Intelligent Edge Computing
Channel Pruning Guided by Spatial and Channel Attention for DNNs in Intelligent Edge Computing
Mengran Liu
Weiwei Fang
Xiaodong Ma
Wenyuan Xu
N. Xiong
Qiankun Li
AAML
24
21
0
08 Nov 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Dual Attention on Pyramid Feature Maps for Image Captioning
Litao Yu
Jian Zhang
Qiang Wu
21
47
0
02 Nov 2020
Boost Image Captioning with Knowledge Reasoning
Boost Image Captioning with Knowledge Reasoning
Feicheng Huang
Zhixin Li
Haiyang Wei
Canlong Zhang
Huifang Ma
9
25
0
02 Nov 2020
Melody-Conditioned Lyrics Generation with SeqGANs
Melody-Conditioned Lyrics Generation with SeqGANs
Yihao Chen
Alexander Lerch
GAN
MGen
32
29
0
28 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual
  Questions
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
38
22
0
24 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei Chen
Weiping Wang
Li Liu
M. Lew
VLM
118
31
0
16 Oct 2020
TextMage: The Automated Bangla Caption Generator Based On Deep Learning
TextMage: The Automated Bangla Caption Generator Based On Deep Learning
Abrar Hasin Kamal
Md Asifuzzaman Jishan
N. Mansoor
VLM
8
17
0
15 Oct 2020
The Benefit of Distraction: Denoising Remote Vitals Measurements using
  Inverse Attention
The Benefit of Distraction: Denoising Remote Vitals Measurements using Inverse Attention
E. Nowara
Daniel J. McDuff
Ashok Veeraraghavan
8
13
0
14 Oct 2020
Previous
123456...101112
Next