Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.5726
Cited By
CIDEr: Consensus-based Image Description Evaluation
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CIDEr: Consensus-based Image Description Evaluation"
50 / 2,137 papers shown
Title
Entity-aware Image Caption Generation
Di Lu
Spencer Whitehead
Lifu Huang
Heng Ji
Shih-Fu Chang
VLM
25
82
0
21 Apr 2018
Learning to Guide Decoding for Image Captioning
Wenhao Jiang
Lin Ma
Xinpeng Chen
Hanwang Zhang
Wen Liu
16
69
0
03 Apr 2018
Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning
Dianqi Li
Qiuyuan Huang
Xiaodong He
Lei Zhang
Ming-Ting Sun
21
50
0
03 Apr 2018
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
Jingwen Wang
Wenhao Jiang
Lin Ma
Wen Liu
Yong-mei Xu
14
203
0
31 Mar 2018
Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Xinpeng Chen
Lin Ma
Wenhao Jiang
Jian Yao
Wen Liu
17
92
0
30 Mar 2018
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
38
317
0
30 Mar 2018
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
200
434
0
27 Mar 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
20
133
0
22 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
22
56
0
21 Mar 2018
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Chenyu You
Jianfei Cai
G. Wang
26
82
0
14 Mar 2018
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
30
202
0
12 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Feiyu Xiong
Qingming Huang
12
200
0
05 Mar 2018
Neural Aesthetic Image Reviewer
Wenshan Wang
Su Yang
Weishan Zhang
Jiulong Zhang
22
38
0
28 Feb 2018
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
32
807
0
22 Feb 2018
Attentive Tensor Product Learning
Qiuyuan Huang
Li Deng
D. Wu
Chang Liu
Xiaodong He
21
23
0
20 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
37
419
0
15 Feb 2018
Multimodal Image Captioning for Marketing Analysis
Philipp Harzig
Stephan Brehm
Rainer Lienhart
Carolin Kaiser
René Schallner
18
10
0
06 Feb 2018
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Seunghoon Hong
Dingdong Yang
Jongwook Choi
Honglak Lee
EGVM
29
336
0
16 Jan 2018
Consensus-based Sequence Training for Video Captioning
Sang Phan Le
G. Henter
Yusuke Miyao
Shiníchi Satoh
3DV
13
22
0
27 Dec 2017
Exploring Models and Data for Remote Sensing Image Caption Generation
Xiaoqiang Lu
Binqiang Wang
Xiangtao Zheng
Xuelong Li
24
461
0
21 Dec 2017
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
A. Schwing
VLM
37
359
0
24 Nov 2017
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric P. Xing
MedIm
35
503
0
22 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
A. Schwing
Svetlana Lazebnik
CoGe
37
175
0
19 Nov 2017
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding
Jiahong Wu
He Zheng
Bo Zhao
Yixin Li
Baoming Yan
...
Shipei Zhou
G. Lin
Yanwei Fu
Yizhou Wang
Yonggang Wang
VLM
38
149
0
17 Nov 2017
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
Jiuxiang Gu
Jianfei Cai
Chenyu You
Li Niu
G. Wang
VLM
16
361
0
17 Nov 2017
Grounded Objects and Interactions for Video Captioning
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
35
6
0
16 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
33
145
0
16 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAML
OOD
27
34
0
16 Nov 2017
Phrase-based Image Captioning with Hierarchical LSTM Model
Y. Tan
Chee Seng Chan
VLM
26
4
0
11 Nov 2017
Object Referring in Visual Scene with Spoken Language
A. Vasudevan
Dengxin Dai
Luc Van Gool
37
18
0
10 Nov 2017
Image Captioning and Classification of Dangerous Situations
Octavio Arriaga
Paul G. Plöger
Matias Valdenegro-Toro
27
8
0
07 Nov 2017
Evaluation of Automatic Video Captioning Using Direct Assessment
Yvette Graham
G. Awad
Alan F. Smeaton
16
31
0
29 Oct 2017
A Neural-Symbolic Approach to Design of CAPTCHA
Qiuyuan Huang
P. Smolensky
Xiaodong He
Li Deng
D. Wu
AAML
26
1
0
29 Oct 2017
InterpNET: Neural Introspection for Interpretable Deep Learning
Shane T. Barratt
17
19
0
26 Oct 2017
Feedback-prop: Convolutional Neural Network Inference under Partial Evidence
Tianlu Wang
Kota Yamaguchi
Vicente Ordonez
31
12
0
23 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance
Aditya Mogadala
Umanga Bista
Lexing Xie
Achim Rettinger
25
7
0
17 Oct 2017
Contrastive Learning for Image Captioning
Bo Dai
Dahua Lin
SSL
VLM
15
190
0
06 Oct 2017
Cold-Start Reinforcement Learning with Softmax Policy Gradient
Nan Ding
Radu Soricut
22
46
0
27 Sep 2017
Tensor Product Generation Networks for Deep NLP Modeling
Qiuyuan Huang
P. Smolensky
Xiaodong He
Li Deng
D. Wu
28
3
0
26 Sep 2017
Visual Question Generation as Dual Task of Visual Question Answering
Yikang Li
Nan Duan
Bolei Zhou
Xiao Chu
Wanli Ouyang
Xiaogang Wang
34
165
0
21 Sep 2017
Learning Functional Causal Models with Generative Neural Networks
Hugo Jair Escalante
Sergio Escalera
Xavier Baro
Isabelle M Guyon
Umut Güçlü
Marcel van Gerven
CML
BDL
20
107
0
15 Sep 2017
Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning
Yang Xian
Yingli Tian
VLM
25
22
0
15 Sep 2017
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Guohao Li
OOD
30
23
0
14 Sep 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
32
178
0
11 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
16
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
34
20
0
31 Aug 2017
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
27
126
0
15 Aug 2017
Fluency-Guided Cross-Lingual Image Captioning
Weiyu Lan
Xirong Li
Jianfeng Dong
19
93
0
15 Aug 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning
Jingkuan Song
Yuyu Guo
Lianli Gao
Xuelong Li
Alan Hanjalic
Heng Tao Shen
34
219
0
08 Aug 2017
Reinforced Video Captioning with Entailment Rewards
Ramakanth Pasunuru
Joey Tianyi Zhou
28
114
0
07 Aug 2017
Previous
1
2
3
...
39
40
41
42
43
Next