Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.5726
Cited By
v1
v2 (latest)
CIDEr: Consensus-based Image Description Evaluation
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CIDEr: Consensus-based Image Description Evaluation"
50 / 2,183 papers shown
Title
Learning Neural Templates for Text Generation
Sam Wiseman
Stuart M. Shieber
Alexander M. Rush
138
201
0
30 Aug 2018
Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation
Renjie Zheng
Mingbo Ma
Liang Huang
80
35
0
28 Aug 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
129
66
0
27 Aug 2018
Approximate Distribution Matching for Sequence-to-Sequence Learning
Wenhu Chen
Guanlin Li
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
OOD
BDL
29
0
0
24 Aug 2018
Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Daqing Liu
Zhengjun Zha
Hanwang Zhang
Yongdong Zhang
Feng Wu
CLIP
103
104
0
16 Aug 2018
Multimodal Differential Network for Visual Question Generation
Badri N. Patro
Sandeep Kumar
V. Kurmi
Vinay P. Namboodiri
68
40
0
12 Aug 2018
Textual Explanations for Self-Driving Vehicles
Jinkyu Kim
Anna Rohrbach
Trevor Darrell
John F. Canny
Zeynep Akata
91
348
0
30 Jul 2018
Move Forward and Tell: A Progressive Generator of Video Descriptions
Yilei Xiong
Bo Dai
Dahua Lin
83
104
0
26 Jul 2018
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
Wen Liu
Tong Zhang
ObjD
86
236
0
26 Jul 2018
Rethinking the Form of Latent States in Image Captioning
Bo Dai
Deming Ye
Dahua Lin
78
18
0
26 Jul 2018
Distinctive-attribute Extraction for Image Captioning
Boeun Kim
Young Han Lee
Hyedong Jung
Choongsang Cho
29
6
0
25 Jul 2018
Video Storytelling: Textual Summaries for Events
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
DiffM
72
47
0
25 Jul 2018
How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video Summarization
Yandong Li
Liqiang Wang
Tianbao Yang
Boqing Gong
91
42
0
11 Jul 2018
Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction
Xiangxi Shi
Jianfei Cai
Jiuxiang Gu
Shafiq Joty
43
19
0
08 Jul 2018
Face-Cap: Image Captioning using Facial Expression Analysis
Omid Mohamad Nezami
Mark Dras
Peter Anderson
Len Hamey
CVBM
55
27
0
06 Jul 2018
The price of debiasing automatic metrics in natural language evaluation
Arun Tejasvi Chaganty
Stephen Mussmann
Percy Liang
81
117
0
06 Jul 2018
Learning Multimodal Representations for Unseen Activities
A. Piergiovanni
Michael S. Ryoo
SSL
50
4
0
21 Jun 2018
Learning to Evaluate Image Captioning
Huayu Chen
Guandao Yang
Andreas Veit
Xun Huang
Serge J. Belongie
89
148
0
17 Jun 2018
Multimodal Grounding for Language Processing
Lisa Beinborn
Teresa Botschen
Iryna Gurevych
62
33
0
17 Jun 2018
Partially-Supervised Image Captioning
Peter Anderson
Stephen Gould
Mark Johnson
80
32
0
15 Jun 2018
Learn from Your Neighbor: Learning Multi-modal Mappings from Sparse Annotations
Ashwin Kalyan
Stefan Lee
A. Kannan
Dhruv Batra
56
6
0
08 Jun 2018
Visual Reasoning by Progressive Module Networks
Seung Wook Kim
Makarand Tapaswi
Sanja Fidler
ReLM
LRM
70
13
0
06 Jun 2018
Mining for meaning: from vision to language through multiple networks consensus
Iulia Duta
Andrei Liviu Nicolicioiu
Simion-Vlad Bogolin
Marius Leordeanu
86
3
0
05 Jun 2018
Natural Language Generation for Electronic Health Records
Scott H. Lee
SyDa
59
82
0
01 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
124
93
0
01 Jun 2018
Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech
Aditya Deshpande
J. Aneja
Liwei Wang
Alex Schwing
David A. Forsyth
103
148
0
31 May 2018
Grow and Prune Compact, Fast, and Accurate LSTMs
Xiaoliang Dai
Hongxu Yin
N. Jha
VLM
SyDa
61
91
0
30 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
92
211
0
24 May 2018
Amortized Context Vector Inference for Sequence-to-Sequence Networks
S. Chatzis
Aristotelis Charalampous
Kyriacos Tolias
24
0
0
23 May 2018
CNN+CNN: Convolutional Decoders for Image Captioning
Qingzhong Wang
Antoni B. Chan
VLM
73
86
0
23 May 2018
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
52
13
0
22 May 2018
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
79
340
0
21 May 2018
Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation
Qiuyuan Huang
Zhe Gan
Asli Celikyilmaz
D. Wu
Jianfeng Wang
Xiaodong He
BDL
88
92
0
21 May 2018
Turbo Learning for Captionbot and Drawingbot
Qiuyuan Huang
Pengchuan Zhang
D. Wu
Lei Zhang
63
25
0
21 May 2018
Improving Image Captioning with Conditional Generative Adversarial Nets
Chen Chen
Shuai Mu
Wanpeng Xiao
Zexiong Ye
Liesi Wu
Qi Ju
GAN
111
92
0
18 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
75
115
0
18 May 2018
Learning to Write with Cooperative Discriminators
Ari Holtzman
Jan Buys
Maxwell Forbes
Antoine Bosselut
David Golub
Yejin Choi
82
238
0
16 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
67
19
0
14 May 2018
Discourse-Aware Neural Rewards for Coherent Text Generation
Antoine Bosselut
Asli Celikyilmaz
Xiaodong He
Jianfeng Gao
Po-Sen Huang
Yejin Choi
84
80
0
10 May 2018
Automatic Article Commenting: the Task and Dataset
Lianhui Qin
Lemao Liu
Victoria Bi
Yan Wang
Xiaojiang Liu
Zhiting Hu
Zhao Hai
Shuming Shi
74
26
0
09 May 2018
Adversarial Semantic Alignment for Improved Image Captions
Pierre Dognin
Igor Melnyk
Youssef Mroueh
Jerret Ross
Tom Sercu
59
16
0
30 Apr 2018
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
69
7
0
27 Apr 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
Xin Eric Wang
Wenhu Chen
Yuan-fang Wang
William Yang Wang
87
160
0
24 Apr 2018
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
194
500
0
24 Apr 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning
Josiah Wang
Pranava Madhyastha
Lucia Specia
ObjD
74
37
0
23 Apr 2018
Jointly Localizing and Describing Events for Dense Video Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
81
175
0
23 Apr 2018
Entity-aware Image Caption Generation
Di Lu
Spencer Whitehead
Lifu Huang
Heng Ji
Shih-Fu Chang
VLM
82
83
0
21 Apr 2018
Learning to Guide Decoding for Image Captioning
Wenhao Jiang
Lin Ma
Xinpeng Chen
Hanwang Zhang
Wen Liu
85
69
0
03 Apr 2018
Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning
Dianqi Li
Qiuyuan Huang
Xiaodong He
Lei Zhang
Ming-Ting Sun
96
50
0
03 Apr 2018
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
Jingwen Wang
Wenhao Jiang
Lin Ma
Wen Liu
Yong-mei Xu
94
208
0
31 Mar 2018
Previous
1
2
3
...
39
40
41
42
43
44
Next