Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.5726
Cited By
CIDEr: Consensus-based Image Description Evaluation
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CIDEr: Consensus-based Image Description Evaluation"
50 / 2,140 papers shown
Title
Designing a Symbolic Intermediate Representation for Neural Surface Realization
H. Elder
Jennifer Foster
James Barry
Alexander O’Connor
27
13
0
24 May 2019
Image Captioning based on Deep Learning Methods: A Survey
Yiyu Wang
Jungang Xu
Yingfei Sun
Xianpei Han
VLM
18
7
0
20 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
27
377
0
20 May 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
Fenglin Liu
Yuanxin Liu
Xuancheng Ren
Xiaodong He
Xu Sun
VLM
34
81
0
15 May 2019
Memory-Attended Recurrent Network for Video Captioning
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu-Wing Tai
14
200
0
10 May 2019
Learning Representations for Predicting Future Activities
Mohammadreza Zolfaghari
Özgün Çiçek
S. M. Ali
F. Mahdisoltani
Can Zhang
Thomas Brox
AI4TS
10
6
0
09 May 2019
Multimodal Semantic Attention Network for Video Captioning
Liang Sun
Bing Li
Chunfen Yuan
Zhengjun Zha
Weiming Hu
29
11
0
08 May 2019
Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Philipp Harzig
D. Zecha
Rainer Lienhart
Carolin Kaiser
René Schallner
19
2
0
06 May 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Hongyang Chao
Tao Mei
21
104
0
03 May 2019
Copy mechanism and tailored training for character-based data-to-text generation
Marco Roberti
Giovanni Bonetta
R. Cancelliere
Patrick Gallinari
11
12
0
26 Apr 2019
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
33
69
0
25 Apr 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
69
5,547
0
21 Apr 2019
Deep Metric Learning Beyond Binary Supervision
Sungyeon Kim
Minkyo Seo
Ivan Laptev
Minsu Cho
Suha Kwak
SSL
20
94
0
21 Apr 2019
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
21
26
0
20 Apr 2019
Challenges and Prospects in Vision and Language Research
Kushal Kafle
Robik Shrestha
Christopher Kanan
22
41
0
19 Apr 2019
Learning to Collocate Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Jianfei Cai
25
77
0
18 Apr 2019
Be Concise and Precise: Synthesizing Open-Domain Entity Descriptions from Facts
Rajarshi Bhowmik
Gerard de Melo
16
4
0
16 Apr 2019
Self-critical n-step Training for Image Captioning
Junlong Gao
Shiqi Wang
Shanshe Wang
Siwei Ma
Wen Gao
19
55
0
15 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
28
136
0
08 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
32
539
0
06 Apr 2019
Step-by-Step: Separating Planning from Realization in Neural Data-to-Text Generation
Amit Moryossef
Yoav Goldberg
Ido Dagan
14
181
0
06 Apr 2019
The Steep Road to Happily Ever After: An Analysis of Current Visual Storytelling Models
Yatri Modi
Natalie Parde
21
16
0
06 Apr 2019
Clinically Accurate Chest X-Ray Report Generation
Guanxiong Liu
T. Hsu
Matthew B. A. McDermott
Willie Boag
W. Weng
Peter Szolovits
Marzyeh Ghassemi
MedIm
39
271
0
04 Apr 2019
End-to-End Video Captioning
Silvio Olivastri
Gurkirt Singh
Fabio Cuzzolin
24
18
0
04 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
8
1,233
0
03 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news images
Ali Furkan Biten
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
27
139
0
02 Apr 2019
Pragmatically Informative Text Generation
Sheng Shen
Daniel Fried
Jacob Andreas
Dan Klein
21
65
0
02 Apr 2019
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models
Sharon Zhou
Mitchell L. Gordon
Ranjay Krishna
Austin Narcomey
Li Fei-Fei
Michael S. Bernstein
VLM
EGVM
6
118
0
01 Apr 2019
Describing like humans: on diversity in image captioning
Qingzhong Wang
Antoni B. Chan
27
98
0
28 Mar 2019
Unpaired Image Captioning via Scene Graph Alignments
Jiuxiang Gu
Chenyu You
Jianfei Cai
Handong Zhao
Xu Yang
G. Wang
GNN
8
171
0
26 Mar 2019
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
25
268
0
25 Mar 2019
End-to-End Learning Using Cycle Consistency for Image-to-Caption Transformations
Keisuke Hagiwara
Yusuke Mukuta
Tatsuya Harada
16
0
0
25 Mar 2019
Boosted Attention: Leveraging Human Attention for Image Captioning
Shi Chen
Qi Zhao
24
47
0
18 Mar 2019
A Weighted Multi-Criteria Decision Making Approach for Image Captioning
Hassan Maleki Galandouz
M. Moghaddam
M. Shamsfard
16
0
0
17 Mar 2019
Image captioning with weakly-supervised attention penalty
Jiayun Li
M. K. Ebrahimpour
Azadeh Moghtaderi
Yen-Yun Yu
20
5
0
06 Mar 2019
Human Attention in Image Captioning: Dataset and Analysis
Sen He
Hamed R. Tavakoli
Ali Borji
N. Pugeault
11
5
0
06 Mar 2019
COMIC: Towards A Compact Image Captioning Model with Attention
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
28
40
0
04 Mar 2019
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
Nayyer Aafaq
Naveed Akhtar
Wei Liu
Syed Zulqarnain Gilani
Ajmal Mian
31
204
0
27 Feb 2019
Cycle-Consistency for Robust Visual Question Answering
Meet Shah
Xinlei Chen
Marcus Rohrbach
Devi Parikh
OOD
25
185
0
15 Feb 2019
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Ramprasaath R. Selvaraju
Stefan Lee
Yilin Shen
Hongxia Jin
Shalini Ghosh
Larry Heck
Dhruv Batra
Devi Parikh
FAtt
VLM
19
252
0
11 Feb 2019
Insertion-based Decoding with automatically Inferred Generation Order
Jiatao Gu
Qi Liu
Kyunghyun Cho
15
108
0
04 Feb 2019
Hierarchical Photo-Scene Encoder for Album Storytelling
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Feng-Li Zhang
11
28
0
02 Feb 2019
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
27
189
0
25 Jan 2019
Improving Image Captioning by Leveraging Knowledge Graphs
Yimin Zhou
Yiwei Sun
Vasant Honavar
VLM
22
54
0
25 Jan 2019
Evaluating the State-of-the-Art of End-to-End Natural Language Generation: The E2E NLG Challenge
Ondrej Dusek
Jekaterina Novikova
Verena Rieser
ELM
46
232
0
23 Jan 2019
Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
Hexiang Hu
Ishan Misra
L. V. D. van der Maaten
24
22
0
19 Jan 2019
Improving Sequence-to-Sequence Learning via Optimal Transport
Liqun Chen
Yizhe Zhang
Ruiyi Zhang
Chenyang Tao
Zhe Gan
Haichao Zhang
Bai Li
Dinghan Shen
Changyou Chen
Lawrence Carin
OT
11
92
0
18 Jan 2019
Dialog System Technology Challenge 7
Koichiro Yoshino
Chiori Hori
Julien Perez
L. F. D’Haro
L. Polymenakos
...
Xiang Gao
Huda AlAmri
Tim K. Marks
Devi Parikh
Dhruv Batra
24
37
0
11 Jan 2019
Robust Change Captioning
Dong Huk Park
Trevor Darrell
Anna Rohrbach
30
5
0
08 Jan 2019
Previous
1
2
3
...
36
37
38
...
41
42
43
Next