Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.5726
Cited By
CIDEr: Consensus-based Image Description Evaluation
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CIDEr: Consensus-based Image Description Evaluation"
50 / 2,142 papers shown
Title
Support-set bottlenecks for video-text representation learning
Mandela Patrick
Po-Yao (Bernie) Huang
Yuki M. Asano
Florian Metze
Alexander G. Hauptmann
João Henriques
Andrea Vedaldi
22
244
0
06 Oct 2020
A Novel Actor Dual-Critic Model for Remote Sensing Image Captioning
Ruchika Chavhan
Biplab Banerjee
Xiaoxiang Zhu
S. Chaudhuri
11
8
0
05 Oct 2020
UNISON: Unpaired Cross-lingual Image Captioning
Jiahui Gao
Yi Zhou
Philip L. H. Yu
Shafiq Joty
Jiuxiang Gu
18
16
0
03 Oct 2020
Partially-Aligned Data-to-Text Generation with Distant Supervision
Z. Fu
Bei Shi
Wai Lam
Lidong Bing
Zhiyuan Liu
28
23
0
03 Oct 2020
MGD-GAN: Text-to-Pedestrian generation through Multi-Grained Discrimination
Shengyu Zhang
Donghui Wang
Zhou Zhao
Siliang Tang
Di Xie
Fei Wu
24
0
0
02 Oct 2020
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
61
733
0
02 Oct 2020
Teacher-Critical Training Strategies for Image Captioning
Yiqing Huang
Jiansheng Chen
VLM
29
8
0
30 Sep 2020
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning
Xiangxi Shi
Xu Yang
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
16
52
0
30 Sep 2020
Where is the Model Looking At?--Concentrate and Explain the Network Attention
Wenjia Xu
Jiuniu Wang
Yang Wang
Guangluan Xu
Wei Dai
Yirong Wu
XAI
32
17
0
29 Sep 2020
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
Ye Liu
Yao Wan
Lifang He
Hao Peng
Philip S. Yu
32
188
0
26 Sep 2020
Learning to Plan and Realize Separately for Open-Ended Dialogue Systems
Sashank Santhanam
Zhuo Cheng
Brodie Mather
Bonnie J. Dorr
Archna Bhatia
Bryanna Hebenstreit
Alan Zemel
Adam Dalton
T. Strzalkowski
Samira Shaikh
29
6
0
26 Sep 2020
Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph
Haozhe Ji
Pei Ke
Shaohan Huang
Furu Wei
Xiaoyan Zhu
Minlie Huang
ReLM
LRM
22
114
0
24 Sep 2020
Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning
Daiki Takeuchi
Yuma Koizumi
Yasunori Ohishi
Noboru Harada
K. Kashino
8
26
0
24 Sep 2020
TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval
G. Awad
A. Butt
Keith Curtis
Yooyoung Lee
Jonathan G. Fiscus
...
Lukas L. Diduch
Alan F. Smeaton
Yyette Graham
Wessel Kraaij
Georges Quénot
20
70
0
21 Sep 2020
Towards Unique and Informative Captioning of Images
Zeyu Wang
Berthy Feng
Karthik Narasimhan
Olga Russakovsky
25
37
0
08 Sep 2020
Video Captioning Using Weak Annotation
Jingyi Hou
Yunde Jia
Xinxiao Wu
Yayun Qi
37
2
0
02 Sep 2020
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
35
230
0
27 Aug 2020
Protect, Show, Attend and Tell: Empowering Image Captioning Models with Ownership Protection
Jian Han Lim
Chee Seng Chan
Kam Woh Ng
Lixin Fan
Qiang Yang
124
31
0
25 Aug 2020
In-Home Daily-Life Captioning Using Radio Signals
Lijie Fan
Tianhong Li
Yuan. Yuan
Dina Katabi
45
47
0
25 Aug 2020
Identity-Aware Multi-Sentence Video Description
J. S. Park
Trevor Darrell
Anna Rohrbach
26
17
0
22 Aug 2020
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
19
13
0
18 Aug 2020
Poet: Product-oriented Video Captioner for E-commerce
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Jie Liu
Jingren Zhou
Hongxia Yang
Fei Wu
14
34
0
16 Aug 2020
Textual Description for Mathematical Equations
Ajoy Mondal
C. V. Jawahar
19
2
0
07 Aug 2020
Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards
Xuewen Yang
Heming Zhang
Di Jin
Yingru Liu
Chi-Hao Wu
Jianchao Tan
Dongliang Xie
Jue Wang
Xin Wang
19
68
0
06 Aug 2020
Describing Textures using Natural Language
Chenyun Wu
Mikayla Timm
Subhransu Maji
3DV
28
10
0
03 Aug 2020
Evaluating Automatically Generated Phoneme Captions for Images
Justin van der Hout
Zoltán D'Haese
M. Hasegawa-Johnson
O. Scharenborg
EGVM
8
3
0
31 Jul 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
45
30
0
31 Jul 2020
Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos
Shaoxiang Chen
Wenhao Jiang
Wei Liu
Yu-Gang Jiang
25
101
0
28 Jul 2020
Active Learning for Video Description With Cluster-Regularized Ensemble Ranking
David M. Chan
Sudheendra Vijayanarasimhan
David A. Ross
John F. Canny
VLM
14
6
0
27 Jul 2020
SummEval: Re-evaluating Summarization Evaluation
Alexander R. Fabbri
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
Dragomir R. Radev
HILM
38
691
0
24 Jul 2020
Comprehensive Image Captioning via Scene Graph Decomposition
Yiwu Zhong
Liwei Wang
Jianshu Chen
Dong Yu
Yin Li
87
124
0
23 Jul 2020
Fine-Grained Image Captioning with Global-Local Discriminative Objective
Jie Wu
Tianshui Chen
Hefeng Wu
Zhi Yang
Guangchun Luo
Liang Lin
28
59
0
21 Jul 2020
Multimodal Dialogue State Tracking By QA Approach with Data Augmentation
Xiangyang Mou
Brandyn Sigouin
Ian Steenstra
Hui Su
12
9
0
20 Jul 2020
Length-Controllable Image Captioning
Chaorui Deng
Ning Ding
Mingkui Tan
Qi Wu
VLM
33
56
0
19 Jul 2020
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Ganchao Tan
Daqing Liu
Meng Wang
Zhengjun Zha
LRM
30
73
0
17 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting
Roberto Bigazzi
Federico Landi
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
EgoV
LM&Ro
29
17
0
14 Jul 2020
Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
37
45
0
14 Jul 2020
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
Aditya Mogadala
Marius Mosbach
Dietrich Klakow
VLM
187
0
0
12 Jul 2020
Image Captioning with Compositional Neural Module Networks
Junjiao Tian
Jean Oh
11
11
0
10 Jul 2020
EMIXER: End-to-end Multimodal X-ray Generation via Self-supervision
Siddharth Biswal
Peiye Zhuang
A. Pyrros
Nasir Siddiqui
Oluwasanmi Koyejo
Jimeng Sun
MedIm
27
5
0
10 Jul 2020
Multi-task Regularization Based on Infrequent Classes for Audio Captioning
Emre Çakir
Konstantinos Drossos
Tuomas Virtanen
19
17
0
09 Jul 2020
Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network
Tadashi Ogura
A. Magassouba
K. Sugiura
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
Hisashi Kawai
24
11
0
09 Jul 2020
IQ-VQA: Intelligent Visual Question Answering
Vatsal Goel
Mohit Chandak
A. Anand
Prithwijit Guha
28
5
0
08 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts
Marzi Heidari
M. Ghatee
A. Nickabadi
Arash Pourhasan Nezhad
DiffM
MoE
35
1
0
07 Jul 2020
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning
K. Nguyen
Konstantinos Drossos
Tuomas Virtanen
20
12
0
06 Jul 2020
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
Wanrong Zhu
Xinze Wang
Tsu-Jui Fu
An Yan
P. Narayana
Kazoo Sone
Sugato Basu
Wenjie Wang
37
33
0
01 Jul 2020
Improving VQA and its Explanations \\ by Comparing Competing Explanations
Jialin Wu
Liyan Chen
Raymond J. Mooney
FAtt
AAML
33
18
0
28 Jun 2020
Listen carefully and tell: an audio captioning system based on residual learning and gammatone audio representation
Sergi Perez-Castanos
Javier Naranjo-Alcazar
P. Zuccarello
M. Cobos
26
11
0
27 Jun 2020
Evaluation of Text Generation: A Survey
Asli Celikyilmaz
Elizabeth Clark
Jianfeng Gao
ELM
LM&MA
44
378
0
26 Jun 2020
Comprehensive Information Integration Modeling Framework for Video Titling
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Tan Jiang
Jingren Zhou
Hongxia Yang
Fei Wu
31
40
0
24 Jun 2020
Previous
1
2
3
...
31
32
33
...
41
42
43
Next