ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.5726
  4. Cited By
CIDEr: Consensus-based Image Description Evaluation

CIDEr: Consensus-based Image Description Evaluation

20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
ArXivPDFHTML

Papers citing "CIDEr: Consensus-based Image Description Evaluation"

50 / 2,140 papers shown
Title
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
74
163
0
27 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Iro Laina
Christian Rupprecht
Nassir Navab
SSL
24
103
0
25 Aug 2019
ViCo: Word Embeddings from Visual Co-occurrences
ViCo: Word Embeddings from Visual Co-occurrences
Tanmay Gupta
Alex Schwing
Derek Hoiem
15
24
0
22 Aug 2019
Attention on Attention for Image Captioning
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
24
823
0
19 Aug 2019
Abductive Commonsense Reasoning
Abductive Commonsense Reasoning
Chandra Bhagavatula
Ronan Le Bras
Chaitanya Malaviya
Keisuke Sakaguchi
Ari Holtzman
Hannah Rashkin
Doug Downey
Scott Yih
Yejin Choi
ReLM
LRM
25
452
0
15 Aug 2019
Unpaired Cross-lingual Image Caption Generation with Self-Supervised
  Rewards
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards
Yuqing Song
Shizhe Chen
Yida Zhao
Qin Jin
SSL
26
40
0
15 Aug 2019
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling
Yi-Ting Yeh
Tzu-Chuan Lin
Hsiao-Hua Cheng
Yuanyuan Deng
Shang-Yu Su
Yun-Nung Chen
11
16
0
14 Aug 2019
Towards Diverse and Accurate Image Captions via Reinforcing
  Determinantal Point Process
Towards Diverse and Accurate Image Captions via Reinforcing Determinantal Point Process
Qingzhong Wang
Antoni B. Chan
27
7
0
14 Aug 2019
Towards Generating Stylized Image Captions via Adversarial Training
Towards Generating Stylized Image Captions via Adversarial Training
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
Len Hamey
GAN
23
18
0
08 Aug 2019
Image Captioning using Facial Expression and Attention
Image Captioning using Facial Expression and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
CVBM
17
8
0
08 Aug 2019
Scene-based Factored Attention for Image Captioning
Scene-based Factored Attention for Image Captioning
Chen Shen
Rongrong Ji
Fuhai Chen
Xiaoshuai Sun
Xiangming Li
24
0
0
07 Aug 2019
Addressing Data Bias Problems for Chest X-ray Image Report Generation
Addressing Data Bias Problems for Chest X-ray Image Report Generation
Philipp Harzig
Yan-Ying Chen
Francine Chen
Rainer Lienhart
MedIm
16
50
0
06 Aug 2019
Visual-Relation Conscious Image Generation from Structured-Text
Visual-Relation Conscious Image Generation from Structured-Text
D. Vo
Akihiro Sugimoto
22
17
0
05 Aug 2019
Prediction and Description of Near-Future Activities in Video
Prediction and Description of Near-Future Activities in Video
T. Mahmud
Mohammad Billah
Mahmudul Hasan
Amit K. Roy-Chowdhury
31
16
0
02 Aug 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph
  Generation
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Jing Wang
Yingwei Pan
Ting Yao
Jinhui Tang
Tao Mei
VLM
BDL
DiffM
27
36
0
01 Aug 2019
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph
  Generation
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation
Yadan Luo
Zi Huang
Zheng-Wei Zhang
Ziwei Wang
Jingjing Li
Yang Yang
26
40
0
01 Aug 2019
ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a
  Mapping from Parts Detected in Multiple Views to Sentences
ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences
Zhizhong Han
Chao Chen
Yu-Shen Liu
Matthias Zwicker
3DPC
27
46
0
31 Jul 2019
Learning Question-Guided Video Representation for Multi-Turn Video
  Question Answering
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Guan-Lin Chao
Abhinav Rastogi
Semih Yavuz
Dilek Z. Hakkani-Tür
Jindong Chen
Ian Lane
16
6
0
31 Jul 2019
Cooperative image captioning
Cooperative image captioning
Gilad Vered
Gal Oren
Yuval Atzmon
Gal Chechik
31
2
0
26 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
25
132
0
22 Jul 2019
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Pranava Madhyastha
Josiah Wang
Lucia Specia
8
32
0
22 Jul 2019
Watch It Twice: Video Captioning with a Refocused Video Encoder
Watch It Twice: Video Captioning with a Refocused Video Encoder
Xiangxi Shi
Jianfei Cai
Chenyu You
Jiuxiang Gu
21
29
0
21 Jul 2019
Justifying Diagnosis Decisions by Deep Neural Networks
Justifying Diagnosis Decisions by Deep Neural Networks
Graham Spinks
Marie-Francine Moens
37
13
0
12 Jul 2019
On the Evaluation of Conditional GANs
On the Evaluation of Conditional GANs
Terrance Devries
Adriana Romero
Luis Villaseñor-Pineda
Graham W. Taylor
M. Drozdzal
EGVM
31
41
0
11 Jul 2019
Informative Visual Storytelling with Cross-modal Rules
Informative Visual Storytelling with Cross-modal Rules
Jiacheng Li
Haizhou Shi
Siliang Tang
Fei Wu
Yueting Zhuang
24
24
0
07 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue
  Systems
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
22
111
0
02 Jul 2019
A Deep Decoder Structure Based on WordEmbedding Regression for An
  Encoder-Decoder Based Model for Image Captioning
A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning
A. Asadi
Reza Safabakhsh
12
3
0
26 Jun 2019
Informative Image Captioning with External Sources of Information
Informative Image Captioning with External Sources of Information
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
21
45
0
20 Jun 2019
Automatic Source Code Summarization with Extended Tree-LSTM
Automatic Source Code Summarization with Extended Tree-LSTM
Yusuke Shido
Yasuaki Kobayashi
Akihiro Yamamoto
A. Miyamoto
Tadayuki Matsumura
19
87
0
19 Jun 2019
Expressing Visual Relationships via Language
Expressing Visual Relationships via Language
Hao Tan
Franck Dernoncourt
Zhe-nan Lin
Trung Bui
Joey Tianyi Zhou
29
63
0
18 Jun 2019
Generating Diverse and Informative Natural Language Fashion Feedback
Generating Diverse and Informative Natural Language Fashion Feedback
Gil Sadeh
L. Fritz
Gabi Shalev
Eduard Oks
11
5
0
15 Jun 2019
Comparison of Diverse Decoding Methods from Conditional Language Models
Comparison of Diverse Decoding Methods from Conditional Language Models
Daphne Ippolito
Reno Kriz
M. Kustikova
João Sedoc
Chris Callison-Burch
AI4CE
25
113
0
14 Jun 2019
Improving Visual Question Answering by Referring to Generated Paragraph
  Captions
Improving Visual Question Answering by Referring to Generated Paragraph Captions
Hyounghun Kim
Joey Tianyi Zhou
CoGe
19
20
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
45
462
0
14 Jun 2019
Continual and Multi-Task Architecture Search
Continual and Multi-Task Architecture Search
Ramakanth Pasunuru
Joey Tianyi Zhou
CLL
25
48
0
12 Jun 2019
Object-aware Aggregation with Bidirectional Temporal Graph for Video
  Captioning
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning
Junchao Zhang
Yuxin Peng
24
170
0
11 Jun 2019
Generation of Multimodal Justification Using Visual Word Constraint
  Model for Explainable Computer-Aided Diagnosis
Generation of Multimodal Justification Using Visual Word Constraint Model for Explainable Computer-Aided Diagnosis
Hyebin Lee
S. T. Kim
Yong Man Ro
MedIm
29
44
0
10 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
31
38
0
07 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via
  Question Answering
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
Zhou Yu
D. Xu
Jun-chen Yu
Ting Yu
Zhou Zhao
Yueting Zhuang
Dacheng Tao
24
439
0
06 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
25
120
0
06 Jun 2019
Relational Reasoning using Prior Knowledge for Visual Captioning
Relational Reasoning using Prior Knowledge for Visual Captioning
Jingyi Hou
Xinxiao Wu
Yayun Qi
Wentian Zhao
Jiebo Luo
Yunde Jia
17
14
0
04 Jun 2019
Handling Divergent Reference Texts when Evaluating Table-to-Text
  Generation
Handling Divergent Reference Texts when Evaluating Table-to-Text Generation
Bhuwan Dhingra
Manaal Faruqui
Ankur P. Parikh
Ming-Wei Chang
Dipanjan Das
William W. Cohen
39
193
0
03 Jun 2019
Masked Non-Autoregressive Image Captioning
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
19
36
0
03 Jun 2019
Reconstruct and Represent Video Contents for Captioning via
  Reinforcement Learning
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
20
67
0
03 Jun 2019
Learning to Generate Grounded Visual Captions without Localization
  Supervision
Learning to Generate Grounded Visual Captions without Localization Supervision
Chih-Yao Ma
Yannis Kalantidis
Ghassan AlRegib
Peter Vajda
Marcus Rohrbach
Z. Kira
SSL
19
10
0
01 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Ensuring Readability and Data-fidelity using Head-modifier Templates in
  Deep Type Description Generation
Ensuring Readability and Data-fidelity using Head-modifier Templates in Deep Type Description Generation
Jiangjie Chen
Ao Wang
Haiyun Jiang
Suo Feng
Chenguang Li
Yanghua Xiao
32
3
0
29 May 2019
A Survey on Biomedical Image Captioning
A Survey on Biomedical Image Captioning
Vasiliki Kougia
John Pavlopoulos
Ion Androutsopoulos
MedIm
22
80
0
26 May 2019
Bivariate Beta-LSTM
Bivariate Beta-LSTM
Kyungwoo Song
Joonho Jang
Seung-Jae Shin
Il-Chul Moon
20
6
0
25 May 2019
Triple-to-Text: Converting RDF Triples into High-Quality Natural
  Languages via Optimizing an Inverse KL Divergence
Triple-to-Text: Converting RDF Triples into High-Quality Natural Languages via Optimizing an Inverse KL Divergence
Yaoming Zhu
Juncheng Wan
Zhiming Zhou
Liheng Chen
Lin Qiu
Weinan Zhang
Xin Jiang
Yong Yu
22
27
0
25 May 2019
Previous
123...353637...414243
Next