ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.5726
  4. Cited By
CIDEr: Consensus-based Image Description Evaluation
v1v2 (latest)

CIDEr: Consensus-based Image Description Evaluation

20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
ArXiv (abs)PDFHTML

Papers citing "CIDEr: Consensus-based Image Description Evaluation"

50 / 2,183 papers shown
Title
Hidden State Guidance: Improving Image Captioning using An Image
  Conditioned Autoencoder
Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Jialin Wu
Raymond J. Mooney
53
0
0
31 Oct 2019
ViGGO: A Video Game Corpus for Data-To-Text Generation in Open-Domain
  Conversation
ViGGO: A Video Game Corpus for Data-To-Text Generation in Open-Domain Conversation
Juraj Juraska
Kevin K. Bowden
M. Walker
65
44
0
26 Oct 2019
Diverse Video Captioning Through Latent Variable Expansion
Diverse Video Captioning Through Latent Variable Expansion
Huanhou Xiao
Jinglun Shi
DiffM
95
15
0
26 Oct 2019
TCT: A Cross-supervised Learning Method for Multimodal Sequence
  Representation
TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation
Wubo Li
Wei Zou
Xiangang Li
ViT
21
0
0
23 Oct 2019
Imperial College London Submission to VATEX Video Captioning Task
Imperial College London Submission to VATEX Video Captioning Task
Ozan Caglayan
Zixiu "Alex" Wu
Pranava Madhyastha
Josiah Wang
Lucia Specia
20
0
0
16 Oct 2019
Exploring Overall Contextual Information for Image Captioning in
  Human-Like Cognitive Style
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
54
25
0
15 Oct 2019
VATEX Captioning Challenge 2019: Multi-modal Information Fusion and
  Multi-stage Training Strategy for Video Captioning
VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning
Ziqi Zhang
Yaya Shi
Jiutong Wei
Chunfen Yuan
Bing Li
Weiming Hu
47
0
0
13 Oct 2019
Neural Generation for Czech: Data and Baselines
Neural Generation for Czech: Data and Baselines
Ondrej Dusek
Filip Jurvcívcek
97
22
0
11 Oct 2019
Automatic Quality Estimation for Natural Language Generation: Ranting
  (Jointly Rating and Ranking)
Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)
Ondrej Dusek
Karin Sevegnani
Ioannis Konstas
Verena Rieser
ALM
62
9
0
10 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic
  Explainability
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
162
29
0
07 Oct 2019
A Case Study on Combining ASR and Visual Features for Generating
  Instructional Video Captions
A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions
Jack Hessel
Bo Pang
Zhenhai Zhu
Radu Soricut
98
37
0
07 Oct 2019
Template-free Data-to-Text Generation of Finnish Sports News
Template-free Data-to-Text Generation of Finnish Sports News
Jenna Kanerva
Samuel Rönnqvist
Riina Kekki
T. Salakoski
Filip Ginter
66
19
0
04 Oct 2019
A Hierarchical Approach for Visual Storytelling Using Image Description
A Hierarchical Approach for Visual Storytelling Using Image Description
Md Sultan al Nahian
Tasmia Tasrin
Sagar Gandhi
Ryan Gaines
Brent Harrison
49
13
0
26 Sep 2019
Read, Attend and Comment: A Deep Architecture for Automatic News Comment
  Generation
Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation
Ze Yang
Can Xu
Wei Wu
Zhoujun Li
3DV
80
29
0
26 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
67
37
0
22 Sep 2019
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event
  Captioning
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Tanzila Rahman
Bicheng Xu
Leonid Sigal
75
81
0
22 Sep 2019
Visuallly Grounded Generation of Entailments from Premises
Visuallly Grounded Generation of Entailments from Premises
Somayeh Jafaritazehjani
Albert Gatt
Marc Tanti
LRM
46
1
0
21 Sep 2019
Adaptively Aligned Image Captioning via Adaptive Attention Time
Adaptively Aligned Image Captioning via Adaptive Attention Time
Lun Huang
Wenmin Wang
Yaxian Xia
Jie Chen
74
63
0
19 Sep 2019
ContCap: A scalable framework for continual image captioning
ContCap: A scalable framework for continual image captioning
Giang Nguyen
Tae Joon Jun
T. Tran
Tolcha Yalew
Daeyoung Kim
VLMCLL
63
10
0
19 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions
Inverse Visual Question Answering with Multi-Level Attentions
Yaser Alwatter
Yuhong Guo
BDL
35
1
0
17 Sep 2019
Communication-based Evaluation for Natural Language Generation
Communication-based Evaluation for Natural Language Generation
Benjamin Newman
Reuben Cohn-Gordon
Christopher Potts
54
7
0
16 Sep 2019
VizSeq: A Visual Analysis Toolkit for Text Generation Tasks
VizSeq: A Visual Analysis Toolkit for Text Generation Tasks
Changhan Wang
Anirudh Jain
Danlu Chen
Jiatao Gu
74
29
0
12 Sep 2019
What Makes A Good Story? Designing Composite Rewards for Visual
  Storytelling
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling
Junjie Hu
Yu Cheng
Zhe Gan
Jingjing Liu
Jianfeng Gao
Graham Neubig
73
67
0
11 Sep 2019
Compositional Generalization in Image Captioning
Compositional Generalization in Image Captioning
Mitja Nikolaus
Mostafa Abdou
Matthew Lamm
Rahul Aralikatte
Desmond Elliott
CoGe
89
49
0
10 Sep 2019
Learning Actions from Human Demonstration Video for Robotic Manipulation
Learning Actions from Human Demonstration Video for Robotic Manipulation
Shuo Yang
Wei Zhang
Weizhi Lu
Hesheng Wang
Yibin Li
42
26
0
10 Sep 2019
Neural Naturalist: Generating Fine-Grained Image Comparisons
Neural Naturalist: Generating Fine-Grained Image Comparisons
Maxwell Forbes
Christine Kaeser-Chen
Piyush Sharma
Serge J. Belongie
VLM
136
58
0
09 Sep 2019
Hierarchy Parsing for Image Captioning
Hierarchy Parsing for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
VLM
96
166
0
09 Sep 2019
Transfer Reward Learning for Policy Gradient-Based Text Generation
Transfer Reward Learning for Policy Gradient-Based Text Generation
James OÑeill
Danushka Bollegala
25
1
0
09 Sep 2019
Conditional Text Generation for Harmonious Human-Machine Interaction
Conditional Text Generation for Harmonious Human-Machine Interaction
Bin Guo
Hao Wang
Yasan Ding
Wei Wu
Shaoyang Hao
Yueqi Sun
Zhiwen Yu
103
4
0
08 Sep 2019
Quality Estimation for Image Captions Based on Large-scale Human
  Evaluations
Quality Estimation for Image Captions Based on Large-scale Human Evaluations
Tomer Levinboim
Ashish V. Thapliyal
Piyush Sharma
Radu Soricut
54
27
0
08 Sep 2019
Look and Modify: Modification Networks for Image Captioning
Look and Modify: Modification Networks for Image Captioning
Fawaz Sammani
Mahmoud Elsayed
52
22
0
07 Sep 2019
MoverScore: Text Generation Evaluating with Contextualized Embeddings
  and Earth Mover Distance
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
Wei Zhao
Maxime Peyrard
Fei Liu
Yang Gao
Christian M. Meyer
Steffen Eger
246
602
0
05 Sep 2019
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Wei Wei
Ling Cheng
Xian-Ling Mao
Guangyou Zhou
Feida Zhu
DiffM
79
19
0
05 Sep 2019
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image
  Captioning
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning
Ming Jiang
Junjie Hu
Qiuyuan Huang
Lei Zhang
Jana Diesner
Jianfeng Gao
62
15
0
05 Sep 2019
Image Captioning with Very Scarce Supervised Data: Adversarial
  Semi-Supervised Learning Approach
Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
SSLVLM
89
56
0
05 Sep 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic
  Labels Improve Image Captioning and Visual Question Answering
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Soravit Changpinyo
Bo Pang
Piyush Sharma
Radu Soricut
ObjD
58
20
0
04 Sep 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
Ming Jiang
Qiuyuan Huang
Lei Zhang
Xin Eric Wang
Pengchuan Zhang
Zhe Gan
Jana Diesner
Jianfeng Gao
108
68
0
04 Sep 2019
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense
  Reasoning
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning
Lifu Huang
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
AIMatRALMLRM
146
459
0
31 Aug 2019
Reflective Decoding Network for Image Captioning
Reflective Decoding Network for Image Captioning
Lei Ke
Wenjie Pei
Ruiyu Li
Xiaoyong Shen
Yu-Wing Tai
ObjD
60
94
0
30 Aug 2019
Aesthetic Image Captioning From Weakly-Labelled Photographs
Aesthetic Image Captioning From Weakly-Labelled Photographs
Koustav Ghosal
A. Rana
A. Smolic
67
25
0
29 Aug 2019
Out the Window: A Crowd-Sourced Dataset for Activity Classification in
  Security Video
Out the Window: A Crowd-Sourced Dataset for Activity Classification in Security Video
Greg Castañón
N. Shnidman
Tim Anderson
J. Byrne
44
2
0
28 Aug 2019
Image Captioning with Sparse Recurrent Neural Network
Image Captioning with Sparse Recurrent Neural Network
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
47
6
0
28 Aug 2019
DeepCopy: Grounded Response Generation with Hierarchical Pointer
  Networks
DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks
Semih Yavuz
Abhinav Rastogi
Guan-Lin Chao
Dilek Z. Hakkani-Tür
80
80
0
28 Aug 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
132
163
0
27 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Iro Laina
Christian Rupprecht
Nassir Navab
SSL
76
103
0
25 Aug 2019
ViCo: Word Embeddings from Visual Co-occurrences
ViCo: Word Embeddings from Visual Co-occurrences
Tanmay Gupta
Alex Schwing
Derek Hoiem
65
25
0
22 Aug 2019
Attention on Attention for Image Captioning
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
87
836
0
19 Aug 2019
Abductive Commonsense Reasoning
Abductive Commonsense Reasoning
Chandra Bhagavatula
Ronan Le Bras
Chaitanya Malaviya
Keisuke Sakaguchi
Ari Holtzman
Hannah Rashkin
Doug Downey
Scott Yih
Yejin Choi
ReLMLRM
130
464
0
15 Aug 2019
Unpaired Cross-lingual Image Caption Generation with Self-Supervised
  Rewards
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards
Yuqing Song
Shizhe Chen
Yida Zhao
Qin Jin
SSL
52
41
0
15 Aug 2019
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling
Yi-Ting Yeh
Tzu-Chuan Lin
Hsiao-Hua Cheng
Yuanyuan Deng
Shang-Yu Su
Yun-Nung Chen
74
16
0
14 Aug 2019
Previous
123...353637...424344
Next