ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.5726
  4. Cited By
CIDEr: Consensus-based Image Description Evaluation
v1v2 (latest)

CIDEr: Consensus-based Image Description Evaluation

20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
ArXiv (abs)PDFHTML

Papers citing "CIDEr: Consensus-based Image Description Evaluation"

50 / 2,183 papers shown
Title
Regularizing RNNs for Caption Generation by Reconstructing The Past with
  The Present
Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Xinpeng Chen
Lin Ma
Wenhao Jiang
Jian Yao
Wen Liu
88
92
0
30 Mar 2018
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
121
319
0
30 Mar 2018
Neural Baby Talk
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
242
436
0
27 Mar 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with
  Partially Labeled Data
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
93
133
0
22 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
71
56
0
21 Mar 2018
Unpaired Image Captioning by Language Pivoting
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
G. Wang
89
83
0
14 Mar 2018
Discriminability objective for training descriptive captions
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
134
203
0
12 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Wentao Zhang
Qingming Huang
82
201
0
05 Mar 2018
Neural Aesthetic Image Reviewer
Neural Aesthetic Image Reviewer
Wenshan Wang
Su Yang
Weishan Zhang
Jiulong Zhang
58
39
0
28 Feb 2018
VizWiz Grand Challenge: Answering Visual Questions from Blind People
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
171
864
0
22 Feb 2018
Attentive Tensor Product Learning
Attentive Tensor Product Learning
Qiuyuan Huang
Li Deng
D. Wu
Chang Liu
Xiaodong He
82
23
0
20 Feb 2018
Multimodal Explanations: Justifying Decisions and Pointing to the
  Evidence
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Anna Rohrbach
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
87
424
0
15 Feb 2018
Multimodal Image Captioning for Marketing Analysis
Multimodal Image Captioning for Marketing Analysis
Philipp Harzig
Stephan Brehm
Rainer Lienhart
Carolin Kaiser
René Schallner
36
10
0
06 Feb 2018
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Seunghoon Hong
Dingdong Yang
Jongwook Choi
Honglak Lee
EGVM
142
337
0
16 Jan 2018
Consensus-based Sequence Training for Video Captioning
Consensus-based Sequence Training for Video Captioning
Sang Phan Le
G. Henter
Yusuke Miyao
Shiníchi Satoh
3DV
32
22
0
27 Dec 2017
Exploring Models and Data for Remote Sensing Image Caption Generation
Exploring Models and Data for Remote Sensing Image Caption Generation
Xiaoqiang Lu
Binqiang Wang
Xiangtao Zheng
Xuelong Li
67
481
0
21 Dec 2017
Convolutional Image Captioning
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
Alex Schwing
VLM
137
361
0
24 Nov 2017
On the Automatic Generation of Medical Imaging Reports
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric Xing
MedIm
97
516
0
22 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder
  with an Additive Gaussian Encoding Space
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
Alex Schwing
Svetlana Lazebnik
CoGe
111
175
0
19 Nov 2017
AI Challenger : A Large-scale Dataset for Going Deeper in Image
  Understanding
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding
Jiahong Wu
He Zheng
Bo Zhao
Yixin Li
Baoming Yan
...
Shipei Zhou
G. Lin
Yanwei Fu
Yizhou Wang
Yonggang Wang
VLM
98
152
0
17 Nov 2017
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval
  with Generative Models
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
Jiuxiang Gu
Jianfei Cai
Shafiq Joty
Li Niu
G. Wang
VLM
122
361
0
17 Nov 2017
Grounded Objects and Interactions for Video Captioning
Grounded Objects and Interactions for Video Captioning
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
52
6
0
16 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video
  Understanding
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
83
145
0
16 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAMLOOD
82
34
0
16 Nov 2017
Phrase-based Image Captioning with Hierarchical LSTM Model
Phrase-based Image Captioning with Hierarchical LSTM Model
Y. Tan
Chee Seng Chan
VLM
31
4
0
11 Nov 2017
Object Referring in Visual Scene with Spoken Language
Object Referring in Visual Scene with Spoken Language
A. Vasudevan
Dengxin Dai
Luc Van Gool
98
19
0
10 Nov 2017
Image Captioning and Classification of Dangerous Situations
Image Captioning and Classification of Dangerous Situations
Octavio Arriaga
Paul G. Plöger
Matias Valdenegro-Toro
37
8
0
07 Nov 2017
Evaluation of Automatic Video Captioning Using Direct Assessment
Evaluation of Automatic Video Captioning Using Direct Assessment
Yvette Graham
G. Awad
Alan F. Smeaton
58
30
0
29 Oct 2017
A Neural-Symbolic Approach to Design of CAPTCHA
A Neural-Symbolic Approach to Design of CAPTCHA
Qiuyuan Huang
P. Smolensky
Xiaodong He
Li Deng
D. Wu
AAML
63
1
0
29 Oct 2017
InterpNET: Neural Introspection for Interpretable Deep Learning
InterpNET: Neural Introspection for Interpretable Deep Learning
Shane T. Barratt
65
20
0
26 Oct 2017
Feedback-prop: Convolutional Neural Network Inference under Partial
  Evidence
Feedback-prop: Convolutional Neural Network Inference under Partial Evidence
Tianlu Wang
Kota Yamaguchi
Vicente Ordonez
92
12
0
23 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided
  Assitance
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance
Aditya Mogadala
Umanga Bista
Lexing Xie
Achim Rettinger
58
7
0
17 Oct 2017
Contrastive Learning for Image Captioning
Contrastive Learning for Image Captioning
Bo Dai
Dahua Lin
SSLVLM
105
194
0
06 Oct 2017
Cold-Start Reinforcement Learning with Softmax Policy Gradient
Cold-Start Reinforcement Learning with Softmax Policy Gradient
Nan Ding
Radu Soricut
64
46
0
27 Sep 2017
Tensor Product Generation Networks for Deep NLP Modeling
Tensor Product Generation Networks for Deep NLP Modeling
Qiuyuan Huang
P. Smolensky
Xiaodong He
Li Deng
D. Wu
87
3
0
26 Sep 2017
Visual Question Generation as Dual Task of Visual Question Answering
Visual Question Generation as Dual Task of Visual Question Answering
Yikang Li
Nan Duan
Bolei Zhou
Xiao Chu
Wanli Ouyang
Xiaogang Wang
98
166
0
21 Sep 2017
Learning Functional Causal Models with Generative Neural Networks
Learning Functional Causal Models with Generative Neural Networks
Hugo Jair Escalante
Sergio Escalera
Xavier Baro
Isabelle M Guyon
Umut Güçlü
Marcel van Gerven
CMLBDL
107
108
0
15 Sep 2017
Self-Guiding Multimodal LSTM - when we do not have a perfect training
  dataset for image captioning
Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning
Yang Xian
Yingli Tian
VLM
59
23
0
15 Sep 2017
Robustness Analysis of Visual QA Models by Basic Questions
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Guohao Li
OOD
65
24
0
14 Sep 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
110
181
0
11 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
116
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
88
21
0
31 Aug 2017
VQS: Linking Segmentations to Questions and Answers for Supervised
  Attention in VQA and Question-Focused Semantic Segmentation
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
106
127
0
15 Aug 2017
Fluency-Guided Cross-Lingual Image Captioning
Fluency-Guided Cross-Lingual Image Captioning
Weiyu Lan
Xirong Li
Jianfeng Dong
71
95
0
15 Aug 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video
  Captioning
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning
Jingkuan Song
Yuyu Guo
Lianli Gao
Xuelong Li
Alan Hanjalic
Heng Tao Shen
104
222
0
08 Aug 2017
Reinforced Video Captioning with Entailment Rewards
Reinforced Video Captioning with Entailment Rewards
Ramakanth Pasunuru
Joey Tianyi Zhou
73
115
0
07 Aug 2017
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption
  Generator?
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?
Marc Tanti
Albert Gatt
K. Camilleri
48
56
0
07 Aug 2017
Referenceless Quality Estimation for Natural Language Generation
Referenceless Quality Estimation for Natural Language Generation
Ondrej Dusek
Jekaterina Novikova
Verena Rieser
96
29
0
05 Aug 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual
  Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
224
4,231
0
25 Jul 2017
OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts
OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts
Xuwang Yin
Vicente Ordonez
VLM
100
55
0
22 Jul 2017
Previous
123...4041424344
Next