v1v2 (latest)

CIDEr: Consensus-based Image Description Evaluation

20 November 2014

Ramakrishna Vedantam

C. L. Zitnick

Devi Parikh

ArXiv (abs)PDF HTML

Papers citing "CIDEr: Consensus-based Image Description Evaluation"

50 / 2,184 papers shown

Title
Subspace Representations for Soft Set Operations and Sentence Similarities Yoichi Ishibashi Sho Yokoi Katsuhito Sudoh Satoshi Nakamura NAI 66 1 0 24 Oct 2022
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach Wenhao Yu Chenguang Zhu Zhihan Zhang Shuohang Wang Zhuosheng Zhang Yuwei Fang Meng Jiang LRM ReLM 66 19 0 23 Oct 2022
ComFact: A Benchmark for Linking Contextual Commonsense Knowledge Silin Gao Jena D. Hwang Saya Kanno Hiromi Wakaki Yuki Mitsufuji Antoine Bosselut HILM 92 16 0 23 Oct 2022
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation Xueliang Zhao Yuxuan Wang Chongyang Tao Chenshuo Wang Dongyan Zhao 71 6 0 22 Oct 2022
Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards Jean-Benoit Delbrouck Pierre J. Chambon Christian Blüthgen E. Tsai Omar Almusa C. Langlotz MedIm 116 81 0 21 Oct 2022
Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning Xingwei He Yeyun Gong Alex Jin Weizhen Qi Hang Zhang Jian Jiao Bartuer Zhou Biao Cheng Sm Yiu Nan Duan 64 11 0 21 Oct 2022
Image-Text Retrieval with Binary and Continuous Label Supervision Zheng Li Caili Guo Zerun Feng Lei Li Ying Jin Yufeng Zhang VLM 71 4 0 20 Oct 2022
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation Yu Zhao Jianguo Wei Zhichao Lin Yueheng Sun Meishan Zhang Hao Fei 79 16 0 20 Oct 2022
Prophet Attention: Predicting Attention with Future Attention for Image Captioning Fenglin Liu Xuancheng Ren Xian Wu Wei Fan Yuexian Zou Xu Sun 114 48 0 19 Oct 2022
Grounded Video Situation Recognition Zeeshan Khan C. V. Jawahar Makarand Tapaswi 100 14 0 19 Oct 2022
Summary Workbench: Unifying Application and Evaluation of Text Summarization Models S. Syed Dominik Schwabe Martin Potthast 47 0 0 18 Oct 2022
Probing Cross-modal Semantics Alignment Capability from the Textual Perspective Zheng Ma Shi Zong Mianzhi Pan Jianbing Zhang Shujian Huang Xinyu Dai Jiajun Chen 59 4 0 18 Oct 2022
Social Biases in Automatic Evaluation Metrics for NLG Mingqi Gao Xiaojun Wan 59 3 0 17 Oct 2022
Hybrid Reinforced Medical Report Generation with M-Linear Attention and Repetition Penalty Wenting Xu Zhenghua Xu Junyang Chen Chang Qi Thomas Lukasiewicz MedIm 69 8 0 14 Oct 2022
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning Tiannan Wang Wangchunshu Zhou Yan Zeng Xinsong Zhang VLM 82 44 0 14 Oct 2022
LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue Anthony Sicilia Malihe Alikhani 115 4 0 14 Oct 2022
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training Wenliang Dai Zihan Liu Ziwei Ji Jane Polak Scowcroft Pascale Fung MLLM VLM 94 67 0 14 Oct 2022
Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets Anurag Roy David Johnson Ekka Saptarshi Ghosh Abir Das 62 1 0 13 Oct 2022
OpenCQA: Open-ended Question Answering with Charts Shankar Kantharaj Do Xuan Long Rixie Tiffany Ko Leong J. Tan Enamul Hoque Shafiq Joty 83 53 0 12 Oct 2022
DATScore: Evaluating Translation with Data Augmented Translations Moussa Kamal Eddine Guokan Shang Michalis Vazirgiannis 73 5 0 12 Oct 2022
On Text Style Transfer via Style Masked Language Models Sharan Narasimhan P. Shekar Suvodip Dey M. Desarkar 59 0 0 12 Oct 2022
Automated Audio Captioning via Fusion of Low- and High- Dimensional Features Jianyuan Sun Xubo Liu Xinhao Mei Mark D. Plumbley V. Kılıç Wenwu Wang 80 3 0 10 Oct 2022
Not All Errors are Equal: Learning Text Generation Metrics using Stratified Error Synthesis Wenda Xu Yi-Lin Tuan Yujie Lu Michael Stephen Saxon Lei Li William Yang Wang 118 22 0 10 Oct 2022
Generating image captions with external encyclopedic knowledge S. Nikiforova Tejaswini Deoskar Denis Paperno Yoad Winter 72 2 0 10 Oct 2022
Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy Planning Yi Cheng Wenge Liu Wenjie Li Jiashuo Wang Ruihui Zhao Bang Liu Xiaodan Liang Yefeng Zheng 94 55 0 09 Oct 2022
CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation Models Steven Y. Feng Vivek Khetan Bogdan Sacaleanu A. Gershman Eduard H. Hovy LRM 90 10 0 09 Oct 2022
VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment Shraman Pramanick Li Jing Sayan Nag Jiachen Zhu Hardik Shah Yann LeCun Ramalingam Chellappa 82 22 0 09 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation Wanrong Zhu An Yan Yujie Lu Wenda Xu Xinze Wang Miguel P. Eckstein William Yang Wang 130 36 0 07 Oct 2022
Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters Vinayshekhar Bannihatti Kumar Rashmi Gangadharaiah Dan Roth 71 1 0 07 Oct 2022
Multiview Contextual Commonsense Inference: A New Dataset and Task Siqi Shen Deepanway Ghosal Navonil Majumder Henry Lim Rada Mihalcea Soujanya Poria LRM 74 12 0 06 Oct 2022
What Should the System Do Next?: Operative Action Captioning for Estimating System Actions Taiki Nakamura Seiya Kawano Akishige Yuguchi Yasutomo Kawanishi Koichiro Yoshino 114 0 0 06 Oct 2022
Progressive Text-to-Image Generation Zhengcong Fei Mingyuan Fan Li Zhu Junshi Huang 162 4 0 05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data Ye Zhu Yuehua Wu N. Sebe Yan Yan 114 19 0 05 Oct 2022
Affection: Learning Affective Explanations for Real-World Visual Data Panos Achlioptas M. Ovsjanikov Leonidas Guibas Sergey Tulyakov 109 12 0 04 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning Xu Yang Hanwang Zhang Chongyang Gao Jianfei Cai MLLM 87 10 0 04 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization Rajkumar Ramamurthy Prithviraj Ammanabrolu Kianté Brantley Jack Hessel R. Sifa Christian Bauckhage Hannaneh Hajishirzi Yejin Choi OffRL 105 250 0 03 Oct 2022
Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity Swapnil Bhosale Rupayan Chakraborty Sunil Kumar Kopparapu 65 1 0 03 Oct 2022
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation R. Ramos Bruno Martins Desmond Elliott Yova Kementchedjhieva VLM 89 89 0 30 Sep 2022
Linearly Mapping from Image to Text Space Jack Merullo Louis Castricato Carsten Eickhoff Ellie Pavlick VLM 251 118 0 30 Sep 2022
MUG: Interactive Multimodal Grounding on User Interfaces Tao Li Gang Li Jingjie Zheng Purple Wang Yang Li LLMAG 84 9 0 29 Sep 2022
REST: REtrieve & Self-Train for generative action recognition Adrian Bulat Enrique Sanchez Brais Martínez Georgios Tzimiropoulos VLM 61 4 0 29 Sep 2022
Medical Image Captioning via Generative Pretrained Transformers Alexander Selivanov Oleg Y. Rogov Daniil Chesakov Artem Shelmanov Irina Fedulova Dmitry V. Dylov MedIm 102 64 0 28 Sep 2022
Thinking Hallucination for Video Captioning Nasib Ullah Partha Pratim Mohanta VLM 84 6 0 28 Sep 2022
Improving Radiology Report Generation Systems by Removing Hallucinated References to Non-existent Priors Vignav Ramesh Nathan Chi Pranav Rajpurkar MedIm 93 50 0 27 Sep 2022
Word to Sentence Visual Semantic Similarity for Caption Generation: Lessons Learned Ahmed Sabir 130 0 0 26 Sep 2022
DRAMA: Joint Risk Localization and Captioning in Driving Srikanth Malla Chiho Choi Isht Dwivedi Joonhyang Choi Jiachen Li 183 100 0 22 Sep 2022
INFINITY: A Simple Yet Effective Unsupervised Framework for Graph-Text Mutual Conversion Yi Xu Luoyi Fu Zhouhan Lin Jiexing Qi Xinbing Wang 77 3 0 22 Sep 2022
Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia K. Nguyen Ali Furkan Biten Andrés Mafla Lluís Gómez Dimosthenis Karatzas 67 11 0 21 Sep 2022
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering Hao Li Jinfa Huang Peng Jin Guoli Song Qi Wu Jie Chen 145 22 0 21 Sep 2022
Recipe Generation from Unsegmented Cooking Videos Taichi Nishimura Atsushi Hashimoto Yoshitaka Ushiku Hirotaka Kameko Shinsuke Mori 44 3 0 21 Sep 2022