ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.00563
  4. Cited By
Self-critical Sequence Training for Image Captioning

Self-critical Sequence Training for Image Captioning

2 December 2016
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
ArXivPDFHTML

Papers citing "Self-critical Sequence Training for Image Captioning"

50 / 858 papers shown
Title
Keep It Private: Unsupervised Privatization of Online Text
Keep It Private: Unsupervised Privatization of Online Text
Calvin Bao
Marine Carpuat
DeLMO
37
3
0
16 May 2024
Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine
  Translation
Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation
Hao Wang
Tetsuro Morimura
Ukyo Honda
Daisuke Kawahara
21
0
0
02 May 2024
Guiding Attention in End-to-End Driving Models
Guiding Attention in End-to-End Driving Models
Diego Porres
Yi Xiao
Gabriel Villalonga
Alexandre Levy
Antonio M. López
39
0
0
30 Apr 2024
Filtered Direct Preference Optimization
Filtered Direct Preference Optimization
Tetsuro Morimura
Mitsuki Sakamoto
Yuu Jinnai
Kenshi Abe
Kaito Air
48
13
0
22 Apr 2024
Sentiment-oriented Transformer-based Variational Autoencoder Network for
  Live Video Commenting
Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting
Fengyi Fu
Shancheng Fang
Weidong Chen
Zhendong Mao
ViT
VGen
34
4
0
19 Apr 2024
Beyond Average: Individualized Visual Scanpath Prediction
Beyond Average: Individualized Visual Scanpath Prediction
Xianyu Chen
Ming Jiang
Qi Zhao
29
5
0
18 Apr 2024
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided
  Reinforcement Learning
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
Yue Jiang
Zixin Guo
Hamed R. Tavakoli
Luis A. Leiva
Antti Oulasvirta
31
5
0
15 Apr 2024
Memory-based Cross-modal Semantic Alignment Network for Radiology Report
  Generation
Memory-based Cross-modal Semantic Alignment Network for Radiology Report Generation
Yitian Tao
Liyan Ma
Jing Yu
Han Zhang
MedIm
34
6
0
31 Mar 2024
Semi-Supervised Image Captioning Considering Wasserstein Graph Matching
Semi-Supervised Image Captioning Considering Wasserstein Graph Matching
Yang Yang
41
0
0
26 Mar 2024
Self-Improvement for Neural Combinatorial Optimization: Sample without
  Replacement, but Improvement
Self-Improvement for Neural Combinatorial Optimization: Sample without Replacement, but Improvement
Jonathan Pirnay
D. G. Grimm
56
11
0
22 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
150
319
0
21 Mar 2024
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for
  Counselor Reflection Generation
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
Do June Min
Verónica Pérez-Rosas
Kenneth Resnicow
Rada Mihalcea
OffRL
53
2
0
20 Mar 2024
Graph Attention Network-based Block Propagation with Optimal AoI and
  Reputation in Web 3.0
Graph Attention Network-based Block Propagation with Optimal AoI and Reputation in Web 3.0
Jiana Liao
Jinbo Wen
Jiawen Kang
Changyan Yi
Yang Zhang
Yutao Jiao
Dusit Niyato
Dong In Kim
Shengli Xie
29
4
0
20 Mar 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing
  Objects in 3D Scenes
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
59
10
0
12 Mar 2024
Enhancing Image Caption Generation Using Reinforcement Learning with
  Human Feedback
Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback
L. AdarshN
V. ArunP
L. AravindhN
37
1
0
11 Mar 2024
How to Understand Named Entities: Using Common Sense for News Captioning
How to Understand Named Entities: Using Common Sense for News Captioning
Ning Xu
Yanhui Wang
Tingting Zhang
Hongshuo Tian
Mohan Kankanhalli
An-An Liu
40
0
0
11 Mar 2024
An Efficient Learning-based Solver Comparable to Metaheuristics for the
  Capacitated Arc Routing Problem
An Efficient Learning-based Solver Comparable to Metaheuristics for the Capacitated Arc Routing Problem
Runze Guo
Feng Xue
Anlong Ming
N. Sebe
50
0
0
11 Mar 2024
Rule-driven News Captioning
Rule-driven News Captioning
Ning Xu
Tingting Zhang
Hongshuo Tian
An-An Liu
68
0
0
08 Mar 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language
  Models
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi
Alona Fyshe
37
1
0
04 Mar 2024
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language
  Pre-training
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Haowei Liu
Yaya Shi
Haiyang Xu
Chunfen Yuan
Qinghao Ye
...
Mingshi Yan
Ji Zhang
Fei Huang
Bing Li
Weiming Hu
VLM
43
0
0
01 Mar 2024
VIXEN: Visual Text Comparison Network for Image Difference Captioning
VIXEN: Visual Text Comparison Network for Image Difference Captioning
Alexander Black
Jing Shi
Yifei Fai
Tu Bui
John Collomosse
52
5
0
29 Feb 2024
Polos: Multimodal Metric Learning from Human Feedback for Image
  Captioning
Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Yuiga Wada
Kanta Kaneda
Daichi Saito
Komei Sugiura
39
24
0
28 Feb 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
39
5
0
24 Feb 2024
MerRec: A Large-scale Multipurpose Mercari Dataset for
  Consumer-to-Consumer Recommendation Systems
MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems
Lichi Li
Zainul Din
Zhen Tan
Sam London
Tianlong Chen
Ajay Daptardar
49
0
0
22 Feb 2024
Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP
  Guided Reinforcement Learning
Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning
Antoine Chaffin
Ewa Kijak
Vincent Claveau
47
0
0
21 Feb 2024
Cobra Effect in Reference-Free Image Captioning Metrics
Cobra Effect in Reference-Free Image Captioning Metrics
Zheng Ma
Changxin Wang
Yawen Ouyang
Fei Zhao
Jianbing Zhang
Shujian Huang
Jiajun Chen
38
2
0
18 Feb 2024
EventRL: Enhancing Event Extraction with Outcome Supervision for Large
  Language Models
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models
Jun Gao
Huan Zhao
Wei Wang
Changlong Yu
Ruifeng Xu
OffRL
27
4
0
18 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
34
23
0
13 Feb 2024
Intensive Vision-guided Network for Radiology Report Generation
Intensive Vision-guided Network for Radiology Report Generation
Fudan Zheng
Mengfei Li
Ying Wang
Weijiang Yu
Ruixuan Wang
Zhiguang Chen
Nong Xiao
Yutong Lu
33
1
0
06 Feb 2024
ARGS: Alignment as Reward-Guided Search
ARGS: Alignment as Reward-Guided Search
Maxim Khanov
Jirayu Burapacheep
Yixuan Li
40
47
0
23 Jan 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing
  Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Aly M. Kassem
Sherif Saad
AAML
25
1
0
21 Jan 2024
KTVIC: A Vietnamese Image Captioning Dataset on the Life Domain
KTVIC: A Vietnamese Image Captioning Dataset on the Life Domain
Anh-Cuong Pham
Van-Quang Nguyen
Thi-Hong Vuong
Quang-Thuy Ha
29
1
0
16 Jan 2024
Efficient Vision-and-Language Pre-training with Text-Relevant Image
  Patch Selection
Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection
Wei Ye
Chaoya Jiang
Haiyang Xu
Chenhao Ye
Chenliang Li
Mingshi Yan
Shikun Zhang
Songhang Huang
Fei Huang
VLM
39
0
0
11 Jan 2024
Enhancing Multimodal Understanding with CLIP-Based Image-to-Text
  Transformation
Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation
Change Che
Qunwei Lin
Xinyu Zhao
Jiaxin Huang
Liqiang Yu
VLM
17
37
0
02 Jan 2024
CamPro: Camera-based Anti-Facial Recognition
CamPro: Camera-based Anti-Facial Recognition
Wenjun Zhu
Yuan Sun
Jiani Liu
Yushi Cheng
Xiaoyu Ji
Wenyuan Xu
PICV
31
1
0
30 Dec 2023
LLM4VG: Large Language Models Evaluation for Video Grounding
LLM4VG: Large Language Models Evaluation for Video Grounding
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Zihan Song
Yuwei Zhou
Wenwu Zhu
39
8
0
21 Dec 2023
Jack of All Tasks, Master of Many: Designing General-purpose
  Coarse-to-Fine Vision-Language Model
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick
Guangxing Han
Rui Hou
Sayan Nag
Ser-Nam Lim
Nicolas Ballas
Qifan Wang
Rama Chellappa
Amjad Almahairi
VLM
MLLM
48
29
0
19 Dec 2023
UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic
  Cross-modal Learnable Prompts
UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Chenlu Zhan
Yufei Zhang
Yu Lin
Gaoang Wang
Hongwei Wang
VLM
MedIm
42
5
0
18 Dec 2023
TiMix: Text-aware Image Mixing for Effective Vision-Language
  Pre-training
TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training
Chaoya Jiang
Wei Ye
Haiyang Xu
Qinghao Ye
Mingshi Yan
Ji Zhang
Shikun Zhang
CLIP
VLM
27
4
0
14 Dec 2023
RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
Jiashuo Fan
Yaoyuan Liang
Leyao Liu
Shao-Lun Huang
Lei Zhang
30
2
0
11 Dec 2023
Mitigating Open-Vocabulary Caption Hallucinations
Mitigating Open-Vocabulary Caption Hallucinations
Assaf Ben-Kish
Moran Yanuka
Morris Alper
Raja Giryes
Hadar Averbuch-Elor
MLLM
VLM
26
6
0
06 Dec 2023
MedXChat: A Unified Multimodal Large Language Model Framework towards
  CXRs Understanding and Generation
MedXChat: A Unified Multimodal Large Language Model Framework towards CXRs Understanding and Generation
Ling Yang
Zhanyu Wang
Zhenghao Chen
Xinyu Liang
Luping Zhou
LM&MA
MedIm
64
6
0
04 Dec 2023
WsiCaption: Multiple Instance Generation of Pathology Reports for
  Gigapixel Whole-Slide Images
WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole-Slide Images
Pingyi Chen
Honglin Li
Chenglu Zhu
Sunyi Zheng
Zhongyi Shui
Lin Yang
29
7
0
27 Nov 2023
A Systematic Review of Deep Learning-based Research on Radiology Report
  Generation
A Systematic Review of Deep Learning-based Research on Radiology Report Generation
Chang Liu
Yuanhe Tian
Yan Song
MedIm
44
16
0
23 Nov 2023
Trustworthy Large Models in Vision: A Survey
Trustworthy Large Models in Vision: A Survey
Ziyan Guo
Li Xu
Jun Liu
MU
66
0
0
16 Nov 2023
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini
  Decoder
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
Abdelrahman Mohamed
Fakhraddin Alwajih
El Moatez Billah Nagoudi
Alcides Alcoba Inciarte
Muhammad Abdul-Mageed
VLM
MLLM
33
7
0
15 Nov 2023
Complex Organ Mask Guided Radiology Report Generation
Complex Organ Mask Guided Radiology Report Generation
Tiancheng Gu
Dongnan Liu
Zhiyuan Li
Weidong Cai
MedIm
32
14
0
04 Nov 2023
Learning A Multi-Task Transformer Via Unified And Customized Instruction
  Tuning For Chest Radiograph Interpretation
Learning A Multi-Task Transformer Via Unified And Customized Instruction Tuning For Chest Radiograph Interpretation
Lijian Xu
Ziyu Ni
Xinglong Liu
Xiaosong Wang
Hongsheng Li
Shaoting Zhang
MedIm
LM&MA
32
4
0
02 Nov 2023
Generating Context-Aware Natural Answers for Questions in 3D Scenes
Generating Context-Aware Natural Answers for Questions in 3D Scenes
Mohammed Munzer Dwedari
Matthias Niessner
Dave Zhenyu Chen
32
1
0
30 Oct 2023
Beyond MLE: Convex Learning for Text Generation
Beyond MLE: Convex Learning for Text Generation
Chenze Shao
Zhengrui Ma
Min Zhang
Yang Feng
30
3
0
26 Oct 2023
Previous
12345...161718
Next