ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.00563
  4. Cited By
Self-critical Sequence Training for Image Captioning

Self-critical Sequence Training for Image Captioning

2 December 2016
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
ArXivPDFHTML

Papers citing "Self-critical Sequence Training for Image Captioning"

50 / 858 papers shown
Title
I Want This Product but Different : Multimodal Retrieval with Synthetic Query Expansion
Ivona Tautkute
Tomasz Trzciñski
34
4
0
17 Feb 2021
Image Captioning using Multiple Transformers for Self-Attention
  Mechanism
Image Captioning using Multiple Transformers for Self-Attention Mechanism
Farrukh Olimov
Shikha Dubey
Labina Shrestha
Tran Trung Tin
M. Jeon
ViT
34
2
0
14 Feb 2021
The MSR-Video to Text Dataset with Clean Annotations
The MSR-Video to Text Dataset with Clean Annotations
Haoran Chen
Jianmin Li
Simone Frintrop
Xiaolin Hu
29
18
0
12 Feb 2021
In Defense of Scene Graphs for Image Captioning
In Defense of Scene Graphs for Image Captioning
Kien Nguyen
Subarna Tripathi
Bang Du
T. Guha
Truong Thao Nguyen
39
42
0
09 Feb 2021
DRAG: Director-Generator Language Modelling Framework for Non-Parallel
  Author Stylized Rewriting
DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting
Hrituraj Singh
Gaurav Verma
Aparna Garimella
Balaji Vasan Srinivasan
DiffM
25
5
0
28 Jan 2021
Scheduled Sampling in Vision-Language Pretraining with Decoupled
  Encoder-Decoder Network
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
Yehao Li
Yingwei Pan
Ting Yao
Jingwen Chen
Tao Mei
VLM
29
52
0
27 Jan 2021
CPTR: Full Transformer Network for Image Captioning
CPTR: Full Transformer Network for Image Captioning
Wei Liu
Sihan Chen
Longteng Guo
Xinxin Zhu
Jing Liu
ViT
18
141
0
26 Jan 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with
  Reinforcement Learning
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
Yufei Wang
Ian D. Wood
Stephen Wan
Mark Johnson
28
7
0
25 Jan 2021
Fast Sequence Generation with Multi-Agent Reinforcement Learning
Fast Sequence Generation with Multi-Agent Reinforcement Learning
Longteng Guo
Jing Liu
Xinxin Zhu
Hanqing Lu
LRM
56
6
0
24 Jan 2021
Visual Question Answering based on Local-Scene-Aware Referring
  Expression Generation
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Jungjun Kim
Dong-Gyu Lee
Jialin Wu
Hong G Jung
Seong-Whan Lee
ObjD
30
21
0
22 Jan 2021
Macroscopic Control of Text Generation for Image Captioning
Macroscopic Control of Text Generation for Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
34
4
0
20 Jan 2021
Diagnostic Captioning: A Survey
Diagnostic Captioning: A Survey
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
91
26
0
18 Jan 2021
Dual-Level Collaborative Transformer for Image Captioning
Dual-Level Collaborative Transformer for Image Captioning
Yunpeng Luo
Jiayi Ji
Xiaoshuai Sun
Liujuan Cao
Yongjian Wu
Feiyue Huang
Chia-Wen Lin
Rongrong Ji
ViT
19
274
0
16 Jan 2021
Persistent Anti-Muslim Bias in Large Language Models
Persistent Anti-Muslim Bias in Large Language Models
Abubakar Abid
Maheen Farooqi
James Zou
AILaw
42
539
0
14 Jan 2021
Exploration of Visual Features and their weighted-additive fusion for
  Video Captioning
Exploration of Visual Features and their weighted-additive fusion for Video Captioning
V. PraveenS.
Akhilesh Bharadwaj
Harsh Raj
Janhavi Dadhania
Ganesh Samarth C.A
Nikhil Pareek
S. M. I. S. R. Mahadeva Prasanna
35
1
0
14 Jan 2021
Unifying Relational Sentence Generation and Retrieval for Medical Image
  Report Composition
Unifying Relational Sentence Generation and Retrieval for Medical Image Report Composition
Fuyu Wang
Xiaodan Liang
Lin Xu
Liang Lin
MedIm
34
25
0
09 Jan 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
265
157
0
02 Jan 2021
SDA: Improving Text Generation with Self Data Augmentation
SDA: Improving Text Generation with Self Data Augmentation
Ping Yu
Ruiyi Zhang
Yang Zhao
Yizhe Zhang
Chunyuan Li
Changyou Chen
33
2
0
02 Jan 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
37
66
0
31 Dec 2020
Neural Text Generation with Artificial Negative Examples
Neural Text Generation with Artificial Negative Examples
Keisuke Shirai
Kazuma Hashimoto
Akiko Eriguchi
Takashi Ninomiya
Shinsuke Mori
13
7
0
28 Dec 2020
SubICap: Towards Subword-informed Image Captioning
SubICap: Towards Subword-informed Image Captioning
Naeha Sharif
Bennamoun
Wei Liu
Syed Afaq Ali Shah
30
2
0
24 Dec 2020
Pattern-aware Data Augmentation for Query Rewriting in Voice Assistant
  Systems
Pattern-aware Data Augmentation for Query Rewriting in Voice Assistant Systems
Yunmo Chen
Sixing Lu
Fan Yang
Xiaojiang Huang
Xing Fan
Chenlei Guo
30
3
0
21 Dec 2020
Exploring Fluent Query Reformulations with Text-to-Text Transformers and
  Reinforcement Learning
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
Jerry Zikun Chen
S. Yu
Haoran Wang
259
5
0
18 Dec 2020
AutoCaption: Image Captioning with Neural Architecture Search
AutoCaption: Image Captioning with Neural Architecture Search
Xinxin Zhu
Weining Wang
Longteng Guo
Jing Liu
32
9
0
16 Dec 2020
Intrinsic Image Captioning Evaluation
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
21
0
0
14 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global
  Representation in Transformer Network
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
54
170
0
13 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
Image Captioning with Context-Aware Auxiliary Guidance
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
41
31
0
10 Dec 2020
Understanding Guided Image Captioning Performance across Domains
Understanding Guided Image Captioning Performance across Domains
Edwin G. Ng
Bo Pang
P. Sharma
Radu Soricut
37
24
0
04 Dec 2020
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Dave Zhenyu Chen
A. Gholami
Matthias Nießner
Angel X. Chang
3DPC
23
161
0
03 Dec 2020
A Comprehensive Review on Recent Methods and Challenges of Video
  Description
A Comprehensive Review on Recent Methods and Challenges of Video Description
Ashutosh Kumar Singh
Thoudam Doren Singh
Sivaji Bandyopadhyay
3DV
VLM
21
5
0
30 Nov 2020
Multimodal Learning for Hateful Memes Detection
Multimodal Learning for Hateful Memes Detection
Yi Zhou
Zhenhao Chen
24
56
0
25 Nov 2020
SuperOCR: A Conversion from Optical Character Recognition to Image
  Captioning
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning
Baohua Sun
Michael Lin
Hao Sha
Lin Yang
27
5
0
21 Nov 2020
Structural and Functional Decomposition for Personality Image Captioning
  in a Communication Game
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game
Minh-Thu Nguyen
Duy Phung
Minh Hoai
Thien Huu Nguyen
38
4
0
17 Nov 2020
Reinforced Medical Report Generation with X-Linear Attention and
  Repetition Penalty
Reinforced Medical Report Generation with X-Linear Attention and Repetition Penalty
Wenting Xu
Chang Qi
Zhenghua Xu
Thomas Lukasiewicz
MedIm
20
4
0
16 Nov 2020
DORB: Dynamically Optimizing Multiple Rewards with Bandits
DORB: Dynamically Optimizing Multiple Rewards with Bandits
Ramakanth Pasunuru
Han Guo
Joey Tianyi Zhou
OffRL
34
6
0
15 Nov 2020
CapWAP: Captioning with a Purpose
CapWAP: Captioning with a Purpose
Adam Fisch
Kenton Lee
Ming-Wei Chang
J. Clark
Regina Barzilay
13
11
0
09 Nov 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Dual Attention on Pyramid Feature Maps for Image Captioning
Litao Yu
Jian Zhang
Qiang Wu
24
47
0
02 Nov 2020
Exploring Question-Specific Rewards for Generating Deep Questions
Exploring Question-Specific Rewards for Generating Deep Questions
Yuxi Xie
Liangming Pan
Dongzhe Wang
Min-Yen Kan
Yansong Feng
53
27
0
02 Nov 2020
Boost Image Captioning with Knowledge Reasoning
Boost Image Captioning with Knowledge Reasoning
Feicheng Huang
Zhixin Li
Haiyang Wei
Canlong Zhang
Huifang Ma
17
25
0
02 Nov 2020
Generating Radiology Reports via Memory-driven Transformer
Generating Radiology Reports via Memory-driven Transformer
Zhihong Chen
Yan Song
Tsung-Hui Chang
Xiang Wan
MedIm
30
466
0
30 Oct 2020
POMO: Policy Optimization with Multiple Optima for Reinforcement
  Learning
POMO: Policy Optimization with Multiple Optima for Reinforcement Learning
Yeong-Dae Kwon
Jinho Choo
Byoungjip Kim
Iljoo Yoon
Youngjune Gwon
Seungjai Min
26
317
0
30 Oct 2020
Less is More: Data-Efficient Complex Question Answering over Knowledge
  Bases
Less is More: Data-Efficient Complex Question Answering over Knowledge Bases
Yuncheng Hua
Yuan-Fang Li
Guilin Qi
Wei Wu
Jingyao Zhang
Daiqing Qi
29
28
0
29 Oct 2020
Quantifying Learnability and Describability of Visual Concepts Emerging
  in Representation Learning
Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning
Iro Laina
Ruth C. Fong
Andrea Vedaldi
OCL
33
13
0
27 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual
  Questions
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
43
22
0
24 Oct 2020
Show and Speak: Directly Synthesize Spoken Description of Images
Show and Speak: Directly Synthesize Spoken Description of Images
Xinsheng Wang
Siyuan Feng
Jihua Zhu
M. Hasegawa-Johnson
O. Scharenborg
26
4
0
23 Oct 2020
Learning to Optimise General TSP Instances
Learning to Optimise General TSP Instances
N. Sultana
Jeffrey Chan
•. A. K. Qin
Tabinda Sarwar
63
13
0
23 Oct 2020
Incorporating Stylistic Lexical Preferences in Generative Language
  Models
Incorporating Stylistic Lexical Preferences in Generative Language Models
Hrituraj Singh
Gaurav Verma
Balaji Vasan Srinivasan
14
5
0
22 Oct 2020
Sample Efficient Reinforcement Learning with REINFORCE
Sample Efficient Reinforcement Learning with REINFORCE
Junzi Zhang
Jongho Kim
Brendan O'Donoghue
Stephen P. Boyd
46
101
0
22 Oct 2020
PARENTing via Model-Agnostic Reinforcement Learning to Correct
  Pathological Behaviors in Data-to-Text Generation
PARENTing via Model-Agnostic Reinforcement Learning to Correct Pathological Behaviors in Data-to-Text Generation
Clément Rebuffel
Laure Soulier
Geoffrey Scoutheeten
Patrick Gallinari
18
9
0
21 Oct 2020
Bayesian Attention Modules
Bayesian Attention Modules
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
117
59
0
20 Oct 2020
Previous
123...91011...161718
Next