Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.14328
Cited By
v1
v2
v3 (latest)
Reinforcement Learning for Generative AI: A Survey
28 August 2023
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning for Generative AI: A Survey"
50 / 205 papers shown
Title
How to Train Your Energy-Based Models
Yang Song
Diederik P. Kingma
DiffM
95
265
0
09 Jan 2021
A Distributional Approach to Controlled Text Generation
Muhammad Khalifa
Hady ElSahar
Marc Dymetman
167
119
0
21 Dec 2020
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
Julia Kreutzer
Stefan Riezler
Carolin (Haas) Lawrence
RALM
OffRL
65
15
0
04 Nov 2020
Improving Dialog Systems for Negotiation with Personality Modeling
Runzhe Yang
Jingxiao Chen
Karthik Narasimhan
106
50
0
20 Oct 2020
Text Generation by Learning from Demonstrations
Richard Yuanzhe Pang
He He
OffRL
70
80
0
16 Sep 2020
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
265
2,194
0
02 Sep 2020
Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning
D. Mohan
R. Lenain
Lorenzo Foglianti
Tian Huey Teh
Marlene Staib
Alexandra Torresquintero
Jiameng Gao
AI4TS
53
11
0
07 Aug 2020
CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search
Xin Chen
Yawen Duan
Zewei Chen
Hang Xu
Zihao Chen
Xiaodan Liang
Tong Zhang
Zhenguo Li
OffRL
75
21
0
18 Jul 2020
Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search
Yuan Tian
Qin Wang
Zhiwu Huang
Wen Li
Dengxin Dai
Minghao Yang
Jun Wang
Olga Fink
OffRL
91
61
0
17 Jul 2020
Unsupervised Paraphrasing via Deep Reinforcement Learning
A.B. Siddique
Samet Oymak
Vagelis Hristidis
126
58
0
05 Jul 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
842
18,437
0
19 Jun 2020
Sample-Efficient Optimization in the Latent Space of Deep Generative Models via Weighted Retraining
Austin Tripp
Erik A. Daxberger
José Miguel Hernández-Lobato
MedIm
97
142
0
16 Jun 2020
ColdGANs: Taming Language GANs with Cautious Sampling Strategies
Thomas Scialom
Paul-Alexis Dray
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
GAN
SyDa
68
18
0
08 Jun 2020
TAG : Type Auxiliary Guiding for Code Comment Generation
Ruichu Cai
Zhihao Liang
Boyan Xu
Zijian Li
Yuexing Hao
Yao-Liang Chen
38
26
0
06 May 2020
Reward Constrained Interactive Recommendation with Natural Language Feedback
Ruiyi Zhang
Tong Yu
Yilin Shen
Hongxia Jin
Changyou Chen
Lawrence Carin
67
17
0
04 May 2020
An Imitation Game for Learning Semantic Parsers from User Interaction
Ziyu Yao
Yiqi Tang
Wen-tau Yih
Huan Sun
Yu-Chuan Su
82
35
0
02 May 2020
Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning
S. Gottipati
B. Sattarov
Sufeng Niu
Yashaswi Pathak
Haoran Wei
...
Simon R. Blackburn
Connor W. Coley
Jian Tang
Sarath Chandar
Yoshua Bengio
77
110
0
26 Apr 2020
Reinforced Curriculum Learning on Pre-trained Neural Machine Translation Models
Mingjun Zhao
Haijiang Wu
Di Niu
Xiaoli Wang
72
42
0
13 Apr 2020
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
412
1,994
0
11 Apr 2020
TextGAIL: Generative Adversarial Imitation Learning for Text Generation
Qingyang Wu
Lei Li
Zhou Yu
GAN
78
50
0
07 Apr 2020
Modeling 3D Shapes by Reinforcement Learning
Cheng Lin
Tingxiang Fan
Wenping Wang
Matthias Nießner
OffRL
3DV
59
37
0
27 Mar 2020
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics
G. Simm
Robert Pinsler
José Miguel Hernández-Lobato
AI4CE
86
85
0
18 Feb 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
575
10,591
0
17 Feb 2020
Self-Adversarial Learning with Comparative Discrimination for Text Generation
Wangchunshu Zhou
Tao Ge
Ke Xu
Furu Wei
Ming Zhou
56
20
0
31 Jan 2020
Goal-directed graph construction using reinforcement learning
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
63
15
0
30 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning
Ahmed Khalifa
Philip Bontrager
Sam Earle
Julian Togelius
69
146
0
24 Jan 2020
Distributional Reinforcement Learning for Energy-Based Sequential Models
Tetiana Parshakova
J. Andreoli
Marc Dymetman
83
21
0
18 Dec 2019
InfoCNF: An Efficient Conditional Continuous Normalizing Flow with Adaptive Solvers
T. Nguyen
Animesh Garg
Richard G. Baraniuk
Anima Anandkumar
TPM
104
9
0
09 Dec 2019
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
Paul Hongsuck Seo
Piyush Sharma
Tomer Levinboim
Bohyung Han
Radu Soricut
OffRL
67
22
0
21 Nov 2019
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
496
1,771
0
18 Sep 2019
Hierarchical Reinforcement Learning for Open-Domain Dialog
Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
J. Shen
Rosalind W. Picard
OffRL
86
59
0
17 Sep 2019
ARAML: A Stable Adversarial Training Framework for Text Generation
Pei Ke
Fei Huang
Minlie Huang
Xiaoyan Zhu
GAN
52
23
0
20 Aug 2019
Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation
Yang Gao
Christian M. Meyer
Mohsen Mesgar
Iryna Gurevych
93
23
0
30 Jul 2019
On the Weaknesses of Reinforcement Learning for Neural Machine Translation
Leshem Choshen
Lior Fox
Zohar Aizenbud
Omri Abend
131
110
0
03 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
143
343
0
30 Jun 2019
Preference-based Interactive Multi-Document Summarisation
Yang Gao
Christian M. Meyer
Iryna Gurevych
41
27
0
07 Jun 2019
RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion
M. Sarmad
H. J. Lee
Y. Kim
3DPC
90
181
0
28 Apr 2019
CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning
Ziyu Yao
Jayavardhan Reddy Peddamail
Huan Sun
70
102
0
13 Mar 2019
IRLAS: Inverse Reinforcement Learning for Architecture Search
Minghao Guo
Zhaobai Zhong
Wei Wu
Dahua Lin
Junjie Yan
3DV
88
37
0
13 Dec 2018
GuacaMol: Benchmarking Models for De Novo Molecular Design
Nathan Brown
Marco Fiscato
Marwin H. S. Segler
Alain C. Vaucher
ELM
123
716
0
22 Nov 2018
Improving Automatic Source Code Summarization via Deep Reinforcement Learning
Yao Wan
Zhou Zhao
Min Yang
Guandong Xu
Haochao Ying
Jian Wu
Philip S. Yu
72
392
0
17 Nov 2018
Reinforcement Learning for Improving Agent Design
David R Ha
106
127
0
09 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
132
777
0
06 Oct 2018
Towards one-shot learning for rare-word translation with external experts
Ngoc-Quan Pham
Jan Niehues
A. Waibel
AAML
49
24
0
10 Sep 2018
A Study of Reinforcement Learning for Neural Machine Translation
Lijun Wu
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
OffRL
68
183
0
27 Aug 2018
MolGAN: An implicit generative model for small molecular graphs
Nicola De Cao
Thomas Kipf
GNN
GAN
181
930
0
30 May 2018
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
Yen-Chun Chen
Joey Tianyi Zhou
BDL
198
584
0
28 May 2018
Detecting Deceptive Reviews using Generative Adversarial Networks
H. Aghakhani
Aravind Machiry
Shirin Nilizadeh
Christopher Krügel
Giovanni Vigna
82
79
0
25 May 2018
A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization
Li Wang
Junlin Yao
Yunzhe Tao
Li Zhong
Wen Liu
Q. Du
BDL
107
136
0
09 May 2018
A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation
Tsz Kin Lam
Julia Kreutzer
Stefan Riezler
79
32
0
03 May 2018
Previous
1
2
3
4
5
Next