Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.14328
Cited By
v1
v2
v3 (latest)
Reinforcement Learning for Generative AI: A Survey
28 August 2023
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning for Generative AI: A Survey"
50 / 205 papers shown
Title
Toward Diverse Text Generation with Inverse Reinforcement Learning
Zhan Shi
Xinchi Chen
Xipeng Qiu
Xuanjing Huang
65
104
0
30 Apr 2018
Learning to Extract Coherent Summary via Deep Reinforcement Learning
Yuxiang Wu
Baotian Hu
AI4TS
60
170
0
19 Apr 2018
Ranking Sentences for Extractive Summarization with Reinforcement Learning
Shashi Narayan
Shay B. Cohen
Mirella Lapata
201
551
0
23 Feb 2018
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
135
2,775
0
09 Feb 2018
SCH-GAN: Semi-supervised Cross-modal Hashing by Generative Adversarial Network
Jian Zhang
Yuxin Peng
Mingkuan Yuan
GAN
81
118
0
07 Feb 2018
DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text
Jingjing Xu
Xuancheng Ren
Junyang Lin
Xu Sun
76
144
0
05 Feb 2018
MaskGAN: Better Text Generation via Filling in the______
W. Fedus
Ian Goodfellow
Andrew M. Dai
123
470
0
23 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
319
8,439
0
04 Jan 2018
Hierarchical Text Generation and Planning for Strategic Dialogue
Denis Yarats
M. Lewis
89
56
0
15 Dec 2017
Deep Reinforcement Learning for De-Novo Drug Design
Mariya Popova
Olexandr Isayev
Alexander Tropsha
103
1,035
0
29 Nov 2017
A Survey on Dialogue Systems: Recent Advances and New Frontiers
Hongshen Chen
Xiaorui Liu
Dawei Yin
Jiliang Tang
VLM
LLMAG
100
704
0
06 Nov 2017
Long Text Generation via Adversarial Training with Leaked Information
Jiaxian Guo
Sidi Lu
Han Cai
Weinan Zhang
Yong Yu
Jun Wang
GAN
87
500
0
24 Sep 2017
A Deep Reinforcement Learning Chatbot
Iulian Serban
Chinnadhurai Sankar
M. Germain
Saizheng Zhang
Zhouhan Lin
...
Dendi Suhubdy
Vincent Michalski
A. Nguyen
Joelle Pineau
Yoshua Bengio
106
236
0
07 Sep 2017
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
85
138
0
24 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
604
19,318
0
20 Jul 2017
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
76
111
0
29 Jun 2017
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
M. Lewis
Denis Yarats
Yann N. Dauphin
Devi Parikh
Dhruv Batra
LLMAG
113
415
0
16 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
845
132,854
0
12 Jun 2017
Adversarial Ranking for Language Generation
Kevin Qinghong Lin
Dianqi Li
Xiaodong He
Zhengyou Zhang
Ming-Ting Sun
GAN
92
334
0
31 May 2017
Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models
G. L. Guimaraes
Benjamín Sánchez-Lengeling
Carlos Outeiral
Pedro Luis Cunha Farias
Alán Aspuru-Guzik
GAN
99
525
0
30 May 2017
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
218
1,560
0
11 May 2017
Molecular De Novo Design through Deep Reinforcement Learning
Marcus Olivecrona
T. Blaschke
Ola Engkvist
Hongming Chen
BDL
156
1,020
0
25 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren
Xiaoyu Wang
Ning Zhang
Xutao Lv
Li Li
65
324
0
12 Apr 2017
Sentence Simplification with Deep Reinforcement Learning
Xingxing Zhang
Mirella Lapata
73
398
0
31 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
853
11,971
0
09 Mar 2017
FeUdal Networks for Hierarchical Reinforcement Learning
A. Vezhnevets
Simon Osindero
Tom Schaul
N. Heess
Max Jaderberg
David Silver
Koray Kavukcuoglu
FedML
100
907
0
03 Mar 2017
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
96
371
0
03 Mar 2017
Boundary-Seeking Generative Adversarial Networks
R. Devon Hjelm
Athul Paul Jacob
Tong Che
Adam Trischler
Kyunghyun Cho
Yoshua Bengio
GAN
97
170
0
27 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
120
1,350
0
27 Feb 2017
Maximum-Likelihood Augmented Discrete Generative Adversarial Networks
Tong Che
Yanran Li
Ruixiang Zhang
R. Devon Hjelm
Wenjie Li
Yangqiu Song
Yoshua Bengio
GAN
79
235
0
26 Feb 2017
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning
Jason D. Williams
Kavosh Asadi
Geoffrey Zweig
OffRL
79
335
0
10 Feb 2017
Beam Search Strategies for Neural Machine Translation
Markus Freitag
Yaser Al-Onaizan
124
396
0
06 Feb 2017
Adversarial Learning for Neural Dialogue Generation
Jiwei Li
Will Monroe
Tianlin Shi
Sébastien Jean
Alan Ritter
Dan Jurafsky
101
899
0
23 Jan 2017
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
111
1,893
0
02 Dec 2016
GuessWhat?! Visual object discovery through multi-modal dialogue
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
110
428
0
23 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
149
173
0
09 Nov 2016
Designing Neural Network Architectures using Reinforcement Learning
Bowen Baker
O. Gupta
Nikhil Naik
Ramesh Raskar
137
1,472
0
07 Nov 2016
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
517
5,388
0
05 Nov 2016
Personalizing a Dialogue System with Transfer Reinforcement Learning
Kaixiang Mo
Shuangyin Li
Yu Zhang
Jiajun Li
Qiang Yang
OffRL
114
93
0
10 Oct 2016
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
72
2,409
0
18 Sep 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Zhiwen Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
109
253
0
01 Sep 2016
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
103
265
0
08 Jun 2016
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
290
1,341
0
05 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
225
5,088
0
05 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
212
8,883
0
04 Feb 2016
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
495
2,580
0
25 Jan 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,774
0
10 Dec 2015
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
132
1,619
0
20 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
362
13,297
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
154
3,442
0
08 Jun 2015
Previous
1
2
3
4
5
Next