ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.14328
  4. Cited By
Reinforcement Learning for Generative AI: A Survey
v1v2v3 (latest)

Reinforcement Learning for Generative AI: A Survey

28 August 2023
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
    SyDa
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning for Generative AI: A Survey"

50 / 205 papers shown
Title
Toward Diverse Text Generation with Inverse Reinforcement Learning
Toward Diverse Text Generation with Inverse Reinforcement Learning
Zhan Shi
Xinchi Chen
Xipeng Qiu
Xuanjing Huang
65
104
0
30 Apr 2018
Learning to Extract Coherent Summary via Deep Reinforcement Learning
Learning to Extract Coherent Summary via Deep Reinforcement Learning
Yuxiang Wu
Baotian Hu
AI4TS
60
170
0
19 Apr 2018
Ranking Sentences for Extractive Summarization with Reinforcement
  Learning
Ranking Sentences for Extractive Summarization with Reinforcement Learning
Shashi Narayan
Shay B. Cohen
Mirella Lapata
201
551
0
23 Feb 2018
Efficient Neural Architecture Search via Parameter Sharing
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
135
2,775
0
09 Feb 2018
SCH-GAN: Semi-supervised Cross-modal Hashing by Generative Adversarial
  Network
SCH-GAN: Semi-supervised Cross-modal Hashing by Generative Adversarial Network
Jian Zhang
Yuxin Peng
Mingkuan Yuan
GAN
81
118
0
07 Feb 2018
DP-GAN: Diversity-Promoting Generative Adversarial Network for
  Generating Informative and Diversified Text
DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text
Jingjing Xu
Xuancheng Ren
Junyang Lin
Xu Sun
76
144
0
05 Feb 2018
MaskGAN: Better Text Generation via Filling in the______
MaskGAN: Better Text Generation via Filling in the______
W. Fedus
Ian Goodfellow
Andrew M. Dai
123
470
0
23 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
319
8,439
0
04 Jan 2018
Hierarchical Text Generation and Planning for Strategic Dialogue
Hierarchical Text Generation and Planning for Strategic Dialogue
Denis Yarats
M. Lewis
89
56
0
15 Dec 2017
Deep Reinforcement Learning for De-Novo Drug Design
Deep Reinforcement Learning for De-Novo Drug Design
Mariya Popova
Olexandr Isayev
Alexander Tropsha
103
1,035
0
29 Nov 2017
A Survey on Dialogue Systems: Recent Advances and New Frontiers
A Survey on Dialogue Systems: Recent Advances and New Frontiers
Hongshen Chen
Xiaorui Liu
Dawei Yin
Jiliang Tang
VLMLLMAG
100
704
0
06 Nov 2017
Long Text Generation via Adversarial Training with Leaked Information
Long Text Generation via Adversarial Training with Leaked Information
Jiaxian Guo
Sidi Lu
Han Cai
Weinan Zhang
Yong Yu
Jun Wang
GAN
87
500
0
24 Sep 2017
A Deep Reinforcement Learning Chatbot
A Deep Reinforcement Learning Chatbot
Iulian Serban
Chinnadhurai Sankar
M. Germain
Saizheng Zhang
Zhouhan Lin
...
Dendi Suhubdy
Vincent Michalski
A. Nguyen
Joelle Pineau
Yoshua Bengio
106
236
0
07 Sep 2017
Reinforcement Learning for Bandit Neural Machine Translation with
  Simulated Human Feedback
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
85
138
0
24 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
604
19,318
0
20 Jul 2017
Actor-Critic Sequence Training for Image Captioning
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
76
111
0
29 Jun 2017
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
M. Lewis
Denis Yarats
Yann N. Dauphin
Devi Parikh
Dhruv Batra
LLMAG
113
415
0
16 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
845
132,854
0
12 Jun 2017
Adversarial Ranking for Language Generation
Adversarial Ranking for Language Generation
Kevin Qinghong Lin
Dianqi Li
Xiaodong He
Zhengyou Zhang
Ming-Ting Sun
GAN
92
334
0
31 May 2017
Objective-Reinforced Generative Adversarial Networks (ORGAN) for
  Sequence Generation Models
Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models
G. L. Guimaraes
Benjamín Sánchez-Lengeling
Carlos Outeiral
Pedro Luis Cunha Farias
Alán Aspuru-Guzik
GAN
99
525
0
30 May 2017
A Deep Reinforced Model for Abstractive Summarization
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
218
1,560
0
11 May 2017
Molecular De Novo Design through Deep Reinforcement Learning
Molecular De Novo Design through Deep Reinforcement Learning
Marcus Olivecrona
T. Blaschke
Ola Engkvist
Hongming Chen
BDL
156
1,020
0
25 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren
Xiaoyu Wang
Ning Zhang
Xutao Lv
Li Li
65
324
0
12 Apr 2017
Sentence Simplification with Deep Reinforcement Learning
Sentence Simplification with Deep Reinforcement Learning
Xingxing Zhang
Mirella Lapata
73
398
0
31 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
853
11,971
0
09 Mar 2017
FeUdal Networks for Hierarchical Reinforcement Learning
FeUdal Networks for Hierarchical Reinforcement Learning
A. Vezhnevets
Simon Osindero
Tom Schaul
N. Heess
Max Jaderberg
David Silver
Koray Kavukcuoglu
FedML
100
907
0
03 Mar 2017
End-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
96
371
0
03 Mar 2017
Boundary-Seeking Generative Adversarial Networks
Boundary-Seeking Generative Adversarial Networks
R. Devon Hjelm
Athul Paul Jacob
Tong Che
Adam Trischler
Kyunghyun Cho
Yoshua Bengio
GAN
97
170
0
27 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
120
1,350
0
27 Feb 2017
Maximum-Likelihood Augmented Discrete Generative Adversarial Networks
Maximum-Likelihood Augmented Discrete Generative Adversarial Networks
Tong Che
Yanran Li
Ruixiang Zhang
R. Devon Hjelm
Wenjie Li
Yangqiu Song
Yoshua Bengio
GAN
79
235
0
26 Feb 2017
Hybrid Code Networks: practical and efficient end-to-end dialog control
  with supervised and reinforcement learning
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning
Jason D. Williams
Kavosh Asadi
Geoffrey Zweig
OffRL
79
335
0
10 Feb 2017
Beam Search Strategies for Neural Machine Translation
Beam Search Strategies for Neural Machine Translation
Markus Freitag
Yaser Al-Onaizan
124
396
0
06 Feb 2017
Adversarial Learning for Neural Dialogue Generation
Adversarial Learning for Neural Dialogue Generation
Jiwei Li
Will Monroe
Tianlin Shi
Sébastien Jean
Alan Ritter
Dan Jurafsky
101
899
0
23 Jan 2017
Self-critical Sequence Training for Image Captioning
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
111
1,893
0
02 Dec 2016
GuessWhat?! Visual object discovery through multi-modal dialogue
GuessWhat?! Visual object discovery through multi-modal dialogue
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
110
428
0
23 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models
  with KL-control
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
149
173
0
09 Nov 2016
Designing Neural Network Architectures using Reinforcement Learning
Designing Neural Network Architectures using Reinforcement Learning
Bowen Baker
O. Gupta
Nikhil Naik
Ramesh Raskar
137
1,472
0
07 Nov 2016
Neural Architecture Search with Reinforcement Learning
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
517
5,388
0
05 Nov 2016
Personalizing a Dialogue System with Transfer Reinforcement Learning
Personalizing a Dialogue System with Transfer Reinforcement Learning
Kaixiang Mo
Shuangyin Li
Yu Zhang
Jiajun Li
Qiang Yang
OffRL
114
93
0
10 Oct 2016
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
72
2,409
0
18 Sep 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Zhiwen Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
109
253
0
01 Sep 2016
Towards End-to-End Learning for Dialog State Tracking and Management
  using Deep Reinforcement Learning
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
103
265
0
08 Jun 2016
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
290
1,341
0
05 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRLODL
225
5,088
0
05 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
212
8,883
0
04 Feb 2016
Pixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSegGAN
495
2,580
0
25 Jan 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,774
0
10 Dec 2015
Sequence Level Training with Recurrent Neural Networks
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
132
1,619
0
20 Nov 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
362
13,297
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
154
3,442
0
08 Jun 2015
Previous
12345
Next