Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.11258
Cited By
Toward Diverse Text Generation with Inverse Reinforcement Learning
30 April 2018
Zhan Shi
Xinchi Chen
Xipeng Qiu
Xuanjing Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Toward Diverse Text Generation with Inverse Reinforcement Learning"
15 / 15 papers shown
Title
Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks
Xinyu Wang
Jinbo Bi
Minghu Song
CLL
69
0
0
01 May 2025
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF
Taiming Lu
Lingfeng Shen
Xinyu Yang
Weiting Tan
Beidi Chen
Huaxiu Yao
61
2
0
12 Jun 2024
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
46
10
0
28 Aug 2023
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
Chris Cundy
Stefano Ermon
16
10
0
08 Jun 2023
Token Imbalance Adaptation for Radiology Report Generation
Yuexin Wu
I. Huang
Xiaolei Huang
MedIm
29
7
0
18 Apr 2023
A survey on text generation using generative adversarial networks
Gustavo de Rosa
João Paulo Papa
GAN
32
88
0
20 Dec 2022
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
Wenhao Li
Xiaoyuan Yi
Jinyi Hu
Maosong Sun
Xing Xie
29
0
0
14 Nov 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
28
27
0
24 Oct 2022
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
Evan Crothers
Nathalie Japkowicz
H. Viktor
DeLMO
38
107
0
13 Oct 2022
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
59
14
0
11 Jun 2022
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts
W. Yu
Chenguang Zhu
Lianhui Qin
Zhihan Zhang
Tong Zhao
Meng Jiang
LRM
28
31
0
14 Mar 2022
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
Haozhe Ji
Minlie Huang
18
23
0
12 Oct 2021
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
24
101
0
12 Apr 2021
Adversarial Sub-sequence for Text Generation
Xingyuan Chen
Yanzhe Li
Peng Jin
Jiuhua Zhang
Xinyu Dai
Jiajun Chen
Gang Song
GAN
35
5
0
30 May 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
25
29
0
02 Jan 2019
1