Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.12894
Cited By
Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation
30 July 2019
Yang Gao
Christian M. Meyer
Mohsen Mesgar
Iryna Gurevych
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation"
17 / 17 papers shown
Title
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
123
13
0
28 Aug 2023
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
96
15
0
11 Jun 2022
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
82
393
0
15 Nov 2018
APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning
Yang Gao
Christian M. Meyer
Iryna Gurevych
43
34
0
29 Aug 2018
Improving Abstraction in Text Summarization
Wojciech Kry'sciñski
Romain Paulus
Caiming Xiong
R. Socher
47
147
0
23 Aug 2018
The price of debiasing automatic metrics in natural language evaluation
Arun Tejasvi Chaganty
Stephen Mussmann
Percy Liang
53
117
0
06 Jul 2018
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
50
86
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
58
83
0
26 May 2018
Discourse-Aware Neural Rewards for Coherent Text Generation
Antoine Bosselut
Asli Celikyilmaz
Xiaodong He
Jianfeng Gao
Po-Sen Huang
Yejin Choi
68
79
0
10 May 2018
Learning to Extract Coherent Summary via Deep Reinforcement Learning
Yuxiang Wu
Baotian Hu
AI4TS
38
170
0
19 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng
Junhyuk Oh
Satinder Singh
57
205
0
17 Apr 2018
Ranking Sentences for Extractive Summarization with Reinforcement Learning
Shashi Narayan
Shay B. Cohen
Mirella Lapata
151
548
0
23 Feb 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
77
601
0
15 Jan 2018
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
62
138
0
24 Jul 2017
Sentence Simplification with Deep Reinforcement Learning
Xingxing Zhang
Mirella Lapata
57
398
0
31 Mar 2017
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
96
1,614
0
20 Nov 2015
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
597
13,416
0
25 Aug 2014
1