Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.05565
Cited By
v1
v2
v3 (latest)
Survey on reinforcement learning for language processing
12 April 2021
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Survey on reinforcement learning for language processing"
50 / 83 papers shown
Title
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning
Chi Zhang
Ziying Jia
George Atia
Sihong He
Yue Wang
105
0
0
24 May 2025
Slot: Provenance-Driven APT Detection through Graph Reinforcement Learning
Wei Qiao
Yebo Feng
Teng Li
Zijian Zhang
Zhengzi Xu
Zhuo Ma
Yulong Shen
112
0
0
23 Oct 2024
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
167
13
0
28 Aug 2023
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
298
30,152
0
01 Mar 2022
Generalization in Multimodal Language Learning from Simulation
Aaron Eisermann
Jae Hee Lee
C. Weber
S. Wermter
49
9
0
03 Aug 2021
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management
Zhi Chen
Lu Chen
Xiaoyuan Liu
Kai Yu
83
20
0
22 Sep 2020
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI
Katya Kudashkina
P. Pilarski
R. Sutton
KELM
77
6
0
27 Aug 2020
Crossmodal Language Grounding in an Embodied Neurocognitive Model
Stefan Heinrich
Yuan Yao
Tobias Hinz
Zhiyuan Liu
Thomas Hummel
Matthias Kerzel
C. Weber
S. Wermter
LM&Ro
56
19
0
24 Jun 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
904
42,463
0
28 May 2020
Curious Hierarchical Actor-Critic Reinforcement Learning
Frank Röder
Manfred Eppe
Phuong D. H. Nguyen
S. Wermter
57
21
0
07 May 2020
Towards Embodied Scene Description
Sinan Tan
Huaping Liu
Di Guo
Xinyu Zhang
F. Sun
LM&Ro
33
9
0
30 Apr 2020
Dual Learning for Semi-Supervised Natural Language Understanding
Su Zhu
Ruisheng Cao
Kai Yu
80
31
0
26 Apr 2020
You Impress Me: Dialogue Generation via Mutual Persona Perception
Qian Liu
Yihong Chen
B. Chen
Jian-Guang Lou
Zixuan Chen
Bin Zhou
Dongmei Zhang
72
169
0
11 Apr 2020
MQA: Answering the Question via Robotic Manipulation
Yuhong Deng
Di Guo
F. Sun
Naifu Zhang
Huaping Liu
Chen Pang
57
22
0
10 Mar 2020
Plato Dialogue System: A Flexible Conversational AI Research Platform
Alexandros Papangelis
Mahdi Namazifar
Chandra Khatri
Yi-Chia Wang
Piero Molino
Gokhan Tur
LLMAG
128
23
0
17 Jan 2020
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
106
330
0
04 Dec 2019
Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation
Yang Gao
Christian M. Meyer
Mohsen Mesgar
Iryna Gurevych
91
23
0
30 Jul 2019
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning
Alexandros Papangelis
Yi-Chia Wang
Piero Molino
Gokhan Tur
70
32
0
11 Jul 2019
Semantic Parsing with Dual Learning
Ruisheng Cao
Su Zhu
Chen Liu
Jieyu Li
Kai Yu
81
62
0
10 Jul 2019
Deep Reinforcement Learning For Modeling Chit-Chat Dialog With Discrete Attributes
Chinnadhurai Sankar
Sujith Ravi
OffRL
57
33
0
05 Jul 2019
Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
AI4CE
57
18
0
04 Jul 2019
A Survey of Reinforcement Learning Informed by Natural Language
Jelena Luketina
Nantas Nardelli
Gregory Farquhar
Jakob N. Foerster
Jacob Andreas
Edward Grefenstette
Shimon Whiteson
Tim Rocktaschel
LM&Ro
KELM
OffRL
LRM
101
282
0
10 Jun 2019
From semantics to execution: Integrating action planning with reinforcement learning for robotic causal problem-solving
Manfred Eppe
Phuong D. H. Nguyen
S. Wermter
62
42
0
23 May 2019
Improving interactive reinforcement learning: What makes a good teacher?
Francisco Cruz
S. Magg
Y. Nagai
S. Wermter
35
31
0
15 Apr 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
Tiancheng Zhao
Kaige Xie
M. Eskénazi
83
142
0
23 Feb 2019
Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning
Muhammad Burhan Hafez
C. Weber
Matthias Kerzel
S. Wermter
39
22
0
26 Oct 2018
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
181
144
0
15 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,229
0
11 Oct 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
145
676
0
21 Sep 2018
A Study of Reinforcement Learning for Neural Machine Translation
Lijun Wu
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
OffRL
62
183
0
27 Aug 2018
Goal-oriented Dialogue Policy Learning from Failures
Keting Lu
Shiqi Zhang
Xiaoping Chen
OffRL
36
29
0
20 Aug 2018
Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning
Francisco Cruz
G. I. Parisi
S. Wermter
OffRL
33
29
0
26 Jul 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
70
211
0
24 May 2018
Toward Diverse Text Generation with Inverse Reinforcement Learning
Zhan Shi
Xinchi Chen
Xipeng Qiu
Xuanjing Huang
55
104
0
30 Apr 2018
Universal Sentence Encoder
Daniel Cer
Yinfei Yang
Sheng-yi Kong
Nan Hua
Nicole Limtiaco
...
Steve Yuan
Chris Tar
Yun-hsuan Sung
B. Strope
R. Kurzweil
446
1,907
0
29 Mar 2018
Quality expectations of machine translation
Andy Way
52
93
0
22 Mar 2018
Achieving Human Parity on Automatic Chinese to English News Translation
Hany Hassan
Anthony Aue
Chang Chen
Vishal Chowdhary
Jonathan Clark
...
Shuangzhi Wu
Yingce Xia
Dongdong Zhang
Zhirui Zhang
Ming Zhou
83
607
0
15 Mar 2018
Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations
Andreas Rucklé
Steffen Eger
Maxime Peyrard
Iryna Gurevych
180
99
0
04 Mar 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
233
11,565
0
15 Feb 2018
From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots
H. Shum
Xiaodong He
Di Li
98
555
0
06 Jan 2018
Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning
Rajarshi Das
Shehzaad Dhuliawala
Manzil Zaheer
Luke Vilnis
Ishan Durugkar
A. Krishnamurthy
Alex Smola
Andrew McCallum
KELM
101
513
0
15 Nov 2017
Paraphrase Generation with Deep Reinforcement Learning
Zichao Li
Xin Jiang
Lifeng Shang
Hang Li
OffRL
103
214
0
01 Nov 2017
Long Text Generation via Adversarial Training with Leaked Information
Jiaxian Guo
Sidi Lu
Han Cai
Weinan Zhang
Yong Yu
Jun Wang
GAN
82
498
0
24 Sep 2017
DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning
Wenhan Xiong
Thi-Lan-Giao Hoang
William Yang Wang
100
728
0
20 Jul 2017
Adversarial Ranking for Language Generation
Kevin Qinghong Lin
Dianqi Li
Xiaodong He
Zhengyou Zhang
Ming-Ting Sun
GAN
89
333
0
31 May 2017
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
Alexis Conneau
Douwe Kiela
Holger Schwenk
Loïc Barrault
Antoine Bordes
AI4TS
SSL
238
2,105
0
05 May 2017
Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads
Ji He
Mari Ostendorf
Xiaodong He
OffRL
LRM
29
10
0
20 Apr 2017
Reading Wikipedia to Answer Open-Domain Questions
Danqi Chen
Adam Fisch
Jason Weston
Antoine Bordes
RALM
121
2,019
0
31 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
129
425
0
20 Mar 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
143
704
0
15 Mar 2017
1
2
Next