Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.06339
Cited By
Deep Reinforcement Learning
15 October 2018
Yuxi Li
VLM
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning"
50 / 521 papers shown
Title
Cognitive Mapping and Planning for Visual Navigation
Saurabh Gupta
Varun Tolani
James Davidson
Sergey Levine
Rahul Sukthankar
Jitendra Malik
91
715
0
13 Feb 2017
Batch Policy Gradient Methods for Improving Neural Conversation Models
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
OffRL
56
37
0
10 Feb 2017
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning
Jason D. Williams
Kavosh Asadi
Geoffrey Zweig
OffRL
77
335
0
10 Feb 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
78
612
0
10 Feb 2017
Adversarial Attacks on Neural Network Policies
Sandy Huang
Nicolas Papernot
Ian Goodfellow
Yan Duan
Pieter Abbeel
MLAU
AAML
102
839
0
08 Feb 2017
Semi-Supervised QA with Generative Domain-Adaptive Nets
Zhilin Yang
Junjie Hu
Ruslan Salakhutdinov
William W. Cohen
OOD
70
152
0
07 Feb 2017
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning
Francois Belletti
Daniel Haziza
G. Gomes
Alexandre M. Bayen
48
139
0
30 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
246
1,544
0
25 Jan 2017
Deep Network Guided Proof Search
Sarah M. Loos
G. Irving
Christian Szegedy
C. Kaliszyk
AIMat
84
159
0
24 Jan 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
330
1,900
0
10 Jan 2017
Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Yanzhe Zhang
Mohammad Pezeshki
Philemon Brakel
Saizheng Zhang
Yoshua Bengio
Aaron Courville
68
367
0
10 Jan 2017
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
211
913
0
06 Jan 2017
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
77
291
0
28 Dec 2016
Learning from Simulated and Unsupervised Images through Adversarial Training
A. Shrivastava
Tomas Pfister
Oncel Tuzel
J. Susskind
Wenda Wang
Russ Webb
GAN
107
1,801
0
22 Dec 2016
First-Person Activity Forecasting with Online Inverse Reinforcement Learning
Nicholas Rhinehart
Kris Kitani
EgoV
61
141
0
22 Dec 2016
Self-Correcting Models for Model-Based Reinforcement Learning
Erik Talvitie
LRM
84
94
0
19 Dec 2016
An Alternative Softmax Operator for Reinforcement Learning
Kavosh Asadi
Michael L. Littman
60
10
0
16 Dec 2016
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
74
295
0
16 Dec 2016
Learning through Dialogue Interactions by Asking Questions
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
70
55
0
15 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
147
2,586
0
12 Dec 2016
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
130
1,456
0
06 Dec 2016
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
374
7,572
0
02 Dec 2016
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
109
1,892
0
02 Dec 2016
Generalizing Skills with Semi-Supervised Reinforcement Learning
Chelsea Finn
Tianhe Yu
Justin Fu
Pieter Abbeel
Sergey Levine
OffRL
SSL
94
69
0
01 Dec 2016
Playing Doom with SLAM-Augmented Deep Reinforcement Learning
Shehroze Bhatti
Alban Desmaison
O. Mikšík
Nantas Nardelli
N. Siddharth
Philip Torr
OffRL
80
69
0
01 Dec 2016
Improved Image Captioning via Policy Gradient optimization of SPIDEr
Siqi Liu
Zhenhai Zhu
Ning Ye
S. Guadarrama
Kevin Patrick Murphy
164
446
0
01 Dec 2016
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
545
1,412
0
01 Dec 2016
Neural Combinatorial Optimization with Reinforcement Learning
Irwan Bello
Hieu H. Pham
Quoc V. Le
Mohammad Norouzi
Samy Bengio
158
1,494
0
29 Nov 2016
Dialogue Learning With Human-In-The-Loop
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
OffRL
288
134
0
29 Nov 2016
Quantum Machine Learning
Jacob Biamonte
P. Wittek
Nicola Pancotti
Patrick Rebentrost
N. Wiebe
S. Lloyd
66
2,024
0
28 Nov 2016
Improving Policy Gradient by Exploring Under-appreciated Rewards
Ofir Nachum
Mohammad Norouzi
Dale Schuurmans
80
44
0
28 Nov 2016
Learning to Compose Words into Sentences with Reinforcement Learning
Dani Yogatama
Phil Blunsom
Chris Dyer
Edward Grefenstette
Wang Ling
NAI
65
159
0
28 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
75
259
0
18 Nov 2016
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
97
983
0
17 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
111
1,229
0
16 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
111
775
0
15 Nov 2016
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson
M. Schuster
Quoc V. Le
M. Krikun
Yonghui Wu
...
F. Viégas
Martin Wattenberg
Gregory S. Corrado
Macduff Hughes
Jeffrey Dean
129
2,096
0
14 Nov 2016
Least Squares Generative Adversarial Networks
Xudong Mao
Qing Li
Haoran Xie
Raymond Y. K. Lau
Zhen Wang
Stephen Paul Smolley
GAN
340
4,579
0
13 Nov 2016
Learning to Navigate in Complex Environments
Piotr Wojciech Mirowski
Razvan Pascanu
Fabio Viola
Hubert Soyer
Andy Ballard
...
Ross Goroshin
Laurent Sifre
Koray Kavukcuoglu
D. Kumaran
R. Hadsell
107
880
0
11 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
105
173
0
09 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
102
1,028
0
09 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
88
345
0
07 Nov 2016
Designing Neural Network Architectures using Reinforcement Learning
Bowen Baker
O. Gupta
Nikhil Naik
Ramesh Raskar
132
1,472
0
07 Nov 2016
DeepCoder: Learning to Write Programs
Matej Balog
Alexander L. Gaunt
Marc Brockschmidt
Sebastian Nowozin
Daniel Tarlow
AIMat
NAI
103
575
0
07 Nov 2016
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Oron Anschel
Nir Baram
N. Shimkin
87
317
0
07 Nov 2016
Neuro-Symbolic Program Synthesis
Emilio Parisotto
Abdel-rahman Mohamed
Rishabh Singh
Lihong Li
Dengyong Zhou
Pushmeet Kohli
NAI
111
322
0
06 Nov 2016
Learning to Perform Physics Experiments via Deep Reinforcement Learning
Misha Denil
Pulkit Agrawal
Tejas D. Kulkarni
Tom Erez
Peter W. Battaglia
Nando de Freitas
AI4CE
95
333
0
06 Nov 2016
Hierarchical Question Answering for Long Documents
Eunsol Choi
D. Hewlett
Alexandre Lacoste
Illia Polosukhin
Jakob Uszkoreit
Jonathan Berant
RALM
83
168
0
06 Nov 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
171
463
0
06 Nov 2016
Learning to Act by Predicting the Future
Alexey Dosovitskiy
V. Koltun
154
281
0
06 Nov 2016
Previous
1
2
3
...
10
11
6
7
8
9
Next