ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06339
  4. Cited By
Deep Reinforcement Learning

Deep Reinforcement Learning

15 October 2018
Yuxi Li
    VLMOffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning"

50 / 521 papers shown
Title
Learning Language Games through Interaction
Learning Language Games through Interaction
Sida I. Wang
Percy Liang
Christopher D. Manning
59
190
0
08 Jun 2016
Deep Successor Reinforcement Learning
Deep Successor Reinforcement Learning
Tejas D. Kulkarni
A. Saeedi
Simanta Gautam
S. Gershman
72
209
0
08 Jun 2016
Natural Language Comprehension with the EpiReader
Natural Language Comprehension with the EpiReader
Adam Trischler
Zheng Ye
Xingdi Yuan
Kaheer Suleman
68
95
0
07 Jun 2016
Learning to Optimize
Learning to Optimize
Ke Li
Jitendra Malik
63
257
0
06 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,484
0
06 Jun 2016
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
285
1,339
0
05 Jun 2016
Predicting Personal Traits from Facial Images using Convolutional Neural
  Networks Augmented with Facial Landmark Information
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
Yoad Lewenberg
Valliappa Chockalingam
Satinder Singh
Honglak Lee
CVBM
62
305
0
29 May 2016
Model-Free Imitation Learning with Policy Optimization
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
55
149
0
26 May 2016
Discovering Causal Signals in Images
Discovering Causal Signals in Images
David Lopez-Paz
Robert Nishihara
Soumith Chintala
Bernhard Schölkopf
Léon Bottou
CML
50
226
0
26 May 2016
Learning Multiagent Communication with Backpropagation
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
227
1,150
0
25 May 2016
Learning End-to-End Goal-Oriented Dialog
Learning End-to-End Goal-Oriented Dialog
Antoine Bordes
Y-Lan Boureau
Jason Weston
82
782
0
24 May 2016
On-line Active Reward Learning for Policy Optimisation in Spoken
  Dialogue Systems
On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
OffRL
80
170
0
24 May 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
155
1,614
0
21 May 2016
Learning Representations for Counterfactual Inference
Learning Representations for Counterfactual Inference
Fredrik D. Johansson
Uri Shalit
David Sontag
CMLOODBDL
298
729
0
12 May 2016
The IBM 2016 English Conversational Telephone Speech Recognition System
The IBM 2016 English Conversational Telephone Speech Recognition System
G. Saon
Tom Sercu
Steven J. Rennie
H. Kuo
39
107
0
27 Apr 2016
End to End Learning for Self-Driving Cars
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
100
4,178
0
25 Apr 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
139
381
0
25 Apr 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
96
1,695
0
22 Apr 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal
  Abstraction and Intrinsic Motivation
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
76
1,141
0
20 Apr 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
77
1,109
0
15 Apr 2016
Interactive Perception: Leveraging Action in Perception and Perception
  in Action
Interactive Perception: Leveraging Action in Perception and Perception in Action
Jeannette Bohg
Karol Hausman
Bharathwaj Sankaran
Oliver Brock
Danica Kragic
S. Schaal
Gaurav Sukhatme
143
305
0
13 Apr 2016
The CMA Evolution Strategy: A Tutorial
The CMA Evolution Strategy: A Tutorial
N. Hansen
74
1,378
0
04 Apr 2016
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
S. M. Ali Eslami
N. Heess
T. Weber
Yuval Tassa
David Szepesvari
Koray Kavukcuoglu
Geoffrey E. Hinton
3DVBDLOCL
131
551
0
28 Mar 2016
Improving Information Extraction by Acquiring External Evidence with
  Reinforcement Learning
Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning
Karthik Narasimhan
Adam Yala
Regina Barzilay
OffRL
84
152
0
25 Mar 2016
Discriminative Embeddings of Latent Variable Models for Structured Data
Discriminative Embeddings of Latent Variable Models for Structured Data
H. Dai
Bo Dai
Le Song
BDL
122
697
0
17 Mar 2016
Text Understanding with the Attention Sum Reader Network
Text Understanding with the Attention Sum Reader Network
Rudolf Kadlec
Martin Schmid
Ondrej Bajgar
Jan Kleindienst
AIMatRALM
68
314
0
04 Mar 2016
Deep Reinforcement Learning from Self-Play in Imperfect-Information
  Games
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Johannes Heinrich
David Silver
SSL
66
399
0
03 Mar 2016
Continuous Deep Q-Learning with Model-based Acceleration
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
94
1,013
0
02 Mar 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy
  Optimization
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
108
952
0
01 Mar 2016
Investigating practical linear temporal difference learning
Investigating practical linear temporal difference learning
Adam White
Martha White
OffRL
71
41
0
28 Feb 2016
Deep Exploration via Bootstrapped DQN
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
123
1,314
0
15 Feb 2016
Unsupervised Domain Adaptation with Residual Transfer Networks
Unsupervised Domain Adaptation with Residual Transfer Networks
Mingsheng Long
Hanjing Zhu
Jianmin Wang
Michael I. Jordan
OOD
97
1,492
0
14 Feb 2016
Associative Long Short-Term Memory
Associative Long Short-Term Memory
Ivo Danihelka
Greg Wayne
Benigno Uria
Nal Kalchbrenner
Alex Graves
75
179
0
09 Feb 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
79
656
0
09 Feb 2016
Data-Efficient Reinforcement Learning in Continuous-State POMDPs
Data-Efficient Reinforcement Learning in Continuous-State POMDPs
R. McAllister
C. Rasmussen
67
12
0
08 Feb 2016
End-to-End Goal-Driven Web Navigation
End-to-End Goal-Driven Web Navigation
Rodrigo Nogueira
Kyunghyun Cho
LLMAG
87
35
0
06 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
207
8,881
0
04 Feb 2016
Taming the Noise in Reinforcement Learning via Soft Updates
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox
Ari Pakman
Naftali Tishby
86
341
0
28 Dec 2015
A Survey of Available Corpora for Building Data-Driven Dialogue Systems
A Survey of Available Corpora for Building Data-Driven Dialogue Systems
Iulian Serban
Ryan J. Lowe
Peter Henderson
Laurent Charlin
Joelle Pineau
61
342
0
17 Dec 2015
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,510
0
10 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement
  Learning
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
83
113
0
04 Dec 2015
Natural Language Understanding with Distributed Representation
Natural Language Understanding with Distributed Representation
Kyunghyun Cho
GNNBDL
77
55
0
24 Nov 2015
Sequence Level Training with Recurrent Neural Networks
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
106
1,620
0
20 Nov 2015
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,769
0
20 Nov 2015
Better Computer Go Player with Neural Network and Long-term Prediction
Better Computer Go Player with Neural Network and Long-term Prediction
Yuandong Tian
Yan Zhu
AI4CE
86
86
0
19 Nov 2015
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
OffRL
99
600
0
19 Nov 2015
Neural Programmer-Interpreters
Neural Programmer-Interpreters
Scott E. Reed
Nando de Freitas
101
411
0
19 Nov 2015
Active Object Localization with Deep Reinforcement Learning
Active Object Localization with Deep Reinforcement Learning
Juan C. Caicedo
Svetlana Lazebnik
ObjD
67
445
0
18 Nov 2015
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
231
3,796
0
18 Nov 2015
Deep Reinforcement Learning with a Natural Language Action Space
Deep Reinforcement Learning with a Natural Language Action Space
Ji He
Jianshu Chen
Xiaodong He
Jianfeng Gao
Lihong Li
Li Deng
Mari Ostendorf
106
246
0
14 Nov 2015
Previous
123...101189
Next