ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01606
  4. Cited By
Learning to Play in a Day: Faster Deep Reinforcement Learning by
  Optimality Tightening

Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening

5 November 2016
Frank S. He
Yang Liu
Alex Schwing
Jian-wei Peng
ArXivPDFHTML

Papers citing "Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening"

19 / 19 papers shown
Title
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
27
0
0
02 Nov 2022
MAN: Multi-Action Networks Learning
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
21
3
0
19 Sep 2022
Constrained unsupervised anomaly segmentation
Constrained unsupervised anomaly segmentation
Julio Silva-Rodríguez
Valery Naranjo
Jose Dolz
20
23
0
03 Mar 2022
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
30
19
0
03 Nov 2021
Looking at the whole picture: constrained unsupervised anomaly
  segmentation
Looking at the whole picture: constrained unsupervised anomaly segmentation
Julio Silva-Rodríguez
Valery Naranjo
Jose Dolz
29
5
0
01 Sep 2021
Human-Level Reinforcement Learning through Theory-Based Modeling,
  Exploration, and Planning
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
24
44
0
27 Jul 2021
Regularized Softmax Deep Multi-Agent $Q$-Learning
Regularized Softmax Deep Multi-Agent QQQ-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
42
31
0
22 Mar 2021
Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum
  Sharing for 5G and Beyond
Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond
Hao-Hsuan Chang
Lingjia Liu
Yuhao Yi
8
46
0
12 Oct 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
An Inductive Bias for Distances: Neural Nets that Respect the Triangle
  Inequality
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
Silviu Pitis
Harris Chan
Kiarash Jamali
Jimmy Ba
8
24
0
14 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
A Data-Efficient Deep Learning Approach for Deployable Multimodal Social
  Robots
A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots
Heriberto Cuayáhuitl
OffRL
32
17
0
27 Aug 2019
Constrained domain adaptation for Image segmentation
Constrained domain adaptation for Image segmentation
M. Bateson
Jose Dolz
H. Kervadec
H. Lombaert
Ismail Ben Ayed
GAN
26
25
0
08 Aug 2019
Sample-Efficient Deep Reinforcement Learning via Episodic Backward
  Update
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
21
73
0
31 May 2018
Episodic Memory Deep Q-Networks
Episodic Memory Deep Q-Networks
Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang
OffRL
24
85
0
19 May 2018
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and
  Request for Research
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
...
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
33
557
0
26 Feb 2018
Neural Episodic Control
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRL
BDL
35
345
0
06 Mar 2017
Learning What Data to Learn
Learning What Data to Learn
Yang Fan
Fei Tian
Tao Qin
Jiang Bian
Tie-Yan Liu
18
79
0
28 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,505
0
25 Jan 2017
1