Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.03497
Cited By
Value Prediction Network
11 July 2017
Junhyuk Oh
Satinder Singh
Honglak Lee
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Value Prediction Network"
50 / 66 papers shown
Title
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
82
1
0
16 Dec 2024
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
114
2
0
23 Oct 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
39
1
0
27 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
40
3
0
20 May 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
Pixel State Value Network for Combined Prediction and Planning in Interactive Environments
Sascha Rosbach
Stefan M. Leupold
S. Großjohann
Stefan Roth
27
0
0
11 Oct 2023
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
12
0
15 Jun 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
34
8
0
05 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
30
3
0
20 Apr 2023
Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration
Chentian Jiang
Nan Rosemary Ke
Hado van Hasselt
16
3
0
08 Feb 2023
Continuous Neural Algorithmic Planners
Yu He
Petar Velivcković
Pietro Lio
Andreea Deac
29
5
0
29 Nov 2022
Operator Splitting Value Iteration
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
33
7
0
25 Nov 2022
Reward-Predictive Clustering
Lucas Lehnert
M. Frank
Michael L. Littman
OffRL
19
0
0
07 Nov 2022
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
24
0
0
31 Oct 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
28
4
0
30 Oct 2022
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
40
27
0
19 Aug 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
50
101
0
19 Jun 2022
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin G. Walters
Lawson L. S. Wong
20
7
0
08 Jun 2022
Goal-Space Planning with Subgoal Models
Chun-Ping Lo
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Gábor Mihucz
Farzane Aminmansour
Martha White
24
5
0
06 Jun 2022
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
38
1
0
04 Jun 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
632
0
20 May 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
27
6
0
20 Apr 2022
SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning
A. Chester
Michael Dann
Fabio Zambetta
John Thangarajah
11
0
0
09 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei-ping Xu
Haonan Yu
38
10
0
24 Jan 2022
Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain
Stefan Wapnick
Travis Manderson
D. Meger
Gregory Dudek
31
5
0
09 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
16
1
0
06 Dec 2021
Visual Goal-Directed Meta-Learning with Contextual Planning Networks
Corban G. Rivera
D. Handelman
39
0
0
18 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng-Tao Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
32
41
0
04 Nov 2021
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
Neural Algorithmic Reasoners are Implicit Planners
Andreea Deac
Petar Velivcković
Ognjen Milinković
Pierre-Luc Bacon
Jian Tang
Mladen Nikolic
OffRL
32
23
0
11 Oct 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Erik Cambria
OffRL
47
73
0
01 Jan 2021
Planning from Pixels in Atari with Learned Symbolic Representations
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
26
11
0
16 Dec 2020
DeepKoCo: Efficient latent planning with a task-relevant Koopman representation
B. V. D. Heijden
L. Ferranti
Jens Kober
Robert Babuška
10
6
0
25 Nov 2020
Deep Affordance Foresight: Planning Through What Can Be Done in the Future
Danfei Xu
Ajay Mandlekar
Roberto Martín-Martín
Yuke Zhu
Silvio Savarese
Li Fei-Fei
33
70
0
17 Nov 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
15
34
0
27 Oct 2020
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
13
25
0
26 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
48
810
0
05 Oct 2020
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
30
43
0
28 Sep 2020
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
24
73
0
24 Jul 2020
State Action Separable Reinforcement Learning
Ziyao Zhang
Liang Ma
K. Leung
Konstantinos Poularakis
M. Srivatsa
31
2
0
05 Jun 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
43
139
0
30 Apr 2020
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
24
25
0
14 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
24
32
0
07 Feb 2020
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
35
34
0
23 Dec 2019
Combining Q-Learning and Search with Amortized Value Estimates
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
T. Weber
Lars Buesing
Peter W. Battaglia
OffRL
27
47
0
05 Dec 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
16
31
0
01 Nov 2019
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
11
41
0
09 Oct 2019
Gradient-Aware Model-based Policy Search
P. DÓro
Alberto Maria Metelli
Andrea Tirinzoni
Matteo Papini
Marcello Restelli
21
34
0
09 Sep 2019
1
2
Next