ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.01178
  4. Cited By

Differentiable Information Enhanced Model-Based Reinforcement Learning

3 March 2025
Xiaoyuan Zhang
Xinyan Cai
Bo Liu
Weidong Huang
Song-Chun Zhu
Siyuan Qi
Y. Yang
ArXiv (abs)PDFHTML

Papers citing "Differentiable Information Enhanced Model-Based Reinforcement Learning"

14 / 14 papers shown
Title
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
82
88
0
13 Mar 2023
DiffPD: Differentiable Projective Dynamics
DiffPD: Differentiable Projective Dynamics
Tao Du
Kui Wu
Pingchuan Ma
Sebastien Wah
Andrew Spielberg
Daniela Rus
Wojciech Matusik
67
104
0
15 Jan 2021
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
117
869
0
05 Oct 2020
On the model-based stochastic value gradient for continuous
  reinforcement learning
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
67
71
0
28 Aug 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
115
49
0
23 Aug 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
70
224
0
10 Jun 2020
MOPO: Model-based Offline Policy Optimization
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
76
773
0
27 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
75
88
0
16 May 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
60
27
0
29 Apr 2020
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
126
1,371
0
03 Dec 2019
When to Trust Your Model: Model-Based Policy Optimization
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
102
956
0
19 Jun 2019
Learning Latent Dynamics for Planning from Pixels
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
88
1,446
0
12 Nov 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,406
0
04 Jan 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
532
19,265
0
20 Jul 2017
1