Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.00953
Cited By
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
1 July 2019
Alex X. Lee
Anusha Nagabandi
Pieter Abbeel
Sergey Levine
OffRL
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model"
50 / 84 papers shown
Title
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
58
0
0
17 Mar 2025
Learning Fused State Representations for Control from Multi-View Observations
Zeyu Wang
Yao Li
Xin Li
Hongyu Zang
Romain Laroche
Riashat Islam
OffRL
54
0
0
03 Feb 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
44
0
0
08 Jan 2025
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen
Wen Zheng Terence Ng
Zichen Chen
Sinno Jialin Pan
Tianwei Zhang
OffRL
37
0
0
09 Nov 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
61
0
0
06 Jul 2024
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
40
0
0
15 Apr 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
10
0
06 Jan 2024
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
46
13
0
31 Aug 2023
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
49
87
0
21 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
35
28
0
14 Aug 2023
World-Model-Based Control for Industrial box-packing of Multiple Objects using NewtonianVAE
Yusuke Kato
Ryogo Okumura
T. Taniguchi
DRL
27
1
0
04 Aug 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
Latent Interactive A2C for Improved RL in Open Many-Agent Systems
Keyang He
Prashant Doshi
Bikramjit Banerjee
OffRL
28
3
0
09 May 2023
Hierarchical State Abstraction Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
Chunyang Liu
Lifang He
Philip S. Yu
31
18
0
24 Apr 2023
Filter-Aware Model-Predictive Control
Baris Kayalibay
Atanas Mirchev
Ahmed Agha
Patrick van der Smagt
Justin Bayer
44
0
0
20 Apr 2023
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning
Aravind Venugopal
Stephanie Milani
Fei Fang
Balaraman Ravindran
OffRL
21
0
0
12 Apr 2023
Explicitly Minimizing the Blur Error of Variational Autoencoders
G. Bredell
Kyriakos Flouris
K. Chaitanya
Ertunc Erdil
E. Konukoglu
26
22
0
12 Apr 2023
Fast exploration and learning of latent graphs with aliased observations
Miguel Lazaro-Gredilla
Ishani Deshpande
Siva K. Swaminathan
Meet Dave
Dileep George
28
3
0
13 Mar 2023
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Wenyan Yang
A. Angleraud
R. Pieters
Joni Pajarinen
Joni-Kristian Kämäräinen
37
6
0
05 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
Ahmed M. Ahmed
Rehaan Ahmad
Chelsea Finn
SSL
59
17
0
02 Mar 2023
Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for Parkinson Disease Treatment
Qitong Gao
Stephen L. Schimdt
Afsana Chowdhury
Guangyu Feng
Jennifer J. Peters
Katherine Genty
W. Grill
Dennis A. Turner
Miroslav Pajic
OffRL
33
11
0
05 Feb 2023
Visual Imitation Learning with Patch Rewards
Minghuan Liu
Tairan He
Weinan Zhang
Shuicheng Yan
Zhongwen Xu
SSL
22
13
0
02 Feb 2023
Variational Latent Branching Model for Off-Policy Evaluation
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
36
6
0
28 Jan 2023
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
28
70
0
12 Dec 2022
PRISM: Probabilistic Real-Time Inference in Spatial World Models
Atanas Mirchev
Baris Kayalibay
Ahmed Agha
Patrick van der Smagt
Daniel Cremers
Justin Bayer
VGen
31
0
0
06 Dec 2022
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
30
1
0
28 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
38
4
0
12 Nov 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
31
21
0
18 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
43
27
0
10 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
16
10
0
02 Oct 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Daesol Cho
D. Shim
H. J. Kim
OffRL
42
11
0
30 Sep 2022
Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum
Eric Schulz
26
1
0
29 Sep 2022
Sampling Through the Lens of Sequential Decision Making
J. Dou
Alvin Pan
Runxue Bao
Haiyi Mao
Lei Luo
Zhi-Hong Mao
26
19
0
17 Aug 2022
Sparse Representation Learning with Modified q-VAE towards Minimal Realization of World Model
Taisuke Kobayashi
Ryoma Watanuki
DRL
29
6
0
08 Aug 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
32
36
0
03 Jul 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
30
31
0
10 Jun 2022
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
45
18
0
23 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
32
12
0
02 May 2022
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining
Qihang Zhang
Zhenghao Peng
Bolei Zhou
SSL
30
38
0
05 Apr 2022
Multi-View Dreaming: Multi-View World Model with Contrastive Learning
Akira Kinose
Masashi Okada
Ryogo Okumura
T. Taniguchi
OffRL
21
10
0
15 Mar 2022
MIRROR: Differentiable Deep Social Projection for Assistive Human-Robot Communication
Kaiqi Chen
J. Fong
Harold Soh
21
10
0
06 Mar 2022
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Aivar Sootla
Alexander I. Cowen-Rivers
Taher Jafferjee
Ziyan Wang
D. Mguni
Jun Wang
Haitham Bou-Ammar
32
54
0
14 Feb 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
44
0
28 Jan 2022
Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning
Tao Huang
Jiacheng Wang
Xiao Chen
34
4
0
18 Jan 2022
Linear Variational State-Space Filtering
Daniel Pfrommer
Nikolai Matni
30
1
0
04 Jan 2022
Transfer RL across Observation Feature Spaces via Model-Based Regularization
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Andrew Cohen
Furong Huang
OOD
OffRL
22
21
0
01 Jan 2022
Stochastic Actor-Executor-Critic for Image-to-Image Translation
Ziwei Luo
Jing Hu
Xin Wang
Siwei Lyu
Bin Kong
Youbing Yin
Qi Song
Xi Wu
BDL
EGVM
30
5
0
14 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
1
2
Next