ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.07696
  4. Cited By
Towards A Unified Policy Abstraction Theory and Representation Learning
  Approach in Markov Decision Processes

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes

16 September 2022
Hao Fei
Hongyao Tang
Jianye Hao
Yan Zheng
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes"

22 / 22 papers shown
Title
PAnDR: Fast Adaptation to New Environments from Offline Experiences via
  Decoupling Policy and Environment Representations
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
Tong Sang
Hongyao Tang
Yi-An Ma
Jianye Hao
Yan Zheng
Zhaopeng Meng
Boyan Li
Zhen Wang
OffRL
65
5
0
06 Apr 2022
Reward-Free Policy Space Compression for Reinforcement Learning
Reward-Free Policy Space Compression for Reinforcement Learning
Mirco Mutti
Stefano Del Col
Marcello Restelli
36
3
0
22 Feb 2022
VICReg: Variance-Invariance-Covariance Regularization for
  Self-Supervised Learning
VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning
Adrien Bardes
Jean Ponce
Yann LeCun
SSLDML
153
945
0
11 May 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELMOffRL
82
104
0
30 Mar 2021
Structured Policy Representation: Imposing Stability in arbitrarily
  conditioned dynamic systems
Structured Policy Representation: Imposing Stability in arbitrarily conditioned dynamic systems
Julen Urain
Davide Tateo
Tianyu Ren
Jan Peters
20
1
0
11 Dec 2020
General Characterization of Agents by States they Visit
General Characterization of Agents by States they Visit
Anssi Kanervisto
Tomi Kinnunen
Ville Hautamaki
51
3
0
02 Dec 2020
What About Inputing Policy in Value Function: Policy Representation and
  Policy-extended Value Function Approximator
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
71
7
0
19 Oct 2020
Learning Invariant Representations for Reinforcement Learning without
  Reconstruction
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Amy Zhang
R. McAllister
Roberto Calandra
Y. Gal
Sergey Levine
OODSSL
126
479
0
18 Jun 2020
Parameter-Based Value Functions
Parameter-Based Value Functions
Francesco Faccio
Louis Kirsch
Jürgen Schmidhuber
OffRL
80
26
0
16 Jun 2020
Policy Evaluation Networks
Policy Evaluation Networks
J. Harb
Tom Schaul
Doina Precup
Pierre-Luc Bacon
OffRL
52
37
0
26 Feb 2020
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human
  Videos
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos
Laura M. Smith
Nikita Dhawan
Marvin Zhang
Pieter Abbeel
Sergey Levine
140
160
0
10 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
140
1,374
0
03 Dec 2019
Learning to Score Behaviors for Guided Policy Optimization
Learning to Score Behaviors for Guided Policy Optimization
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
A. Choromańska
K. Choromanski
Michael I. Jordan
71
39
0
11 Jun 2019
MolecularRNN: Generating realistic molecular graphs with optimized
  properties
MolecularRNN: Generating realistic molecular graphs with optimized properties
Mariya Popova
Mykhailo Shvets
Junier Oliva
Olexandr Isayev
GNN
77
167
0
31 May 2019
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
356
10,369
0
10 Jul 2018
Learning Policy Representations in Multiagent Systems
Learning Policy Representations in Multiagent Systems
Aditya Grover
Maruan Al-Shedivat
Jayesh K. Gupta
Yuri Burda
Harrison Edwards
AI4CE
73
125
0
17 Jun 2018
Graph Convolutional Policy Network for Goal-Directed Molecular Graph
  Generation
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation
Jiaxuan You
Bowen Liu
Rex Ying
Vijay S. Pande
J. Leskovec
GNN
299
905
0
07 Jun 2018
Structured Evolution with Compact Architectures for Scalable Policy
  Optimization
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
98
149
0
06 Apr 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
580
19,315
0
20 Jul 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRLODL
225
5,087
0
05 Jun 2016
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
283
6,807
0
19 Feb 2015
1