ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.05891
  4. Cited By
PID-Inspired Inductive Biases for Deep Reinforcement Learning in
  Partially Observable Control Tasks
v1v2 (latest)

PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks

12 July 2023
I. Char
J. Schneider
ArXiv (abs)PDFHTML

Papers citing "PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks"

37 / 37 papers shown
Title
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Lea Richtmann
Viktoria-S. Schmiesing
Dennis Wilken
Jan Heine
Aaron Tranter
Avishek Anand
Tobias J. Osborne
M. Heurs
85
2
0
24 May 2024
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.5K
14,761
0
15 Mar 2023
A Survey on Transformers in Reinforcement Learning
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRLMUAI4CE
97
58
0
08 Jan 2023
Legged Locomotion in Challenging Terrains using Egocentric Vision
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
80
216
0
14 Nov 2022
Exploration via Planning for Information about the Optimal Trajectory
Exploration via Planning for Information about the Optimal Trajectory
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
Willie Neiswanger
OffRL
72
6
0
06 Oct 2022
Transformers are Meta-Reinforcement Learners
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
OffRL
77
50
0
14 Jun 2022
An Experimental Design Perspective on Model-Based Reinforcement Learning
An Experimental Design Perspective on Model-Based Reinforcement Learning
Viraj Mehta
Biswajit Paria
J. Schneider
Stefano Ermon
Willie Neiswanger
OffRL
65
21
0
09 Dec 2021
Recurrent Off-policy Baselines for Memory-based Continuous Control
Recurrent Off-policy Baselines for Memory-based Continuous Control
Zhihan Yang
Hai V. Nguyen
CLLOffRL
72
24
0
25 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
69
110
0
11 Oct 2021
Robust Predictable Control
Robust Predictable Control
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
62
45
0
07 Sep 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
106
190
0
27 Jul 2021
Estimating Disentangled Belief about Hidden State and Hidden Task for
  Meta-RL
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
60
4
0
14 May 2021
Memory-based Deep Reinforcement Learning for POMDPs
Memory-based Deep Reinforcement Learning for POMDPs
Lingheng Meng
R. Gorbet
Dana Kulic
89
99
0
24 Feb 2021
Dynamics Generalization via Information Bottleneck in Deep Reinforcement
  Learning
Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning
Xingyu Lu
Kimin Lee
Pieter Abbeel
Stas Tiomkin
DRLAI4CE
54
34
0
03 Aug 2020
Reinforcement Learning based Design of Linear Fixed Structure
  Controllers
Reinforcement Learning based Design of Linear Fixed Structure Controllers
Nathan P. Lawrence
G. Stewart
Philip D. Loewen
M. Forbes
Johan U. Backstrom
R. Bhushan Gopaluni
48
7
0
10 May 2020
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRLOffRL
62
63
0
23 Dec 2019
Generalization in Reinforcement Learning with Selective Noise Injection
  and Information Bottleneck
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
Maximilian Igl
K. Ciosek
Yingzhen Li
Sebastian Tschiatschek
Cheng Zhang
Sam Devlin
Katja Hofmann
OffRL
73
173
0
28 Oct 2019
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
70
279
0
18 Oct 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
96
367
0
13 Oct 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic
  Context Variables
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
83
661
0
19 Mar 2019
Assessing Generalization in Deep Reinforcement Learning
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
Basel Alomair
OffRL
122
238
0
29 Oct 2018
Deep Variational Reinforcement Learning for POMDPs
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDLOffRL
71
262
0
06 Jun 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
195
5,218
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,420
0
04 Jan 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
550
19,296
0
20 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
805
132,725
0
12 Jun 2017
The Statistical Recurrent Unit
The Statistical Recurrent Unit
Junier B. Oliva
Barnabás Póczós
J. Schneider
64
50
0
01 Mar 2017
Preparing for the Unknown: Learning a Universal Policy with Online
  System Identification
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
103
309
0
08 Feb 2017
Deep Variational Information Bottleneck
Deep Variational Information Bottleneck
Alexander A. Alemi
Ian S. Fischer
Joshua V. Dillon
Kevin Patrick Murphy
130
1,728
0
01 Dec 2016
Learning to reinforcement learn
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
97
983
0
17 Nov 2016
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
105
1,028
0
09 Nov 2016
EPOpt: Learning Robust Neural Network Policies Using Model Ensembles
EPOpt: Learning Robust Neural Network Policies Using Model Ensembles
Aravind Rajeswaran
Sarvjeet Ghotra
Balaraman Ravindran
Sergey Levine
205
353
0
05 Oct 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
435
10,541
0
21 Jul 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
210
8,881
0
04 Feb 2016
Memory-based control with recurrent neural networks
Memory-based control with recurrent neural networks
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
89
303
0
14 Dec 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
467
43,347
0
11 Feb 2015
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
1.1K
23,396
0
03 Jun 2014
1