Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01853
Cited By
Episodic Policy Gradient Training
3 December 2021
Hung Le
Majid Abdolshah
Thommen George Karimpanal
Kien Do
D. Nguyen
Svetha Venkatesh
BDL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Episodic Policy Gradient Training"
3 / 3 papers shown
Title
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
26
14
0
27 Jan 2023
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
75
83
0
06 Feb 2020
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
301
1,616
0
18 Sep 2019
1