Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.10259
Cited By
Active Policy Improvement from Multiple Black-box Oracles
17 June 2023
Xuefeng Liu
Takuma Yoneda
Chaoqi Wang
Matthew R. Walter
Yuxin Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Active Policy Improvement from Multiple Black-box Oracles"
8 / 8 papers shown
Title
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization
Xuefeng Liu
Songhao Jiang
Siyu Chen
Zhuoran Yang
Yuxin Chen
Ian Foster
Rick L. Stevens
LM&MA
OffRL
58
0
0
11 Feb 2025
Entropy-Reinforced Planning with Large Language Models for Drug Discovery
Xuefeng Liu
Chih-chan Tien
Peng Ding
Songhao Jiang
Rick L. Stevens
45
4
0
11 Jun 2024
Model Ensembling for Constrained Optimization
Ira Globus-Harris
Varun Gupta
Michael Kearns
Aaron Roth
38
0
0
27 May 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
43
0
0
27 May 2024
Inverse Reinforcement Learning with Sub-optimal Experts
Riccardo Poiani
Gabriele Curti
Alberto Maria Metelli
Marcello Restelli
26
1
0
08 Jan 2024
Blending Imitation and Reinforcement Learning for Robust Policy Improvement
Xuefeng Liu
Takuma Yoneda
Rick L. Stevens
Matthew R. Walter
Yuxin Chen
39
10
0
03 Oct 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
24
81
0
05 Sep 2023
Multi-expert learning of adaptive legged locomotion
Chuanyu Yang
Kai Yuan
Qiuguo Zhu
Wanming Yu
Zhibin Li
111
184
0
10 Dec 2020
1