ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.10259
  4. Cited By
Active Policy Improvement from Multiple Black-box Oracles

Active Policy Improvement from Multiple Black-box Oracles

17 June 2023
Xuefeng Liu
Takuma Yoneda
Chaoqi Wang
Matthew R. Walter
Yuxin Chen
ArXivPDFHTML

Papers citing "Active Policy Improvement from Multiple Black-box Oracles"

8 / 8 papers shown
Title
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization
Xuefeng Liu
Songhao Jiang
Siyu Chen
Zhuoran Yang
Yuxin Chen
Ian Foster
Rick L. Stevens
LM&MA
OffRL
58
0
0
11 Feb 2025
Entropy-Reinforced Planning with Large Language Models for Drug Discovery
Entropy-Reinforced Planning with Large Language Models for Drug Discovery
Xuefeng Liu
Chih-chan Tien
Peng Ding
Songhao Jiang
Rick L. Stevens
45
4
0
11 Jun 2024
Model Ensembling for Constrained Optimization
Model Ensembling for Constrained Optimization
Ira Globus-Harris
Varun Gupta
Michael Kearns
Aaron Roth
38
0
0
27 May 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
43
0
0
27 May 2024
Inverse Reinforcement Learning with Sub-optimal Experts
Inverse Reinforcement Learning with Sub-optimal Experts
Riccardo Poiani
Gabriele Curti
Alberto Maria Metelli
Marcello Restelli
26
1
0
08 Jan 2024
Blending Imitation and Reinforcement Learning for Robust Policy
  Improvement
Blending Imitation and Reinforcement Learning for Robust Policy Improvement
Xuefeng Liu
Takuma Yoneda
Rick L. Stevens
Matthew R. Walter
Yuxin Chen
36
10
0
03 Oct 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and
  Challenges
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
24
81
0
05 Sep 2023
Multi-expert learning of adaptive legged locomotion
Multi-expert learning of adaptive legged locomotion
Chuanyu Yang
Kai Yuan
Qiuguo Zhu
Wanming Yu
Zhibin Li
111
184
0
10 Dec 2020
1