Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.13424
Cited By
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories
26 April 2023
Li-Cheng Lan
Huan Zhang
Cho-Jui Hsieh
OODD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories"
11 / 11 papers shown
Title
Exploring Expert Failures Improves LLM Agent Tuning
Li-Cheng Lan
Andrew Bai
Minhao Cheng
Ruochen Wang
Cho-Jui Hsieh
LRM
183
0
0
17 Apr 2025
Imagining In-distribution States: How Predictable Robot Behavior Can Enable User Control Over Learned Policies
Isaac S. Sheidlower
Emma Bethel
Douglas Lilly
Reuben M. Aronson
E. Short
34
0
0
19 Jun 2024
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Yuwei Fu
Haichao Zhang
Di Wu
Wei-ping Xu
Benoit Boulet
VLM
24
12
0
02 Jun 2024
RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model
Junyi Fan
Yuxuan Han
Jialin Zeng
Jian-Feng Cai
Yang Wang
Yang Xiang
Jiheng Zhang
40
1
0
18 Mar 2024
Modifying RL Policies with Imagined Actions: How Predictable Policies Can Enable Users to Perform Novel Tasks
Isaac S. Sheidlower
Reuben M. Aronson
E. Short
OffRL
63
1
0
10 Dec 2023
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Miguel Suau
M. Spaan
F. Oliehoek
CML
24
4
0
04 Jun 2023
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin
S. Thiagarajan
N. Lazić
Nived Rajaraman
Botao Hao
Csaba Szepesvári
25
4
0
29 Jan 2023
Adaptable Agent Populations via a Generative Model of Policies
Kenneth Derek
Phillip Isola
62
16
0
15 Jul 2021
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
67
162
0
21 Jan 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
59
132
0
27 Jan 2020
1