Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.05763
Cited By
v1
v2
v3 (latest)
Learning to reinforcement learn
17 November 2016
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning to reinforcement learn"
50 / 584 papers shown
Title
Distinct Computations Emerge From Compositional Curricula in In-Context Learning
Jin Hwa Lee
Andrew Kyle Lampinen
Aaditya K. Singh
Andrew Saxe
37
0
0
16 Jun 2025
Scaling Algorithm Distillation for Continuous Control with Mamba
Samuel Beaussant
Mehdi Mounsif
30
0
0
16 Jun 2025
Self-Adapting Language Models
Adam Zweiger
Jyothish Pari
Han Guo
Ekin Akyürek
Yoon Kim
Pulkit Agrawal
KELM
LRM
155
0
0
12 Jun 2025
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
Jingjing Jiang
Chongjie Si
Jun Luo
Hanwang Zhang
Chao Ma
198
0
0
23 May 2025
Understanding Prompt Tuning and In-Context Learning via Meta-Learning
Tim Genewein
Kevin Wenliang Li
Jordi Grau-Moya
Anian Ruoss
Laurent Orseau
Marcus Hutter
VPVLM
106
1
0
22 May 2025
Reward Is Enough: LLMs Are In-Context Reinforcement Learners
Kefan Song
Amir Moeini
Peng Wang
Lei Gong
Rohan Chandra
Yanjun Qi
Shangtong Zhang
ReLM
LRM
37
3
0
21 May 2025
Neural Fidelity Calibration for Informative Sim-to-Real Adaptation
Youwei Yu
Lantao Liu
79
1
0
11 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
89
1
0
06 Apr 2025
Meta-Reinforcement Learning with Discrete World Models for Adaptive Load Balancing
Cameron Redovian
115
0
0
11 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
91
0
0
04 Mar 2025
FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users
Anikait Singh
Sheryl Hsu
Kyle Hsu
E. Mitchell
Stefano Ermon
Tatsunori Hashimoto
Archit Sharma
Chelsea Finn
SyDa
OffRL
132
3
0
26 Feb 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
135
4
0
26 Feb 2025
Provable Benefits of Unsupervised Pre-training and Transfer Learning via Single-Index Models
Taj Jones-McCormick
Aukosh Jagannath
S. Sen
127
0
0
24 Feb 2025
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
227
3
0
17 Feb 2025
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
Maxence Faldor
Robert Tjarko Lange
Antoine Cully
164
1
0
04 Feb 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
229
3
0
04 Feb 2025
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Kaixi Bao
Chenhao Li
Yarden As
Andreas Krause
Marco Hutter
OffRL
CLL
275
1
0
03 Feb 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
203
2
0
28 Jan 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
117
6
0
24 Jan 2025
TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments
Chenyang Qi
Huiping Li
Panfeng Huang
OffRL
89
0
0
13 Jan 2025
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps
Benjamin Ellis
Matthew Jackson
Andrei Lupu
Alexander David Goldie
Mattie Fellows
Shimon Whiteson
Jakob Foerster
152
3
0
22 Dec 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
74
6
0
17 Nov 2024
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales
Stefanos Nikolaidis
AI4CE
86
0
0
07 Nov 2024
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRL
OnRL
100
2
0
06 Nov 2024
Multi-agent cooperation through learning-aware policy gradients
Alexander Meulemans
Seijin Kobayashi
J. Oswald
Nino Scherrer
Eric Elmoznino
Blake A. Richards
Guillaume Lajoie
Blaise Agüera y Arcas
João Sacramento
90
1
0
24 Oct 2024
Neural networks that overcome classic challenges through practice
Kazuki Irie
Brenden M. Lake
97
6
0
14 Oct 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu
Minghui Zhu
OffRL
74
1
0
13 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
103
7
0
09 Oct 2024
ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI
Ahmad Elawady
Gunjan Chhablani
Ram Ramrakhya
Karmesh Yadav
Dhruv Batra
Z. Kira
Andrew Szot
OffRL
107
0
0
03 Oct 2024
A New First-Order Meta-Learning Algorithm with Convergence Guarantees
El Mahdi Chayti
Martin Jaggi
63
1
0
05 Sep 2024
Automated Design of Agentic Systems
Shengran Hu
Cong Lu
Jeff Clune
AI4CE
150
62
0
15 Aug 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
109
1
0
13 Aug 2024
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
90
0
0
31 Jul 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
102
0
0
26 Jul 2024
MetaAug: Meta-Data Augmentation for Post-Training Quantization
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
Dinh Q. Phung
Gustavo Carneiro
Thanh-Toan Do
MQ
83
0
0
20 Jul 2024
Graceful task adaptation with a bi-hemispheric RL agent
Grant Nicholas
L. Kuhlmann
Gideon Kowadlo
77
0
0
16 Jul 2024
Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems
Gianluigi Silvestri
F. V. Massoli
Tribhuvanesh Orekondy
Afshin Abdi
Arash Behboodi
72
0
0
10 Jul 2024
Adversaries Can Misuse Combinations of Safe Models
Erik Jones
Anca Dragan
Jacob Steinhardt
78
13
0
20 Jun 2024
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
Menglong Zhang
Fuyuan Qian
Quanying Liu
94
1
0
18 Jun 2024
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Alexander Nikulin
Ilya Zisman
Alexey Zemtsov
Viacheslav Sinii
209
7
0
13 Jun 2024
Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach
Georgios Tsoumplekas
C. Athanasiadis
Dimitrios I. Doukas
Antonios C. Chrysopoulos
P. Mitkas
AI4TS
98
3
0
09 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
260
2
0
07 Jun 2024
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Erik Jenner
Shreyas Kapur
Vasil Georgiev
Cameron Allen
Scott Emmons
Stuart J. Russell
115
13
0
02 Jun 2024
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
Jonathan Cook
Chris Xiaoxuan Lu
Edward Hughes
Joel Z Leibo
Jakob N. Foerster
86
6
0
01 Jun 2024
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
Sili Huang
Jifeng Hu
Zhe Yang
Liwei Yang
Yaoyu Zhang
Hechang Chen
Lichao Sun
Bo Yang
Mamba
83
4
0
31 May 2024
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
Sili Huang
Jifeng Hu
Hechang Chen
Lichao Sun
Bo Yang
OffRL
LRM
64
11
0
31 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
114
2
0
18 May 2024
CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning
Jingwen Wang
Dehui Du
Yida Li
Yiyang Li
Yikang Chen
AI4TS
CML
37
0
0
14 May 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
106
9
0
22 Apr 2024
Self-adaptive PSRO: Towards an Automatic Population-based Game Solver
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Xiao Huang
Hau Chan
Bo An
79
1
0
17 Apr 2024
1
2
3
4
...
10
11
12
Next