Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.05763
Cited By
v1
v2
v3 (latest)
Learning to reinforcement learn
17 November 2016
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning to reinforcement learn"
50 / 584 papers shown
Title
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning
Tidiane Camaret Ndir
André Biedenkapp
Noor H. Awad
OffRL
82
1
0
15 Apr 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
88
7
0
28 Mar 2024
DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation
Mu-Yi Shen
Chia-Chi Hsu
Hao-Yu Hou
Yu-Chen Huang
Wei-Fang Sun
Chia-Che Chang
Yu-Lun Liu
Chun-Yi Lee
116
3
0
23 Mar 2024
Towards Understanding the Relationship between In-context Learning and Compositional Generalization
Sungjun Han
Sebastian Padó
CoGe
72
2
0
18 Mar 2024
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
Sai Prasanna
Karim Farid
Raghu Rajan
André Biedenkapp
123
6
0
16 Mar 2024
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
Zohar Rimon
Tom Jurgenson
Orr Krupnik
Gilad Adler
Aviv Tamar
69
10
0
14 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
95
11
0
14 Mar 2024
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRL
OnRL
102
6
0
11 Mar 2024
SplAgger: Split Aggregation for Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Zheng Xiong
Shimon Whiteson
OffRL
103
2
0
05 Mar 2024
Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Yuan Lin
Antai Xie
Xiao Liu
73
2
0
05 Mar 2024
Large Language Model-Based Evolutionary Optimizer: Reasoning with elitism
Shuvayan Brahmachary
Subodh M. Joshi
Aniruddha Panda
K. Koneripalli
A. Sagotra
Harshil Patel
Ankush Sharma
Ameya Dilip Jagtap
Kaushic Kalyanaraman
LRM
116
22
0
04 Mar 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
86
1
0
25 Feb 2024
Discovering Temporally-Aware Reinforcement Learning Algorithms
Matthew Jackson
Chris Xiaoxuan Lu
Louis Kirsch
R. T. Lange
Shimon Whiteson
Jakob N. Foerster
110
18
0
08 Feb 2024
In-context learning agents are asymmetric belief updaters
Johannes A. Schubert
Akshay K. Jagadish
Marcel Binz
Eric Schulz
LLMAG
79
10
0
06 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
111
11
0
04 Feb 2024
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
Akshay K. Jagadish
Julian Coda-Forno
Mirko Thalmann
Eric Schulz
Marcel Binz
64
4
0
02 Feb 2024
Control-Theoretic Techniques for Online Adaptation of Deep Neural Networks in Dynamical Systems
Jacob G. Elkins
F. Fahimi
AI4CE
95
0
0
01 Feb 2024
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for Model-free LQR
Leonardo F. Toso
Donglin Zhan
James Anderson
Han Wang
111
10
0
25 Jan 2024
Memory, Space, and Planning: Multiscale Predictive Representations
Ida Momennejad
61
3
0
16 Jan 2024
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning
Bram Grooten
Tristan Tomilin
Gautham Vasan
Matthew E. Taylor
A. R. Mahmood
Meng Fang
Mykola Pechenizkiy
Decebal Constantin Mocanu
88
10
0
23 Dec 2023
In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Sergey Kolesnikov
140
17
0
20 Dec 2023
Emergence of In-Context Reinforcement Learning from Noise Distillation
Ilya Zisman
Vladislav Kurenkov
Alexander Nikulin
Viacheslav Sinii
Sergey Kolesnikov
OffRL
113
15
0
19 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
123
30
0
19 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
92
4
0
11 Dec 2023
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
85
1
0
09 Dec 2023
Scalable Meta-Learning with Gaussian Processes
Petru Tighineanu
Lukas Großberger
P. Baireuther
Kathrin Skubch
Stefan Falkner
Julia Vinogradska
Felix Berkenkamp
77
5
0
01 Dec 2023
Designing Long-term Group Fair Policies in Dynamical Systems
Miriam Rateike
Isabel Valera
Patrick Forré
100
5
0
21 Nov 2023
Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning
Arjun Bhardwaj
Jonas Rothfuss
Bhavya Sukhija
Yarden As
Marco Hutter
Stelian Coros
Andreas Krause
94
5
0
13 Nov 2023
An introduction to reinforcement learning for neuroscience
Kristopher T. Jensen
OOD
OffRL
AI4CE
55
1
0
13 Nov 2023
Dream to Adapt: Meta Reinforcement Learning by Latent Context Imagination and MDP Imagination
Lu Wen
Songan Zhang
E. Tseng
Huei Peng
VLM
OffRL
76
6
0
11 Nov 2023
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement Learning Adaptation
Maxwell J. Jacobson
Yexiang Xue
88
0
0
07 Nov 2023
Astrocytes as a mechanism for meta-plasticity and contextually-guided network function
Lulu Gong
Fabio Pasqualetti
Thomas Papouin
ShiNung Ching
25
2
0
06 Nov 2023
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Annie S. Chen
Govind Chada
Laura M. Smith
Archit Sharma
Zipeng Fu
Sergey Levine
Chelsea Finn
100
8
0
02 Nov 2023
Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning
Richard Bornemann
Gautier Hamon
Eleni Nisioti
Clément Moulin-Frier
LRM
113
1
0
01 Nov 2023
Meta-Learning Strategies through Value Maximization in Neural Networks
Rodrigo Carrasco-Davis
Javier Masís
Andrew M. Saxe
68
1
0
30 Oct 2023
Are LSTMs Good Few-Shot Learners?
Mike Huisman
Thomas M. Moerland
Aske Plaat
Jan N. van Rijn
VLM
83
9
0
22 Oct 2023
Solving Expensive Optimization Problems in Dynamic Environments with Meta-learning
Huan Zhang
Jinliang Ding
Liang Feng
Kay Chen Tan
Ke Li
75
7
0
19 Oct 2023
Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies
Kieran Wood
Samuel Kessler
Stephen J. Roberts
S. Zohren
AI4TS
85
2
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
126
34
0
15 Oct 2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
Licong Lin
Yu Bai
Song Mei
OffRL
102
53
0
12 Oct 2023
Cross-Episodic Curriculum for Transformer Agents
Lucy Xiaoyang Shi
Yunfan Jiang
Jake Grigsby
Linxi "Jim" Fan
Yuke Zhu
77
7
0
12 Oct 2023
Deep Model Predictive Optimization
Jacob Sacks
Rwik Rana
Kevin Huang
Alex Spitzer
Guanya Shi
Byron Boots
91
7
0
06 Oct 2023
Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi
Todd Charter
Marjan Yaghoubi
Masoud Jalayer
Maryam Ahang
Ardeshir Shojaeinasab
Homayoun Najjaran
75
11
0
04 Oct 2023
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Matthew Jackson
Minqi Jiang
Jack Parker-Holder
Risto Vuorio
Chris Xiaoxuan Lu
Gregory Farquhar
Shimon Whiteson
Jakob N. Foerster
OOD
72
9
0
04 Oct 2023
Conceptual Framework for Autonomous Cognitive Entities
David Shapiro
Wangfan Li
Manuel Delaflor
Carlos Toxtli
85
1
0
03 Oct 2023
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Jacob Beck
Risto Vuorio
Zheng Xiong
Shimon Whiteson
100
10
0
26 Sep 2023
Uncovering mesa-optimization algorithms in Transformers
J. Oswald
Eyvind Niklasson
Maximilian Schlegel
Seijin Kobayashi
Nicolas Zucchet
...
Mark Sandler
Blaise Agüera y Arcas
Max Vladymyrov
Razvan Pascanu
João Sacramento
72
64
0
11 Sep 2023
Thinker: Learning to Plan and Act
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAG
OffRL
LRM
56
9
0
27 Jul 2023
Automatically Reconciling the Trade-off between Prediction Accuracy and Earliness in Prescriptive Business Process Monitoring
Andreas Metzger
Tristan Kley
Aristide Rothweiler
Klaus Pohl
71
4
0
12 Jul 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
80
4
0
12 Jul 2023
Previous
1
2
3
4
5
...
10
11
12
Next