ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05763
  4. Cited By
Learning to reinforcement learn
v1v2v3 (latest)

Learning to reinforcement learn

17 November 2016
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Learning to reinforcement learn"

50 / 584 papers shown
Title
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in
  Reinforcement Learning
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning
Tidiane Camaret Ndir
André Biedenkapp
Noor H. Awad
OffRL
82
1
0
15 Apr 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and
  Alignment
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
88
7
0
28 Mar 2024
DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving
  Environment for Real-World Performance Validation
DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation
Mu-Yi Shen
Chia-Chi Hsu
Hao-Yu Hou
Yu-Chen Huang
Wei-Fang Sun
Chia-Che Chang
Yu-Lun Liu
Chun-Yi Lee
116
3
0
23 Mar 2024
Towards Understanding the Relationship between In-context Learning and
  Compositional Generalization
Towards Understanding the Relationship between In-context Learning and Compositional Generalization
Sungjun Han
Sebastian Padó
CoGe
72
2
0
18 Mar 2024
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot
  Generalization
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
Sai Prasanna
Karim Farid
Raghu Rajan
André Biedenkapp
123
6
0
16 Mar 2024
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
Zohar Rimon
Tom Jurgenson
Orr Krupnik
Gilad Adler
Aviv Tamar
69
10
0
14 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
95
11
0
14 Mar 2024
In-context Exploration-Exploitation for Reinforcement Learning
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRLOnRL
102
6
0
11 Mar 2024
SplAgger: Split Aggregation for Meta-Reinforcement Learning
SplAgger: Split Aggregation for Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Zheng Xiong
Shimon Whiteson
OffRL
103
2
0
05 Mar 2024
Autonomous vehicle decision and control through reinforcement learning
  with traffic flow randomization
Autonomous vehicle decision and control through reinforcement learning with traffic flow randomization
Yuan Lin
Antai Xie
Xiao Liu
73
2
0
05 Mar 2024
Large Language Model-Based Evolutionary Optimizer: Reasoning with
  elitism
Large Language Model-Based Evolutionary Optimizer: Reasoning with elitism
Shuvayan Brahmachary
Subodh M. Joshi
Aniruddha Panda
K. Koneripalli
A. Sagotra
Harshil Patel
Ankush Sharma
Ameya Dilip Jagtap
Kaushic Kalyanaraman
LRM
116
22
0
04 Mar 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement
  Learning
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
86
1
0
25 Feb 2024
Discovering Temporally-Aware Reinforcement Learning Algorithms
Discovering Temporally-Aware Reinforcement Learning Algorithms
Matthew Jackson
Chris Xiaoxuan Lu
Louis Kirsch
R. T. Lange
Shimon Whiteson
Jakob N. Foerster
110
18
0
08 Feb 2024
In-context learning agents are asymmetric belief updaters
In-context learning agents are asymmetric belief updaters
Johannes A. Schubert
Akshay K. Jagadish
Marcel Binz
Eric Schulz
LLMAG
79
10
0
06 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
111
11
0
04 Feb 2024
Human-like Category Learning by Injecting Ecological Priors from Large
  Language Models into Neural Networks
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
Akshay K. Jagadish
Julian Coda-Forno
Mirko Thalmann
Eric Schulz
Marcel Binz
64
4
0
02 Feb 2024
Control-Theoretic Techniques for Online Adaptation of Deep Neural
  Networks in Dynamical Systems
Control-Theoretic Techniques for Online Adaptation of Deep Neural Networks in Dynamical Systems
Jacob G. Elkins
F. Fahimi
AI4CE
95
0
0
01 Feb 2024
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML
  Approach for Model-free LQR
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for Model-free LQR
Leonardo F. Toso
Donglin Zhan
James Anderson
Han Wang
111
10
0
25 Jan 2024
Memory, Space, and Planning: Multiscale Predictive Representations
Memory, Space, and Planning: Multiscale Predictive Representations
Ida Momennejad
61
3
0
16 Jan 2024
MaDi: Learning to Mask Distractions for Generalization in Visual Deep
  Reinforcement Learning
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning
Bram Grooten
Tristan Tomilin
Gautham Vasan
Matthew E. Taylor
A. R. Mahmood
Meng Fang
Mykola Pechenizkiy
Decebal Constantin Mocanu
88
10
0
23 Dec 2023
In-Context Reinforcement Learning for Variable Action Spaces
In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Sergey Kolesnikov
140
17
0
20 Dec 2023
Emergence of In-Context Reinforcement Learning from Noise Distillation
Emergence of In-Context Reinforcement Learning from Noise Distillation
Ilya Zisman
Vladislav Kurenkov
Alexander Nikulin
Viacheslav Sinii
Sergey Kolesnikov
OffRL
113
15
0
19 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
123
30
0
19 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and
  Skills
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
92
4
0
11 Dec 2023
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and
  Online LQR
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
85
1
0
09 Dec 2023
Scalable Meta-Learning with Gaussian Processes
Scalable Meta-Learning with Gaussian Processes
Petru Tighineanu
Lukas Großberger
P. Baireuther
Kathrin Skubch
Stefan Falkner
Julia Vinogradska
Felix Berkenkamp
77
5
0
01 Dec 2023
Designing Long-term Group Fair Policies in Dynamical Systems
Designing Long-term Group Fair Policies in Dynamical Systems
Miriam Rateike
Isabel Valera
Patrick Forré
100
5
0
21 Nov 2023
Data-Efficient Task Generalization via Probabilistic Model-based Meta
  Reinforcement Learning
Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning
Arjun Bhardwaj
Jonas Rothfuss
Bhavya Sukhija
Yarden As
Marco Hutter
Stelian Coros
Andreas Krause
94
5
0
13 Nov 2023
An introduction to reinforcement learning for neuroscience
An introduction to reinforcement learning for neuroscience
Kristopher T. Jensen
OODOffRLAI4CE
55
1
0
13 Nov 2023
Dream to Adapt: Meta Reinforcement Learning by Latent Context
  Imagination and MDP Imagination
Dream to Adapt: Meta Reinforcement Learning by Latent Context Imagination and MDP Imagination
Lu Wen
Songan Zhang
E. Tseng
Huei Peng
VLMOffRL
76
6
0
11 Nov 2023
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement
  Learning Adaptation
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement Learning Adaptation
Maxwell J. Jacobson
Yexiang Xue
88
0
0
07 Nov 2023
Astrocytes as a mechanism for meta-plasticity and contextually-guided
  network function
Astrocytes as a mechanism for meta-plasticity and contextually-guided network function
Lulu Gong
Fabio Pasqualetti
Thomas Papouin
ShiNung Ching
25
2
0
06 Nov 2023
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Annie S. Chen
Govind Chada
Laura M. Smith
Archit Sharma
Zipeng Fu
Sergey Levine
Chelsea Finn
100
8
0
02 Nov 2023
Emergence of Collective Open-Ended Exploration from Decentralized
  Meta-Reinforcement Learning
Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning
Richard Bornemann
Gautier Hamon
Eleni Nisioti
Clément Moulin-Frier
LRM
113
1
0
01 Nov 2023
Meta-Learning Strategies through Value Maximization in Neural Networks
Meta-Learning Strategies through Value Maximization in Neural Networks
Rodrigo Carrasco-Davis
Javier Masís
Andrew M. Saxe
68
1
0
30 Oct 2023
Are LSTMs Good Few-Shot Learners?
Are LSTMs Good Few-Shot Learners?
Mike Huisman
Thomas M. Moerland
Aske Plaat
Jan N. van Rijn
VLM
83
9
0
22 Oct 2023
Solving Expensive Optimization Problems in Dynamic Environments with
  Meta-learning
Solving Expensive Optimization Problems in Dynamic Environments with Meta-learning
Huan Zhang
Jinliang Ding
Liang Feng
Kay Chen Tan
Ke Li
75
7
0
19 Oct 2023
Few-Shot Learning Patterns in Financial Time-Series for Trend-Following
  Strategies
Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies
Kieran Wood
Samuel Kessler
Stephen J. Roberts
S. Zohren
AI4TS
85
2
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRLLM&Ro
126
34
0
15 Oct 2023
Transformers as Decision Makers: Provable In-Context Reinforcement
  Learning via Supervised Pretraining
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
Licong Lin
Yu Bai
Song Mei
OffRL
102
53
0
12 Oct 2023
Cross-Episodic Curriculum for Transformer Agents
Cross-Episodic Curriculum for Transformer Agents
Lucy Xiaoyang Shi
Yunfan Jiang
Jake Grigsby
Linxi "Jim" Fan
Yuke Zhu
77
7
0
12 Oct 2023
Deep Model Predictive Optimization
Deep Model Predictive Optimization
Jacob Sacks
Rwik Rana
Kevin Huang
Alex Spitzer
Guanya Shi
Byron Boots
91
7
0
06 Oct 2023
Deep reinforcement learning for machine scheduling: Methodology, the
  state-of-the-art, and future directions
Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi
Todd Charter
Marjan Yaghoubi
Masoud Jalayer
Maryam Ahang
Ardeshir Shojaeinasab
Homayoun Najjaran
75
11
0
04 Oct 2023
Discovering General Reinforcement Learning Algorithms with Adversarial
  Environment Design
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Matthew Jackson
Minqi Jiang
Jack Parker-Holder
Risto Vuorio
Chris Xiaoxuan Lu
Gregory Farquhar
Shimon Whiteson
Jakob N. Foerster
OOD
72
9
0
04 Oct 2023
Conceptual Framework for Autonomous Cognitive Entities
Conceptual Framework for Autonomous Cognitive Entities
David Shapiro
Wangfan Li
Manuel Delaflor
Carlos Toxtli
85
1
0
03 Oct 2023
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Jacob Beck
Risto Vuorio
Zheng Xiong
Shimon Whiteson
100
10
0
26 Sep 2023
Uncovering mesa-optimization algorithms in Transformers
Uncovering mesa-optimization algorithms in Transformers
J. Oswald
Eyvind Niklasson
Maximilian Schlegel
Seijin Kobayashi
Nicolas Zucchet
...
Mark Sandler
Blaise Agüera y Arcas
Max Vladymyrov
Razvan Pascanu
João Sacramento
72
64
0
11 Sep 2023
Thinker: Learning to Plan and Act
Thinker: Learning to Plan and Act
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAGOffRLLRM
56
9
0
27 Jul 2023
Automatically Reconciling the Trade-off between Prediction Accuracy and
  Earliness in Prescriptive Business Process Monitoring
Automatically Reconciling the Trade-off between Prediction Accuracy and Earliness in Prescriptive Business Process Monitoring
Andreas Metzger
Tristan Kley
Aristide Rothweiler
Klaus Pohl
71
4
0
12 Jul 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in
  Partially Observable Control Tasks
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
80
4
0
12 Jul 2023
Previous
12345...101112
Next