ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05763
  4. Cited By
Learning to reinforcement learn
v1v2v3 (latest)

Learning to reinforcement learn

17 November 2016
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Learning to reinforcement learn"

50 / 584 papers shown
Title
Contextual Pre-planning on Reward Machine Abstractions for Enhanced
  Transfer in Deep Reinforcement Learning
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
132
2
0
11 Jul 2023
First-Explore, then Exploit: Meta-Learning Intelligent Exploration
First-Explore, then Exploit: Meta-Learning Intelligent Exploration
Ben Norman
Jeff Clune
58
0
0
05 Jul 2023
Achieving Stable Training of Reinforcement Learning Agents in Bimodal
  Environments through Batch Learning
Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning
E. Hurwitz
N. Peace
G. Cevora
OffRL
20
0
0
03 Jul 2023
RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$
RL3^33: Boosting Meta Reinforcement Learning via RL inside RL2^22
Abhinav Bhatia
Samer B. Nashed
S. Zilberstein
OffRL
107
0
0
28 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
121
86
0
26 Jun 2023
Recurrent Action Transformer with Memory
Recurrent Action Transformer with Memory
A. Staroverov
A. Bessonov
Dmitry A. Yudin
A. Kovalev
Aleksandr I. Panov
OffRL
106
7
0
15 Jun 2023
One-Shot Learning of Visual Path Navigation for Autonomous Vehicles
One-Shot Learning of Visual Path Navigation for Autonomous Vehicles
Zhongying CuiZhu
François Charette
A. Ghafourian
Debo Shi
Matthew Cui
Anjali Krishnamachar
I. S. Bozchalooi
78
1
0
15 Jun 2023
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement
  Learning
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning
Emmy Liu
S. Suri
Tong Mu
Allan Zhou
Chelsea Finn
LLMAGLM&Ro
51
2
0
14 Jun 2023
ContraBAR: Contrastive Bayes-Adaptive Deep RL
ContraBAR: Contrastive Bayes-Adaptive Deep RL
Era Choshen
Aviv Tamar
BDLOffRL
61
9
0
04 Jun 2023
Offline Meta Reinforcement Learning with In-Distribution Online
  Adaptation
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation
Jianhao Wang
Jin Zhang
Haozhe Jiang
Junyu Zhang
Liwei Wang
Chongjie Zhang
OffRL
90
10
0
31 May 2023
Doing the right thing for the right reason: Evaluating artificial moral
  cognition by probing cost insensitivity
Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Yiran Mao
Madeline G. Reinecke
M. Kunesch
Edgar A. Duénez-Guzmán
Ramona Comanescu
Julia Haas
Joel Z Leibo
66
2
0
29 May 2023
Online Nonstochastic Model-Free Reinforcement Learning
Online Nonstochastic Model-Free Reinforcement Learning
Udaya Ghai
Arushi Gupta
Wenhan Xia
Karan Singh
Elad Hazan
OffRL
96
6
0
27 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu
Pieter Abbeel
OffRL
93
29
0
26 May 2023
Meta-in-context learning in large language models
Meta-in-context learning in large language models
Julian Coda-Forno
Marcel Binz
Zeynep Akata
M. Botvinick
Jane X. Wang
Eric Schulz
LRM
327
44
0
22 May 2023
Brain-inspired learning in artificial neural networks: a review
Brain-inspired learning in artificial neural networks: a review
Samuel Schmidgall
Jascha Achterberg
Thomas Miconi
Louis Kirsch
Rojin Ziaei
S. P. Hajiseyedrazi
Jason K. Eshraghian
85
63
0
18 May 2023
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for
  Meta-Learning
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-Learning
Jun Shu
Xiang Yuan
Deyu Meng
Zongben Xu
98
4
0
13 May 2023
Learning and Adapting Agile Locomotion Skills by Transferring Experience
Learning and Adapting Agile Locomotion Skills by Transferring Experience
Laura M. Smith
J. Kew
Tianyu Li
Linda Luu
Xue Bin Peng
Sehoon Ha
Jie Tan
Sergey Levine
103
56
0
19 Apr 2023
A Platform-Agnostic Deep Reinforcement Learning Framework for Effective
  Sim2Real Transfer in Autonomous Driving
A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer in Autonomous Driving
Dian-Tao Li
Ostap Okhrin
108
3
0
14 Apr 2023
Meta-Learned Models of Cognition
Meta-Learned Models of Cognition
Marcel Binz
Ishita Dasgupta
Akshay K. Jagadish
M. Botvinick
Jane X. Wang
Eric Schulz
111
27
0
12 Apr 2023
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box
  Optimization
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
R. T. Lange
Tom Schaul
Yutian Chen
Chris Xiaoxuan Lu
Tom Zahavy
Valentin Dalibard
Sebastian Flennerhag
114
36
0
08 Apr 2023
On Context Distribution Shift in Task Representation Learning for
  Offline Meta RL
On Context Distribution Shift in Task Representation Learning for Offline Meta RL
Chenyang Zhao
Zihao Zhou
Bing-Quan Liu
OffRL
61
4
0
01 Apr 2023
FindView: Precise Target View Localization Task for Look Around Agents
FindView: Precise Target View Localization Task for Look Around Agents
Haruya Ishikawa
Y. Aoki
52
0
0
16 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&RoOffRLLRMAI4CE
203
172
0
07 Mar 2023
Structured State Space Models for In-Context Reinforcement Learning
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
180
99
0
07 Mar 2023
Domain Adaptation of Reinforcement Learning Agents based on Network
  Service Proximity
Domain Adaptation of Reinforcement Learning Agents based on Network Service Proximity
Kaushik Dey
S. K. Perepu
P. Dasgupta
Abir Das
69
1
0
02 Mar 2023
Bayes meets Bernstein at the Meta Level: an Analysis of Fast Rates in
  Meta-Learning with PAC-Bayes
Bayes meets Bernstein at the Meta Level: an Analysis of Fast Rates in Meta-Learning with PAC-Bayes
Charles Riou
Pierre Alquier
Badr-Eddine Chérief-Abdellatif
121
10
0
23 Feb 2023
Minimax-Bayes Reinforcement Learning
Minimax-Bayes Reinforcement Learning
Thomas Kleine Buening
Christos Dimitrakakis
Hannes Eriksson
Divya Grover
Emilio Jorge
OffRL
67
5
0
21 Feb 2023
Meta-Reinforcement Learning via Exploratory Task Clustering
Meta-Reinforcement Learning via Exploratory Task Clustering
Zhendong Chu
Hongning Wang
OffRL
88
7
0
15 Feb 2023
Graph schemas as abstractions for transfer learning, inference, and
  planning
Graph schemas as abstractions for transfer learning, inference, and planning
J. S. Guntupalli
Rajkumar Vasudeva Raju
Shrinu Kushagra
Carter Wendelken
Daniel P. Sawyer
Ishani Deshpande
Guangyao Zhou
Miguel Lazaro-Gredilla
Dileep George
111
10
0
14 Feb 2023
Learning How to Infer Partial MDPs for In-Context Adaptation and
  Exploration
Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration
Chentian Jiang
Nan Rosemary Ke
Hado van Hasselt
72
4
0
08 Feb 2023
Memory-Based Meta-Learning on Non-Stationary Distributions
Memory-Based Meta-Learning on Non-Stationary Distributions
Tim Genewein
Grégoire Delétang
Anian Ruoss
L. Wenliang
Elliot Catt
Vincent Dutordoir
Jordi Grau-Moya
Laurent Orseau
Marcus Hutter
J. Veness
BDL
100
12
0
06 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
65
21
0
31 Jan 2023
Incorporating Recurrent Reinforcement Learning into Model Predictive
  Control for Adaptive Control in Autonomous Driving
Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving
Yehui Zhang
Joschka Boedecker
Chuxuan Li
Guyue Zhou
55
0
0
30 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&RoOffRLAI4CELRM
139
119
0
18 Jan 2023
Optimistic Meta-Gradients
Optimistic Meta-Gradients
Sebastian Flennerhag
Tom Zahavy
Brendan O'Donoghue
Hado van Hasselt
András Gyorgy
Satinder Singh
93
3
0
09 Jan 2023
Eliminating Meta Optimization Through Self-Referential Meta Learning
Eliminating Meta Optimization Through Self-Referential Meta Learning
Louis Kirsch
Jürgen Schmidhuber
65
7
0
29 Dec 2022
Hyperparameters in Contextual RL are Highly Situational
Hyperparameters in Contextual RL are Highly Situational
Theresa Eimer
C. Benjamins
Marius Lindauer
143
4
0
21 Dec 2022
Evaluating Human-Language Model Interaction
Evaluating Human-Language Model Interaction
Mina Lee
Megha Srivastava
Amelia Hardy
John Thickstun
Esin Durmus
...
Hancheng Cao
Tony Lee
Rishi Bommasani
Michael S. Bernstein
Percy Liang
LM&MAALM
114
102
0
19 Dec 2022
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer
  across Agents
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
64
0
0
18 Dec 2022
Pre-Trained Image Encoder for Generalizable Visual Reinforcement
  Learning
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Zhecheng Yuan
Zhengrong Xue
Bo Yuan
Xueqian Wang
Yi Wu
Yang Gao
Huazhe Xu
SSLOffRL
117
74
0
17 Dec 2022
Learning Options via Compression
Learning Options via Compression
Yiding Jiang
Emmy Liu
Benjamin Eysenbach
Zico Kolter
Chelsea Finn
OffRL
95
15
0
08 Dec 2022
General-Purpose In-Context Learning by Meta-Learning Transformers
General-Purpose In-Context Learning by Meta-Learning Transformers
Louis Kirsch
James Harrison
Jascha Narain Sohl-Dickstein
Luke Metz
134
78
0
08 Dec 2022
Few-Shot Preference Learning for Human-in-the-Loop RL
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
115
101
0
06 Dec 2022
Learning to Optimize in Model Predictive Control
Learning to Optimize in Model Predictive Control
Jacob Sacks
Byron Boots
81
22
0
05 Dec 2022
Cooperative control of environmental extremes by artificial intelligent
  agents
Cooperative control of environmental extremes by artificial intelligent agents
Martí Sánchez-Fibla
Clément Moulin-Frier
Ricard Solé
AI4CE
64
2
0
05 Dec 2022
Active learning using adaptable task-based prioritisation
Active learning using adaptable task-based prioritisation
Shaheer U. Saeed
João Ramalhinho
Mark A. Pinnock
Ziyi Shen
Yunguan Fu
...
D. Barratt
Stephen P. Pereira
Brian R. Davidson
Matthew J. Clarkson
Yipeng Hu
89
7
0
03 Dec 2022
A System for Morphology-Task Generalization via Unified Representation
  and Behavior Distillation
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
83
17
0
25 Nov 2022
Adaptive Prototypical Networks
Adaptive Prototypical Networks
Manas Gogoi
Sambhavi Tiwari
Shekhar Verma
74
2
0
22 Nov 2022
Discovering Evolution Strategies via Meta-Black-Box Optimization
Discovering Evolution Strategies via Meta-Black-Box Optimization
R. T. Lange
Tom Schaul
Yutian Chen
Tom Zahavy
Valenti Dallibard
Chris Xiaoxuan Lu
Satinder Singh
Sebastian Flennerhag
121
49
0
21 Nov 2022
Giving Feedback on Interactive Student Programs with Meta-Exploration
Giving Feedback on Interactive Student Programs with Meta-Exploration
Emmy Liu
Moritz Stephan
Allen Nie
Chris Piech
Emma Brunskill
Chelsea Finn
AI4Ed
109
7
0
16 Nov 2022
Previous
123456...101112
Next