Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.02868
Cited By
v1
v2
v3 (latest)
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
5 February 2024
Maciej Wolczyk
Bartłomiej Cupiał
M. Ostaszewski
Michal Bortkiewicz
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem"
43 / 43 papers shown
Title
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning
Guanxing Lu
Wenkai Guo
Chubin Zhang
Yuheng Zhou
Haonan Jiang
Zifeng Gao
Yansong Tang
Ziwei Wang
OffRL
103
0
0
24 May 2025
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Mohit Pandey
G. Subbaraj
Artem Cherkasov
Martin Ester
Emmanuel Bengio
AI4CE
131
1
0
08 Mar 2025
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
202
22
0
20 Nov 2024
An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
Yun Luo
Zhen Yang
Fandong Meng
Yafu Li
Jie Zhou
Yue Zhang
CLL
KELM
184
315
0
17 Aug 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
69
24
0
14 Mar 2023
NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
J. Bornschein
Alexandre Galashov
Ross Hemsley
Amal Rannen-Triki
Yutian Chen
...
Angeliki Lazaridou
Yee Whye Teh
Andrei A. Rusu
Razvan Pascanu
MarcÁurelio Ranzato
OOD
VLM
AI4TS
90
18
0
15 Nov 2022
Dungeons and Data: A Large-Scale NetHack Dataset
Eric Hambro
Roberta Raileanu
Dan Rothermel
Vegard Mella
Tim Rocktaschel
Heinrich Küttler
Naila Murray
OffRL
218
19
0
01 Nov 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
62
6
0
22 Oct 2022
Scaling Instruction-Finetuned Language Models
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
...
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
ReLM
LRM
215
3,150
0
20 Oct 2022
Disentangling Transfer in Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
121
28
0
28 Sep 2022
Modular Lifelong Reinforcement Learning via Neural Composition
Jorge Armando Mendez Mendez
H. V. Seijen
Eric Eaton
OffRL
KELM
CLL
134
41
0
01 Jul 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
132
303
0
23 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
114
34
0
07 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
115
65
0
03 Jun 2022
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Alex X. Lee
Coline Devin
Jost Tobias Springenberg
Yuxiang Zhou
Thomas Lampe
A. Abdolmaleki
Konstantinos Bousmalis
OffRL
OnRL
78
15
0
06 May 2022
Fine-tuning Image Transformers using Learnable Memory
Mark Sandler
A. Zhmoginov
Max Vladymyrov
Andrew Jackson
ViT
77
48
0
29 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
92
123
0
25 Mar 2022
Insights From the NeurIPS 2021 NetHack Challenge
Eric Hambro
Sharada Mohanty
Dmitrii Babaev
Mi-Ra Byeon
Dipam Chakraborty
...
Dan Rothermel
Mikayel Samvelyan
Dmitry Sorokin
Maciej Sypetkowski
Michal Sypetkowski
62
19
0
22 Mar 2022
Architecture Matters in Continual Learning
Seyed Iman Mirzadeh
Arslan Chaudhry
Dong Yin
Timothy Nguyen
Razvan Pascanu
Dilan Görür
Mehrdad Farajtabar
OOD
KELM
162
62
0
01 Feb 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
301
927
0
12 Oct 2021
Continual Learning in the Teacher-Student Setup: Impact of Task Similarity
Sebastian Lee
Sebastian Goldt
Andrew M. Saxe
CLL
75
74
0
09 Jul 2021
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Seunghyun Lee
Younggyo Seo
Kimin Lee
Pieter Abbeel
Jinwoo Shin
OffRL
OnRL
62
191
0
01 Jul 2021
Pretraining Representations for Data-Efficient Reinforcement Learning
Max Schwarzer
Nitarshan Rajkumar
Michael Noukhovitch
Ankesh Anand
Laurent Charlin
Devon Hjelm
Philip Bachman
Aaron Courville
OffRL
98
118
0
09 Jun 2021
Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
OffRL
68
98
0
23 May 2021
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
78
109
0
20 Nov 2020
Rethinking Experience Replay: a Bag of Tricks for Continual Learning
Pietro Buzzega
Matteo Boschini
Angelo Porrello
Simone Calderara
CLL
45
153
0
12 Oct 2020
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSL
DRL
367
345
0
14 Sep 2020
What is being transferred in transfer learning?
Behnam Neyshabur
Hanie Sedghi
Chiyuan Zhang
117
528
0
26 Aug 2020
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
82
181
0
24 Jun 2020
Similarity of Neural Network Representations Revisited
Simon Kornblith
Mohammad Norouzi
Honglak Lee
Geoffrey E. Hinton
143
1,435
0
01 May 2019
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Tom Schaul
Diana Borsa
Joseph Modayil
Razvan Pascanu
70
63
0
25 Apr 2019
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
145
2,450
0
13 Dec 2018
Experience Replay for Continual Learning
David Rolnick
Arun Ahuja
Jonathan Richard Schwarz
Timothy Lillicrap
Greg Wayne
CLL
116
1,171
0
28 Nov 2018
Kickstarting Deep Reinforcement Learning
Simon Schmitt
Jonathan J. Hudson
Augustin Žídek
Simon Osindero
Carl Doersch
...
Joel Z Leibo
Heinrich Küttler
Andrew Zisserman
Karen Simonyan
S. M. Ali Eslami
OnRL
64
134
0
10 Mar 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,406
0
04 Jan 2018
Memory Aware Synapses: Learning what (not) to forget
Rahaf Aljundi
F. Babiloni
Mohamed Elhoseiny
Marcus Rohrbach
Tinne Tuytelaars
KELM
CLL
87
1,646
0
27 Nov 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
541
19,265
0
20 Jul 2017
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
374
7,572
0
02 Dec 2016
iCaRL: Incremental Classifier and Representation Learning
Sylvestre-Alvise Rebuffi
Alexander Kolesnikov
G. Sperl
Christoph H. Lampert
CLL
OOD
160
3,781
0
23 Nov 2016
Progressive Neural Networks
Andrei A. Rusu
Neil C. Rabinowitz
Guillaume Desjardins
Hubert Soyer
J. Kirkpatrick
Koray Kavukcuoglu
Razvan Pascanu
R. Hadsell
CLL
AI4CE
81
2,464
0
15 Jun 2016
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
129
12,269
0
19 Dec 2013
Rich feature hierarchies for accurate object detection and semantic segmentation
Ross B. Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
ObjD
291
26,217
0
11 Nov 2013
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,021
0
19 Jul 2012
1