ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.11435
  4. Cited By
InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning
  Large Language Models with Instructions

InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions

18 March 2024
Yifan Wang
Yafei Liu
Chufan Shi
Haoling Li
Chen Chen
H. Lu
Yujiu Yang
    CLL
ArXiv (abs)PDFHTML

Papers citing "InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions"

32 / 32 papers shown
Title
Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning
Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning
Gangwei Jiang
Caigao Jiang
Zhaoyi Li
Siqiao Xue
Jun-ping Zhou
Linqi Song
Defu Lian
Yin Wei
CLLMU
153
2
0
16 Feb 2025
Estimating the Optimal Number of Clusters in Categorical Data Clustering by Silhouette Coefficient
Duy-Tai Dinh
Tsutomu Fujinami
Van-Nam Huynh
77
119
0
28 Jan 2025
LoRTA: Low Rank Tensor Adaptation of Large Language Models
LoRTA: Low Rank Tensor Adaptation of Large Language Models
Ignacio Hounie
Charilaos I. Kanatsoulis
Arnuv Tandon
Alejandro Ribeiro
145
0
0
05 Oct 2024
Towards LifeSpan Cognitive Systems
Towards LifeSpan Cognitive Systems
Yu Wang
Chi Han
Tongtong Wu
Xiaoxin He
Wangchunshu Zhou
...
Zexue He
Wei Wang
Gholamreza Haffari
Heng Ji
Julian McAuley
KELMCLL
461
2
0
20 Sep 2024
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
Yupeng Chen
Senmiao Wang
Zhihang Lin
Zhihang Lin
Yushun Zhang
Tian Ding
Ruoyu Sun
Ruoyu Sun
CLL
153
5
0
30 Jul 2024
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Yongquan He
Wenyuan Zhang
Xuancheng Huang
Peng Zhang
Lingxun Meng
Wei Lin
Wenyuan Zhang
Yifu Gao
CLLALM
130
5
0
15 Mar 2024
D4: Improving LLM Pretraining via Document De-Duplication and
  Diversification
D4: Improving LLM Pretraining via Document De-Duplication and Diversification
Kushal Tirumala
Daniel Simig
Armen Aghajanyan
Ari S. Morcos
SyDa
64
115
0
23 Aug 2023
An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
Yun Luo
Zhen Yang
Fandong Meng
Yafu Li
Jie Zhou
Yue Zhang
CLLKELM
192
318
0
17 Aug 2023
Exploring the Benefits of Training Expert Language Models over
  Instruction Tuning
Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Joel Jang
Seungone Kim
Seonghyeon Ye
Doyoung Kim
Lajanugen Logeswaran
Moontae Lee
Kyungjae Lee
Minjoon Seo
LRMALM
106
82
0
07 Feb 2023
Scaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language Models
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
...
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
ReLMLRM
231
3,158
0
20 Oct 2022
Fine-tuned Language Models are Continual Learners
Fine-tuned Language Models are Continual Learners
Thomas Scialom
Tuhin Chakrabarty
Smaranda Muresan
CLLLRM
191
122
0
24 May 2022
ELLE: Efficient Lifelong Pre-training for Emerging Data
ELLE: Efficient Lifelong Pre-training for Emerging Data
Yujia Qin
Jiajie Zhang
Yankai Lin
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
95
73
0
12 Mar 2022
TIDF-DLPM: Term and Inverse Document Frequency based Data Leakage
  Prevention Model
TIDF-DLPM: Term and Inverse Document Frequency based Data Leakage Prevention Model
Ishu Gupta
Sloni Mittal
Ankit Tiwari
Priyanka Agarwal
Ashutosh Kumar Singh
119
18
0
10 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
888
13,207
0
04 Mar 2022
Achieving Forgetting Prevention and Knowledge Transfer in Continual
  Learning
Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning
Zixuan Ke
Bing-Quan Liu
Nianzu Ma
Hu Xu
Lei Shu
CLL
217
126
0
05 Dec 2021
Lifelong Pretraining: Continually Adapting Language Models to Emerging
  Corpora
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora
Xisen Jin
Dejiao Zhang
Henghui Zhu
Wei Xiao
Shang-Wen Li
Xiaokai Wei
Andrew O. Arnold
Xiang Ren
KELMCLL
106
117
0
16 Oct 2021
Multitask Prompted Training Enables Zero-Shot Task Generalization
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
355
1,709
0
15 Oct 2021
Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to
  Corpus Exploration
Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration
Shufan Wang
Laure Thompson
Mohit Iyyer
229
68
0
13 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALMUQCV
251
3,789
0
03 Sep 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
502
10,526
0
17 Jun 2021
Continual Learning in Task-Oriented Dialogue Systems
Continual Learning in Task-Oriented Dialogue Systems
Andrea Madotto
Zhaojiang Lin
Zhenpeng Zhou
Seungwhan Moon
Paul A. Crook
Bing-Quan Liu
Zhou Yu
Eunjoon Cho
Zhiguang Wang
CLL
130
132
0
31 Dec 2020
Efficient Meta Lifelong-Learning with Limited Memory
Efficient Meta Lifelong-Learning with Limited Memory
Zirui Wang
Sanket Vaibhav Mehta
Barnabás Póczós
J. Carbonell
CLLKELM
75
76
0
06 Oct 2020
Continual Learning for Natural Language Generation in Task-oriented
  Dialog Systems
Continual Learning for Natural Language Generation in Task-oriented Dialog Systems
Fei Mi
Liangwei Chen
Mengjie Zhao
Minlie Huang
Boi Faltings
CLLKELM
50
71
0
02 Oct 2020
Geometric Dataset Distances via Optimal Transport
Geometric Dataset Distances via Optimal Transport
David Alvarez-Melis
Nicolò Fusi
OT
130
205
0
07 Feb 2020
LAMOL: LAnguage MOdeling for Lifelong Language Learning
LAMOL: LAnguage MOdeling for Lifelong Language Learning
Fan-Keng Sun
Cheng-Hao Ho
Hung-yi Lee
CLLKELM
90
213
0
07 Sep 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
697
24,557
0
26 Jul 2019
Class-incremental Learning via Deep Model Consolidation
Class-incremental Learning via Deep Model Consolidation
Junting Zhang
Jie Zhang
Shalini Ghosh
Dawei Li
Serafettin Tasci
Larry Heck
Heming Zhang
C.-C. Jay Kuo
CLL
84
339
0
19 Mar 2019
An Empirical Study of Example Forgetting during Deep Neural Network
  Learning
An Empirical Study of Example Forgetting during Deep Neural Network Learning
Mariya Toneva
Alessandro Sordoni
Rémi Tachet des Combes
Adam Trischler
Yoshua Bengio
Geoffrey J. Gordon
134
741
0
12 Dec 2018
Overcoming catastrophic forgetting in neural networks
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
374
7,572
0
02 Dec 2016
Progressive Neural Networks
Progressive Neural Networks
Andrei A. Rusu
Neil C. Rabinowitz
Guillaume Desjardins
Hubert Soyer
J. Kirkpatrick
Koray Kavukcuoglu
Razvan Pascanu
R. Hadsell
CLLAI4CE
83
2,465
0
15 Jun 2016
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based
  Neural Networks
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks
Ian Goodfellow
M. Berk Mirza
Xia Da
Aaron Courville
Yoshua Bengio
159
1,455
0
21 Dec 2013
Sinkhorn Distances: Lightspeed Computation of Optimal Transportation
  Distances
Sinkhorn Distances: Lightspeed Computation of Optimal Transportation Distances
Marco Cuturi
OT
220
4,294
0
04 Jun 2013
1