ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.05492
  4. Cited By
How Abilities in Large Language Models are Affected by Supervised
  Fine-tuning Data Composition

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

9 October 2023
Guanting Dong
Hongyi Yuan
Keming Lu
Chengpeng Li
Mingfeng Xue
Dayiheng Liu
Wei Wang
Zheng Yuan
Chang Zhou
Jingren Zhou
    LRM
    CLL
ArXivPDFHTML

Papers citing "How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition"

45 / 95 papers shown
Title
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in
  Alignment
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
Keming Lu
Bowen Yu
Fei Huang
Yang Fan
Runji Lin
Chang Zhou
MoMe
32
18
0
28 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs
  across Diverse Granularity
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
43
4
0
26 May 2024
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
85
54
0
22 May 2024
When LLMs Meet Cybersecurity: A Systematic Literature Review
When LLMs Meet Cybersecurity: A Systematic Literature Review
Jie Zhang
Haoyu Bu
Hui Wen
Yu Chen
Lun Li
Hongsong Zhu
52
36
0
06 May 2024
Aloe: A Family of Fine-tuned Open Healthcare LLMs
Aloe: A Family of Fine-tuned Open Healthcare LLMs
Ashwin Kumar Gururajan
Enrique Lopez-Cuena
Jordi Bayarri-Planas
Adrián Tormos
Daniel Hinjos
...
Lucia Urcelay-Ganzabal
Marta Gonzalez-Mallo
Sergio Alvarez-Napagao
Eduard Ayguadé-Parra
Ulises Cortés Dario Garcia-Gasulla
ELM
LM&MA
39
14
0
03 May 2024
HFT: Half Fine-Tuning for Large Language Models
HFT: Half Fine-Tuning for Large Language Models
Tingfeng Hui
Zhenyu Zhang
Shuohuan Wang
Weiran Xu
Yu Sun
Hua Wu
CLL
50
5
0
29 Apr 2024
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation
  Language Model
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
Runzhe Zhan
Xinyi Yang
Derek F. Wong
Lidia S. Chao
Yue Zhang
58
10
0
25 Apr 2024
Self-playing Adversarial Language Game Enhances LLM Reasoning
Self-playing Adversarial Language Game Enhances LLM Reasoning
Pengyu Cheng
Tianhao Hu
Han Xu
Zhisong Zhang
Yong Dai
Lei Han
Nan Du
Nan Du
Xiaolong Li
SyDa
LRM
ReLM
98
30
0
16 Apr 2024
Balancing Speciality and Versatility: a Coarse to Fine Framework for
  Supervised Fine-tuning Large Language Model
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Hengyuan Zhang
Yanru Wu
Dawei Li
Zacc Yang
Rui Zhao
Yong Jiang
Fei Tan
ALM
40
0
0
16 Apr 2024
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical
  Reasoning in Large Language Models
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models
Hyeonwoo Kim
Gyoungjin Gim
Yungi Kim
Jihoo Kim
Byungju Kim
Wonseok Lee
Chanjun Park
ReLM
LRM
39
1
0
05 Apr 2024
Large Language Models for Expansion of Spoken Language Understanding
  Systems to New Languages
Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages
Jakub Hoscilowicz
Pawel Pawlowski
Marcin Skorupa
Marcin Sowanski
Artur Janicki
23
4
0
03 Apr 2024
Exploring the Mystery of Influential Data for Mathematical Reasoning
Exploring the Mystery of Influential Data for Mathematical Reasoning
Xinzhe Ni
Yeyun Gong
Zhibin Gou
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
47
9
0
01 Apr 2024
Token-Efficient Leverage Learning in Large Language Models
Token-Efficient Leverage Learning in Large Language Models
Yuanhao Zeng
Min Wang
Yihang Wang
Yingxia Shao
42
0
0
01 Apr 2024
SMART: Submodular Data Mixture Strategy for Instruction Tuning
SMART: Submodular Data Mixture Strategy for Instruction Tuning
Kowndinya Renduchintala
S. Bhatia
Ganesh Ramakrishnan
51
3
0
13 Mar 2024
ORPO: Monolithic Preference Optimization without Reference Model
ORPO: Monolithic Preference Optimization without Reference Model
Jiwoo Hong
Noah Lee
James Thorne
OSLM
42
213
0
12 Mar 2024
Common 7B Language Models Already Possess Strong Math Capabilities
Common 7B Language Models Already Possess Strong Math Capabilities
Chen Li
Weiqi Wang
Jingcheng Hu
Yixuan Wei
Nanning Zheng
Han Hu
Zheng-Wei Zhang
Houwen Peng
ALM
LRM
45
78
0
07 Mar 2024
Yi: Open Foundation Models by 01.AI
Yi: Open Foundation Models by 01.AI
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
...
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
OSLM
LRM
150
512
0
07 Mar 2024
Do Large Language Model Understand Multi-Intent Spoken Language ?
Do Large Language Model Understand Multi-Intent Spoken Language ?
Shangjian Yin
Peijie Huang
Yuhong Xu
Haojing Huang
Jiatian Chen
45
5
0
07 Mar 2024
DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
Shanghaoran Quan
MoE
OffRL
52
9
0
02 Mar 2024
Clustering and Ranking: Diversity-preserved Instruction Selection
  through Expert-aligned Quality Estimation
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
Yuan Ge
Yilun Liu
Chi Hu
Weibin Meng
Shimin Tao
Xiaofeng Zhao
Hongxia Ma
Li Zhang
Hao Yang
Tong Xiao
ALM
42
27
0
28 Feb 2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu
Junting Chen
Qinglong Zhang
Shoufa Chen
Qiaojun Yu
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Mingyu Ding
Ping Luo
51
22
0
25 Feb 2024
Kuaiji: the First Chinese Accounting Large Language Model
Kuaiji: the First Chinese Accounting Large Language Model
Jiayuan Luo
Songhua Yang
Xiaoling Qiu
Panyu Chen
Yufei Nai
Wenxuan Zeng
Wentao Zhang
Xinke Jiang
RALM
ALM
38
1
0
21 Feb 2024
Discerning and Resolving Knowledge Conflicts through Adaptive Decoding
  with Contextual Information-Entropy Constraint
Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint
Xiaowei Yuan
Zhao Yang
Yequan Wang
Shengping Liu
Jun Zhao
Kang Liu
26
9
0
19 Feb 2024
Multi-Perspective Consistency Enhances Confidence Estimation in Large
  Language Models
Multi-Perspective Consistency Enhances Confidence Estimation in Large Language Models
Pei Wang
Yejie Wang
Muxi Diao
Keqing He
Guanting Dong
Weiran Xu
32
0
0
17 Feb 2024
DolphCoder: Echo-Locating Code Large Language Models with Diverse and
  Multi-Objective Instruction Tuning
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning
Yejie Wang
Keqing He
Guanting Dong
Pei Wang
Weihao Zeng
...
Yutao Mou
Mengdi Zhang
Jingang Wang
Xunliang Cai
Weiran Xu
ALM
33
10
0
14 Feb 2024
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and
  Generative Datasets
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets
Israel Abebe Azime
A. Tonja
Tadesse Destaw Belay
Mitiku Yohannes Fuge
A. Wassie
Eyasu Shiferaw Jada
Yonas Chanie
W. Sewunetie
Seid Muhie Yimam
27
1
0
12 Feb 2024
On Catastrophic Inheritance of Large Foundation Models
On Catastrophic Inheritance of Large Foundation Models
Hao Chen
Bhiksha Raj
Xing Xie
Jindong Wang
AI4CE
61
12
0
02 Feb 2024
Continual Learning for Large Language Models: A Survey
Continual Learning for Large Language Models: A Survey
Tongtong Wu
Linhao Luo
Yuan-Fang Li
Shirui Pan
Thuy-Trang Vu
Gholamreza Haffari
CLL
LRM
KELM
47
104
0
02 Feb 2024
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Weizhou Shen
Chenliang Li
Hongzhan Chen
Ming Yan
Xiaojun Quan
Hehong Chen
Ji Zhang
Fei Huang
LLMAG
48
50
0
14 Jan 2024
Extending LLMs' Context Window with 100 Samples
Extending LLMs' Context Window with 100 Samples
Yikai Zhang
Junlong Li
Pengfei Liu
37
11
0
13 Jan 2024
LightHouse: A Survey of AGI Hallucination
LightHouse: A Survey of AGI Hallucination
Feng Wang
LRM
HILM
VLM
37
3
0
08 Jan 2024
LLaMA Pro: Progressive LLaMA with Block Expansion
LLaMA Pro: Progressive LLaMA with Block Expansion
Chengyue Wu
Yukang Gan
Yixiao Ge
Zeyu Lu
Jiahao Wang
Ye Feng
Ying Shan
Ping Luo
CLL
37
61
0
04 Jan 2024
DrugAssist: A Large Language Model for Molecule Optimization
DrugAssist: A Large Language Model for Molecule Optimization
Geyan Ye
Xibao Cai
Houtim Lai
Xing Wang
Junhong Huang
Longyue Wang
Wei Liu
Xian Zeng
60
26
0
28 Dec 2023
LARP: Language-Agent Role Play for Open-World Games
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
35
17
0
24 Dec 2023
On Diversified Preferences of Large Language Model Alignment
On Diversified Preferences of Large Language Model Alignment
Dun Zeng
Yong Dai
Pengyu Cheng
Longyue Wang
Tianhao Hu
Wanshun Chen
Nan Du
Zenglin Xu
ALM
43
16
0
12 Dec 2023
Large Language Models Meet Open-World Intent Discovery and Recognition:
  An Evaluation of ChatGPT
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT
Xiaoshuai Song
Keqing He
Pei Wang
Guanting Dong
Yutao Mou
Jingang Wang
Yunsen Xian
Xunliang Cai
Weiran Xu
LRM
47
14
0
16 Oct 2023
Revisit Input Perturbation Problems for LLMs: A Unified Robustness
  Evaluation Framework for Noisy Slot Filling Task
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task
Guanting Dong
Jinxu Zhao
Tingfeng Hui
Daichi Guo
Wenlong Wan
...
Yueyan Qiu
Zhuoma Gongque
Keqing He
Zechen Wang
Weiran Xu
AAML
40
20
0
10 Oct 2023
MuggleMath: Assessing the Impact of Query and Response Augmentation on
  Math Reasoning
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning
Chengpeng Li
Zheng Yuan
Hongyi Yuan
Guanting Dong
Keming Lu
Jiancan Wu
Chuanqi Tan
Xiang Wang
Chang Zhou
LRM
30
22
0
09 Oct 2023
InstructERC: Reforming Emotion Recognition in Conversation with a
  Retrieval Multi-task LLMs Framework
InstructERC: Reforming Emotion Recognition in Conversation with a Retrieval Multi-task LLMs Framework
Shanglin Lei
Guanting Dong
Xiaoping Wang
Keheng Wang
Sirui Wang
60
2
0
21 Sep 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
275
2,575
0
06 Oct 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
417
12,150
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
450
8,699
0
28 Jan 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
221
1,664
0
15 Oct 2021
Raise a Child in Large Language Model: Towards Effective and
  Generalizable Fine-tuning
Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Runxin Xu
Fuli Luo
Zhiyuan Zhang
Chuanqi Tan
Baobao Chang
Songfang Huang
Fei Huang
LRM
151
178
0
13 Sep 2021
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
266
4,532
0
23 Jan 2020
Previous
12