ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.05492
  4. Cited By
How Abilities in Large Language Models are Affected by Supervised
  Fine-tuning Data Composition

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

9 October 2023
Guanting Dong
Hongyi Yuan
Keming Lu
Chengpeng Li
Mingfeng Xue
Dayiheng Liu
Wei Wang
Zheng Yuan
Chang Zhou
Jingren Zhou
    LRM
    CLL
ArXivPDFHTML

Papers citing "How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition"

50 / 95 papers shown
Title
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment
Chenlin Ming
Chendi Qu
Mengzhang Cai
Qizhi Pei
Zhuoshi Pan
Yu Li
Xiaoming Duan
Lijun Wu
Zeang Sheng
17
0
0
19 May 2025
PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs
PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs
Xilong Cheng
Yunxiao Qin
Yuting Tan
Zhengnan Li
Ye Wang
Hongjiang Xiao
Yuan Zhang
17
0
0
19 May 2025
DIMT25@ICDAR2025: HW-TSC's End-to-End Document Image Machine Translation System Leveraging Large Vision-Language Model
DIMT25@ICDAR2025: HW-TSC's End-to-End Document Image Machine Translation System Leveraging Large Vision-Language Model
Zhanglin Wu
Tengfei Song
Ning Xie
Feiyu Xiong
Pengfei Li
Shuang Wu
Chong Li
Junhao Zhu
Hao Yang
41
0
0
24 Apr 2025
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
Jabez Magomere
Elena Kochkina
Samuel Mensah
Simerjot Kaur
Charese Smiley
36
1
0
22 Apr 2025
From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System
From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System
Rohan Surana
Junda Wu
Zhouhang Xie
Yu Xia
Harald Steck
Dawen Liang
Nathan Kallus
Julian McAuley
37
0
0
21 Apr 2025
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
Yicheng Chen
Yining Li
Kai Hu
Zerun Ma
Haochen Ye
Kai Chen
34
0
0
18 Apr 2025
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation
Juzheng Zhang
Jiacheng You
Ashwinee Panda
Tom Goldstein
MoMe
53
1
0
10 Apr 2025
Beyond Accuracy: The Role of Calibration in Self-Improving Large Language Models
Beyond Accuracy: The Role of Calibration in Self-Improving Large Language Models
Liangjie Huang
Dawei Li
Huan Liu
Lu Cheng
LRM
41
0
0
03 Apr 2025
Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words
Hongyu Su
Yifeng Gao
Yifan Ding
Jie Zhang
52
0
0
10 Mar 2025
Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment
Matthew DosSantos DiSorbo
Harang Ju
Sinan Aral
ELM
LRM
55
0
0
04 Mar 2025
Can Large Language Models Extract Customer Needs as well as Professional Analysts?
Artem Timoshenko
Chengfeng Mao
J. Hauser
ELM
60
0
0
25 Feb 2025
BA-LoRA: Bias-Alleviating Low-Rank Adaptation to Mitigate Catastrophic Inheritance in Large Language Models
BA-LoRA: Bias-Alleviating Low-Rank Adaptation to Mitigate Catastrophic Inheritance in Large Language Models
Yupeng Chang
Yi-Ju Chang
Yuan Wu
AI4CE
ALM
95
0
0
24 Feb 2025
Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning
Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning
Xuetao Ma
Wenbin Jiang
Hua Huang
LRM
73
1
0
21 Feb 2025
Multi-Attribute Steering of Language Models via Targeted Intervention
Multi-Attribute Steering of Language Models via Targeted Intervention
Duy Nguyen
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
LLMSV
110
0
0
18 Feb 2025
Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models
Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models
Yingqian Cui
Pengfei He
Jingying Zeng
Hui Liu
Xianfeng Tang
...
Zhen Li
Suhang Wang
Yue Xing
Jiliang Tang
Qi He
LRM
52
9
0
18 Feb 2025
Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarcity
Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarcity
Dylan Zhang
Justin Wang
Tianran Sun
58
1
0
17 Feb 2025
A Dynamic and High-Precision Method for Scenario-Based HRA Synthetic Data Collection in Multi-Agent Collaborative Environments Driven by LLMs
Xingyu Xiao
Peng Chen
Qianqian Jia
Jiejuan Tong
Jingang Liang
Haitao Wang
77
0
0
16 Jan 2025
MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task
  Learning
MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning
Yufei Ma
Zihan Liang
Huangyu Dai
Bin Chen
D. Gao
...
Linbo Jin
Wen Jiang
Guannan Zhang
Xiaoyan Cai
Libin Yang
MoE
MoMe
99
1
0
10 Dec 2024
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of
  Mixture-of-Experts with Post-Training
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
Xiaoye Qu
Daize Dong
Xuyang Hu
Tong Zhu
Weigao Sun
Yu Cheng
MoE
95
11
0
24 Nov 2024
VersaTune: An Efficient Data Composition Framework for Training Multi-Capability LLMs
Keer Lu
Keshi Zhao
Zheng Liang
Zhuoran Zhang
Da Pan
...
Xin Wu
Zenan Zhou
Guosheng Dong
Bin Cui
Wentao Zhang
VLM
CLL
35
0
0
18 Nov 2024
Meta-Learning Adaptable Foundation Models
Meta-Learning Adaptable Foundation Models
Jacob L. Block
Sundararajan Srinivasan
Liam Collins
Aryan Mokhtari
Sanjay Shakkottai
28
0
0
29 Oct 2024
Fine-Tuning LLMs for Reliable Medical Question-Answering Services
Fine-Tuning LLMs for Reliable Medical Question-Answering Services
Ali Anaissi
Ali Braytee
Junaid Akram
LM&MA
AI4MH
36
2
0
21 Oct 2024
CodePMP: Scalable Preference Model Pretraining for Large Language Model
  Reasoning
CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
Huimu Yu
Xing Wu
Weidong Yin
Debing Zhang
Songlin Hu
LRM
36
5
0
03 Oct 2024
FlipGuard: Defending Preference Alignment against Update Regression with
  Constrained Optimization
FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization
Mingye Zhu
Yi Liu
Quan Wang
Junbo Guo
Zhendong Mao
39
1
0
01 Oct 2024
Supervised Fine-Tuning Achieve Rapid Task Adaption Via Alternating
  Attention Head Activation Patterns
Supervised Fine-Tuning Achieve Rapid Task Adaption Via Alternating Attention Head Activation Patterns
Yang Zhao
Li Du
Xiao Ding
Kai Xiong
Ting Liu
Bing Qin
25
2
0
24 Sep 2024
HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks
HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks
Zhanglin Wu
Yuanchang Luo
Daimeng Wei
Jiawei Zheng
Bin Wei
...
Jiaxin Guo
Shaojun Li
Mengli Zhu
Ning Xie
Hao Yang
45
1
0
23 Sep 2024
Beyond IID: Optimizing Instruction Learning from the Perspective of
  Instruction Interaction and Dependency
Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency
hanyu Zhao
Li Du
Yiming Ju
Chengwei Wu
Tengfei Pan
35
5
0
11 Sep 2024
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with
  High-Quality Data
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
Yejie Wang
Keqing He
Dayuan Fu
Zhuoma Gongque
Heyang Xu
...
Muxi Diao
Jingang Wang
Hao Fei
Xunliang Cai
Weiran Xu
ALM
SyDa
48
3
0
05 Sep 2024
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Jing Zhou
Chenglin Jiang
Wei Shen
Xiao Zhou
Xiaonan He
ALM
50
3
0
15 Aug 2024
P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for
  Optimizing LLM Training
P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for Optimizing LLM Training
Yingxuan Yang
Huayi Wang
Muning Wen
Weinan Zhang
52
0
0
10 Aug 2024
ChipExpert: The Open-Source Integrated-Circuit-Design-Specific Large
  Language Model
ChipExpert: The Open-Source Integrated-Circuit-Design-Specific Large Language Model
Ning Xu
Zhaoyang Zhang
Lei Qi
Wensuo Wang
Chao Zhang
...
Mengyao Zhao
Junbo Liu
Yufan Song
Xin Geng
Jun Yang
28
0
0
26 Jul 2024
A Survey on Employing Large Language Models for Text-to-SQL Tasks
A Survey on Employing Large Language Models for Text-to-SQL Tasks
Liang Shi
Zhengju Tang
Nan Zhang
Xiaotong Zhang
Zhi Yang
36
22
0
21 Jul 2024
Thought-Like-Pro: Enhancing Reasoning of Large Language Models through
  Self-Driven Prolog-based Chain-of-Thought
Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought
Jue Chen
Yongxin Deng
Xihe Qiu
Weidi Xu
Chao Qu
Wei Chu
Yinghui Xu
Yuan Qi
LRM
AI4CE
LM&Ro
49
2
0
18 Jul 2024
Weak-to-Strong Reasoning
Weak-to-Strong Reasoning
Yuqing Yang
Yan Ma
Pengfei Liu
LRM
39
14
0
18 Jul 2024
Qwen2 Technical Report
Qwen2 Technical Report
An Yang
Baosong Yang
Binyuan Hui
Jian Xu
Bowen Yu
...
Yuqiong Liu
Zeyu Cui
Zhenru Zhang
Zhifang Guo
Zhi-Wei Fan
OSLM
VLM
MU
60
815
0
15 Jul 2024
BAPO: Base-Anchored Preference Optimization for Personalized Alignment
  in Large Language Models
BAPO: Base-Anchored Preference Optimization for Personalized Alignment in Large Language Models
Gihun Lee
Minchan Jeong
Yujin Kim
Hojung Jung
Jaehoon Oh
Sangmook Kim
Se-Young Yun
43
1
0
30 Jun 2024
Understand What LLM Needs: Dual Preference Alignment for
  Retrieval-Augmented Generation
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation
Guanting Dong
Yutao Zhu
Chenghao Zhang
Zechen Wang
Zhicheng Dou
Ji-Rong Wen
RALM
51
10
0
26 Jun 2024
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Ashwinee Panda
Berivan Isik
Xiangyu Qi
Sanmi Koyejo
Tsachy Weissman
Prateek Mittal
MoMe
47
15
0
24 Jun 2024
WARP: On the Benefits of Weight Averaged Rewarded Policies
WARP: On the Benefits of Weight Averaged Rewarded Policies
Alexandre Ramé
Johan Ferret
Nino Vieillard
Robert Dadashi
Léonard Hussenot
Pierre-Louis Cedoz
Pier Giuseppe Sessa
Sertan Girgin
Arthur Douillard
Olivier Bachem
62
14
0
24 Jun 2024
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math
  Reasoning by Eight-Fold
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
Amrith Rajagopal Setlur
Saurabh Garg
Xinyang Geng
Naman Garg
Virginia Smith
Aviral Kumar
47
48
0
20 Jun 2024
CityGPT: Empowering Urban Spatial Cognition of Large Language Models
CityGPT: Empowering Urban Spatial Cognition of Large Language Models
Jie Feng
Yuwei Du
Tianhui Liu
Siqi Guo
Yuming Lin
Yong Li
47
13
0
20 Jun 2024
Self-play with Execution Feedback: Improving Instruction-following
  Capabilities of Large Language Models
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
Guanting Dong
Keming Lu
Chengpeng Li
Tingyu Xia
Bowen Yu
Chang Zhou
Jingren Zhou
SyDa
ALM
LRM
55
15
0
19 Jun 2024
Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language
  Models
Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Models
Lulu Zhao
Weihao Zeng
Xiaofeng Shi
Hua Zhou
Donglin Hao
Yonghua Lin
LM&MA
48
4
0
18 Jun 2024
How Far Can In-Context Alignment Go? Exploring the State of In-Context
  Alignment
How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment
Heyan Huang
Yinghao Li
Huashan Sun
Yu Bai
Yang Gao
52
3
0
17 Jun 2024
Self and Cross-Model Distillation for LLMs: Effective Methods for
  Refusal Pattern Alignment
Self and Cross-Model Distillation for LLMs: Effective Methods for Refusal Pattern Alignment
Jie Li
Yi Liu
Chongyang Liu
Xiaoning Ren
Ling Shi
Weisong Sun
Yinxing Xue
37
0
0
17 Jun 2024
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Xiaoshuai Song
Muxi Diao
Guanting Dong
Zhengyang Wang
Yujia Fu
...
Yejie Wang
Zhuoma Gongque
Jianing Yu
Qiuna Tan
Weiran Xu
ELM
60
11
0
12 Jun 2024
Sparsity-Accelerated Training for Large Language Models
Sparsity-Accelerated Training for Large Language Models
Da Ma
Lu Chen
Pengyu Wang
Hongshen Xu
Hanqi Li
Liangtai Sun
Su Zhu
Shuai Fan
Kai Yu
LRM
33
0
0
03 Jun 2024
Learning to Clarify: Multi-turn Conversations with Action-Based
  Contrastive Self-Training
Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training
Maximillian Chen
Ruoxi Sun
Sercan O. Arik
Tomas Pfister
LLMAG
50
6
0
31 May 2024
From Symbolic Tasks to Code Generation: Diversification Yields Better
  Task Performers
From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Dylan Zhang
Justin Wang
Francois Charton
38
0
0
30 May 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight
  Tuning on Multi-source Data
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Zifan Song
Yudong Wang
Wenwei Zhang
Kuikun Liu
Chengqi Lyu
...
Qipeng Guo
Hang Yan
Dahua Lin
Kai-xiang Chen
Cairong Zhao
SyDa
46
2
0
29 May 2024
12
Next