ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.13372
  4. Cited By
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

20 March 2024
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
ArXivPDFHTML

Papers citing "LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models"

50 / 246 papers shown
Title
RESTOR: Knowledge Recovery through Machine Unlearning
RESTOR: Knowledge Recovery through Machine Unlearning
Keivan Rezaei
Khyathi Raghavi Chandu
S. Feizi
Yejin Choi
Faeze Brahman
Abhilasha Ravichander
KELM
CLL
MU
58
0
0
31 Oct 2024
Language Models can Self-Lengthen to Generate Long Texts
Language Models can Self-Lengthen to Generate Long Texts
Shanghaoran Quan
Tianyi Tang
Bowen Yu
An Yang
Dayiheng Liu
Bofei Gao
Jianhong Tu
Yichang Zhang
Jingren Zhou
Junyang Lin
ALM
SyDa
55
6
0
31 Oct 2024
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation
  Generation
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Yiruo Cheng
Kelong Mao
Ziliang Zhao
Guanting Dong
Hongjin Qian
Yongkang Wu
Tetsuya Sakai
Ji-Rong Wen
Zhicheng Dou
RALM
37
2
0
30 Oct 2024
MatExpert: Decomposing Materials Discovery by Mimicking Human Experts
MatExpert: Decomposing Materials Discovery by Mimicking Human Experts
Qianggang Ding
Santiago Miret
Bang Liu
MoE
32
7
0
26 Oct 2024
GCoder: Improving Large Language Model for Generalized Graph Problem
  Solving
GCoder: Improving Large Language Model for Generalized Graph Problem Solving
Qifan Zhang
Xiaobin Hong
Jianheng Tang
Nuo Chen
Yuhan Li
Wenzhong Li
Jing Tang
Jia Li
OffRL
AI4CE
LRM
37
1
0
24 Oct 2024
Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model
Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model
Wenhong Zhu
Zhiwei He
Xiaofeng Wang
Pengfei Liu
Rui Wang
OSLM
56
3
0
24 Oct 2024
An Adaptive Framework for Generating Systematic Explanatory Answer in
  Online Q&A Platforms
An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A Platforms
Ziyang Chen
Xiaobin Wang
Yong Jiang
Jinzhi Liao
Pengjun Xie
Fei Huang
Xiang Zhao
155
0
0
23 Oct 2024
Markov Chain of Thought for Efficient Mathematical Reasoning
Markov Chain of Thought for Efficient Mathematical Reasoning
Wen Yang
Kai Fan
Minpeng Liao
LRM
39
4
0
23 Oct 2024
Atomic Fact Decomposition Helps Attributed Question Answering
Atomic Fact Decomposition Helps Attributed Question Answering
Zhichao Yan
J. Wang
Jiaoyan Chen
Xiaoli Li
Ru Li
Jeff Z. Pan
KELM
HILM
36
0
0
22 Oct 2024
DEAN: Deactivating the Coupled Neurons to Mitigate Fairness-Privacy
  Conflicts in Large Language Models
DEAN: Deactivating the Coupled Neurons to Mitigate Fairness-Privacy Conflicts in Large Language Models
Chen Qian
Dongrui Liu
Jie Zhang
Yong Liu
Jing Shao
37
1
0
22 Oct 2024
Understanding and Alleviating Memory Consumption in RLHF for LLMs
Understanding and Alleviating Memory Consumption in RLHF for LLMs
Jin Zhou
Hanmei Yang
Steven
Tang
Mingcan Xiang
Hui Guan
Tongping Liu
36
0
0
21 Oct 2024
Guardians of Discourse: Evaluating LLMs on Multilingual Offensive
  Language Detection
Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection
Jianfei He
Lilin Wang
Jiaying Wang
Zhenyu Liu
Hongbin Na
Zehua Wang
Wei Wang
Qi Chen
30
2
0
21 Oct 2024
Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Shaonan Wu
Shuai Lu
Y. Gong
Nan Duan
Ping Wei
AIMat
45
0
0
21 Oct 2024
Boosting LLM Translation Skills without General Ability Loss via
  Rationale Distillation
Boosting LLM Translation Skills without General Ability Loss via Rationale Distillation
Junhong Wu
Yang Zhao
Yangyifan Xu
Bing Liu
Chengqing Zong
CLL
40
1
0
17 Oct 2024
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the
  Hints from Its Router
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Yanyue Xie
Zhi Zhang
Ding Zhou
Cong Xie
Ziang Song
Xin Liu
Yanzhi Wang
Xue Lin
An Xu
LLMAG
40
3
0
15 Oct 2024
Self-adaptive Multimodal Retrieval-Augmented Generation
Self-adaptive Multimodal Retrieval-Augmented Generation
Wenjia Zhai
VLM
42
0
0
15 Oct 2024
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and
  Injection for Enhancing Large Language Models
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models
Jiaxin Zhang
Wendi Cui
Yiran Huang
Kamalika Das
Sricharan Kumar
KELM
SyDa
27
2
0
12 Oct 2024
Toward General Instruction-Following Alignment for Retrieval-Augmented
  Generation
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation
Guanting Dong
Xiaoshuai Song
Bo Li
Runqi Qiao
Zhicheng Dou
Ji-Rong Wen
3DV
86
4
0
12 Oct 2024
Rethinking Data Selection at Scale: Random Selection is Almost All You
  Need
Rethinking Data Selection at Scale: Random Selection is Almost All You Need
Tingyu Xia
Bowen Yu
K. Dang
An Yang
Yuan Wu
Yuan Tian
Yi-Ju Chang
Junyang Lin
ALM
49
5
0
12 Oct 2024
Language Imbalance Driven Rewarding for Multilingual Self-improving
Language Imbalance Driven Rewarding for Multilingual Self-improving
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
Junzhe Zhang
ALM
LRM
68
4
0
11 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
60
6
0
10 Oct 2024
Learning Evolving Tools for Large Language Models
Learning Evolving Tools for Large Language Models
Guoxin Chen
Zhong Zhang
Xin Cong
Fangda Guo
Yesai Wu
Yankai Lin
Wenzheng Feng
Yasheng Wang
KELM
52
1
0
09 Oct 2024
Are Large Language Models State-of-the-art Quality Estimators for
  Machine Translation of User-generated Content?
Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?
Shenbin Qian
Constantin Orasan
Diptesh Kanojia
Félix do Carmo
ELM
27
0
0
08 Oct 2024
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan
Elias Stengel-Eskin
Jaemin Cho
Joey Tianyi Zhou
VGen
43
1
0
08 Oct 2024
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Shashank Yadav
Rohan Tomar
Garvit Jain
Chirag Ahooja
Shubham Chaudhary
Charles Elkan
33
0
0
05 Oct 2024
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
Xiangyu Peng
Congying Xia
Xinyi Yang
Caiming Xiong
Chien-Sheng Wu
Chen Xing
LRM
48
2
0
03 Oct 2024
Speculative Coreset Selection for Task-Specific Fine-tuning
Speculative Coreset Selection for Task-Specific Fine-tuning
Xiaoyu Zhang
Juan Zhai
Shiqing Ma
Chao Shen
Tianlin Li
Weipeng Jiang
Yang Liu
30
1
0
02 Oct 2024
House of Cards: Massive Weights in LLMs
House of Cards: Massive Weights in LLMs
Jaehoon Oh
Seungjun Shin
Dokwan Oh
37
1
0
02 Oct 2024
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language
  Models
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
Hossein Rajabzadeh
A. Jafari
Aman Sharma
Benyamin Jami
Hyock Ju Kwon
Ali Ghodsi
Boxing Chen
Mehdi Rezagholizadeh
30
0
0
22 Sep 2024
Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension
Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension
Chenxu Wang
Ping Jian
Zhen Yang
LRM
22
0
0
22 Sep 2024
Large Language Model Should Understand Pinyin for Chinese ASR Error
  Correction
Large Language Model Should Understand Pinyin for Chinese ASR Error Correction
Yuang Li
Xiaosong Qiao
Xiaofeng Zhao
Huan Zhao
Wei Tang
Min Zhang
Hao Yang
43
1
0
20 Sep 2024
$\textit{SKIntern}$: Internalizing Symbolic Knowledge for Distilling
  Better CoT Capabilities into Small Language Models
SKIntern\textit{SKIntern}SKIntern: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models
Huanxuan Liao
Shizhu He
Yupu Hao
Xiang Li
Yuanzhe Zhang
Kang Liu
Jun Zhao
LRM
44
0
0
20 Sep 2024
Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
Huanxuan Liao
Shizhu He
Yao Xu
Yuanzhe Zhang
Kang Liu
Jun Zhao
LRM
53
3
0
20 Sep 2024
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Chen Zhang
Dading Chong
Feng Jiang
Chengguang Tang
Anningzhe Gao
Guohua Tang
Haizhou Li
ALM
33
2
0
20 Sep 2024
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
Haoran Ye
Yuhang Xie
Yuanyi Ren
Hanjun Fang
Xin Zhang
Guojie Song
LM&MA
37
1
0
18 Sep 2024
MindGuard: Towards Accessible and Sitgma-free Mental Health First Aid
  via Edge LLM
MindGuard: Towards Accessible and Sitgma-free Mental Health First Aid via Edge LLM
Sijie Ji
Xinzhe Zheng
Jiawei Sun
Renqi Chen
Wei Gao
Mani Srivastava
AI4MH
34
3
0
16 Sep 2024
LLM-as-BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning
LLM-as-BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning
Jicong Ao
Fan Wu
Yansong Wu
Abdalla Swikir
Sami Haddadin
37
5
0
16 Sep 2024
LLM Honeypot: Leveraging Large Language Models as Advanced Interactive
  Honeypot Systems
LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems
Hakan T. Otal
M. Abdullah Canbaz
16
1
0
12 Sep 2024
Full-text Error Correction for Chinese Speech Recognition with Large
  Language Model
Full-text Error Correction for Chinese Speech Recognition with Large Language Model
Zhiyuan Tang
Dong Wang
Shen Huang
Shidong Shang
AuLLM
27
1
0
12 Sep 2024
A Practice of Post-Training on Llama-3 70B with Optimal Selection of
  Additional Language Mixture Ratio
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio
Ningyuan Xi
Yetao Wu
Kun Fan
Teng Chen
Qingqing Gu
...
Jinxian Qu
Chenxi Liu
Zhonglin Jiang
Yong Chen
Luo Ji
ALM
40
0
0
10 Sep 2024
Differentially Private Kernel Density Estimation
Differentially Private Kernel Density Estimation
Erzhi Liu
Jerry Yao-Chieh Hu
Alex Reneau
Zhao Song
Han Liu
66
3
0
03 Sep 2024
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large
  Language Models
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Dian Yu
Baolin Peng
Ye Tian
Linfeng Song
Haitao Mi
Dong Yu
ALM
LRM
49
1
0
28 Aug 2024
What Do You Want? User-centric Prompt Generation for Text-to-image
  Synthesis via Multi-turn Guidance
What Do You Want? User-centric Prompt Generation for Text-to-image Synthesis via Multi-turn Guidance
Yilun Liu
Minggui He
Feiyu Yao
Yuhe Ji
Shimin Tao
...
Jian Gao
Li Zhang
Hao Yang
Boxing Chen
Osamu Yoshie
46
5
0
23 Aug 2024
Quality or Quantity? On Data Scale and Diversity in Adapting Large
  Language Models for Low-Resource Translation
Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Vivek Iyer
Bhavitvya Malik
Pavel Stepachev
Pinzhen Chen
Barry Haddow
Alexandra Birch
ALM
34
3
0
23 Aug 2024
Minor SFT loss for LLM fine-tune to increase performance and reduce
  model deviation
Minor SFT loss for LLM fine-tune to increase performance and reduce model deviation
Shiming Xie
Hong Chen
Fred Yu
Zeye Sun
Xiuyu Wu
30
0
0
20 Aug 2024
Minor DPO reject penalty to increase training robustness
Minor DPO reject penalty to increase training robustness
Shiming Xie
Hong Chen
Fred Yu
Zeye Sun
Xiuyu Wu
Yingfan Hu
35
2
0
19 Aug 2024
Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion
  Contrastive Decoding with Truthfulness Refocused
Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Dingwei Chen
Feiteng Fang
Shiwen Ni
Feng Liang
Ruifeng Xu
Min Yang
Chengming Li
HILM
26
1
0
16 Aug 2024
Fine-tuning LLMs for Autonomous Spacecraft Control: A Case Study Using
  Kerbal Space Program
Fine-tuning LLMs for Autonomous Spacecraft Control: A Case Study Using Kerbal Space Program
Alejandro Carrasco
Victor Rodriguez-Fernandez
Richard Linares
33
1
0
16 Aug 2024
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative
  Self-Enhancement Paradigm
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm
Yiming Liang
Ge Zhang
Xingwei Qu
Tianyu Zheng
Jiawei Guo
...
Jiaheng Liu
Chenghua Lin
Lei Ma
Wenhao Huang
Jiajun Zhang
ALM
51
5
0
15 Aug 2024
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
Yuchen Xia
Jiho Kim
Yuhan Chen
Haojie Ye
Souvik Kundu
Cong
Hao
Nishil Talati
MoE
35
20
0
08 Aug 2024
Previous
12345
Next