ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.13372
  4. Cited By
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

20 March 2024
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
ArXivPDFHTML

Papers citing "LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models"

50 / 246 papers shown
Title
Small Models Struggle to Learn from Strong Reasoners
Small Models Struggle to Learn from Strong Reasoners
Yuetai Li
Xiang Yue
Zhangchen Xu
Fengqing Jiang
Luyao Niu
Bill Yuchen Lin
Bhaskar Ramasubramanian
Radha Poovendran
LRM
46
12
0
17 Feb 2025
SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors
SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors
Bohan Lyu
Siqiao Huang
Zichen Liang
Qi-An Sun
Jiaming Zhang
ELM
LRM
60
0
0
16 Feb 2025
Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training
Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training
Yao-Ching Yu
Tsun-Han Chiang
Cheng-Wei Tsai
Chien-Ming Huang
Wen-Kwang Tsao
62
5
0
16 Feb 2025
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation
C. Zhou
Xinyu Zhang
Dandan Song
Xiancai Chen
Wanli Gu
Huipeng Ma
Yuhang Tian
M. Zhang
Linmei Hu
63
1
0
13 Feb 2025
Typhoon T1: An Open Thai Reasoning Model
Typhoon T1: An Open Thai Reasoning Model
Pittawat Taveekitworachai
Potsawee Manakul
Kasima Tharnpipitchai
Kunat Pipatanakul
OffRL
LRM
102
0
0
13 Feb 2025
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM
Zhi-Hua Zhou
Kun-Yang Yu
Shi-Yu Tian
Jiang-Xin Shi
Xiao-Wen Yang
Pengxiao Song
Yi-Xuan Jin
Lan-Zhe Guo
Yu-Feng Li
ELM
AILaw
55
1
0
10 Feb 2025
C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation
Guoxin Chen
Minpeng Liao
Peiying Yu
Dingmin Wang
Zile Qiao
Chao Yang
Xin Zhao
Kai Fan
58
1
0
10 Feb 2025
OntoTune: Ontology-Driven Self-training for Aligning Large Language Models
OntoTune: Ontology-Driven Self-training for Aligning Large Language Models
Zhiqiang Liu
Chengtao Gan
Junjie Wang
Yuhang Zhang
Zhongpu Bo
Mengshu Sun
H. Chen
Wen Zhang
65
0
0
08 Feb 2025
Leveraging Reasoning with Guidelines to Elicit and Utilize Knowledge for Enhancing Safety Alignment
Leveraging Reasoning with Guidelines to Elicit and Utilize Knowledge for Enhancing Safety Alignment
Haoyu Wang
Zeyu Qin
Li Shen
Xueqian Wang
Minhao Cheng
Dacheng Tao
99
2
0
06 Feb 2025
Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
Lin Yuan
Jun Xu
Honghao Gui
Mengshu Sun
Qing Cui
Lei Liang
Jun Zhou
AI4CE
208
0
0
06 Feb 2025
STAIR: Improving Safety Alignment with Introspective Reasoning
STAIR: Improving Safety Alignment with Introspective Reasoning
Y. Zhang
Siyuan Zhang
Yao Huang
Zeyu Xia
Zhengwei Fang
Xiao Yang
Ranjie Duan
Dong Yan
Yinpeng Dong
Jun Zhu
LRM
LLMSV
56
3
0
04 Feb 2025
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Maohao Shen
Guangtao Zeng
Zhenting Qi
Zhang-Wei Hong
Zhenfang Chen
Wei Lu
G. Wornell
Subhro Das
David D. Cox
Chuang Gan
LLMAG
LRM
171
6
0
04 Feb 2025
Preference Leakage: A Contamination Problem in LLM-as-a-judge
Preference Leakage: A Contamination Problem in LLM-as-a-judge
Dawei Li
Renliang Sun
Yue Huang
Ming Zhong
Bohan Jiang
J. Han
Xuzhi Zhang
Wei Wang
Huan Liu
65
11
0
03 Feb 2025
MorphBPE: A Morpho-Aware Tokenizer Bridging Linguistic Complexity for Efficient LLM Training Across Morphologies
MorphBPE: A Morpho-Aware Tokenizer Bridging Linguistic Complexity for Efficient LLM Training Across Morphologies
Ehsaneddin Asgari
Yassine El Kheir
Mohammad Ali Sadraei Javaheri
58
0
0
02 Feb 2025
GuardReasoner: Towards Reasoning-based LLM Safeguards
Yue Liu
Hongcheng Gao
Shengfang Zhai
Jun-Xiong Xia
Tianyi Wu
Zhiwei Xue
Y. Chen
Kenji Kawaguchi
Jiaheng Zhang
Bryan Hooi
AI4TS
LRM
131
14
0
30 Jan 2025
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
Yumeng Wang
Zhiyuan Fan
Q. Wang
May Fung
Heng Ji
80
1
0
30 Jan 2025
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
149
0
0
28 Jan 2025
StringLLM: Understanding the String Processing Capability of Large Language Models
StringLLM: Understanding the String Processing Capability of Large Language Models
Xilong Wang
Hao Fu
Jindong Wang
Neil Zhenqiang Gong
51
0
0
28 Jan 2025
MDEval: Evaluating and Enhancing Markdown Awareness in Large Language Models
Zhongpu Chen
Y. Liu
Long Shi
Zhi-Jie Wang
Xingyan Chen
Yu Zhao
Fuji Ren
46
0
0
28 Jan 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
46
8
0
23 Jan 2025
HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor
HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor
Zihui Wu
Haichang Gao
Jiacheng Luo
Zhaoxiang Liu
41
0
0
23 Jan 2025
HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja
HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja
Shri Kiran Srinivasan
Haneul Yoo
Jiho Jin
Kyunghyun Cho
Alice H. Oh
AI4TS
VLM
32
0
0
21 Jan 2025
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
Yafu Li
Zhilin Wang
Tingchen Fu
Ganqu Cui
Sen Yang
Yu Cheng
45
1
0
21 Jan 2025
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Chaoqi Wang
Zhuokai Zhao
Yibo Jiang
Zhaorun Chen
Chen Zhu
...
Jiayi Liu
Lizhu Zhang
Xiangjun Fan
Hao Ma
Sinong Wang
77
3
0
17 Jan 2025
A Dynamic and High-Precision Method for Scenario-Based HRA Synthetic Data Collection in Multi-Agent Collaborative Environments Driven by LLMs
Xingyu Xiao
Peng Chen
Qianqian Jia
Jiejuan Tong
Jingang Liang
Haitao Wang
77
0
0
16 Jan 2025
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning
Zhongzhen Huang
Gui Geng
Shengyi Hua
Zhen Huang
Haoyang Zou
S. Zhang
Pengfei Liu
Xiaofan Zhang
LRM
38
10
0
11 Jan 2025
LogLM: From Task-based to Instruction-based Automated Log Analysis
LogLM: From Task-based to Instruction-based Automated Log Analysis
Yilun Liu
Yuhe Ji
Shimin Tao
Minggui He
Weibin Meng
Shenglin Zhang
Yongqian Sun
Yuming Xie
Boxing Chen
Hao Yang
47
2
0
10 Jan 2025
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages
Archchana Sindhujan
Diptesh Kanojia
Constantin Orasan
Shenbin Qian
38
1
0
08 Jan 2025
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Zhangyang Qi
Zhixiong Zhang
Ye Fang
Jiaqi Wang
Hengshuang Zhao
83
6
0
02 Jan 2025
GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
GaLore+++: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Xutao Liao
Shaohui Li
Yuhui Xu
Zhi Li
Y. Liu
You He
VLM
59
2
0
31 Dec 2024
BaiJia: A Large-Scale Role-Playing Agent Corpus of Chinese Historical Characters
BaiJia: A Large-Scale Role-Playing Agent Corpus of Chinese Historical Characters
Ting Bai
Jiazheng Kang
Jiayang Fan
AI4CE
42
2
0
28 Dec 2024
scReader: Prompting Large Language Models to Interpret scRNA-seq Data
scReader: Prompting Large Language Models to Interpret scRNA-seq Data
Cong Li
Qingqing Long
Yuanchun Zhou
Meng Xiao
33
1
0
24 Dec 2024
Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Luo Ji
Feixiang Guo
Teng Chen
Qingqing Gu
Xiaoyu Wang
...
Peng Yu
Yue Zhao
Hongyang Lei
Zhonglin Jiang
Yong Chen
RALM
LRM
99
0
0
21 Dec 2024
From General to Specific: Tailoring Large Language Models for
  Personalized Healthcare
From General to Specific: Tailoring Large Language Models for Personalized Healthcare
Ruize Shi
Hong Huang
Wei Zhou
Kehan Yin
Kai Zhao
Yun Zhao
LM&MA
AI4MH
79
0
0
20 Dec 2024
Outcome-Refining Process Supervision for Code Generation
Outcome-Refining Process Supervision for Code Generation
Zhuohao Yu
Weizheng Gu
Yidong Wang
Zhengran Zeng
Jindong Wang
Wei Ye
Shikun Zhang
LRM
89
4
0
19 Dec 2024
PsyDT: Using LLMs to Construct the Digital Twin of Psychological
  Counselor with Personalized Counseling Style for Psychological Counseling
PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological Counseling
Haojie Xie
Yirong Chen
Xiaofen Xing
Jingkai Lin
Xiangmin Xu
OffRL
82
2
0
18 Dec 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning
Hongbin Zhang
K. Chen
Xuefeng Bai
Yang Xiang
Min Zhang
79
0
0
17 Dec 2024
Empowering LLMs to Understand and Generate Complex Vector Graphics
Empowering LLMs to Understand and Generate Complex Vector Graphics
Ximing Xing
Juncheng Hu
Guotao Liang
Jing Zhang
Dong Xu
Qian Yu
92
7
0
15 Dec 2024
MLD-EA: Check and Complete Narrative Coherence by Introducing Emotions
  and Actions
MLD-EA: Check and Complete Narrative Coherence by Introducing Emotions and Actions
Jinming Zhang
Yunfei Long
77
0
0
03 Dec 2024
Adapting Large Language Models to Log Analysis with Interpretable Domain
  Knowledge
Adapting Large Language Models to Log Analysis with Interpretable Domain Knowledge
Yuhe Ji
Yilun Liu
Feiyu Yao
Minggui He
Shimin Tao
...
Xinhua Yang
Weibin Meng
Yuming Xie
Boxing Chen
Hao Yang
90
3
0
02 Dec 2024
On Domain-Specific Post-Training for Multimodal Large Language Models
On Domain-Specific Post-Training for Multimodal Large Language Models
Daixuan Cheng
Shaohan Huang
Ziyu Zhu
Xintong Zhang
Wayne Xin Zhao
Zhongzhi Luan
Bo Dai
Zhenliang Zhang
VLM
102
2
0
29 Nov 2024
Enhancing Character-Level Understanding in LLMs through Token Internal
  Structure Learning
Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
Zhu Xu
Zhiqiang Zhao
Zihan Zhang
Yuchi Liu
Quanwei Shen
Fei Liu
Yu Kuang
Jian He
Conglin Liu
83
1
0
26 Nov 2024
IterIS: Iterative Inference-Solving Alignment for LoRA Merging
IterIS: Iterative Inference-Solving Alignment for LoRA Merging
Hongxu Chen
Runshi Li
Bowei Zhu
Zhen Wang
Long Chen
MoMe
98
0
0
21 Nov 2024
ORID: Organ-Regional Information Driven Framework for Radiology Report
  Generation
ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
Tiancheng Gu
Kaicheng Yang
Xiang An
Ziyong Feng
Dongnan Liu
Weidong Cai
74
1
0
20 Nov 2024
Teaching VLMs to Localize Specific Objects from In-context Examples
Teaching VLMs to Localize Specific Objects from In-context Examples
Sivan Doveh
Nimrod Shabtay
Wei Lin
Eli Schwartz
Hilde Kuehne
...
Leonid Karlinsky
James Glass
Assaf Arbelle
S. Ullman
Muhammad Jehanzeb Mirza
VLM
103
1
0
20 Nov 2024
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Moran Yanuka
Assaf Ben-Kish
Yonatan Bitton
Idan Szpektor
Raja Giryes
VLM
47
2
0
13 Nov 2024
Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating
  Financial Large Language Models
Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models
Xiaojun Wu
Junxi Liu
Huanyi Su
Zhouchi Lin
Yiyan Qi
...
Fuwei Wang
Saizhuo Wang
Fengrui Hua
Jia Li
Jian Guo
50
0
0
09 Nov 2024
Smart-LLaMA: Two-Stage Post-Training of Large Language Models for Smart
  Contract Vulnerability Detection and Explanation
Smart-LLaMA: Two-Stage Post-Training of Large Language Models for Smart Contract Vulnerability Detection and Explanation
Lei Yu
Shiqi Chen
Hang Yuan
Peng Wang
Zhirong Huang
Jingyang Zhang
Chenjie Shen
Fengjun Zhang
Li Yang
Jiajia Ma
36
2
0
09 Nov 2024
Clustering Algorithms and RAG Enhancing Semi-Supervised Text
  Classification with Large LLMs
Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs
Shan Zhong
Jiahao Zeng
Yongxin Yu
Bohong Lin
36
1
0
09 Nov 2024
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Bohan Lyu
Yadi Cao
Duncan Watson-Parris
Leon Bergen
Taylor Berg-Kirkpatrick
Rose Yu
61
3
0
01 Nov 2024
Previous
12345
Next