ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.13372
  4. Cited By
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

20 March 2024
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
ArXivPDFHTML

Papers citing "LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models"

50 / 246 papers shown
Title
Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench
Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench
Ziyi Liu
Priyanka Dey
Zhenyu Zhao
Zitong Yu
Rahul Gupta
Yong-Jin Liu
Jieyu Zhao
33
0
0
01 Apr 2025
Surgical Action Planning with Large Language Models
Surgical Action Planning with Large Language Models
Mengya Xu
Zhongzhen Huang
Jie Zhang
Xiaofan Zhang
Qi Dou
46
0
0
24 Mar 2025
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
Weihao Zeng
Yuzhen Huang
Qian Liu
Wei Liu
Keqing He
Zejun Ma
Junxian He
OffRL
ReLM
LRM
91
38
0
24 Mar 2025
SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging
SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging
Aladin Djuhera
S. Kadhe
Farhan Ahmed
Syed Zawad
Holger Boche
MoMe
51
0
0
21 Mar 2025
VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning
VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning
Y. Tan
Chen Liu
Jingyuan Gao
Banghao Wu
Mingchen Li
...
Lingrong Zhang
Huiqun Yu
Guisheng Fan
Liang Hong
Bingxin Zhou
50
1
0
19 Mar 2025
Task-Specific Data Selection for Instruction Tuning via Monosemantic Neuronal Activations
Task-Specific Data Selection for Instruction Tuning via Monosemantic Neuronal Activations
Da Ma
Gonghu Shang
Zhi Chen
L. Qin
Yijie Luo
Lei Pan
Shuai Fan
Lu Chen
Kai Yu
43
0
0
19 Mar 2025
Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models
Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models
Mingming Peng
Zhendong Chen
Jie Yang
Jin Huang
Zhengqi Shi
Qihao Liu
Xinyu Li
Liang Gao
48
1
0
18 Mar 2025
RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving
RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving
Yujin Wang
Quanfeng Liu
Zhengxin Jiang
Tianyi Wang
Junfeng Jiao
Hongqing Chu
B. Gao
Hong Chen
60
1
0
18 Mar 2025
Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning
Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning
Hai-Long Sun
Zhun Sun
Houwen Peng
Han-Jia Ye
LRM
43
0
0
17 Mar 2025
Pensez: Less Data, Better Reasoning -- Rethinking French LLM
Pensez: Less Data, Better Reasoning -- Rethinking French LLM
Huy Hoang Ha
ReLM
LRM
68
1
0
17 Mar 2025
General Table Question Answering via Answer-Formula Joint Generation
General Table Question Answering via Answer-Formula Joint Generation
Zhongyuan Wang
Richong Zhang
Zhijie Nie
LMTD
149
0
0
16 Mar 2025
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
Cheng Deng
Luoyang Sun
Jiwen Jiang
Yongcheng Zeng
Xinjian Wu
...
Haoyang Li
Lei Chen
Lionel M. Ni
Hongzhi Zhang
Jun Wang
168
0
0
15 Mar 2025
NeurIPS 2023 LLM Efficiency Fine-tuning Competition
NeurIPS 2023 LLM Efficiency Fine-tuning Competition
Mark Saroufim
Yotam Perlitz
Leshem Choshen
Luca Antiga
Greg Bowyer
...
Ashvini Kumar
Jindal Pawan Kumar
Rajpoot Ankur Parikh
Joe Isaacson
Weiwei Yang
ELM
49
0
0
13 Mar 2025
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Wenxuan Huang
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MU
OffRL
LRM
MLLM
ReLM
VLM
59
41
0
09 Mar 2025
Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Yuxiang Zhang
Yuqi Yang
Jiangming Shu
Xinyan Wen
Jitao Sang
LRM
LLMAG
LM&Ro
49
1
0
09 Mar 2025
GenAI for Simulation Model in Model-Based Systems Engineering
Lin Zhang
Y. Zhang
Dusit Niyato
Lei Ren
Pengfei Gu
Zhen Chen
Y. Laili
Wentong Cai
Agostino Bruzzone
AI4CE
36
0
0
09 Mar 2025
Adding Alignment Control to Language Models
Wenhong Zhu
Weinan Zhang
Rui Wang
57
0
0
06 Mar 2025
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL
Haoyuan Ma
Yongliang Shen
Hengwei Liu
Wenqi Zhang
Haolei Xu
Qiuying Peng
Jun Wang
Weiming Lu
49
0
0
06 Mar 2025
Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
Junzhe Zhang
73
1
0
06 Mar 2025
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Yafu Li
Ronghao Zhang
Zhilin Wang
Huajian Zhang
Leyang Cui
Yongjing Yin
Tong Xiao
Yue Zhang
74
0
0
06 Mar 2025
How to Mitigate Overfitting in Weak-to-strong Generalization?
Junhao Shi
Qinyuan Cheng
Zhaoye Fei
Y. Zheng
Qipeng Guo
Xipeng Qiu
70
0
0
06 Mar 2025
COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence
Wentao Li
Congcong Wang
Xiaoxiao Cui
Zhi Liu
Wei Guo
Lizhen Cui
55
0
0
05 Mar 2025
RiskAgent: Autonomous Medical AI Copilot for Generalist Risk Prediction
Fenglin Liu
Jinge Wu
Hongjian Zhou
Xiao Gu
Soheila Molaei
A. Thakur
Lei A. Clifton
Honghan Wu
David A. Clifton
LM&MA
41
0
0
05 Mar 2025
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Rui Ye
Shuo Tang
Rui Ge
Yaxin Du
Zhenfei Yin
S. Chen
Jing Shao
LLMAG
90
1
0
05 Mar 2025
MPO: Boosting LLM Agents with Meta Plan Optimization
Weimin Xiong
Yifan Song
Qingxiu Dong
Bingchan Zhao
Feifan Song
Xun Wang
Sujian Li
LLMAG
81
0
0
04 Mar 2025
Towards Event Extraction with Massive Types: LLM-based Collaborative Annotation and Partitioning Extraction
Wenxuan Liu
Zehan Li
Long Bai
Yuxin Zuo
Daozhu Xu
Xiaolong Jin
J. Guo
Xueqi Cheng
61
1
0
04 Mar 2025
AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification
Xuan Zhang
Yongliang Shen
Zhe Zheng
Linjuan Wu
Wenqi Zhang
Yuchen Yan
Qiuying Peng
Jun Wang
Weiming Lu
KELM
77
1
0
03 Mar 2025
CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom
Yisen Li
Lingfeng Yang
Wenxuan Shen
Pan Zhou
Yao Wan
Weiwei Lin
Danny Chen
70
0
0
03 Mar 2025
Smoothing Grounding and Reasoning for MLLM-Powered GUI Agents with Query-Oriented Pivot Tasks
Zongru Wu
Pengzhou Cheng
Zheng Wu
Tianjie Ju
Zhuosheng Zhang
Gongshen Liu
LRM
32
1
0
01 Mar 2025
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
Dawei Zhu
Xiyu Wei
Guangxiang Zhao
Wenhao Wu
Haosheng Zou
Junfeng Ran
Xun Wang
Lin Sun
Xiangzheng Zhang
Sujian Li
LRM
56
1
0
28 Feb 2025
Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks
Hanjiang Hu
Alexander Robey
Changliu Liu
AAML
LLMSV
47
1
0
28 Feb 2025
Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content
Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content
Hongyuan Shen
Min Zheng
Jincheng Wang
Yang Zhao
44
0
0
28 Feb 2025
Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation
Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation
Kuang-Da Wang
Teng-Ruei Chen
Yu-Heng Hung
Shuoyang Ding
Yueh-Hua Wu
Yu-Chun Wang
Chao-Han Huck Yang
Wen-Chih Peng
Ping-Chun Hsieh
74
0
0
28 Feb 2025
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers
Kechen Li
Wenqi Zhu
Coralia Cartis
Tianbo Ji
Shiwei Liu
ReLM
LRM
49
0
0
27 Feb 2025
Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time
Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time
Jiazheng Li
Yuxiang Zhou
Junru Lu
Gladys Tyen
Lin Gui
Cesare Aloisi
Yulan He
LRM
39
2
0
26 Feb 2025
Bián: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation
Bián: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation
Zhouyu Jiang
Mengshu Sun
Qing Cui
Lei Liang
RALM
3DV
233
0
0
26 Feb 2025
Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Xiaofeng Wang
Z. Zhang
Jinguang Zheng
Yiming Ai
Rui Wang
45
1
0
25 Feb 2025
CuDIP: Enhancing Theorem Proving in LLMs via Curriculum Learning-based Direct Preference Optimization
CuDIP: Enhancing Theorem Proving in LLMs via Curriculum Learning-based Direct Preference Optimization
Shuming Shi
Ruobing Zuo
Gaolei He
Jianlin Wang
Chenyang Xu
Zhengfeng Yang
60
0
0
25 Feb 2025
Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning
Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning
Xinghao Chen
Zhijing Sun
Wenjin Guo
Miaoran Zhang
Yanjun Chen
...
Hui Su
Yijie Pan
Dietrich Klakow
Wenjie Li
Xiaoyu Shen
LRM
56
5
0
25 Feb 2025
Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts
Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts
Zhenghao Liu
Xingsheng Zhu
Tianshuo Zhou
Xinyi Zhang
Xiaoyuan Yi
Yukun Yan
Yu Gu
Ge Yu
Maosong Sun
RALM
VLM
43
0
0
24 Feb 2025
Learning to Retrieve and Reason on Knowledge Graph through Active Self-Reflection
Learning to Retrieve and Reason on Knowledge Graph through Active Self-Reflection
Han Zhang
Langshi Zhou
Hanfang Yang
LRM
RALM
ReLM
KELM
167
1
0
24 Feb 2025
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
Chenlong Wang
Zhaoyang Chu
Zhengxiang Cheng
Xuyi Yang
Kaiyue Qiu
Yao Wan
Zhou Zhao
Xuanhua Shi
Danny Chen
ALM
SyDa
43
0
0
23 Feb 2025
RAG-Optimized Tibetan Tourism LLMs: Enhancing Accuracy and Personalization
RAG-Optimized Tibetan Tourism LLMs: Enhancing Accuracy and Personalization
Jinhu Qi
Shuai Yan
Yibo Zhang
Wentao Zhang
R. L. Jin
Yihan Hu
Ke Wang
3DV
62
1
0
21 Feb 2025
IPAD: Inverse Prompt for AI Detection -- A Robust and Explainable LLM-Generated Text Detector
IPAD: Inverse Prompt for AI Detection -- A Robust and Explainable LLM-Generated Text Detector
Zheng Chen
Yushi Feng
Changyang He
Yue Deng
Hongxi Pu
Bo-wen Li
DeLMO
47
1
0
21 Feb 2025
LESA: Learnable LLM Layer Scaling-Up
LESA: Learnable LLM Layer Scaling-Up
Yifei Yang
Zouying Cao
Xinbei Ma
Yao Yao
L. Qin
Z. Chen
Hai Zhao
64
0
0
20 Feb 2025
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
Sifan Zhou
Shuo Wang
Zhihang Yuan
Mingjia Shi
Yuzhang Shang
Dawei Yang
ALM
MQ
90
0
0
18 Feb 2025
Small Models Struggle to Learn from Strong Reasoners
Small Models Struggle to Learn from Strong Reasoners
Yuetai Li
Xiang Yue
Zhangchen Xu
Fengqing Jiang
Luyao Niu
Bill Yuchen Lin
Bhaskar Ramasubramanian
Radha Poovendran
LRM
46
12
0
17 Feb 2025
Diversity-Oriented Data Augmentation with Large Language Models
Diversity-Oriented Data Augmentation with Large Language Models
Zaitian Wang
Jinghan Zhang
Xinhao Zhang
Kunpeng Liu
Pengfei Wang
Yuanchun Zhou
80
1
0
17 Feb 2025
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
Fengqing Jiang
Zhangchen Xu
Yuetai Li
Luyao Niu
Zhen Xiang
Bo-wen Li
Bill Yuchen Lin
Radha Poovendran
KELM
ELM
LRM
83
14
0
17 Feb 2025
InsBank: Evolving Instruction Subset for Ongoing Alignment
InsBank: Evolving Instruction Subset for Ongoing Alignment
Jiayi Shi
Yiwei Li
Shaoxiong Feng
Peiwen Yuan
X. U. Wang
...
Chuyi Tan
Boyuan Pan
Huan Ren
Yao Hu
Kan Li
ALM
92
0
0
17 Feb 2025
Previous
12345
Next