Understanding the Performance and Estimating the Cost of LLM Fine-Tuning

Understanding the Performance and Estimating the Cost of LLM Fine-Tuning

8 August 2024

Cong

Papers citing "Understanding the Performance and Estimating the Cost of LLM Fine-Tuning"

17 / 17 papers shown

Title
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades Yanan Li Fanxu Meng Muhan Zhang Shiai Zhu Shangguang Wang Mengwei Xu MoMe 7 0 0 17 May 2025
The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware) Kirill Vasilevski Benjamin Rombaut Gopi Krishnan Rajbahadur G. Oliva Keheliya Gallaba ... Bouyan Chen Kishanthan Thangarajah Ahmed E. Hassan Zhen Ming Jiang 22 0 0 15 May 2025
Investigating Task Arithmetic for Zero-Shot Information Retrieval Marco Braga Pranav Kasela Alessandro Raganato G. Pasi RALM 71 0 0 01 May 2025
LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems Zhengwu Liu Carlos Rabat Villarreal Mostafa Rahgouy Amit Das Zheng Zhang Chang Ren Dongji Feng ReLM LRM 56 0 0 03 Apr 2025
Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins Yiqing Shen Chenjia Li Bohan Liu Cheng-Yi Li Tito Porras Mathias Unberath 62 2 0 26 Mar 2025
Collaboration is all you need: LLM Assisted Safe Code Translation Rabimba Karanjai Sam Blackshear Lei Xu W. Shi 54 0 0 14 Mar 2025
OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models Hui Feng Yuntzu Yin Emiliano Reynares Jay Nanavati 60 0 0 26 Feb 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap Gopi Krishnan Rajbahadur G. Oliva Dayi Lin Ahmed E. Hassan 49 1 0 28 Jan 2025
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models Qinggang Zhang Shengyuan Chen Yuanchen Bei Zheng Yuan Huachi Zhou Zijin Hong Junnan Dong Hao-Heng Chen Yi-Ju Chang Xiao Huang 3DV 78 8 0 21 Jan 2025
Software Performance Engineering for Foundation Model-Powered Software (FMware) Haoxiang Zhang Shi Chang Arthur Leung Kishanthan Thangarajah Boyuan Chen Hanan Lutfiyya Ahmed E. Hassan 143 1 0 14 Nov 2024
LLMs: A Game-Changer for Software Engineers? Md Asraful Haque LLMAG SyDa 36 0 0 01 Nov 2024
Accelerating Direct Preference Optimization with Prefix Sharing Franklin Wang Sumanth Hegde 36 0 0 27 Oct 2024
Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading Avinash Maurya Jie Ye M. Rafique Franck Cappello Bogdan Nicolae 31 1 0 26 Oct 2024
Opportunities and Challenges of Generative-AI in Finance Akshar Prabhu Desai Ganesh Satish Mallya Mohammad Luqman Tejasvi Ravi Nithya Kota Pranjul Yadav AIFin 47 2 0 21 Oct 2024
Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models Longteng Zhang Xiang Liu Zeyu Li Xinglin Pan Peijie Dong ... Rui Guo Xin Wang Qiong Luo S. Shi Xiaowen Chu 49 7 0 07 Nov 2023
Tutel: Adaptive Mixture-of-Experts at Scale Changho Hwang Wei Cui Yifan Xiong Ziyue Yang Ze Liu ... Joe Chau Peng Cheng Fan Yang Mao Yang Y. Xiong MoE 118 112 0 07 Jun 2022
Mixture-of-Experts with Expert Choice Routing Yan-Quan Zhou Tao Lei Han-Chu Liu Nan Du Yanping Huang Vincent Zhao Andrew M. Dai Zhifeng Chen Quoc V. Le James Laudon MoE 160 331 0 18 Feb 2022