Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.04693
Cited By
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
8 August 2024
Yuchen Xia
Jiho Kim
Yuhan Chen
Haojie Ye
Souvik Kundu
Cong
Hao
Nishil Talati
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding the Performance and Estimating the Cost of LLM Fine-Tuning"
17 / 17 papers shown
Title
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades
Yanan Li
Fanxu Meng
Muhan Zhang
Shiai Zhu
Shangguang Wang
Mengwei Xu
MoMe
7
0
0
17 May 2025
The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware)
Kirill Vasilevski
Benjamin Rombaut
Gopi Krishnan Rajbahadur
G. Oliva
Keheliya Gallaba
...
Bouyan Chen
Kishanthan Thangarajah
Ahmed E. Hassan
Zhen Ming
Jiang
22
0
0
15 May 2025
Investigating Task Arithmetic for Zero-Shot Information Retrieval
Marco Braga
Pranav Kasela
Alessandro Raganato
G. Pasi
RALM
71
0
0
01 May 2025
LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems
Zhengwu Liu
Carlos Rabat Villarreal
Mostafa Rahgouy
Amit Das
Zheng Zhang
Chang Ren
Dongji Feng
ReLM
LRM
56
0
0
03 Apr 2025
Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins
Yiqing Shen
Chenjia Li
Bohan Liu
Cheng-Yi Li
Tito Porras
Mathias Unberath
62
2
0
26 Mar 2025
Collaboration is all you need: LLM Assisted Safe Code Translation
Rabimba Karanjai
Sam Blackshear
Lei Xu
W. Shi
54
0
0
14 Mar 2025
OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models
Hui Feng
Yuntzu Yin
Emiliano Reynares
Jay Nanavati
60
0
0
26 Feb 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
49
1
0
28 Jan 2025
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models
Qinggang Zhang
Shengyuan Chen
Yuanchen Bei
Zheng Yuan
Huachi Zhou
Zijin Hong
Junnan Dong
Hao-Heng Chen
Yi-Ju Chang
Xiao Huang
3DV
78
8
0
21 Jan 2025
Software Performance Engineering for Foundation Model-Powered Software (FMware)
Haoxiang Zhang
Shi Chang
Arthur Leung
Kishanthan Thangarajah
Boyuan Chen
Hanan Lutfiyya
Ahmed E. Hassan
143
1
0
14 Nov 2024
LLMs: A Game-Changer for Software Engineers?
Md Asraful Haque
LLMAG
SyDa
36
0
0
01 Nov 2024
Accelerating Direct Preference Optimization with Prefix Sharing
Franklin Wang
Sumanth Hegde
36
0
0
27 Oct 2024
Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading
Avinash Maurya
Jie Ye
M. Rafique
Franck Cappello
Bogdan Nicolae
31
1
0
26 Oct 2024
Opportunities and Challenges of Generative-AI in Finance
Akshar Prabhu Desai
Ganesh Satish Mallya
Mohammad Luqman
Tejasvi Ravi
Nithya Kota
Pranjul Yadav
AIFin
47
2
0
21 Oct 2024
Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models
Longteng Zhang
Xiang Liu
Zeyu Li
Xinglin Pan
Peijie Dong
...
Rui Guo
Xin Wang
Qiong Luo
S. Shi
Xiaowen Chu
49
7
0
07 Nov 2023
Tutel: Adaptive Mixture-of-Experts at Scale
Changho Hwang
Wei Cui
Yifan Xiong
Ziyue Yang
Ze Liu
...
Joe Chau
Peng Cheng
Fan Yang
Mao Yang
Y. Xiong
MoE
118
112
0
07 Jun 2022
Mixture-of-Experts with Expert Choice Routing
Yan-Quan Zhou
Tao Lei
Han-Chu Liu
Nan Du
Yanping Huang
Vincent Zhao
Andrew M. Dai
Zhifeng Chen
Quoc V. Le
James Laudon
MoE
160
331
0
18 Feb 2022
1