Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 6,911 papers shown
Title
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
Longze Chen
Ziqiang Liu
Wanwei He
Yunshui Li
Run Luo
Min Yang
89
12
0
28 May 2024
Near-Infrared and Low-Rank Adaptation of Vision Transformers in Remote Sensing
Irem Ülkü
Ö. Ö. Tanriöver
Erdem Akagündüz
ViT
47
0
0
28 May 2024
Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Xiaocheng Yang
Bingsen Chen
Yik-Cheung Tam
LRM
98
12
0
28 May 2024
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Jiaxiang Li
Siliang Zeng
Hoi-To Wai
Chenliang Li
Alfredo García
Mingyi Hong
138
18
0
28 May 2024
Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning for Medical Diagnosis
Mingyuan Liu
Lu Xu
Shengnan Liu
Jicong Zhang
83
2
0
28 May 2024
More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs
Chengyuan Liu
Shihang Wang
Yangyang Kang
Lizhi Qing
Fubang Zhao
Changlong Sun
Kun Kuang
Leilei Gan
ELM
AILaw
CLL
79
7
0
28 May 2024
Diffusion Model Patching via Mixture-of-Prompts
Seokil Ham
Sangmin Woo
Jin-Young Kim
Hyojun Go
Byeongjun Park
Changick Kim
VLM
89
2
0
28 May 2024
Detection-Correction Structure via General Language Model for Grammatical Error Correction
Wei Li
Houfeng Wang
107
6
0
28 May 2024
Online Analytic Exemplar-Free Continual Learning with Large Models for Imbalanced Autonomous Driving Task
Huiping Zhuang
Di Fang
Kai Tong
Yuchen Liu
Huiping Zhuang
Xu Zhou
Cen Chen
CLL
87
3
0
28 May 2024
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design
Rui Kong
Qiyang Li
Xinyu Fang
Qingtian Feng
Qingfeng He
Yazhu Dong
Weijun Wang
Yuanchun Li
Linghe Kong
Yunxin Liu
MoE
107
7
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
155
5
0
28 May 2024
Cross-Modal Safety Alignment: Is textual unlearning all you need?
Trishna Chakraborty
Erfan Shayegani
Zikui Cai
Nael B. Abu-Ghazaleh
M. Salman Asif
Yue Dong
Amit K. Roy-Chowdhury
Chengyu Song
88
18
0
27 May 2024
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters
Klaudia Bałazy
Mohammadreza Banaei
Karl Aberer
Jacek Tabor
93
34
0
27 May 2024
QUB-Cirdan at "Discharge Me!": Zero shot discharge letter generation by open-source LLM
Rui Guo
Greg Farnan
Niall McLaughlin
Barry Devereux
42
4
0
27 May 2024
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Litu Rout
Yujia Chen
Nataniel Ruiz
Abhishek Kumar
Constantine Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
101
26
0
27 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
173
103
0
27 May 2024
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
Zixian Huang
Wenhao Zhu
Gong Cheng
Lei Li
Fei Yuan
LRM
98
14
0
27 May 2024
Controllable Longer Image Animation with Diffusion Models
Qiang Wang
Minghua Liu
Junjun Hu
Fan Jiang
Mu Xu
VGen
82
0
0
27 May 2024
Efficient Ensembles Improve Training Data Attribution
Junwei Deng
Ting-Wei Li
Shichang Zhang
Jiaqi Ma
TDI
83
3
0
27 May 2024
Trans-LoRA
\textit{Trans-LoRA}
Trans-LoRA
: towards data-free Transferable Parameter Efficient Finetuning
Runqian Wang
Soumya Ghosh
David D. Cox
Diego Antognini
Aude Oliva
Rogerio Feris
Leonid Karlinsky
80
2
0
27 May 2024
From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan Creation
Xiaoyu Li
Jonathan Benjamin
Xin Zhang
105
1
0
27 May 2024
CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs
Haoyu Wang
Bei Liu
Hang Shao
Bo Xiao
Ke Zeng
Guanglu Wan
Yanmin Qian
MQ
55
1
0
27 May 2024
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
116
15
0
27 May 2024
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings
Robert Wolfe
Isaac Slaughter
Bin Han
Bingbing Wen
Yiwei Yang
...
Bernease Herman
E. Brown
Zening Qu
Nicholas Weber
Bill Howe
107
8
0
27 May 2024
PromptFix: You Prompt and We Fix the Photo
Yongsheng Yu
Ziyun Zeng
Hang Hua
Jianlong Fu
Jiebo Luo
MLLM
DiffM
VLM
90
28
0
27 May 2024
CHESS: Contextual Harnessing for Efficient SQL Synthesis
Shayan Talaei
Mohammadreza Pourreza
Yu-Chen Chang
Azalia Mirhoseini
Amin Saberi
87
76
0
27 May 2024
Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective
Akiyoshi Tomihari
Issei Sato
75
4
0
27 May 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
Mohammad Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
312
205
0
27 May 2024
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
Ju-Seung Byun
Andrew Perrault
57
1
0
27 May 2024
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Zhuoling Li
Xiaogang Xu
Zhenhua Xu
Sernam Lim
Hengshuang Zhao
LM&Ro
161
2
0
27 May 2024
Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization
Amir Saeidi
Shivanshu Verma
Aswin Rrv
Chitta Baral
85
5
0
26 May 2024
Mixture of Experts Using Tensor Products
Zhan Su
Fengran Mo
Prayag Tiwari
Benyou Wang
Jian-Yun Nie
J. Simonsen
MoE
MoMe
58
3
0
26 May 2024
Compressing Lengthy Context With UltraGist
Peitian Zhang
Zheng Liu
Shitao Xiao
Ninglu Shao
Qiwei Ye
Zhicheng Dou
51
4
0
26 May 2024
Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation
Runyi Li
Xuanyu Zhang
Zhipei Xu
Yongbing Zhang
Jian Zhang
WIGM
90
4
0
26 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
82
4
0
26 May 2024
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
F. Babiloni
Alexandros Lattas
Jiankang Deng
Stefanos Zafeiriou
DiffM
100
4
0
26 May 2024
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
Wenqi Ouyang
Yi Dong
Lei Yang
Jianlou Si
Xingang Pan
VGen
DiffM
104
16
0
26 May 2024
LoQT: Low Rank Adapters for Quantized Training
Sebastian Loeschcke
M. Toftrup
M. Kastoryano
Serge Belongie
Vésteinn Snæbjarnarson
MQ
74
0
0
26 May 2024
Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors
Soumava Paul
Christopher Wewer
Bernt Schiele
J. E. Lenssen
3DGS
82
4
0
26 May 2024
M
3
^3
3
CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought
Qiguang Chen
Libo Qin
Jin Zhang
Zhi Chen
Xiao Xu
Wanxiang Che
LRM
123
61
0
26 May 2024
GRAG: Graph Retrieval-Augmented Generation
Yuntong Hu
Zhihan Lei
Zhengwu Zhang
Bo Pan
Chen Ling
Liang Zhao
123
31
0
26 May 2024
RLSF: Fine-tuning LLMs via Symbolic Feedback
Piyush Jha
Prithwish Jana
Pranavkrishna Suresh
Arnav Arora
Vijay Ganesh
LRM
116
4
0
26 May 2024
Learning to Reason via Program Generation, Emulation, and Search
Nathaniel Weir
Muhammad Khalifa
Linlu Qiu
Orion Weller
Peter Clark
SyDa
ReLM
LRM
158
9
0
25 May 2024
LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters
Xinyu Zhou
Boris Knyazev
Alexia Jolicoeur-Martineau
Jie Fu
AI4CE
85
0
0
25 May 2024
ModelLock: Locking Your Model With a Spell
Yifeng Gao
Yuhua Sun
Xingjun Ma
Zuxuan Wu
Yu-Gang Jiang
VLM
98
1
0
25 May 2024
Generating clickbait spoilers with an ensemble of large language models
M. Woźny
Mateusz Lango
61
1
0
25 May 2024
TURNIP: A "Nondeterministic" GPU Runtime with CPU RAM Offload
Zhimin Ding
Jiawen Yao
Brianna Barrow
Tania Lorido-Botran
Christopher M. Jermaine
Yu-Shuen Tang
Jiehui Li
Xinyu Yao
Sleem Mahmoud Abdelghafar
Daniel Bourgeois
73
2
0
25 May 2024
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection
Yun Zhu
Jia-Chen Gu
Caitlin Sikora
Ho Ko
Yinxiao Liu
...
Lei Shu
Liangchen Luo
Lei Meng
Bang Liu
Jindong Chen
RALM
99
19
0
25 May 2024
Mixture of In-Context Prompters for Tabular PFNs
Derek Xu
Olcay Cirit
Reza Asadi
Yizhou Sun
Wei Wang
109
15
0
25 May 2024
5W1H Extraction With Large Language Models
Yang Cao
Yangsong Lan
Feiyan Zhai
Piji Li
105
1
0
25 May 2024
Previous
1
2
3
...
77
78
79
...
137
138
139
Next