Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 6,909 papers shown
Title
GPT4All: An Ecosystem of Open Source Compressed Language Models
Yuvanesh Anand
Zach Nussbaum
Adam Treat
Aaron Miller
Richard Guo
Ben Schmidt
GPT4All Community
Brandon Duderstadt
Andriy Mulyar
27
20
0
06 Nov 2023
Safurai-Csharp: Harnessing Synthetic Data to improve language-specific Code LLM
Davide Cifarelli
Leonardo Boiardi
Alessandro Puppo
Leon Jovanovic
SyDa
64
1
0
06 Nov 2023
CogVLM: Visual Expert for Pretrained Language Models
Weihan Wang
Qingsong Lv
Wenmeng Yu
Wenyi Hong
Ji Qi
...
Bin Xu
Juanzi Li
Yuxiao Dong
Ming Ding
Jie Tang
VLM
MLLM
176
517
0
06 Nov 2023
QualEval: Qualitative Evaluation for Model Improvement
Vishvak Murahari
Ameet Deshpande
Peter Clark
Tanmay Rajpurohit
Ashish Sabharwal
Karthik Narasimhan
Ashwin Kalyan
63
5
0
06 Nov 2023
GQKVA: Efficient Pre-training of Transformers by Grouping Queries, Keys, and Values
Farnoosh Javadi
Walid Ahmed
Habib Hajimolahoseini
Foozhan Ataiefard
Mohammad Hassanpour
Saina Asani
Austin Wen
Omar Mohamed Awad
Kangling Liu
Yang Liu
VLM
109
8
0
06 Nov 2023
AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs
Yann Hicke
Anmol Agarwal
Qianou Ma
Paul Denny
AI4Ed
88
24
0
05 Nov 2023
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE
Zeren Chen
Ziqin Wang
Zhen Wang
Huayang Liu
Zhen-fei Yin
Si Liu
Lu Sheng
Wanli Ouyang
Yu Qiao
Jing Shao
MoE
85
8
0
05 Nov 2023
Task Arithmetic with LoRA for Continual Learning
Rajas Chitale
Ankit Vaidya
Aditya Kane
Archana Ghotkar
93
17
0
04 Nov 2023
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Jing Pan
Jian Wu
Yashesh Gaur
S. Sivasankaran
Zhuo Chen
Shujie Liu
Jinyu Li
ELM
86
29
0
03 Nov 2023
Automating Governing Knowledge Commons and Contextual Integrity (GKC-CI) Privacy Policy Annotations with Large Language Models
Jake Chanenson
Madison Pickering
Noah J. Apthorpe
34
1
0
03 Nov 2023
A Simple and Efficient Baseline for Data Attribution on Images
Vasu Singla
Pedro Sandoval-Segura
Micah Goldblum
Jonas Geiping
Tom Goldstein
FAtt
100
4
0
03 Nov 2023
ProSG: Using Prompt Synthetic Gradients to Alleviate Prompt Forgetting of RNN-like Language Models
Haotian Luo
Kunming Wu
Cheng Dai
Sixian Ding
Xinhao Chen
37
1
0
03 Nov 2023
BoschAI @ PLABA 2023: Leveraging Edit Operations in End-to-End Neural Sentence Simplification
Valentin Knappich
Simon Razniewski
Annemarie Friedrich
113
1
0
03 Nov 2023
TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine
Guoxing Yang
Jianyu Shi
Zan Wang
Xiaohong Liu
Guangyu Wang
34
22
0
03 Nov 2023
The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
Shen Nie
Hanzhong Guo
Cheng Lu
Yuhao Zhou
Chenyu Zheng
Chongxuan Li
DiffM
134
43
0
02 Nov 2023
Making Harmful Behaviors Unlearnable for Large Language Models
Xin Zhou
Yi Lu
Ruotian Ma
Tao Gui
Qi Zhang
Xuanjing Huang
MU
84
12
0
02 Nov 2023
Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts
Thomas Palmeira Ferraz
Marcely Zanon Boito
Caroline Brun
Vassilina Nikoulina
77
13
0
02 Nov 2023
Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images
Zalan Fabian
Zhongqi Miao
Chunyuan Li
Yuanhan Zhang
Ziwei Liu
...
Laura Siabatto
Andrés Link
Pablo Arbelaez
Rahul Dodhia
J. L. Ferres
98
11
0
02 Nov 2023
Task-Agnostic Low-Rank Adapters for Unseen English Dialects
Zedian Xiao
William B. Held
Yanchen Liu
Diyi Yang
102
9
0
02 Nov 2023
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models
Xiaoyue Wang
Xin Liu
Lijie Wang
Yaoxiang Wang
Jinsong Su
Hua Wu
74
2
0
01 Nov 2023
Continuous Training and Fine-tuning for Domain-Specific Language Models in Medical Question Answering
Zhen Guo
Yining Hua
LM&MA
CLL
ALM
AI4MH
65
5
0
01 Nov 2023
ChipNeMo: Domain-Adapted LLMs for Chip Design
Mingjie Liu
Teodor-Dumitru Ene
Robert M. Kirby
Chris Cheng
N. Pinckney
...
Pratik P Suthar
Varun Tej
Walker J. Turner
Kaizhe Xu
Haoxin Ren
184
164
0
31 Oct 2023
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Pranav M. Gade
Simon Lermen
Charlie Rogers-Smith
Jeffrey Ladish
ALM
AI4MH
92
27
0
31 Oct 2023
BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text
Aarohi Srivastava
David Chiang
77
7
0
31 Oct 2023
Learning From Mistakes Makes LLM Better Reasoner
Shengnan An
Zexiong Ma
Zeqi Lin
Nanning Zheng
Jian-Guang Lou
Weizhu Chen
LRM
120
82
0
31 Oct 2023
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Simon Lermen
Charlie Rogers-Smith
Jeffrey Ladish
ALM
77
92
0
31 Oct 2023
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi
Yuyao Liu
Yanjie Ze
Simon S. Du
Huazhe Xu
OffRL
RALM
120
23
0
31 Oct 2023
Dealing with Structure Constraints in Evolutionary Pareto Set Learning
Xi Lin
Xiao-Yan Zhang
Zhiyuan Yang
Qingfu Zhang
141
1
0
31 Oct 2023
Interactive Multi-fidelity Learning for Cost-effective Adaptation of Language Model with Sparse Human Supervision
Jiaxin Zhang
Zhuohang Li
Kamalika Das
Kumar Sricharan
91
2
0
31 Oct 2023
Improving Prompt Tuning with Learned Prompting Layers
Wei Zhu
Ming Tan
VLM
117
1
0
31 Oct 2023
Emotional Theory of Mind: Bridging Fast Visual Processing with Slow Linguistic Reasoning
Yasaman Etesam
Özge Nilay Yalçin
Chuxuan Zhang
Angelica Lim
105
2
0
30 Oct 2023
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing
Hieu Tran
Zhichao Yang
Zonghai Yao
Hong-ye Yu
ALM
LM&MA
90
27
0
30 Oct 2023
Herd: Using multiple, smaller LLMs to match the performances of proprietary, large LLMs via an intelligent composer
S. N. Hari
Matt Thomson
55
0
0
30 Oct 2023
Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models
Hao Li
Han Liu
Dewei Hu
Jiacheng Wang
I. Oguz
MedIm
55
22
0
30 Oct 2023
A Survey on Knowledge Editing of Neural Networks
Vittorio Mazzia
Alessandro Pedrani
Andrea Caciolai
Kay Rottmann
Davide Bernardi
KELM
125
25
0
30 Oct 2023
When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations
Aleksandar Petrov
Philip Torr
Adel Bibi
VPVLM
82
28
0
30 Oct 2023
Text-to-3D with Classifier Score Distillation
Xin Yu
Yuanchen Guo
Yangguang Li
Ding Liang
Song-Hai Zhang
Xiaojuan Qi
DiffM
117
87
0
30 Oct 2023
CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation
Yang Zhang
Fuli Feng
Jizhi Zhang
Keqin Bao
Qifan Wang
Xiangnan He
97
88
0
30 Oct 2023
PACuna: Automated Fine-Tuning of Language Models for Particle Accelerators
Antonin Sulc
Raimund Kammering
Annika Eichler
T. Wilksen
97
3
0
29 Oct 2023
Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models
Hai Wang
Xiaoyu Xiang
Yuchen Fan
Jing-Hao Xue
155
29
0
28 Oct 2023
Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision
Bobby Azad
Reza Azad
Sania Eskandari
Afshin Bozorgpour
Amirhossein Kazerouni
I. Rekik
Dorit Merhof
VLM
MedIm
146
68
0
28 Oct 2023
Fine-Tuning Language Models Using Formal Methods Feedback
Yunhao Yang
N. Bhatt
Tyler Ingebrand
William Ward
Steven Carr
Zhangyang Wang
Ufuk Topcu
69
9
0
27 Oct 2023
Personas as a Way to Model Truthfulness in Language Models
Nitish Joshi
Javier Rando
Abulhair Saparov
Najoung Kim
He He
HILM
123
34
0
27 Oct 2023
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey
Weixu Zhang
Yifei Wang
Yuanfeng Song
Victor Junqiu Wei
Yuxing Tian
Yiyan Qi
Jonathan H. Chan
Raymond Chi-Wing Wong
Haiqin Yang
LMTD
81
20
0
27 Oct 2023
PockEngine: Sparse and Efficient Fine-tuning in a Pocket
Ligeng Zhu
Lanxiang Hu
Ji Lin
Wei-Chen Wang
Wei-Ming Chen
Chuang Gan
Song Han
49
23
0
26 Oct 2023
Orchestration of Emulator Assisted Mobile Edge Tuning for AI Foundation Models: A Multi-Agent Deep Reinforcement Learning Approach
Wen-li Yu
Terence Jie Chua
Junfeng Zhao
49
2
0
26 Oct 2023
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
Terence Jie Chua
Wen-li Yu
Junfeng Zhao
Kwok-Yan Lam
FedML
47
5
0
26 Oct 2023
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge
Tanel Alumäe
Jiaming Kong
Daniil Robnikov
34
2
0
26 Oct 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLM
VLM
154
30
0
26 Oct 2023
Low-Dimensional Gradient Helps Out-of-Distribution Detection
Yingwen Wu
Tao Li
Xinwen Cheng
Jie Yang
Xiaolin Huang
OODD
129
5
0
26 Oct 2023
Previous
1
2
3
...
112
113
114
...
137
138
139
Next