Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 6,911 papers shown
Title
LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems
Tasnim Ahmed
Salimur Choudhury
73
12
0
02 Mar 2024
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning
Alexander Scarlatos
Digory Smith
Simon Woodhead
Andrew Lan
OffRL
80
12
0
02 Mar 2024
Analysis of Privacy Leakage in Federated Large Language Models
Minh Nhat Vu
Truc D. T. Nguyen
Tre' R. Jeter
My T. Thai
84
7
0
02 Mar 2024
Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Jianheng Huang
Leyang Cui
Ante Wang
Chengyi Yang
Xinting Liao
Linfeng Song
Junfeng Yao
Jinsong Su
KELM
CLL
90
46
0
02 Mar 2024
STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
98
5
0
02 Mar 2024
MuseGraph: Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining
Yanchao Tan
Hang Lv
Xin Huang
Jiawei Zhang
Shiping Wang
Carl Yang
112
12
0
02 Mar 2024
Face Swap via Diffusion Model
Feifei Wang
DiffM
66
1
0
02 Mar 2024
FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis
Songhua Yang
Xinke Jiang
Hanjie Zhao
Wenxuan Zeng
Hongde Liu
Yuxiang Jia
91
6
0
02 Mar 2024
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Fakhraddin Alwajih
El Moatez Billah Nagoudi
Gagan Bhatia
Abdelrahman Mohamed
Muhammad Abdul-Mageed
VLM
LRM
83
16
0
01 Mar 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Vithursan Thangarasa
Mahmoud Salem
Shreyas Saxena
Kevin Leong
Joel Hestness
Sean Lie
MedIm
81
1
0
01 Mar 2024
ATP: Enabling Fast LLM Serving via Attention on Top Principal Keys
Yue Niu
Saurav Prakash
Salman Avestimehr
51
1
0
01 Mar 2024
Differentially Private Knowledge Distillation via Synthetic Text Generation
James Flemings
Murali Annavaram
SyDa
85
14
0
01 Mar 2024
Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency
Yixuan Zhang
Feng Zhou
52
3
0
01 Mar 2024
TempCompass: Do Video LLMs Really Understand Videos?
Yuanxin Liu
Shicheng Li
Yi Liu
Yuxiang Wang
Shuhuai Ren
Lei Li
Sishuo Chen
Xu Sun
Lu Hou
VLM
162
141
0
01 Mar 2024
Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish
R. Çekinel
Pinar Senkul
Çağrı Çöltekin
83
2
0
01 Mar 2024
Crimson: Empowering Strategic Reasoning in Cybersecurity through Large Language Models
Jiandong Jin
Bowen Tang
Mingxuan Ma
Xiao Liu
Yunfei Wang
Qingnan Lai
Jia Yang
Changling Zhou
76
6
0
01 Mar 2024
Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy
Jieyong Kim
Ryang Heo
Yongsik Seo
SeongKu Kang
Jinyoung Yeo
Dongha Lee
ReLM
LRM
59
8
0
01 Mar 2024
Never-Ending Behavior-Cloning Agent for Robotic Manipulation
Wenqi Liang
Gan Sun
Qian He
Yu Ren
Jiahua Dong
Yang Cong
LM&Ro
87
1
0
01 Mar 2024
Large Convolutional Model Tuning via Filter Subspace
Wei Chen
Zichen Miao
Qiang Qiu
229
4
0
01 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Tengjiao Wang
3DV
290
286
0
29 Feb 2024
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong
Idan Shenfeld
Tsun-Hsuan Wang
Yung-Sung Chuang
Aldo Pareja
James R. Glass
Akash Srivastava
Pulkit Agrawal
LRM
116
45
0
29 Feb 2024
Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge
Ansh Arora
Xuanli He
Maximilian Mozes
Srinibas Swain
Mark Dras
Xingliang Yuan
SILM
MoMe
AAML
121
14
0
29 Feb 2024
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
Hao Li
Ying Chen
Yifei Chen
Wenxian Yang
Bowen Ding
Yuchen Han
Liansheng Wang
Rongshan Yu
107
19
0
29 Feb 2024
SEED: Customize Large Language Models with Sample-Efficient Adaptation for Code Generation
Xue Jiang
Yihong Dong
Zhi Jin
Ge Li
VLM
117
6
0
29 Feb 2024
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition
Feng Lu
Xiangyuan Lan
Lijun Zhang
Dongmei Jiang
Yaowei Wang
Chun Yuan
101
37
0
29 Feb 2024
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
Mihai Masala
Traian Rebedea
Horia Velicu
AILaw
84
3
0
29 Feb 2024
Teaching Large Language Models an Unseen Language on the Fly
Chen Zhang
Xiao Liu
Jiuheng Lin
Yansong Feng
96
21
0
29 Feb 2024
AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
Yiran Zhao
Wenxuan Zhang
Huiming Wang
Kenji Kawaguchi
Lidong Bing
MoMe
99
23
0
29 Feb 2024
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
Weijieying Ren
Xinlong Li
Lei Wang
Tianxiang Zhao
Wei Qin
CLL
KELM
117
39
0
29 Feb 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
180
11
0
29 Feb 2024
Grounding Language Models for Visual Entity Recognition
Zilin Xiao
Ming Gong
Paola Cascante-Bonilla
Xingyao Zhang
Jie Wu
Vicente Ordonez
VLM
97
10
0
28 Feb 2024
The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA
Yiming Li
Zhao Zhang
31
1
0
28 Feb 2024
FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes
Ziying Pan
Kun Wang
Gang Li
Feihong He
Yongxuan Lai
92
1
0
28 Feb 2024
Exploration of Adapter for Noise Robust Automatic Speech Recognition
Hao Shi
Tatsuya Kawahara
84
5
0
28 Feb 2024
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
Shicheng Xu
Liang Pang
Mo Yu
Fandong Meng
Huawei Shen
Xueqi Cheng
Jie Zhou
RALM
79
15
0
28 Feb 2024
Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis
Zhenxiao Cheng
Jie Zhou
Wen Wu
Qin Chen
Liang He
77
0
0
28 Feb 2024
Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models
Derong Xu
Ziheng Zhang
Zhihong Zhu
Zhenxi Lin
Qidong Liu
...
Wanyu Wang
Yuyang Ye
Xiangyu Zhao
Yefeng Zheng
Enhong Chen
KELM
86
10
0
28 Feb 2024
TroubleLLM: Align to Red Team Expert
Zhuoer Xu
Jianping Zhang
Shiwen Cui
Changhua Meng
Weiqiang Wang
90
1
0
28 Feb 2024
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization
Yoshiki Masuyama
Gordon Wichern
François Germain
Zexu Pan
Sameer Khurana
Chiori Hori
Jonathan Le Roux
74
3
0
27 Feb 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
204
63
0
27 Feb 2024
Structure-Guided Adversarial Training of Diffusion Models
Ling Yang
Haotian Qian
Zhilong Zhang
Jingwei Liu
Tengjiao Wang
96
12
0
27 Feb 2024
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
Shyam Marjit
Harshit Singh
Nityanand Mathur
Sayak Paul
Chia-Mu Yu
Pin-Yu Chen
DiffM
77
7
0
27 Feb 2024
Unsupervised multiple choices question answering via universal corpus
Qin Zhang
Hao Ge
Xiaojun Chen
Menglu Fang
OffRL
91
2
0
27 Feb 2024
ACTrack: Adding Spatio-Temporal Condition for Visual Object Tracking
Yushan Han
Kaer Huang
78
1
0
27 Feb 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
119
160
0
27 Feb 2024
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
128
51
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
263
103
0
27 Feb 2024
Benchmarking LLMs on the Semantic Overlap Summarization Task
John Salvador
Naman Bansal
Mousumi Akter
Souvik Sarkar
Anupam Das
S. Karmaker
83
2
0
26 Feb 2024
A Survey of Large Language Models in Cybersecurity
Gabriel de Jesus Coelho da Silva
Carlos Becker Westphall
69
6
0
26 Feb 2024
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
127
7
0
26 Feb 2024
Previous
1
2
3
...
94
95
96
...
137
138
139
Next