Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 6,911 papers shown
Title
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
84
0
0
14 Apr 2024
JaFIn: Japanese Financial Instruction Dataset
Kota Tanabe
Masahiro Suzuki
Hiroki Sakaji
Itsuki Noda
76
1
0
14 Apr 2024
TransformerFAM: Feedback attention is working memory
Dongseong Hwang
Weiran Wang
Zhuoyuan Huo
K. Sim
P. M. Mengibar
121
12
0
14 Apr 2024
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts
Yusheng Liao
Shuyang Jiang
Yu Wang
Yanfeng Wang
MoE
116
5
0
13 Apr 2024
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
LM&MA
130
10
0
13 Apr 2024
Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives
Yidan Liu
Jun Yue
Shaobo Xia
Pedram Ghamisi
Weiying Xie
Leyuan Fang
DiffM
97
17
0
13 Apr 2024
PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos
Qi Zhao
M. Salman Asif
Zhan Ma
65
4
0
13 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLM
LRM
VLM
85
29
0
12 Apr 2024
CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models
Je-Yong Lee
Donghyun Lee
Genghan Zhang
Mo Tiwari
Azalia Mirhoseini
73
21
0
12 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
123
17
0
12 Apr 2024
Dataset Reset Policy Optimization for RLHF
Jonathan D. Chang
Wenhao Zhan
Owen Oertell
Kianté Brantley
Dipendra Kumar Misra
Jason D. Lee
Wen Sun
OffRL
119
24
0
12 Apr 2024
MoE-FFD: Mixture of Experts for Generalized and Parameter-Efficient Face Forgery Detection
Chenqi Kong
Anwei Luo
Song Xia
Yi Yu
Haoliang Li
Zengwei Zheng
Shiqi Wang
Alex C. Kot
MoE
142
8
0
12 Apr 2024
The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing
Muzhi Li
Minda Hu
Irwin King
Ho-fung Leung
78
8
0
12 Apr 2024
Struggle with Adversarial Defense? Try Diffusion
Yujie Li
Yanbin Wang
Peiyue Li
Bin Liu
Jianguo Sun
Yifan Jia
Wenrui Ma
DiffM
70
1
0
12 Apr 2024
Improving Continuous Sign Language Recognition with Adapted Image Models
Lianyu Hu
Tongkai Shi
Liqing Gao
Zekang Liu
Wei Feng
VLM
93
5
0
12 Apr 2024
Reducing hallucination in structured outputs via Retrieval-Augmented Generation
Patrice Béchard
Orlando Marquez Ayala
LLMAG
106
61
0
12 Apr 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
123
25
0
12 Apr 2024
FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning
Duy Phuong Nguyen
J. P. Muñoz
Ali Jannesari
VLM
77
9
0
12 Apr 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
William Fleshman
Aleem Khan
Marc Marone
Benjamin Van Durme
CLL
KELM
124
4
0
12 Apr 2024
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Yiwen Tang
Ray Zhang
Jiaming Liu
Zoey Guo
Dong Wang
...
Bin Zhao
Shanghang Zhang
Peng Gao
Hongsheng Li
Xuelong Li
101
13
0
11 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
Chong Chen
92
82
0
11 Apr 2024
Taming Stable Diffusion for Text to 360° Panorama Image Generation
Cheng Zhang
Qianyi Wu
Camilo Cruz Gambardella
Xiaoshui Huang
Dinh Q. Phung
Wanli Ouyang
Jianfei Cai
MDE
92
9
0
11 Apr 2024
DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation
Anna C. Doris
Daniele Grandi
Ryan Tomich
Md Ferdous Alam
Hyunmin Cheong
Faez Ahmed
71
20
0
11 Apr 2024
MindBridge: A Cross-Subject Brain Decoding Framework
Shizun Wang
Songhua Liu
Zhenxiong Tan
Xinchao Wang
AI4CE
145
29
0
11 Apr 2024
On Training Data Influence of GPT Models
Qingyi Liu
Yekun Chai
Shuohuan Wang
Yu Sun
Qiwei Peng
Keze Wang
Hua Wu
TDI
AI4CE
82
7
0
11 Apr 2024
Realistic Continual Learning Approach using Pre-trained Models
Nadia Nasri
Carlos Gutiérrez-Álvarez
Sergio Lafuente-Arroyo
Saturnino Maldonado-Bascón
Roberto J. López-Sastre
CLL
69
0
0
11 Apr 2024
Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
Iker García-Ferrero
Rodrigo Agerri
Aitziber Atutxa Salazar
Elena Cabrio
Iker de la Iglesia
...
Johana Ramirez-Romero
German Rigau
J. M. Villa-Gonzalez
S. Villata
Andrea Zaninello
129
21
0
11 Apr 2024
PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination
Anant Khandelwal
VLM
77
1
0
11 Apr 2024
Remembering Transformer for Continual Learning
Yuwei Sun
Ippei Fujisawa
Arthur Juliani
Jun Sakuma
Ryota Kanai
CLL
73
1
0
11 Apr 2024
MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting
Avinash Anand
Janak Kapuriya
Apoorv Singh
Jay Saraf
Naman Lal
Astha Verma
Rushali Gupta
R. Shah
LRM
46
15
0
11 Apr 2024
Scalable Language Model with Generalized Continual Learning
Bohao Peng
Zhuotao Tian
Shu Liu
Mingchang Yang
Jiaya Jia
ALM
CLL
KELM
92
18
0
11 Apr 2024
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Jingxuan Xu
Wuyang Chen
Yao-Min Zhao
Yunchao Wei
VLM
109
2
0
11 Apr 2024
ST-LoRA: Low-rank Adaptation for Spatio-Temporal Forecasting
Weilin Ruan
Wei Chen
Xilin Dang
Jianxiang Zhou
Weichuang Li
Xu Liu
Yuxuan Liang
AI4TS
47
2
0
11 Apr 2024
Flatness Improves Backbone Generalisation in Few-shot Classification
Rui Li
Martin Trapp
Talal Alrawajfeh
Arno Solin
124
0
0
11 Apr 2024
LLMs in Biomedicine: A study on clinical Named Entity Recognition
Masoud Monajatipoor
Jiaxin Yang
Joel Stremmel
Melika Emami
Fazlolah Mohaghegh
Mozhdeh Rouhsedaghat
Kai-Wei Chang
LM&MA
59
18
0
10 Apr 2024
BRAVE: Broadening the visual encoding of vision-language models
Ouguzhan Fatih Kar
A. Tonioni
Petra Poklukar
Achin Kulshrestha
Amir Zamir
Federico Tombari
MLLM
VLM
80
32
0
10 Apr 2024
Continuous Language Model Interpolation for Dynamic and Controllable Text Generation
Sara Kangaslahti
David Alvarez-Melis
KELM
142
0
0
10 Apr 2024
Dynamic Generation of Personalities with Large Language Models
Jianzhi Liu
Hexiang Gu
Tianyu Zheng
Liuyu Xiang
Huijia Wu
Jie Fu
Zhaofeng He
100
3
0
10 Apr 2024
Identification of Fine-grained Systematic Errors via Controlled Scene Generation
Valentyn Boreiko
Matthias Hein
J. H. Metzen
88
1
0
10 Apr 2024
ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling
Ege Özsoy
Chantal Pellegrini
Matthias Keicher
Nassir Navab
VLM
87
4
0
10 Apr 2024
Improving Language Model Reasoning with Self-motivated Learning
Yunlong Feng
Yang Xu
Libo Qin
Yasheng Wang
Wanxiang Che
LRM
ReLM
84
7
0
10 Apr 2024
Enhancing Question Answering for Enterprise Knowledge Bases using Large Language Models
Feihu Jiang
Chuan Qin
Kaichun Yao
Chuyu Fang
Fuzhen Zhuang
Hengshu Zhu
Hui Xiong
79
7
0
10 Apr 2024
Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior
Fan Lu
Kwan-Yee Lin
Yan Xu
Hongsheng Li
Guang Chen
Changjun Jiang
73
8
0
10 Apr 2024
We're Calling an Intervention: Exploring Fundamental Hurdles in Adapting Language Models to Nonstandard Text
Aarohi Srivastava
David Chiang
129
0
0
10 Apr 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
163
59
0
10 Apr 2024
Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation
Sidra Aleem
Fangyijie Wang
Mayug Maniparambil
Eric Arazo
J. Dietlmeier
Guénolé Silvestre
Kathleen M. Curran
Noel E. O'Connor
Suzanne Little
VLM
MedIm
92
14
0
09 Apr 2024
Understanding Cross-Lingual Alignment -- A Survey
Katharina Hämmerl
Jindvrich Libovický
Alexander Fraser
87
14
0
09 Apr 2024
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
Zhihao Lin
Wei Ma
Tao Lin
Yaowen Zheng
Jingquan Ge
Jun Wang
Jacques Klein
Tegawende F. Bissyande
Yang Liu
Li Li
VLM
69
6
0
09 Apr 2024
Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles
Andrea Zugarini
Kamyar Zeinalipour
Surya Sai Kadali
Marco Maggini
Marco Gori
Leonardo Rigutini
AI4Ed
65
6
0
09 Apr 2024
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Zhuohao Yu
Chang Gao
Wenjin Yao
Yidong Wang
Zhengran Zeng
Wei Ye
Jindong Wang
Yue Zhang
Shikun Zhang
68
3
0
09 Apr 2024
Previous
1
2
3
...
85
86
87
...
137
138
139
Next