Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 6,910 papers shown
Title
Can LLMs Learn New Concepts Incrementally without Forgetting?
Junhao Zheng
Shengjie Qiu
Qianli Ma
CLL
90
0
0
13 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
126
30
0
13 Feb 2024
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation
AprilPyone Maungmaung
H. Nguyen
Hitoshi Kiya
Isao Echizen
83
6
0
13 Feb 2024
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents
Jae-Woo Choi
Youngwoo Yoon
Hyobin Ong
Jaehong Kim
Minsu Jang
72
18
0
13 Feb 2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Mateusz Lajszczak
Guillermo Cámbara
Yang Li
Fatih Beyhan
Arent van Korlaar
...
Bartosz Putrycz
Soledad López Gambino
Kayeon Yoo
Elena Sokolova
Thomas Drugman
LM&MA
113
88
0
12 Feb 2024
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
Huixin Zhan
Ying Nian Wu
Zijun Zhang
ALM
42
1
0
12 Feb 2024
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets
Israel Abebe Azime
A. Tonja
Tadesse Destaw Belay
Mitiku Yohannes Fuge
A. Wassie
Eyasu Shiferaw Jada
Yonas Chanie
W. Sewunetie
Seid Muhie Yimam
42
3
0
12 Feb 2024
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
Víctor Gallego
SyDa
55
7
0
12 Feb 2024
Mercury: A Code Efficiency Benchmark for Code Large Language Models
Mingzhe Du
Anh Tuan Luu
Bin Ji
Qian Liu
See-Kiong Ng
ALM
ELM
OffRL
96
13
0
12 Feb 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
Ahmet Üstün
Viraat Aryabumi
Zheng-Xin Yong
Wei-Yin Ko
Daniel D'souza
...
Shayne Longpre
Niklas Muennighoff
Marzieh Fadaee
Julia Kreutzer
Sara Hooker
ALM
ELM
SyDa
LRM
100
231
0
12 Feb 2024
Empowering Federated Learning for Massive Models with NVIDIA FLARE
Holger R. Roth
Ziyue Xu
Yuan-Ting Hsieh
Adithya Renduchintala
Isaac Yang
...
Camir Ricketts
Daguang Xu
Chester Chen
Yan Cheng
Andrew Feng
AI4CE
46
5
0
12 Feb 2024
Task-conditioned adaptation of visual features in multi-task policy learning
Pierre Marza
L. Matignon
Olivier Simonin
Christian Wolf
99
3
0
12 Feb 2024
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering
Xiaoxin He
Yijun Tian
Yifei Sun
Nitesh Chawla
T. Laurent
Yann LeCun
Xavier Bresson
Bryan Hooi
RALM
197
96
0
12 Feb 2024
T-RAG: Lessons from the LLM Trenches
M. Fatehkia
J. Lucas
Sanjay Chawla
LLMAG
87
22
0
12 Feb 2024
VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization
Dongsheng Zhu
Xunzhu Tang
Weidong Han
Jinghui Lu
Yukun Zhao
Guoliang Xing
Junfeng Wang
D. Yin
VLM
MLLM
90
10
0
12 Feb 2024
Sophia-in-Audition: Virtual Production with a Robot Performer
Taotao Zhou
Teng Xu
Dong Zhang
Yuyang Jiao
Peijun Xu
Yaoyu He
Lan Xu
Jingyi Yu
79
1
0
10 Feb 2024
Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models
Ayman Abaid
Muhammad Ali Farooq
N. Hynes
Peter Corcoran
Ihsan Ullah
MedIm
DiffM
21
2
0
10 Feb 2024
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
Jian Wang
Chak Tou Leong
Jiashuo Wang
Dongding Lin
Wenjie Li
Xiao-Yong Wei
87
9
0
10 Feb 2024
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
Rui Ye
Wenhao Wang
Jingyi Chai
Dihan Li
Zexi Li
Yinda Xu
Yaxin Du
Yanfeng Wang
Siheng Chen
ALM
FedML
AIFin
101
98
0
10 Feb 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu
Chen Chen
Chao-Han Huck Yang
Ruizhe Li
Dong Zhang
Zhehuai Chen
Eng Siong Chng
91
21
0
10 Feb 2024
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction
Yansong Ning
Hao Liu
LLMAG
94
7
0
10 Feb 2024
Reasoning Grasping via Multimodal Large Language Model
Shiyu Jin
Jinxuan Xu
Yutian Lei
Liangjun Zhang
LRM
106
21
0
09 Feb 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
Ahmet Üstün
Marzieh Fadaee
Sara Hooker
231
127
0
09 Feb 2024
Calibrating Long-form Generations from Large Language Models
Yukun Huang
Yixin Liu
Raghuveer Thirukovalluru
Arman Cohan
Bhuwan Dhingra
69
15
0
09 Feb 2024
Large Language Models for Captioning and Retrieving Remote Sensing Images
João Daniel Silva
João Magalhães
D. Tuia
Bruno Martins
89
29
0
09 Feb 2024
Explaining Veracity Predictions with Evidence Summarization: A Multi-Task Model Approach
R. Çekinel
Pinar Senkul
AAML
108
3
0
09 Feb 2024
Human Aesthetic Preference-Based Large Text-to-Image Model Personalization: Kandinsky Generation as an Example
Aven Le Zhou
Yu-Ao Wang
Wei Wu
Kang Zhang
50
1
0
09 Feb 2024
CultureLLM: Incorporating Cultural Differences into Large Language Models
Cheng-rong Li
Mengzhou Chen
Jindong Wang
Sunayana Sitaram
Xing Xie
VLM
127
23
0
09 Feb 2024
ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling
Siming Yan
Min Bai
Weifeng Chen
Xiong Zhou
Qixing Huang
Erran L. Li
VLM
50
20
0
09 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomas Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
248
426
0
09 Feb 2024
Randomness Is All You Need: Semantic Traversal of Problem-Solution Spaces with Large Language Models
Thomas Sandholm
Sayandev Mukherjee
Bernardo A. Huberman
29
2
0
08 Feb 2024
OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models
Hainiu Xu
Runcong Zhao
Lixing Zhu
Bin Liang
Yulan He
158
25
0
08 Feb 2024
Collaborative Control for Geometry-Conditioned PBR Image Generation
Shimon Vainer
Mark Boss
Mathias Parger
Konstantin Kutsy
Dante De Nigris
Ciara Rowles
Nicolas Perony
Simon Donné
DiffM
65
13
0
08 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
65
6
0
08 Feb 2024
FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs
Eun Cheol Choi
Emilio Ferrara
HILM
101
27
0
08 Feb 2024
Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation
Xianghe Pang
Shuo Tang
Rui Ye
Yuxin Xiong
Bolun Zhang
Yanfeng Wang
Siheng Chen
194
36
0
08 Feb 2024
Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz
Yair Kittenplon
Aviad Aberdam
Elad Ben Avraham
Oren Nuriel
Shai Mazor
Ron Litman
108
23
0
08 Feb 2024
Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes
Lucio Dery
Steven Kolawole
Jean-Francois Kagey
Virginia Smith
Graham Neubig
Ameet Talwalkar
112
36
0
08 Feb 2024
Scaling Up LLM Reviews for Google Ads Content Moderation
Wei Qiao
Tushar Dogra
Otilia Stretcu
Yu-Han Lyu
Tiantian Fang
...
Chih-Chun Chia
Ariel Fuxman
Fangzhou Wang
Ranjay Krishna
Mehmet Tek
75
13
0
07 Feb 2024
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models
Lijun Li
Bowen Dong
Ruohui Wang
Xuhao Hu
Wangmeng Zuo
Dahua Lin
Yu Qiao
Jing Shao
ELM
129
106
0
07 Feb 2024
Asymptotics of feature learning in two-layer networks after one gradient-step
Hugo Cui
Luca Pesce
Yatin Dandi
Florent Krzakala
Yue M. Lu
Lenka Zdeborová
Bruno Loureiro
MLT
135
19
0
07 Feb 2024
ConvLoRA and AdaBN based Domain Adaptation via Self-Training
Sidra Aleem
J. Dietlmeier
Eric Arazo
Suzanne Little
57
10
0
07 Feb 2024
L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models
Hyesung Jeon
Yulhwa Kim
Jae-Joon Kim
MQ
62
5
0
07 Feb 2024
RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation
Xiaohan Yu
Li Zhang
Xin Zhao
Yue Wang
Zhongrui Ma
76
11
0
07 Feb 2024
Dual-View Visual Contextualization for Web Navigation
Jihyung Kil
Chan Hee Song
Boyuan Zheng
Xiang Deng
Yu-Chuan Su
Wei-Lun Chao
EgoV
54
15
0
06 Feb 2024
Learning to Extract Structured Entities Using Language Models
Haolun Wu
Ye Yuan
Liana Mikaelyan
Alexander Meulemans
Xue Liu
James Hensman
Bhaskar Mitra
100
4
0
06 Feb 2024
Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Nate Gruver
Anuroop Sriram
Andrea Madotto
A. Wilson
C. L. Zitnick
Zachary W. Ulissi
69
67
0
06 Feb 2024
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
Michael Zhang
Kush S. Bhatia
Hermann Kumbong
Christopher Ré
80
54
0
06 Feb 2024
LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text
Dor Bernsohn
Gil Semo
Yaron Vazana
Gila Hayat
Ben Hagag
Joel Niklaus
Rohit Saha
Kyryl Truskovskyi
AILaw
68
18
0
06 Feb 2024
Training Language Models to Generate Text with Citations via Fine-grained Rewards
Chengyu Huang
Zeqiu Wu
Yushi Hu
Wenya Wang
HILM
LRM
138
30
0
06 Feb 2024
Previous
1
2
3
...
98
99
100
...
137
138
139
Next