Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 6,911 papers shown
Title
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
James Chua
Edward Rees
Hunar Batra
Samuel R. Bowman
Julian Michael
Ethan Perez
Miles Turpin
LRM
127
13
0
08 Mar 2024
ConstitutionalExperts: Training a Mixture of Principle-based Prompts
S. Petridis
Ben Wedin
Ann Yuan
James Wexler
Nithum Thain
60
8
0
07 Mar 2024
AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors
Kaishen Yuan
Zitong Yu
Xin Liu
Weicheng Xie
Huanjing Yue
Jingyu Yang
ViT
79
18
0
07 Mar 2024
Teaching Large Language Models to Reason with Reinforcement Learning
Alex Havrilla
Yuqing Du
Sharath Chandra Raparthy
Christoforos Nalmpantis
Jane Dwivedi-Yu
Maksym Zhuravinskyi
Eric Hambro
Sainbayar Sukhbaatar
Roberta Raileanu
ReLM
LRM
113
94
0
07 Mar 2024
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Qilang Ye
Zitong Yu
Rui Shao
Xinyu Xie
Philip Torr
Xiaochun Cao
MLLM
112
30
0
07 Mar 2024
Embodied Understanding of Driving Scenarios
Yunsong Zhou
Linyan Huang
Qingwen Bu
Jia Zeng
Tianyu Li
Hang Qiu
Hongzi Zhu
Minyi Guo
Yu Qiao
Hongyang Li
LM&Ro
103
33
0
07 Mar 2024
Enhancing Data Quality in Federated Fine-Tuning of Foundation Models
Wanru Zhao
Yaxin Du
Nicholas D. Lane
Siheng Chen
Yanfeng Wang
94
4
0
07 Mar 2024
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Yuliang Liu
Biao Yang
Qiang Liu
Zhang Li
Zhiyin Ma
Shuo Zhang
Xiang Bai
MLLM
VLM
126
109
0
07 Mar 2024
Exploring Continual Learning of Compositional Generalization in NLI
Xiyan Fu
Anette Frank
CLL
LRM
66
3
0
07 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Chak Tou Leong
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
87
8
0
07 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
96
5
0
07 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
137
38
0
07 Mar 2024
Can Small Language Models be Good Reasoners for Sequential Recommendation?
Yuling Wang
Changxin Tian
Binbin Hu
Yanhua Yu
Ziqi Liu
Qing Cui
Jun Zhou
Liang Pang
Xiao Wang
LRM
115
31
0
07 Mar 2024
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Yusheng Dai
Hang Chen
Jun Du
Ruoyu Wang
Shihao Chen
Jie Ma
Haotian Wang
Chin-Hui Lee
98
4
0
07 Mar 2024
DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning
Xingwei Qu
Yiming Liang
Yucheng Wang
Tianyu Zheng
Tommy Yue
...
Jiajun Zhang
Wenhu Chen
Chenghua Lin
Jie Fu
Ge Zhang
66
2
0
07 Mar 2024
Generative AI for Synthetic Data Generation: Methods, Challenges and the Future
Xu Guo
Yiqiang Chen
SyDa
88
37
0
07 Mar 2024
Large Language Models are In-Context Molecule Learners
Jiatong Li
Wei Liu
Zhihao Ding
Wenqi Fan
Yuqiang Li
Qing Li
124
6
0
07 Mar 2024
Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Wei Zhang
Miaoxin Cai
Tong Zhang
Guoqiang Lei
Zhuang Yin
Xuerui Mao
76
8
0
06 Mar 2024
Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese
Yikun Sun
Zhen Wan
Nobuhiro Ueda
Sakiko Yahata
Fei Cheng
Chenhui Chu
Sadao Kurohashi
ALM
ELM
60
5
0
06 Mar 2024
Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models
Wenfeng Feng
Chuzhan Hao
Yuewei Zhang
Yu Han
Hao Wang
ALM
MoE
66
15
0
06 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
87
63
0
06 Mar 2024
Japanese-English Sentence Translation Exercises Dataset for Automatic Grading
Naoki Miura
Hiroaki Funayama
Seiya Kikuchi
Yuichiroh Matsubayashi
Yuya Iwase
Kentaro Inui
70
0
0
06 Mar 2024
DINOv2 based Self Supervised Learning For Few Shot Medical Image Segmentation
Lev Ayzenberg
Raja Giryes
H. Greenspan
66
4
0
05 Mar 2024
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Nathaniel Li
Alexander Pan
Anjali Gopal
Summer Yue
Daniel Berrios
...
Yan Shoshitaishvili
Jimmy Ba
K. Esvelt
Alexandr Wang
Dan Hendrycks
ELM
134
195
0
05 Mar 2024
Learning to Use Tools via Cooperative and Interactive Agents
Zhengliang Shi
Shen Gao
Xiuyi Chen
Zhumin Chen
Lingyong Yan
Haibo Shi
D. Yin
Fajie Yuan
Suzan Verberne
Zhaochun Ren
LLMAG
96
32
0
05 Mar 2024
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Xue Song
Jiequan Cui
Hanwang Zhang
Jingjing Chen
Richang Hong
Yu-Gang Jiang
DiffM
71
13
0
05 Mar 2024
Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Jun-Yan He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Jinpeng Lan
Bin Luo
Xuansong Xie
MLLM
VLM
98
22
0
05 Mar 2024
Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
Rui Wang
Fei Mi
Yi Chen
Boyang Xue
Hongru Wang
Qi Zhu
Kam-Fai Wong
Rui-Lan Xu
CLL
73
7
0
05 Mar 2024
Towards Training A Chinese Large Language Model for Anesthesiology
Zhonghai Wang
Jie Jiang
Yibing Zhan
Bohao Zhou
Yanhong Li
...
Liang Ding
Hua Jin
Jun Peng
Xu Lin
Weifeng Liu
LM&MA
71
4
0
05 Mar 2024
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Jiwen Zhang
Jihao Wu
Yihua Teng
Minghui Liao
Nuo Xu
Xiao Xiao
Zhongyu Wei
Duyu Tang
LLMAG
LM&Ro
125
75
0
05 Mar 2024
Few-shot Learner Parameterization by Diffusion Time-steps
Zhongqi Yue
Pan Zhou
Richang Hong
Hanwang Zhang
Qianru Sun
109
12
0
05 Mar 2024
Interactive Continual Learning: Fast and Slow Thinking
Biqing Qi
Xingquan Chen
Junqi Gao
Dong Li
Jianxing Liu
Ligang Wu
Bowen Zhou
CLL
66
19
0
05 Mar 2024
Training Machine Learning models at the Edge: A Survey
Aymen Rayane Khouas
Mohamed Reda Bouadjenek
Hakim Hacid
Sunil Aryal
113
12
0
05 Mar 2024
Eliciting Better Multilingual Structured Reasoning from LLMs through Code
Bryan Li
Tamer Alkhouli
Daniele Bonadiman
Nikolaos Pappas
Saab Mansour
LRM
73
9
0
05 Mar 2024
Socratic Reasoning Improves Positive Text Rewriting
Anmol Goel
Nico Daheim
Iryna Gurevych
Iryna Gurevych
LRM
126
4
0
05 Mar 2024
Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering
Chenglei Si
Yanzhe Zhang
Zhengyuan Yang
Zhengyuan Yang
Ruibo Liu
Diyi Yang
82
6
0
05 Mar 2024
A Tutorial on the Pretrain-Finetune Paradigm for Natural Language Processing
Yu Wang
Wen Qu
92
0
0
04 Mar 2024
Enhancing LLM Safety via Constrained Direct Preference Optimization
Zixuan Liu
Xiaolin Sun
Zizhan Zheng
91
29
0
04 Mar 2024
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review
Iryna Hartsock
Ghulam Rasool
102
82
0
04 Mar 2024
Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground
Adil Soubki
John Murzaku
Arash Yousefi Jordehi
Peter Zeng
Magdalena Markowska
Seyed Abolghasem Mirroshandel
Owen Rambow
VLM
70
7
0
04 Mar 2024
A Decade of Privacy-Relevant Android App Reviews: Large Scale Trends
Omer Akgul
Sai Teja Peddinti
Nina Taft
Michelle L. Mazurek
Hamza Harkous
Animesh Srivastava
Benoit Seguin
86
6
0
04 Mar 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi
Alona Fyshe
78
2
0
04 Mar 2024
Large language models surpass human experts in predicting neuroscience results
Xiaoliang Luo
Akilles Rechardt
Guangzhi Sun
Kevin K. Nejad
Felipe Y´a˜nez
...
Anna Behler
Chloe M. Hall
J. Dafflon
Sherry Dongqi Bao
Bradley C. Love
91
58
0
04 Mar 2024
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Lukas Höllein
Aljavz Bovzivc
Norman Muller
David Novotny
Hung-Yu Tseng
Christian Richardt
Michael Zollhöfer
Matthias Nießner
DiffM
91
45
0
04 Mar 2024
AtomoVideo: High Fidelity Image-to-Video Generation
Litong Gong
Yiran Zhu
Weijie Li
Xiaoyang Kang
Biao Wang
Tiezheng Ge
Bo Zheng
DiffM
VGen
198
12
0
04 Mar 2024
Derivative-Free Optimization for Low-Rank Adaptation in Large Language Models
Feihu Jin
Yin Liu
Ying Tan
68
4
0
04 Mar 2024
Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection
Jieren Deng
Haojian Zhang
Kun Ding
Jianhua Hu
Xingxuan Zhang
Yunkuan Wang
VLM
ObjD
185
7
0
04 Mar 2024
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
Jiaxiang Cheng
Pan Xie
Xin Xia
Jiashi Li
Jie Wu
Yuxi Ren
Huixia Li
Xuefeng Xiao
Min Zheng
Lean Fu
116
12
0
04 Mar 2024
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey
Qizhi Pei
Lijun Wu
Kaiyuan Gao
Jinhua Zhu
Yue Wang
Zun Wang
Tao Qin
Rui Yan
AI4CE
127
18
0
03 Mar 2024
Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Xin Zhou
Dingkang Liang
Wei Xu
Xingkui Zhu
Yihan Xu
Zhikang Zou
Xiang Bai
96
28
0
03 Mar 2024
Previous
1
2
3
...
93
94
95
...
137
138
139
Next