Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.05987
Cited By
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
14 March 2019
Matthew E. Peters
Sebastian Ruder
Noah A. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks"
50 / 229 papers shown
Title
Chronocept: Instilling a Sense of Time in Machines
Krish Goel
Sanskar Pandey
KS Mahadevan
Harsh Kumar
Vishesh Khadaria
28
0
0
12 May 2025
Fine-Tuning without Performance Degradation
Han Wang
Adam White
Martha White
OnRL
166
0
0
01 May 2025
Overcoming Vocabulary Constraints with Pixel-level Fallback
Jonas F. Lotz
Hendra Setiawan
Stephan Peitz
Yova Kementchedjhieva
43
0
0
02 Apr 2025
MobiFuse: Learning Universal Human Mobility Patterns through Cross-domain Data Fusion
Haoxuan Ma
Xishun Liao
Yifan Liu
Qinhua Jiang
Chris Stanford
Shangqing Cao
Jiaqi Ma
47
0
0
20 Mar 2025
Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization
Ziqing Xu
Hancheng Min
Lachlan Ewen MacDonald
Jinqi Luo
Salma Tarmoun
Enrique Mallada
René Vidal
AI4CE
56
0
0
10 Mar 2025
Mixtraining: A Better Trade-Off Between Compute and Performance
Zexin Li
Jiancheng Zhang
Yufei Li
Yinglun Zhu
Cong Liu
51
0
0
26 Feb 2025
Fine-Tuning Games: Bargaining and Adaptation for General-Purpose Models
Benjamin Laufer
Jon M. Kleinberg
Hoda Heidari
55
8
0
03 Jan 2025
IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks
Yaming Zhang
Chenqiang Gao
Fangcen Liu
Junjie Guo
Lan Wang
Xinggan Peng
Deyu Meng
106
0
0
21 Dec 2024
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
106
2
0
26 Nov 2024
Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Qingyu Yin
Xuzheng He
Luoao Deng
Chak Tou Leong
Fan Wang
Yanzhao Yan
Xiaoyu Shen
Qiang Zhang
39
2
0
07 Oct 2024
Reconstructing Human Mobility Pattern: A Semi-Supervised Approach for Cross-Dataset Transfer Learning
Xishun Liao
Yifan Liu
Chenchen Kuai
Haoxuan Ma
Yueshuai He
Shangqing Cao
Chris Stanford
Jiaqi Ma
35
1
0
03 Oct 2024
Transfer Learning with Clinical Concept Embeddings from Large Language Models
Yuhe Gao
Runxue Bao
Yuelyu Ji
Yiming Sun
Chenxi Song
Jeffrey P. Ferraro
Ye Ye
38
1
0
20 Sep 2024
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models
Sonam Gupta
Yatin Nandwani
Asaf Yehudai
Mayank Mishra
Gaurav Pandey
Dinesh Raghu
Sachindra Joshi
LRM
22
1
0
07 Sep 2024
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks
Dongshuo Yin
Leiyi Hu
Bin Li
Youqun Zhang
Xue Yang
46
6
0
15 Aug 2024
ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification
Mridula Vijendran
Frederick W. B. Li
Jingjing Deng
Hubert P. H. Shum
53
0
0
03 Aug 2024
Aligning Programming Language and Natural Language: Exploring Design Choices in Multi-Modal Transformer-Based Embedding for Bug Localization
Partha Chakraborty
Venkatraman Arumugam
M. Nagappan
31
0
0
25 Jun 2024
OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models
Kerim Büyükakyüz
AI4CE
21
5
0
03 Jun 2024
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Justin Zhao
Timothy Wang
Wael Abid
Geoffrey Angus
Arnav Garg
Jeffery Kinnison
Alex Sherstinsky
Piero Molino
Travis Addair
Devvret Rishi
ALM
48
28
0
29 Apr 2024
Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis
Yufan Li
Subhabrata Sen
Ben Adlam
MLT
51
1
0
18 Apr 2024
Improving Pre-trained Language Model Sensitivity via Mask Specific losses: A case study on Biomedical NER
Micheal Abaho
Danushka Bollegala
Gary Leeming
Dan Joyce
Iain E Buchan
28
0
0
26 Mar 2024
Applied Causal Inference Powered by ML and AI
Victor Chernozhukov
Christian Hansen
Nathan Kallus
Martin Spindler
Vasilis Syrgkanis
CML
36
29
0
04 Mar 2024
Derivative-Free Optimization for Low-Rank Adaptation in Large Language Models
Feihu Jin
Yin Liu
Ying Tan
35
3
0
04 Mar 2024
NeuralSI: Neural Design of Semantic Interaction for Interactive Deep Learning
Yali Bian
Rebecca Faust
Chris North
HAI
22
0
0
27 Feb 2024
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
21
9
0
02 Feb 2024
Breaking Free Transformer Models: Task-specific Context Attribution Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs
Stepan Tytarenko
Mohammad Ruhul Amin
11
3
0
30 Jan 2024
Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data
Leonardo Castro-Gonzalez
Yi-Ling Chung
Hannak Rose Kirk
John Francis
Angus R. Williams
Pica Johansson
Jonathan Bright
50
1
0
22 Jan 2024
On the Necessity of Metalearning: Learning Suitable Parameterizations for Learning Processes
Massinissa Hamidi
A. Osmani
35
0
0
31 Dec 2023
Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models
Ibtihel Amara
Vinija Jain
Aman Chadha
32
0
0
12 Dec 2023
LLM-TAKE: Theme Aware Keyword Extraction Using Large Language Models
Reza Yousefi Maragheh
Chenhao Fang
Charan Chand Irugu
Parth Parikh
Jason H. D. Cho
...
Saranyan Sukumar
Malay Patel
Evren Körpeoglu
Sushant Kumar
Kannan Achan
30
6
0
01 Dec 2023
How much data do I need? A case study on medical data
Ayse Betul Cengiz
A. Mcgough
16
2
0
26 Nov 2023
Adapter is All You Need for Tuning Visual Tasks
Dongshuo Yin
Leiyi Hu
Bin Li
Youqun Zhang
16
15
0
25 Nov 2023
Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition
Isaac Slaughter
Craig Greenberg
Reva Schwartz
Aylin Caliskan
27
4
0
29 Oct 2023
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Zihao Lin
Yan Sun
Yifan Shi
Xueqian Wang
Lifu Huang
Li Shen
Dacheng Tao
36
11
0
04 Oct 2023
Detecting Natural Language Biases with Prompt-based Learning
Md Abdul Aowal
Maliha T Islam
P. Mammen
Sandesh Shetty
14
1
0
11 Sep 2023
Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Guangyi Chen
Xiao Liu
Guangrun Wang
Anton van den Hengel
Philip H.S.Torr
Xiaoping Zhang
Yansong Tang
24
18
0
16 Aug 2023
MultiSChuBERT: Effective Multimodal Fusion for Scholarly Document Quality Prediction
Gideon Maillette de Buy Wenniger
Thomas van Dongen
Lambert Schomaker
13
4
0
15 Aug 2023
You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content
Xinlei He
Savvas Zannettou
Yun Shen
Yang Zhang
CLL
21
37
0
10 Aug 2023
Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
Charlie Hou
K. K. Thekumparampil
Michael Shavlovsky
Giulia Fanti
Yesh Dattatreya
Sujay Sanghavi
LMTD
21
1
0
31 Jul 2023
AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets
Siyi Du
Nourhan Bayasi
Ghassan Hamarneh
Rafeef Garbi
ViT
36
3
0
26 Jul 2023
Gradient Sparsification For Masked Fine-Tuning of Transformers
J. Ó. Neill
Sourav Dutta
24
0
0
19 Jul 2023
Analyzing Dataset Annotation Quality Management in the Wild
Jan-Christoph Klie
Richard Eckart de Castilho
Iryna Gurevych
16
17
0
16 Jul 2023
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
23
0
0
23 Jun 2023
Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions
Dongshuo Yin
Xueting Han
Bin Li
Hao Feng
Jinghua Bai
VPVLM
31
17
0
16 Jun 2023
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Myles Foley
Ambrish Rawat
Taesung Lee
Yufang Hou
Gabriele Picco
Giulio Zizzo
DeLMO
32
5
0
15 Jun 2023
Deep Model Compression Also Helps Models Capture Ambiguity
Hancheol Park
Jong C. Park
27
1
0
12 Jun 2023
Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making
Aliyah R. Hsu
Yeshwanth Cherapanamjeri
Briton Park
Tristan Naumann
A. Odisho
Bin-Xia Yu
MedIm
26
0
0
27 May 2023
Weaker Than You Think: A Critical Look at Weakly Supervised Learning
D. Zhu
Xiaoyu Shen
Marius Mosbach
Andreas Stephan
Dietrich Klakow
NoLa
31
9
0
27 May 2023
DeepSI: Interactive Deep Learning for Semantic Interaction
Yail Bian
Chris North
HAI
10
15
0
26 May 2023
To Revise or Not to Revise: Learning to Detect Improvable Claims for Argumentative Writing Support
Gabriella Skitalinskaya
Henning Wachsmuth
24
9
0
26 May 2023
Adversarial Multi-task Learning for End-to-end Metaphor Detection
Shenglong Zhang
Yong-Jin Liu
6
11
0
26 May 2023
1
2
3
4
5
Next