Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.08835
Cited By
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP
18 April 2021
Qinyuan Ye
Bill Yuchen Lin
Xiang Ren
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP"
50 / 139 papers shown
Title
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Ming Li
Yongqian Li
Ziyue Li
Tianyi Zhou
LRM
27
1
0
14 Apr 2025
RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models
Lv Qingsong
Yangning Li
Zihua Lan
Zishan Xu
Jiwei Tang
Hai-Tao Zheng
Wenhao Jiang
Hai-tao Zheng
Philip S. Yu
32
0
0
09 Apr 2025
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
Jia Zhang
Chen-Xi Zhang
Yong-Jin Liu
Yi-Xuan Jin
Xiao-Wen Yang
Bo Zheng
Yi Liu
Lan-Zhe Guo
47
2
0
14 Mar 2025
From Selection to Generation: A Survey of LLM-based Active Learning
Yu Xia
Subhojyoti Mukherjee
Zhouhang Xie
Junda Wu
Xintong Li
...
Namyong Park
T. Nguyen
Jiebo Luo
Ryan Rossi
Julian McAuley
55
0
0
17 Feb 2025
Improving Instruction-Following in Language Models through Activation Steering
Alessandro Stolfo
Vidhisha Balachandran
Safoora Yousefi
Eric Horvitz
Besmira Nushi
LLMSV
62
14
0
15 Oct 2024
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
Can Wang
Dianbo Sui
Hongliang Sun
Hao Ding
Bolin Zhang
Zhiying Tu
26
0
0
10 Oct 2024
Document-Level Event Extraction with Definition-Driven ICL
Zhuoyuan Liu
Yilin Luo
78
1
0
10 Aug 2024
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALM
KELM
44
22
0
02 Jul 2024
Factual Dialogue Summarization via Learning from Large Language Models
Rongxin Zhu
Jey Han Lau
Jianzhong Qi
HILM
52
1
0
20 Jun 2024
Data-Centric AI in the Age of Large Language Models
Xinyi Xu
Zhaoxuan Wu
Rui Qiao
Arun Verma
Yao Shu
...
Xiaoqiang Lin
Wenyang Hu
Zhongxiang Dai
Pang Wei Koh
Bryan Kian Hsiang Low
ALM
46
2
0
20 Jun 2024
Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference
Jihwan Bang
Juntae Lee
Kyuhong Shim
Seunghan Yang
Simyung Chang
34
5
0
11 Jun 2024
From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Dylan Zhang
Justin Wang
Francois Charton
38
0
0
30 May 2024
MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
Sanchit Sinha
Yuguang Yue
Victor Soto
Mayank Kulkarni
Jianhua Lu
Aidong Zhang
LRM
46
4
0
19 May 2024
Evaluation of Few-Shot Learning for Classification Tasks in the Polish Language
Tsimur Hadeliya
D. Kajtoch
46
0
0
27 Apr 2024
Empowering Large Language Models for Textual Data Augmentation
Yichuan Li
Kaize Ding
Jianling Wang
Kyumin Lee
26
10
0
26 Apr 2024
Automated Data Curation for Robust Language Model Fine-Tuning
Jiuhai Chen
Jonas W. Mueller
ALM
39
19
0
19 Mar 2024
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks
Yuncheng Huang
Qi He
Yipei Xu
Jiaqing Liang
Yanghua Xiao
LRM
41
1
0
14 Mar 2024
MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning
Yichuan Li
Xiyao Ma
Sixing Lu
Kyumin Lee
Xiaohu Liu
Chenlei Guo
21
6
0
11 Mar 2024
X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification
Hanzi Xu
Muhao Chen
Lifu Huang
Slobodan Vučetić
Wenpeng Yin
BDL
VLM
47
2
0
06 Mar 2024
ICE-SEARCH: A Language Model-Driven Feature Selection Approach
T. Yang
Tim Tianyi Yang
Fuyuan Lyu
Shaoshan Liu
Xue
Xue Liu
36
4
0
28 Feb 2024
Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?
Nader Asadi
Mahdi Beitollahi
Yasser H. Khalil
Yinchuan Li
Guojun Zhang
Xi Chen
MoMe
35
8
0
23 Feb 2024
Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond
Xinyu Wang
Hainiu Xu
Lin Gui
Yulan He
MoMe
AIFin
36
1
0
22 Feb 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
Yizhi Li
Ge Zhang
Xingwei Qu
Jiali Li
Zhaoqun Li
...
Jiajun Zhang
Zuowen Li
Stephen W. Huang
Chenghua Lin
Jie Fu
ALM
ELM
34
3
0
20 Feb 2024
On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices
Branislav Pecher
Ivan Srba
M. Bieliková
69
3
0
20 Feb 2024
Instruction Diversity Drives Generalization To Unseen Tasks
Dylan Zhang
Justin Wang
Francois Charton
ALM
31
6
0
16 Feb 2024
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Dinesh Manocha
71
17
0
16 Feb 2024
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Ming Li
Lichang Chen
Jiuhai Chen
Shwai He
Jiuxiang Gu
Dinesh Manocha
26
51
0
15 Feb 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
A. Ustun
Marzieh Fadaee
Sara Hooker
119
117
0
09 Feb 2024
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Ming Li
Yong Zhang
Shwai He
Zhitao Li
Hongyu Zhao
Jianzong Wang
Ning Cheng
Dinesh Manocha
32
64
0
01 Feb 2024
Improving In-context Learning via Bidirectional Alignment
Chengwei Qin
Wenhan Xia
Fangkai Jiao
Chen Chen
Yuchen Hu
Bosheng Ding
Shafiq R. Joty
35
7
0
28 Dec 2023
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
E. Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLM
LRM
AI4CE
24
76
0
17 Dec 2023
Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning
Haowen Wang
Tao Sun
Cong Fan
Jinjie Gu
MoE
16
7
0
06 Dec 2023
FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models
Rui Xiao
Yiwen Dong
Junbo Zhao
Runze Wu
Minmin Lin
Gang Chen
Haobo Wang
26
7
0
27 Nov 2023
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Prateek Yadav
Leshem Choshen
Colin Raffel
Mohit Bansal
32
13
0
22 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
31
3
0
16 Nov 2023
Selecting Shots for Demographic Fairness in Few-Shot Learning with Large Language Models
Carlos Alejandro Aguirre
Kuleen Sasse
Isabel Cachola
Mark Dredze
30
1
0
14 Nov 2023
Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains
Joshua Clymer
Garrett Baker
Rohan Subramani
Sam Wang
22
6
0
13 Nov 2023
Can LLMs Grade Short-Answer Reading Comprehension Questions : An Empirical Study with a Novel Dataset
Owen Henkel
Libby Hills
Bill Roberts
Joshua A. McGrane
AI4Ed
29
1
0
26 Oct 2023
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks
Kaiser Sun
Adina Williams
Dieuwke Hupkes
CoGe
16
6
0
26 Oct 2023
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models
Pierre Colombo
Victor Pellegrain
Malik Boudiaf
Victor Storchan
Myriam Tami
Ismail Ben Ayed
C´eline Hudelot
Pablo Piantanida
35
8
0
21 Oct 2023
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Ke Wang
Houxing Ren
Aojun Zhou
Zimu Lu
Sichun Luo
Weikang Shi
Renrui Zhang
Linqi Song
Mingjie Zhan
Hongsheng Li
ReLM
LRM
SyDa
22
94
0
05 Oct 2023
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue
Xingwei Qu
Ge Zhang
Yao Fu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
AIMat
LRM
62
361
0
11 Sep 2023
Donkii: Can Annotation Error Detection Methods Find Errors in Instruction-Tuning Datasets?
Leon Weber-Genzel
Robert Litschko
Ekaterina Artemova
Barbara Plank
18
2
0
04 Sep 2023
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
Ming Li
Yong Zhang
Zhitao Li
Jiuhai Chen
Lichang Chen
Ning Cheng
Jianzong Wang
Dinesh Manocha
Jing Xiao
38
170
0
23 Aug 2023
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?
Ari Holtzman
Peter West
Luke Zettlemoyer
AI4CE
30
14
0
31 Jul 2023
Exploring Format Consistency for Instruction Tuning
Shi Liang
Runchu Tian
Kunlun Zhu
Yujia Qin
Huadong Wang
Xin Cong
Zhiyuan Liu
Xiaojiang Liu
Maosong Sun
ALM
16
12
0
28 Jul 2023
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Chengsong Huang
Qian Liu
Bill Yuchen Lin
Tianyu Pang
Chao Du
Min-Bin Lin
MoMe
38
185
0
25 Jul 2023
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Saeed Mian
OffRL
70
525
0
12 Jul 2023
Scaling In-Context Demonstrations with Structured Attention
Tianle Cai
Kaixuan Huang
Jason D. Lee
Mengdi Wang
LRM
31
8
0
05 Jul 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Aaron Mueller
Kanika Narang
Lambert Mathias
Qifan Wang
Hamed Firooz
RALM
16
3
0
30 Jun 2023
1
2
3
Next