Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.00705
Cited By
SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning
23 April 2024
Yexiao He
Ziyao Wang
Zheyu Shen
Guoheng Sun
Yucong Dai
Yongkai Wu
Hongyi Wang
Ang Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning"
4 / 4 papers shown
Title
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
Yanjun Fu
Faisal Hamman
Sanghamitra Dutta
ALM
79
0
0
02 Jun 2025
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
Jia Zhang
Chen-Xi Zhang
Yang Liu
Yi-Xuan Jin
Xiao-Wen Yang
Bo Zheng
Yi Liu
Lan-Zhe Guo
143
3
0
14 Mar 2025
Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models
Ziche Liu
Rui Ke
Feng Jiang
Feng Jiang
Haizhou Li
146
2
0
20 Jun 2024
Contextual Diversity for Active Learning
Sharat Agarwal
H. Arora
Saket Anand
Chetan Arora
185
172
0
13 Aug 2020
1