Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.00808
Cited By
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
2 March 2025
Kashun Shum
Yuanmin Huang
Hongjian Zou
Qi Ding
Yixuan Liao
Xiao Chen
Qian Liu
Junxian He
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predictive Data Selection: The Data That Predicts Is the Data That Teaches"
3 / 3 papers shown
Title
DataRater: Meta-Learned Dataset Curation
Dan A. Calian
Gregory Farquhar
Iurii Kemaev
Luisa M. Zintgraf
Matteo Hessel
...
András Gyorgy
Tom Schaul
Jeffrey Dean
Hado van Hasselt
David Silver
10
0
0
23 May 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Dan Su
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
52
0
0
17 Apr 2025
Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets
H. Kniesel
Pedro Hermosilla
Timo Ropinski
86
0
0
12 Mar 2025
1