Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.14563
Cited By
Towards a statistical theory of data selection under weak supervision
25 September 2023
Germain Kolossov
Andrea Montanari
Pulkit Tandon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards a statistical theory of data selection under weak supervision"
8 / 8 papers shown
Title
Most Influential Subset Selection: Challenges, Promises, and Beyond
Yuzheng Hu
Pingbang Hu
Han Zhao
Jiaqi W. Ma
TDI
142
2
0
10 Jan 2025
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
M. E. Ildiz
Halil Alperen Gozeten
Ege Onur Taga
Marco Mondelli
Samet Oymak
56
2
0
24 Oct 2024
Balancing Label Quantity and Quality for Scalable Elicitation
Alex Troy Mallen
Nora Belrose
34
1
0
17 Oct 2024
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
Zhen Qin
Daoyuan Chen
Wenhao Zhang
Liuyi Yao
Yilun Huang
Bolin Ding
Yaliang Li
Shuiguang Deng
57
5
0
11 Jul 2024
Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
Yijun Dong
Hoang Phan
Xiang Pan
Qi Lei
48
5
0
08 Jul 2024
Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement
Yunzhen Feng
Elvis Dohmatob
Pu Yang
Francois Charton
Julia Kempe
53
17
0
11 Jun 2024
High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization
Yihang Chen
Fanghui Liu
Taiji Suzuki
V. Cevher
40
1
0
05 Jun 2024
SelMatch: Effectively Scaling Up Dataset Distillation via Selection-Based Initialization and Partial Updates by Trajectory Matching
Yongmin Lee
Hye Won Chung
31
6
0
28 May 2024
1