
Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection
Ziqing Fan
Siyuan Du
Shengchao Hu
Pingjie Wang
Li Shen
Ya Zhang
Dacheng Tao
Yanfeng Wang
Papers citing "Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection"
Title | |||
---|---|---|---|
No papers |