Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

Ziqing Fan
Siyuan Du
Shengchao Hu
Pingjie Wang
Li Shen
Ya Zhang
Dacheng Tao
Yanfeng Wang

Papers citing "Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection"

Title
No papers