
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Papers citing "What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions"
29 / 29 papers shown
Title |
---|
![]() Dolma: an Open Corpus of Three Trillion Tokens for Language Model
Pretraining Research Luca Soldaini Rodney Michael Kinney Akshita Bhagia Dustin Schwenk David Atkinson ...Hanna Hajishirzi Iz Beltagy Dirk Groeneveld Jesse Dodge Kyle Lo |
![]() Citation: A Key to Building Responsible and Accountable Large Language
Models Jie Huang Kevin Chen-Chuan Chang |