
v1v2 (latest)
Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training
Papers citing "Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training"
50 / 61 papers shown
Title |
---|
![]() OLMo: Accelerating the Science of Language Models Dirk Groeneveld Iz Beltagy Pete Walsh Akshita Bhagia Rodney Michael Kinney ...Jesse Dodge Kyle Lo Luca Soldaini Noah A. Smith Hanna Hajishirzi |
![]() Llama 2: Open Foundation and Fine-Tuned Chat Models Hugo Touvron Louis Martin Kevin R. Stone Peter Albert Amjad Almahairi ...Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov Thomas Scialom |