
v1v2 (latest)
ComplexityNet: Increasing LLM Inference Efficiency by Learning Task Complexity
Papers citing "ComplexityNet: Increasing LLM Inference Efficiency by Learning Task Complexity"
15 / 15 papers shown
Title |
---|
![]() Llama 2: Open Foundation and Fine-Tuned Chat Models Hugo Touvron Louis Martin Kevin R. Stone Peter Albert Amjad Almahairi ...Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov Thomas Scialom |