
Do LLMs "know" internally when they follow instructions?
Juyeon Heo
Christina Heinze-Deml
Jaya Narain
Papers citing "Do LLMs "know" internally when they follow instructions?"
38 / 38 papers shown
Title |
---|
![]() Unlearn What You Want to Forget: Efficient Unlearning for LLMs Jiaao Chen Diyi Yang |
![]() Mistral 7B Albert Q. Jiang Alexandre Sablayrolles A. Mensch Chris Bamford Devendra Singh Chaplot ...Teven Le Scao Thibaut Lavril Thomas Wang Timothée Lacroix William El Sayed |
![]() Llama 2: Open Foundation and Fine-Tuned Chat Models Hugo Touvron Louis Martin Kevin R. Stone Peter Albert Amjad Almahairi ...Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov Thomas Scialom |