Title |
---|
![]() Selective Annotation Makes Language Models Better Few-Shot Learners Hongjin Su Jungo Kasai Chen Henry Wu Weijia Shi Tianlu Wang ...Rui Zhang Mari Ostendorf Luke Zettlemoyer Noah A. Smith Tao Yu |
![]() What Matters In On-Policy Reinforcement Learning? A Large-Scale
Empirical Study Marcin Andrychowicz Anton Raichuk Piotr Stańczyk Manu Orsini Sertan Girgin ...Matthieu Geist Olivier Pietquin Marcin Michalski Sylvain Gelly Olivier Bachem |