Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.02632
Cited By
Deep Reinforcement Learning from Hierarchical Preference Design
6 September 2023
Alexander Bukharin
Yixiao Li
Pengcheng He
Tuo Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning from Hierarchical Preference Design"
4 / 4 papers shown
Title
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
Guosheng Lin
SyDa
ALM
135
241
0
05 Jul 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
375
12,081
0
04 Mar 2022
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq Joty
Guosheng Lin
246
1,506
0
02 Sep 2021
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
208
631
0
20 May 2021
1