Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.16032
Cited By
An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders
28 August 2024
Shuang Feng
Grace Feng
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders"
6 / 6 papers shown
Title
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
886
13,207
0
04 Mar 2022
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
126
24
0
01 Sep 2021
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,229
0
11 Oct 2018
RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
D. Rohde
Stephen Bonner
Travis Dunlop
Flavian Vasile
Alexandros Karatzoglou
OffRL
57
150
0
02 Aug 2018
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
OffRL
81
187
0
25 May 2018
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
1