ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.16032
  4. Cited By
An Extremely Data-efficient and Generative LLM-based Reinforcement
  Learning Agent for Recommenders

An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders

28 August 2024
Shuang Feng
Grace Feng
    OffRL
ArXiv (abs)PDFHTML

Papers citing "An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders"

6 / 6 papers shown
Title
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
886
13,207
0
04 Mar 2022
Boosting Search Engines with Interactive Agents
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
126
24
0
01 Sep 2021
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,229
0
11 Oct 2018
RecoGym: A Reinforcement Learning Environment for the problem of Product
  Recommendation in Online Advertising
RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
D. Rohde
Stephen Bonner
Travis Dunlop
Flavian Vasile
Alexandros Karatzoglou
OffRL
57
150
0
02 Aug 2018
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for
  Reinforcement Learning
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
OffRL
81
187
0
25 May 2018
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
1