ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.15420
  4. Cited By
PREDILECT: Preferences Delineated with Zero-Shot Language-based
  Reasoning in Reinforcement Learning

PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning

23 February 2024
Simon Holk
Daniel Marta
Iolanda Leite
ArXiv (abs)PDFHTML

Papers citing "PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning"

24 / 24 papers shown
Title
GRACE: Generating Socially Appropriate Robot Actions Leveraging LLMs and Human Explanations
GRACE: Generating Socially Appropriate Robot Actions Leveraging LLMs and Human Explanations
Fethiye Irmak Dogan
Umut Ozyurt
Gizem Cinar
Hatice Gunes
LLMAG
77
4
0
25 Sep 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
Matthew E. Taylor
OffRL
108
2
0
30 Apr 2024
Multimodal Deep Learning
Multimodal Deep Learning
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
...
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Matthias Aßenmacher
120
3,174
0
12 Jan 2023
Large Language Models Are Reasoning Teachers
Large Language Models Are Reasoning Teachers
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLMELMLRM
111
350
0
20 Dec 2022
Few-Shot Preference Learning for Human-in-the-Loop RL
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
107
100
0
06 Dec 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
823
9,576
0
28 Jan 2022
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
104
44
0
11 Aug 2021
Multimodal Few-Shot Learning with Frozen Language Models
Multimodal Few-Shot Learning with Frozen Language Models
Maria Tsimpoukelli
Jacob Menick
Serkan Cabi
S. M. Ali Eslami
Oriol Vinyals
Felix Hill
MLLM
178
788
0
25 Jun 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via
  Relabeling Experience and Unsupervised Pre-training
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
65
288
0
09 Jun 2021
Active Preference-Based Gaussian Process Regression for Reward Learning
Active Preference-Based Gaussian Process Regression for Reward Learning
Erdem Biyik
Nicolas Huynh
Mykel J. Kochenderfer
Dorsa Sadigh
GP
72
109
0
06 May 2020
Meta-Transfer Learning for Zero-Shot Super-Resolution
Meta-Transfer Learning for Zero-Shot Super-Resolution
Jae Woong Soh
Sunwoo Cho
N. Cho
SupR
77
284
0
27 Feb 2020
Reward-rational (implicit) choice: A unifying formalism for reward
  learning
Reward-rational (implicit) choice: A unifying formalism for reward learning
Hong Jun Jeon
S. Milli
Anca Dragan
76
177
0
12 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
523
42,559
0
03 Dec 2019
Self-training with Noisy Student improves ImageNet classification
Self-training with Noisy Student improves ImageNet classification
Qizhe Xie
Minh-Thang Luong
Eduard H. Hovy
Quoc V. Le
NoLa
312
2,392
0
11 Nov 2019
Self-Supervised Correspondence in Visuomotor Policy Learning
Self-Supervised Correspondence in Visuomotor Policy Learning
Peter R. Florence
Lucas Manuelli
Russ Tedrake
SSL
83
163
0
16 Sep 2019
Unsupervised Learning of Object Keypoints for Perception and Control
Unsupervised Learning of Object Keypoints for Perception and Control
Tejas D. Kulkarni
Ankush Gupta
Catalin Ionescu
Sebastian Borgeaud
Malcolm Reynolds
Andrew Zisserman
Volodymyr Mnih
SSLOCL
55
196
0
19 Jun 2019
Few-Shot Goal Inference for Visuomotor Learning and Planning
Few-Shot Goal Inference for Visuomotor Learning and Planning
Annie Xie
Avi Singh
Sergey Levine
Chelsea Finn
OffRL
91
70
0
30 Sep 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
69
360
0
20 Jun 2018
Know What You Don't Know: Unanswerable Questions for SQuAD
Know What You Don't Know: Unanswerable Questions for SQuAD
Pranav Rajpurkar
Robin Jia
Percy Liang
RALMELM
290
2,853
0
11 Jun 2018
Inverse Reward Design
Inverse Reward Design
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
81
399
0
08 Nov 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
526
19,237
0
20 Jul 2017
Loss is its own Reward: Self-Supervision for Reinforcement Learning
Loss is its own Reward: Self-Supervision for Reinforcement Learning
Evan Shelhamer
Parsa Mahmoudieh
Max Argus
Trevor Darrell
SSL
83
186
0
21 Dec 2016
Learning to Navigate in Complex Environments
Learning to Navigate in Complex Environments
Piotr Wojciech Mirowski
Razvan Pascanu
Fabio Viola
Hubert Soyer
Andy Ballard
...
Ross Goroshin
Laurent Sifre
Koray Kavukcuoglu
D. Kumaran
R. Hadsell
107
880
0
11 Nov 2016
Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets
  for Future Tasks
Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks
E. Gavves
Thomas Mensink
Tatiana Tommasi
Cees G. M. Snoek
Tinne Tuytelaars
VLM
45
69
0
06 Oct 2015
1