Learning Rewards from Linguistic Feedback

30 September 2020

Papers citing "Learning Rewards from Linguistic Feedback"

38 / 38 papers shown

Title
Increasing happiness through conversations with artificial intelligence Joseph Heffner Chongyu Qin Martin Chadwick Chris Knutsen Christopher Summerfield Zeb Kurth-Nelson Robb B. Rutledge AI4MH 42 0 0 02 Apr 2025
Retrospective Learning from Interactions Zizhao Chen Mustafa Omer Gul Yiwei Chen Gloria Geng Anne Wu Yoav Artzi LRM 28 1 0 17 Oct 2024
Problem Solving Through Human-AI Preference-Based Cooperation Subhabrata Dutta Timo Kaufmann Goran Glavas Ivan Habernal Kristian Kersting Frauke Kreuter Mira Mezini Iryna Gurevych Eyke Hüllermeier Hinrich Schuetze 98 1 0 14 Aug 2024
Social Learning through Interactions with Other Agents: A Survey Dylan Hillier Cheston Tan Jing Jiang 40 0 0 31 Jul 2024
Building Machines that Learn and Think with People Katherine M. Collins Ilia Sucholutsky Umang Bhatt Kartik Chandra Lionel Wong ... Mark K. Ho Vikash K. Mansinghka Adrian Weller Joshua B. Tenenbaum Thomas L. Griffiths 54 30 0 22 Jul 2024
Representational Alignment Supports Effective Machine Teaching Ilia Sucholutsky Katherine M. Collins Maya Malaviya Nori Jacoby Weiyang Liu ... J. Tenenbaum Brad Love Z. Pardos Adrian Weller Thomas L. Griffiths 54 3 0 06 Jun 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods Yuji Cao Huan Zhao Yuheng Cheng Ting Shu Guolong Liu Gaoqi Liang Junhua Zhao Yun Li LLMAG KELM OffRL LM&Ro 35 50 0 30 Mar 2024
Learning with Language-Guided State Abstractions Andi Peng Ilia Sucholutsky Belinda Z. Li T. Sumers Thomas L. Griffiths Jacob Andreas Julie A. Shah LM&Ro 49 13 0 28 Feb 2024
Natural Language Reinforcement Learning Xidong Feng Bo Liu Mengyue Yang Ziyan Wang Girish A. Koushiks Yali Du Ying Wen Jun Wang OffRL 35 3 0 11 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies Zhixuan Chu Yan Wang Feng Zhu Lu Yu Longfei Li Jinjie Gu LLMAG 23 8 0 06 Feb 2024
Trajectory-Oriented Policy Optimization with Sparse Rewards Guojian Wang Faguo Wu Xiao Zhang OffRL 17 1 0 04 Jan 2024
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning Md Saiful Islam Srijita Das S. Gottipati William Duguay Clodéric Mars Jalal Arabneydi Antoine Fagette Matthew J. Guzdial Matthew E. Taylor 35 1 0 23 Dec 2023
LLF-Bench: Benchmark for Interactive Learning from Language Feedback Ching-An Cheng Andrey Kolobov Dipendra Kumar Misra Allen Nie Adith Swaminathan 29 19 0 11 Dec 2023
Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing Feiyang Han Yimin Wei Zhaofeng Liu Yanxing Qi 30 1 0 24 Nov 2023
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language Zhourui Guo Meng Yao Yang Yu Qiyue Yin OnRL 28 1 0 21 Sep 2023
Cognitive Architectures for Language Agents T. Sumers Shunyu Yao Karthik Narasimhan Thomas L. Griffiths LLMAG LM&Ro 54 153 0 05 Sep 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback Stephen Casper Xander Davies Claudia Shi T. Gilbert Jérémy Scheurer ... Erdem Biyik Anca Dragan David M. Krueger Dorsa Sadigh Dylan Hadfield-Menell ALM OffRL 47 472 0 27 Jul 2023
Learning to Generate Better Than Your LLM Jonathan D. Chang Kianté Brantley Rajkumar Ramamurthy Dipendra Kumar Misra Wen Sun 19 41 0 20 Jun 2023
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning E. Liu S. Suri Tong Mu Allan Zhou Chelsea Finn LLMAG LM&Ro 21 2 0 14 Jun 2023
Curricular Subgoals for Inverse Reinforcement Learning Shunyu Liu Yunpeng Qing Shuqi Xu Hongyan Wu Jiangtao Zhang Jingyuan Cong Tianhao Chen Yunfu Liu Mingli Song 21 1 0 14 Jun 2023
Training Language Models with Language Feedback at Scale Jérémy Scheurer Jon Ander Campos Tomasz Korbak Jun Shern Chan Angelica Chen Kyunghyun Cho Ethan Perez ALM 39 103 0 28 Mar 2023
Distilling Internet-Scale Vision-Language Models into Embodied Agents T. Sumers Kenneth Marino Arun Ahuja Rob Fergus Ishita Dasgupta LM&Ro 28 24 0 29 Jan 2023
Large Language Models as Fiduciaries: A Case Study Toward Robustly Communicating With Artificial Intelligence Through Legal Standards John J. Nay ELM AILaw 29 15 0 24 Jan 2023
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations Zilu Tang Muhammed Yusuf Kocyigit Derry Wijaya 35 8 0 20 Oct 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans John J. Nay ELM AILaw 88 27 0 14 Sep 2022
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents Rafael Rodríguez-Sánchez Benjamin A. Spiegel Jenny Wang Roma Patel Stefanie Tellex George Konidaris 13 2 0 12 Aug 2022
Leveraging Language for Accelerated Learning of Tool Manipulation Allen Z. Ren Bharat Govil Tsung-Yen Yang Karthik Narasimhan Anirudha Majumdar LM&Ro 22 37 0 27 Jun 2022
How to talk so AI will learn: Instructions, descriptions, and autonomy T. Sumers Robert D. Hawkins Mark K. Ho Thomas L. Griffiths Dylan Hadfield-Menell LM&Ro 32 20 0 16 Jun 2022
Training Language Models with Language Feedback Jérémy Scheurer Jon Ander Campos Jun Shern Chan Angelica Chen Kyunghyun Cho Ethan Perez ALM 38 47 0 29 Apr 2022
Linguistic communication as (inverse) reward design T. Sumers Robert D. Hawkins Mark K. Ho Thomas L. Griffiths Dylan Hadfield-Menell 12 4 0 11 Apr 2022
Teaching Drones on the Fly: Can Emotional Feedback Serve as Learning Signal for Training Artificial Agents? M. Pollak Andrea Salfinger K. Hummel 14 2 0 19 Feb 2022
A Framework for Learning to Request Rich and Contextually Useful Information from Humans Khanh Nguyen Yonatan Bisk Hal Daumé 47 16 0 14 Oct 2021
Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning Kaylee Burns Christopher D. Manning Li Fei-Fei 19 0 0 20 Jul 2021
Communicating Natural Programs to Humans and Machines Samuel Acquaviva Yewen Pu Marta Kryven Theo Sechopoulos Catherine Wong Gabrielle Ecanow Maxwell Nye Michael Henry Tessler J. Tenenbaum 25 40 0 15 Jun 2021
Interactive Learning from Activity Description Khanh Nguyen Dipendra Kumar Misra Robert Schapire Miroslav Dudík Patrick Shafto 47 34 0 13 Feb 2021
Adapting a Language Model for Controlled Affective Text Generation Ishika Singh Ahsan Barkati Tushar Goswamy Ashutosh Modi 10 30 0 08 Nov 2020
Dialogue Learning With Human-In-The-Loop Jiwei Li Alexander H. Miller S. Chopra MarcÁurelio Ranzato Jason Weston OffRL 227 134 0 29 Nov 2016
Convolutional Neural Networks for Sentence Classification Yoon Kim AILaw VLM 255 13,364 0 25 Aug 2014