ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.09823
  4. Cited By
Dialogue Learning With Human-In-The-Loop

Dialogue Learning With Human-In-The-Loop

29 November 2016
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
    OffRL
ArXivPDFHTML

Papers citing "Dialogue Learning With Human-In-The-Loop"

24 / 24 papers shown
Title
It Takes Two: On the Seamlessness between Reward and Policy Model in
  RLHF
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF
Taiming Lu
Lingfeng Shen
Xinyu Yang
Weiting Tan
Beidi Chen
Huaxiu Yao
55
2
0
12 Jun 2024
Reasons to Reject? Aligning Language Models with Judgments
Reasons to Reject? Aligning Language Models with Judgments
Weiwen Xu
Deng Cai
Zhisong Zhang
Wai Lam
Shuming Shi
ALM
21
14
0
22 Dec 2023
Let Me Teach You: Pedagogical Foundations of Feedback for Language
  Models
Let Me Teach You: Pedagogical Foundations of Feedback for Language Models
Beatriz Borges
Niket Tandon
Tanja Kaser
Antoine Bosselut
22
3
0
01 Jul 2023
LeTI: Learning to Generate from Textual Interactions
LeTI: Learning to Generate from Textual Interactions
Xingyao Wang
Hao Peng
Reyhaneh Jabbarvand
Heng Ji
35
30
0
17 May 2023
Training Language Models with Language Feedback at Scale
Training Language Models with Language Feedback at Scale
Jérémy Scheurer
Jon Ander Campos
Tomasz Korbak
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
39
101
0
28 Mar 2023
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad
  Responses into Good Labels
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels
Weiyan Shi
Emily Dinan
Kurt Shuster
Jason Weston
Jing Xu
46
19
0
28 Oct 2022
Learning New Skills after Deployment: Improving open-domain
  internet-driven dialogue with human feedback
Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback
Jing Xu
Megan Ung
M. Komeili
Kushal Arora
Y-Lan Boureau
Jason Weston
24
37
0
05 Aug 2022
Using Interactive Feedback to Improve the Accuracy and Explainability of
  Question Answering Systems Post-Deployment
Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment
Zichao Li
Prakhar Sharma
Xing Han Lù
Jackie C.K. Cheung
Siva Reddy
HAI
25
26
0
06 Apr 2022
Improving mathematical questioning in teacher training
Improving mathematical questioning in teacher training
Debajyoti Datta
Maria Phillips
J. Bywater
Jennifer L. Chiu
G. Watson
Laura E. Barnes
Donald E. Brown
21
0
0
02 Dec 2021
Building and Evaluating Open-Domain Dialogue Corpora with Clarifying
  Questions
Building and Evaluating Open-Domain Dialogue Corpora with Clarifying Questions
Mohammad Aliannejadi
Julia Kiseleva
A. Chuklin
Jeffrey Stephen Dalton
Mikhail Burtsev
73
96
0
13 Sep 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic
  Survey
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Erik Cambria
54
268
0
10 May 2021
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement
  Learning Approach
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach
Shashi Suman
Ali Etemad
F. Rivest
27
15
0
26 Feb 2021
Reinforcement Learning with Subspaces using Free Energy Paradigm
Reinforcement Learning with Subspaces using Free Energy Paradigm
Milad Ghorbani
Reshad Hosseini
Seyed Pooya Shariatpanahi
M. N. Ahmadabadi
16
0
0
13 Dec 2020
Learning Rewards from Linguistic Feedback
Learning Rewards from Linguistic Feedback
T. Sumers
Mark K. Ho
Robert D. Hawkins
Karthik Narasimhan
Thomas L. Griffiths
19
51
0
30 Sep 2020
Open-Domain Conversational Agents: Current Progress, Open Problems, and
  Future Directions
Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions
Stephen Roller
Y-Lan Boureau
Jason Weston
Antoine Bordes
Emily Dinan
...
Kurt Shuster
Eric Michael Smith
Arthur Szlam
Jack Urbanek
Mary Williamson
LLMAG
AI4CE
20
51
0
22 Jun 2020
Speak to your Parser: Interactive Text-to-SQL with Natural Language
  Feedback
Speak to your Parser: Interactive Text-to-SQL with Natural Language Feedback
Ahmed Elgohary
Saghar Hosseini
Ahmed Hassan Awadallah
13
67
0
05 May 2020
Teaching Machines to Converse
Teaching Machines to Converse
Jiwei Li
19
4
0
31 Jan 2020
Grounding Human-to-Vehicle Advice for Self-driving Vehicles
Grounding Human-to-Vehicle Advice for Self-driving Vehicles
Jinkyu Kim
Teruhisa Misu
Yi-Ting Chen
Ashish Tawari
John F. Canny
24
100
0
16 Nov 2019
Deep Learning Based Chatbot Models
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
Entity-Relation Extraction as Multi-Turn Question Answering
Entity-Relation Extraction as Multi-Turn Question Answering
Xiaoya Li
Fan Yin
Zijun Sun
Xiayu Li
Arianna Yuan
Duo Chai
Mingxin Zhou
Jiwei Li
30
346
0
14 May 2019
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
37
668
0
21 Sep 2018
A Benchmarking Environment for Reinforcement Learning Based Task
  Oriented Dialogue Management
A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management
I. Casanueva
Paweł Budzianowski
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
Stefan Ultes
L. Rojas-Barahona
S. Young
Milica Gasic
OffRL
25
54
0
29 Nov 2017
A Survey on Dialogue Systems: Recent Advances and New Frontiers
A Survey on Dialogue Systems: Recent Advances and New Frontiers
Hongshen Chen
Xiaorui Liu
Dawei Yin
Jiliang Tang
VLM
LLMAG
38
695
0
06 Nov 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,502
0
25 Jan 2017
1