ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.14165
  4. Cited By
Towards Boosting the Open-Domain Chatbot with Human Feedback

Towards Boosting the Open-Domain Chatbot with Human Feedback

30 August 2022
Hua Lu
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
    ALM
ArXivPDFHTML

Papers citing "Towards Boosting the Open-Domain Chatbot with Human Feedback"

34 / 34 papers shown
Title
Improving alignment of dialogue agents via targeted human judgements
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
304
528
0
28 Sep 2022
Learning from data in the mixed adversarial non-adversarial case:
  Finding the helpers and ignoring the trolls
Learning from data in the mixed adversarial non-adversarial case: Finding the helpers and ignoring the trolls
Da Ju
Jing Xu
Y-Lan Boureau
Jason Weston
AAML
71
18
0
05 Aug 2022
Learning New Skills after Deployment: Improving open-domain
  internet-driven dialogue with human feedback
Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback
Jing Xu
Megan Ung
M. Komeili
Kushal Arora
Y-Lan Boureau
Jason Weston
66
37
0
05 Aug 2022
Link the World: Improving Open-domain Conversation with Dynamic
  Spatiotemporal-aware Knowledge
Link the World: Improving Open-domain Conversation with Dynamic Spatiotemporal-aware Knowledge
Han Zhou
Xinchao Xu
Wenquan Wu
Zheng-Yu Niu
Hua Wu
Siqi Bao
Fan Wang
Haifeng Wang
KELM
56
7
0
28 Jun 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning
  from Human Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
249
2,561
0
12 Apr 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
483
6,240
0
05 Apr 2022
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with
  Large-Scale Pre-Training
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Yuxian Gu
Jiaxin Wen
Hao Sun
Yi Song
Pei Ke
...
Zheng Zhang
Jianzhu Yao
Lei Liu
Xiaoyan Zhu
Minlie Huang
76
55
0
17 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
874
12,973
0
04 Mar 2022
LaMDA: Language Models for Dialog Applications
LaMDA: Language Models for Dialog Applications
R. Thoppilan
Daniel De Freitas
Jamie Hall
Noam M. Shazeer
Apoorv Kulshreshtha
...
Blaise Aguera-Arcas
Claire Cui
M. Croak
Ed H. Chi
Quoc Le
ALM
137
1,595
0
20 Jan 2022
Human Evaluation of Conversations is an Open Problem: comparing the
  sensitivity of various methods for evaluating dialogue agents
Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents
Eric Michael Smith
Orion Hsu
Rebecca Qian
Stephen Roller
Y-Lan Boureau
Jason Weston
64
67
0
12 Jan 2022
WebGPT: Browser-assisted question-answering with human feedback
WebGPT: Browser-assisted question-answering with human feedback
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
...
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
182
1,275
0
17 Dec 2021
A General Language Assistant as a Laboratory for Alignment
A General Language Assistant as a Laboratory for Alignment
Amanda Askell
Yuntao Bai
Anna Chen
Dawn Drain
Deep Ganguli
...
Tom B. Brown
Jack Clark
Sam McCandlish
C. Olah
Jared Kaplan
ALM
118
779
0
01 Dec 2021
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
...
Xinxian Huang
Xin Tian
Xinchao Xu
Yingzhan Lin
Zhengyu Niu
VLM
ALM
54
63
0
20 Sep 2021
EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative
  Pre-Training
EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Hao Zhou
Pei Ke
Zheng Zhang
Yuxian Gu
Yinhe Zheng
...
Xiaocong Yang
Bosi Wen
Xiaoyan Zhu
Minlie Huang
Jie Tang
55
54
0
03 Aug 2021
Anticipating Safety Issues in E2E Conversational AI: Framework and
  Tooling
Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling
Emily Dinan
Gavin Abercrombie
A. S. Bergman
Shannon L. Spruit
Dirk Hovy
Y-Lan Boureau
Verena Rieser
64
107
0
07 Jul 2021
Human-centric Dialog Training via Offline Reinforcement Learning
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
77
95
0
12 Oct 2020
Deploying Lifelong Open-Domain Dialogue Learning
Deploying Lifelong Open-Domain Dialogue Learning
Kurt Shuster
Jack Urbanek
Emily Dinan
Arthur Szlam
Jason Weston
60
22
0
18 Aug 2020
A Large-Scale Chinese Short-Text Conversation Dataset
A Large-Scale Chinese Short-Text Conversation Dataset
Yida Wang
Pei Ke
Yinhe Zheng
Kaili Huang
Yong Jiang
Xiaoyan Zhu
Minlie Huang
46
136
0
10 Aug 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
792
42,055
0
28 May 2020
Recipes for building an open-domain chatbot
Recipes for building an open-domain chatbot
Stephen Roller
Emily Dinan
Naman Goyal
Da Ju
Mary Williamson
...
Myle Ott
Kurt Shuster
Eric Michael Smith
Y-Lan Boureau
Jason Weston
ALM
117
1,009
0
28 Apr 2020
Can You Put it All Together: Evaluating Conversational Agents' Ability
  to Blend Skills
Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills
Eric Michael Smith
Mary Williamson
Kurt Shuster
Jason Weston
Y-Lan Boureau
62
223
0
17 Apr 2020
Towards a Human-like Open-Domain Chatbot
Towards a Human-like Open-Domain Chatbot
Daniel De Freitas
Minh-Thang Luong
David R. So
Jamie Hall
Noah Fiedel
...
Zi Yang
Apoorv Kulshreshtha
Gaurav Nemade
Yifeng Lu
Quoc V. Le
114
938
0
27 Jan 2020
Unified Language Model Pre-training for Natural Language Understanding
  and Generation
Unified Language Model Pre-training for Natural Language Understanding and Generation
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu Wang
Jianfeng Gao
M. Zhou
H. Hon
ELM
AI4CE
220
1,556
0
08 May 2019
Towards Coherent and Engaging Spoken Dialog Response Generation Using
  Automatic Conversation Evaluators
Towards Coherent and Engaging Spoken Dialog Response Generation Using Automatic Conversation Evaluators
Sanghyun Yi
Rahul Goel
Chandra Khatri
Alessandra Cervone
Tagyoung Chung
Behnam Hedayatnia
Anu Venkatesh
Raefer Gabriel
Dilek Z. Hakkani-Tür
45
60
0
30 Apr 2019
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
107
193
0
16 Jan 2019
The Design and Implementation of XiaoIce, an Empathetic Social Chatbot
The Design and Implementation of XiaoIce, an Empathetic Social Chatbot
Li Zhou
Jianfeng Gao
Di Li
Harry Shum
63
601
0
21 Dec 2018
Wizard of Wikipedia: Knowledge-Powered Conversational agents
Wizard of Wikipedia: Knowledge-Powered Conversational agents
Emily Dinan
Stephen Roller
Kurt Shuster
Angela Fan
Michael Auli
Jason Weston
RALM
KELM
124
950
0
03 Nov 2018
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Saizheng Zhang
Emily Dinan
Jack Urbanek
Arthur Szlam
Douwe Kiela
Jason Weston
105
1,459
0
22 Jan 2018
DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset
DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset
Yanran Li
Hui Su
Xiaoyu Shen
Wenjie Li
Ziqiang Cao
Shuzi Niu
60
1,302
0
11 Oct 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
701
131,652
0
12 Jun 2017
Overcoming catastrophic forgetting in neural networks
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
367
7,518
0
02 Dec 2016
How NOT To Evaluate Your Dialogue System: An Empirical Study of
  Unsupervised Evaluation Metrics for Dialogue Response Generation
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
Chia-Wei Liu
Ryan J. Lowe
Iulian Serban
Michael Noseworthy
Laurent Charlin
Joelle Pineau
104
1,297
0
25 Mar 2016
A Diversity-Promoting Objective Function for Neural Conversation Models
A Diversity-Promoting Objective Function for Neural Conversation Models
Jiwei Li
Michel Galley
Chris Brockett
Jianfeng Gao
W. Dolan
143
2,392
0
11 Oct 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,115
0
22 Dec 2014
1