ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.09050
  4. Cited By
Ethical Challenges in Data-Driven Dialogue Systems

Ethical Challenges in Data-Driven Dialogue Systems

24 November 2017
Peter Henderson
Koustuv Sinha
Nicolas Angelard-Gontier
Nan Rosemary Ke
G. Fried
Ryan J. Lowe
Joelle Pineau
ArXivPDFHTML

Papers citing "Ethical Challenges in Data-Driven Dialogue Systems"

44 / 44 papers shown
Title
Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks
Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks
Mohammad Saleha
Azadeh Tabatabaeib
52
0
0
14 Apr 2025
From Pixels to Personas: Investigating and Modeling
  Self-Anthropomorphism in Human-Robot Dialogues
From Pixels to Personas: Investigating and Modeling Self-Anthropomorphism in Human-Robot Dialogues
Yu Li
Devamanyu Hazarika
Di Jin
Julia Hirschberg
Yang Liu
30
0
0
04 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELM
PILM
89
7
0
03 Oct 2024
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Tomer Ashuach
Martin Tutek
Yonatan Belinkov
KELM
MU
71
4
0
13 Jun 2024
The Mosaic Memory of Large Language Models
The Mosaic Memory of Large Language Models
Igor Shilov
Matthieu Meeus
Yves-Alexandre de Montjoye
47
3
0
24 May 2024
Navigating LLM Ethics: Advancements, Challenges, and Future Directions
Navigating LLM Ethics: Advancements, Challenges, and Future Directions
Junfeng Jiao
S. Afroogh
Yiming Xu
Connor Phillips
AILaw
68
20
0
14 May 2024
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language
  Models
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models
Xinwei Wu
Junzhuo Li
Minghui Xu
Weilong Dong
Shuangzhi Wu
Chao Bian
Deyi Xiong
MU
KELM
32
46
0
31 Oct 2023
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language
  Models
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models
Mingfeng Xue
Dayiheng Liu
Kexin Yang
Guanting Dong
Wenqiang Lei
Zheng Yuan
Chang Zhou
Jingren Zhou
LLMAG
27
2
0
25 Oct 2023
Evaluating Chatbots to Promote Users' Trust -- Practices and Open
  Problems
Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems
Biplav Srivastava
Kausik Lakkaraju
T. Koppel
Vignesh Narayanan
Ashish Kundu
Sachindra Joshi
37
2
0
09 Sep 2023
Reducing Sensitivity on Speaker Names for Text Generation from Dialogues
Reducing Sensitivity on Speaker Names for Text Generation from Dialogues
Qi Jia
Haifeng Tang
Kenny Q. Zhu
24
2
0
23 May 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language
  Model Society
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
Ge Li
Hasan Hammoud
Hani Itani
Dmitrii Khizbullin
Guohao Li
SyDa
ALM
50
413
0
31 Mar 2023
Conversational AI-Powered Design: ChatGPT as Designer, User, and Product
Conversational AI-Powered Design: ChatGPT as Designer, User, and Product
A. Kocaballi
24
38
0
15 Feb 2023
Advances in Automatically Rating the Trustworthiness of Text Processing
  Services
Advances in Automatically Rating the Trustworthiness of Text Processing Services
Biplav Srivastava
Kausik Lakkaraju
Mariana Bernagozzi
Marco Valtorta
30
6
0
04 Feb 2023
Extracting Training Data from Diffusion Models
Extracting Training Data from Diffusion Models
Nicholas Carlini
Jamie Hayes
Milad Nasr
Matthew Jagielski
Vikash Sehwag
Florian Tramèr
Borja Balle
Daphne Ippolito
Eric Wallace
DiffM
68
572
0
30 Jan 2023
On Safe and Usable Chatbots for Promoting Voter Participation
On Safe and Usable Chatbots for Promoting Voter Participation
Bharath Muppasani
Vishal Pallagani
Kausik Lakkaraju
Shuge Lei
Biplav Srivastava
Brett W. Robertson
Andrea A. Hickerson
Vignesh Narayanan
32
2
0
16 Dec 2022
An Empathetic AI Coach for Self-Attachment Therapy
An Empathetic AI Coach for Self-Attachment Therapy
Lisa Alazraki
Ali Ghachem
Neophytos Polydorou
Foaad Khosmood
A. Edalat
24
9
0
17 Sep 2022
In conversation with Artificial Intelligence: aligning language models
  with human values
In conversation with Artificial Intelligence: aligning language models with human values
Atoosa Kasirzadeh
Iason Gabriel
24
98
0
01 Sep 2022
Target-Guided Dialogue Response Generation Using Commonsense and Data
  Augmentation
Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation
Prakhar Gupta
Harsh Jhamtani
Jeffrey P. Bigham
49
12
0
19 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin P. Adewumi
F. Liwicki
Marcus Liwicki
32
15
0
02 May 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning
  from Human Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
95
2,352
0
12 Apr 2022
PanGu-Bot: Efficient Generative Dialogue Pre-training from Pre-trained
  Language Model
PanGu-Bot: Efficient Generative Dialogue Pre-training from Pre-trained Language Model
Fei Mi
Yitong Li
Yulong Zeng
Jingyan Zhou
Yasheng Wang
Chuanfei Xu
Lifeng Shang
Xin Jiang
Shiqi Zhao
Qun Liu
ALM
45
18
0
31 Mar 2022
Do Language Models Plagiarize?
Do Language Models Plagiarize?
Jooyoung Lee
Thai Le
Jinghui Chen
Dongwon Lee
38
74
0
15 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
372
12,081
0
04 Mar 2022
What Does it Mean for a Language Model to Preserve Privacy?
What Does it Mean for a Language Model to Preserve Privacy?
Hannah Brown
Katherine Lee
Fatemehsadat Mireshghallah
Reza Shokri
Florian Tramèr
PILM
55
232
0
11 Feb 2022
Red Teaming Language Models with Language Models
Red Teaming Language Models with Language Models
Ethan Perez
Saffron Huang
Francis Song
Trevor Cai
Roman Ring
John Aslanides
Amelia Glaese
Nat McAleese
G. Irving
AAML
13
611
0
07 Feb 2022
Does Entity Abstraction Help Generative Transformers Reason?
Does Entity Abstraction Help Generative Transformers Reason?
Nicolas Angelard-Gontier
Siva Reddy
C. Pal
34
5
0
05 Jan 2022
A Survey on Gender Bias in Natural Language Processing
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
30
110
0
28 Dec 2021
Counterfactual Memorization in Neural Language Models
Counterfactual Memorization in Neural Language Models
Chiyuan Zhang
Daphne Ippolito
Katherine Lee
Matthew Jagielski
Florian Tramèr
Nicholas Carlini
32
129
0
24 Dec 2021
Trustworthy AI: A Computational Perspective
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
196
0
12 Jul 2021
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of
  Conversational Language Models
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models
Soumya Barikeri
Anne Lauscher
Ivan Vulić
Goran Glavaš
45
178
0
07 Jun 2021
Evaluating Gender Bias in Natural Language Inference
Evaluating Gender Bias in Natural Language Inference
Shanya Sharma
Manan Dey
Koustuv Sinha
28
41
0
12 May 2021
Detoxifying Language Models Risks Marginalizing Minority Voices
Detoxifying Language Models Risks Marginalizing Minority Voices
Albert Xu
Eshaan Pathak
Eric Wallace
Suchin Gururangan
Maarten Sap
Dan Klein
24
123
0
13 Apr 2021
Detecting and Classifying Malevolent Dialogue Responses: Taxonomy, Data
  and Methodology
Detecting and Classifying Malevolent Dialogue Responses: Taxonomy, Data and Methodology
Yangjun Zhang
Pengjie Ren
Maarten de Rijke
26
11
0
21 Aug 2020
Chat as Expected: Learning to Manipulate Black-box Neural Dialogue
  Models
Chat as Expected: Learning to Manipulate Black-box Neural Dialogue Models
Haochen Liu
Zhiwei Wang
Tyler Derr
Jiliang Tang
AAML
22
15
0
27 May 2020
Personalized Chatbot Trustworthiness Ratings
Personalized Chatbot Trustworthiness Ratings
Biplav Srivastava
F. Rossi
Sheema Usmani
Mariana Bernagozzi
13
20
0
13 May 2020
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation
Emily Dinan
Angela Fan
Adina Williams
Jack Urbanek
Douwe Kiela
Jason Weston
32
206
0
10 Nov 2019
A Crowd-based Evaluation of Abuse Response Strategies in Conversational
  Agents
A Crowd-based Evaluation of Abuse Response Strategies in Conversational Agents
Amanda Cercas Curry
Verena Rieser
30
31
0
10 Sep 2019
Build it Break it Fix it for Dialogue Safety: Robustness from
  Adversarial Human Attack
Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack
Emily Dinan
Samuel Humeau
Bharath Chintagunta
Jason Weston
22
243
0
17 Aug 2019
A Virtual Conversational Agent for Teens with Autism: Experimental
  Results and Design Lessons
A Virtual Conversational Agent for Teens with Autism: Experimental Results and Design Lessons
M. R. Ali
Zahra Razavi
A. Mamun
Raina Langevin
Benjamin Kane
Reza Rawassizadeh
Lenhart Schubert
M Ehsan Hoque
21
25
0
07 Nov 2018
The RLLChatbot: a solution to the ConvAI challenge
The RLLChatbot: a solution to the ConvAI challenge
Nicolas Angelard-Gontier
Koustuv Sinha
Peter Henderson
Iulian Serban
Michael Noseworthy
Prasanna Parthasarathi
Joelle Pineau
OffRL
33
0
0
07 Nov 2018
Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue
  Models
Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models
Tong Niu
Joey Tianyi Zhou
AAML
21
85
0
06 Sep 2018
Fighting Offensive Language on Social Media with Unsupervised Text Style
  Transfer
Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer
Cicero Nogueira dos Santos
Igor Melnyk
Inkit Padhi
22
153
0
20 May 2018
A Review of Evaluation Techniques for Social Dialogue Systems
A Review of Evaluation Techniques for Social Dialogue Systems
Amanda Cercas Curry
H. Hastie
Verena Rieser
126
13
0
13 Sep 2017
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
220
1,328
0
05 Jun 2016
1