Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.04492
Cited By
Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization
10 October 2022
Zonghan Yang
Xiaoyuan Yi
Peng Li
Yang Liu
Xing Xie
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization"
13 / 13 papers shown
Title
Towards Effective Discrimination Testing for Generative AI
Thomas P. Zollo
Nikita Rajaneesh
Richard Zemel
Talia B. Gillis
Emily Black
30
1
0
31 Dec 2024
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
Guangliang Liu
Milad Afshari
Xitong Zhang
Zhiyu Xue
Avrajit Ghosh
Bidhan Bashyal
Rongrong Wang
K. Johnson
27
0
0
06 Jun 2024
Controlled Text Generation via Language Model Arithmetic
Jasper Dekoninck
Marc Fischer
Luca Beurer-Kellner
Martin Vechev
31
36
0
24 Nov 2023
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
Luiza Amador Pozzobon
B. Ermiş
Patrick Lewis
Sara Hooker
30
20
0
11 Oct 2023
MAUVE Scores for Generative Models: Theory and Practice
Krishna Pillutla
Lang Liu
John Thickstun
Sean Welleck
Swabha Swayamdipta
Rowan Zellers
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
EGVM
35
21
0
30 Dec 2022
DuNST: Dual Noisy Self Training for Semi-Supervised Controllable Text Generation
Yuxi Feng
Xiaoyuan Yi
Xiting Wang
L. Lakshmanan
Xing Xie
DiffM
35
5
0
16 Dec 2022
An Analysis of the Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation
Jwala Dhamala
Varun Kumar
Rahul Gupta
Kai-Wei Chang
Aram Galstyan
32
7
0
07 Oct 2022
Challenges in Measuring Bias via Open-Ended Language Generation
Afra Feyza Akyürek
Muhammed Yusuf Kocyigit
Sejin Paik
Derry Wijaya
38
22
0
23 May 2022
Text Detoxification using Large Pre-trained Neural Models
David Dale
Anton Voronov
Daryna Dementieva
V. Logacheva
Olga Kozlova
Nikita Semenov
Alexander Panchenko
39
71
0
18 Sep 2021
Challenges in Detoxifying Language Models
Johannes Welbl
Amelia Glaese
J. Uesato
Sumanth Dathathri
John F. J. Mellor
Lisa Anne Hendricks
Kirsty Anderson
Pushmeet Kohli
Ben Coppin
Po-Sen Huang
LM&MA
250
193
0
15 Sep 2021
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP
Timo Schick
Sahana Udupa
Hinrich Schütze
259
374
0
28 Feb 2021
The Woman Worked as a Babysitter: On Biases in Language Generation
Emily Sheng
Kai-Wei Chang
Premkumar Natarajan
Nanyun Peng
214
616
0
03 Sep 2019
A Survey on Bias and Fairness in Machine Learning
Ninareh Mehrabi
Fred Morstatter
N. Saxena
Kristina Lerman
Aram Galstyan
SyDa
FaML
323
4,212
0
23 Aug 2019
1