Assessing Social and Intersectional Biases in Contextualized Word Representations

4 November 2019

Papers citing "Assessing Social and Intersectional Biases in Contextualized Word Representations"

40 / 40 papers shown

Title
Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model Qiyuan Deng X. Bai Kehai Chen Yaowei Wang Liqiang Nie Min Zhang OffRL 66 0 0 13 Mar 2025
Fair Text Classification via Transferable Representations Thibaud Leteno Michael Perrot Charlotte Laclau Antoine Gourru Christophe Gravier FaML 88 0 0 10 Mar 2025
Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings Carolin M. Schuster Maria-Alexandra Dinisor Shashwat Ghatiwala Georg Groh 79 1 0 25 Nov 2024
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT Shucheng Zhu Weikang Wang Ying Liu 37 5 0 11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes Damin Zhang Yi Zhang Geetanjali Bihani Julia Taylor Rayz 53 2 0 06 May 2024
A Survey on Fairness in Large Language Models Yingji Li Mengnan Du Rui Song Xin Wang Ying Wang ALM 52 60 0 20 Aug 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models Tarek Naous Michael Joseph Ryan Alan Ritter Wei-ping Xu 37 85 0 23 May 2023
Language-Agnostic Bias Detection in Language Models with Bias Probing Abdullatif Köksal Omer F. Yalcin Ahmet Akbiyik M. Kilavuz Anna Korhonen Hinrich Schütze 41 1 0 22 May 2023
ChatGPT and a New Academic Reality: Artificial Intelligence-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing Brady Lund Ting Wang Nishith Reddy Mannuru Bing Nie S. Shimray Ziang Wang AI4CE 15 498 0 21 Mar 2023
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets Tosin P. Adewumi Isabella Sodergren Lama Alkhaled Sana Sabah Sabry F. Liwicki Marcus Liwicki 35 4 0 28 Jan 2023
Trustworthy Social Bias Measurement Rishi Bommasani Percy Liang 27 10 0 20 Dec 2022
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting Su Wang Chitwan Saharia Ceslee Montgomery Jordi Pont-Tuset Shai Noy ... Radu Soricut Jason Baldridge Mohammad Norouzi Peter Anderson William Chan 35 176 0 13 Dec 2022
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models Silke Husse Andreas Spitz 25 6 0 15 Nov 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation Tianxiang Sun Junliang He Xipeng Qiu Xuanjing Huang 24 44 0 14 Oct 2022
Toxicity in Multilingual Machine Translation at Scale Marta R. Costa-jussá Eric Michael Smith C. Ropers Daniel Licht Jean Maillard Javier Ferrando Carlos Escolano 30 25 0 06 Oct 2022
Debiasing Word Embeddings with Nonlinear Geometry Lu Cheng Nayoung Kim Huan Liu 24 5 0 29 Aug 2022
FairDistillation: Mitigating Stereotyping in Language Models Pieter Delobelle Bettina Berendt 23 8 0 10 Jul 2022
Markedness in Visual Semantic AI Robert Wolfe Aylin Caliskan VLM 30 35 0 23 May 2022
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset Eric Michael Smith Melissa Hall Melanie Kambadur Eleonora Presani Adina Williams 79 130 0 18 May 2022
Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation Angelina Wang V. V. Ramaswamy Olga Russakovsky FaML 26 92 0 10 May 2022
How Gender Debiasing Affects Internal Model Representations, and Why It Matters Hadas Orgad Seraphina Goldfarb-Tarrant Yonatan Belinkov 26 18 0 14 Apr 2022
Fair and Argumentative Language Modeling for Computational Argumentation Carolin Holtermann Anne Lauscher Simone Paolo Ponzetto 16 21 0 08 Apr 2022
Mapping the Multilingual Margins: Intersectional Biases of Sentiment Analysis Systems in English, Spanish, and Arabic Antonio Camara Nina Taneja Tamjeed Azad Emily Allaway R. Zemel 21 21 0 07 Apr 2022
Speciesist Language and Nonhuman Animal Bias in English Masked Language Models Masashi Takeshita Rafal Rzepka K. Araki 29 6 0 10 Mar 2022
iSEA: An Interactive Pipeline for Semantic Error Analysis of NLP Models Jun Yuan Jesse Vig Nazneen Rajani 16 13 0 08 Mar 2022
CM3: A Causal Masked Multimodal Model of the Internet Armen Aghajanyan Po-Yao (Bernie) Huang Candace Ross Vladimir Karpukhin Hu Xu ... Dmytro Okhonko Mandar Joshi Gargi Ghosh M. Lewis Luke Zettlemoyer 15 155 0 19 Jan 2022
A Survey on Gender Bias in Natural Language Processing Karolina Stañczak Isabelle Augenstein 30 110 0 28 Dec 2021
Measuring Fairness with Biased Rulers: A Survey on Quantifying Biases in Pretrained Language Models Pieter Delobelle E. Tokpo T. Calders Bettina Berendt 19 24 0 14 Dec 2021
SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets Ann Yuan Daphne Ippolito Vitaly Nikolaev Chris Callison-Burch Andy Coenen Sebastian Gehrmann SyDa 112 20 0 11 Nov 2021
Simple Entity-Centric Questions Challenge Dense Retrievers Christopher Sciavolino Zexuan Zhong Jinhyuk Lee Danqi Chen RALM 27 160 0 17 Sep 2021
Boosting Search Engines with Interactive Agents Leonard Adolphs Benjamin Boerschinger Christian Buck Michelle Chen Huebscher Massimiliano Ciaramita ... Thomas Hofmann Yannic Kilcher Sascha Rothe Pier Giuseppe Sessa Lierni Sestorain Saralegui LLMAG 26 24 0 01 Sep 2021
Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies Sunipa Dev Masoud Monajatipoor Anaelia Ovalle Arjun Subramonian J. M. Phillips Kai-Wei Chang 33 164 0 27 Aug 2021
A Survey of Race, Racism, and Anti-Racism in NLP Anjalie Field Su Lin Blodgett Zeerak Talat Yulia Tsvetkov 42 122 0 21 Jun 2021
Large Pre-trained Language Models Contain Human-like Biases of What is Right and Wrong to Do P. Schramowski Cigdem Turan Nico Andersen Constantin Rothkopf Kristian Kersting 33 281 0 08 Mar 2021
WordBias: An Interactive Visual Tool for Discovering Intersectional Biases Encoded in Word Embeddings Bhavya Ghai Md. Naimul Hoque Klaus Mueller 29 26 0 05 Mar 2021
Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models Hannah Rose Kirk Yennie Jun Haider Iqbal Elias Benussi Filippo Volpin F. Dreyer Aleksandar Shtedritski Yuki M. Asano 22 179 0 08 Feb 2021
Debiasing Pre-trained Contextualised Embeddings Masahiro Kaneko Danushka Bollegala 218 138 0 23 Jan 2021
Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases Ryan Steed Aylin Caliskan SSL 27 156 0 28 Oct 2020
BERTology Meets Biology: Interpreting Attention in Protein Language Models Jesse Vig Ali Madani L. Varshney Caiming Xiong R. Socher Nazneen Rajani 29 288 0 26 Jun 2020
On the Relationships Between the Grammatical Genders of Inanimate Nouns and Their Co-Occurring Adjectives and Verbs Adina Williams Ryan Cotterell Lawrence Wolf-Sonkin Damián E. Blasi Hanna M. Wallach 34 18 0 03 May 2020