Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

6 June 2020

Papers citing "Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases"

40 / 40 papers shown

Title
Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings Carolin M. Schuster Maria-Alexandra Dinisor Shashwat Ghatiwala Georg Groh 79 1 0 25 Nov 2024
LLMScan: Causal Scan for LLM Misbehavior Detection Mengdi Zhang Kai Kiat Goh Peixin Zhang Jun Sun Rose Lin Xin Hongyu Zhang 25 0 0 22 Oct 2024
No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users Mengxuan Hu Hongyi Wu Zihan Guan Ronghang Zhu Dongliang Guo Daiqing Qi Sheng Li SILM 38 3 0 10 Oct 2024
Collapsed Language Models Promote Fairness Jingxuan Xu Wuyang Chen Linyi Li Yao Zhao Yunchao Wei 46 0 0 06 Oct 2024
Analyzing Correlations Between Intrinsic and Extrinsic Bias Metrics of Static Word Embeddings With Their Measuring Biases Aligned Taisei Katô Yusuke Miyao 19 0 0 14 Sep 2024
A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers Valentin Barriere Sebastian Cifuentes 28 0 0 01 Jul 2024
Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models Jisu Shin Hoyun Song Huije Lee Soyeong Jeong Jong C. Park 38 6 0 06 Jun 2024
Navigating LLM Ethics: Advancements, Challenges, and Future Directions Junfeng Jiao S. Afroogh Yiming Xu Connor Phillips AILaw 65 20 0 14 May 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation Kristian Lum Jacy Reese Anthis Chirag Nagpal Alex DÁmour Alexander D’Amour 31 14 0 20 Feb 2024
A Survey on Fairness in Large Language Models Yingji Li Mengnan Du Rui Song Xin Wang Ying Wang ALM 52 59 0 20 Aug 2023
Intersectionality and Testimonial Injustice in Medical Records Kenya Andrews Bhuvani Shah Lu Cheng 28 0 0 20 Jun 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models Tarek Naous Michael Joseph Ryan Alan Ritter Wei-ping Xu 37 85 0 23 May 2023
ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores Non-Gendered Pronouns: Findings across Bengali and Five other Low-Resource Languages Sourojit Ghosh Aylin Caliskan 33 69 0 17 May 2023
Surfacing Biases in Large Language Models using Contrastive Input Decoding G. Yona Or Honovich Itay Laish Roee Aharoni 27 11 0 12 May 2023
Trustworthy Social Bias Measurement Rishi Bommasani Percy Liang 27 10 0 20 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods Chao Zhou Cheng Qiu Daniel Ernesto Acuna 32 25 0 13 Dec 2022
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models Silke Husse Andreas Spitz 22 6 0 15 Nov 2022
Choose Your Lenses: Flaws in Gender Bias Evaluation Hadas Orgad Yonatan Belinkov 27 35 0 20 Oct 2022
SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models Haozhe An Zongxia Li Jieyu Zhao Rachel Rudinger 27 25 0 13 Oct 2022
Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks Masahiro Kaneko Danushka Bollegala Naoaki Okazaki 26 41 0 06 Oct 2022
Debiasing Word Embeddings with Nonlinear Geometry Lu Cheng Nayoung Kim Huan Liu 24 5 0 29 Aug 2022
Large scale analysis of gender bias and sexism in song lyrics L. Betti Carlo Abrate Andreas Kaltenbrunner 30 18 0 03 Aug 2022
Mimetic Models: Ethical Implications of AI that Acts Like You Reid McIlroy-Young Jon M. Kleinberg S. Sen Solon Barocas Ashton Anderson 13 16 0 19 Jul 2022
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin America Laura Alonso Alemany Luciana Benotti Hernán Maina Lucía González Mariela Rajngewerc ... Guido Ivetta Alexia Halvorsen Amanda Rojo M. Bordone Beatriz Busaniche 32 3 0 14 Jul 2022
Markedness in Visual Semantic AI Robert Wolfe Aylin Caliskan VLM 27 35 0 23 May 2022
Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation Angelina Wang V. V. Ramaswamy Olga Russakovsky FaML 26 92 0 10 May 2022
How Gender Debiasing Affects Internal Model Representations, and Why It Matters Hadas Orgad Seraphina Goldfarb-Tarrant Yonatan Belinkov 26 18 0 14 Apr 2022
Mapping the Multilingual Margins: Intersectional Biases of Sentiment Analysis Systems in English, Spanish, and Arabic Antonio Camara Nina Taneja Tamjeed Azad Emily Allaway R. Zemel 18 21 0 07 Apr 2022
Probing Pre-Trained Language Models for Cross-Cultural Differences in Values Arnav Arora Lucie-Aimée Kaffee Isabelle Augenstein VLM 31 123 0 25 Mar 2022
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal Umang Gupta Jwala Dhamala Varun Kumar Apurv Verma Yada Pruksachatkun Satyapriya Krishna Rahul Gupta Kai-Wei Chang Greg Ver Steeg Aram Galstyan 18 50 0 23 Mar 2022
Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations Robert Wolfe Aylin Caliskan VLM 21 13 0 14 Mar 2022
Speciesist Language and Nonhuman Animal Bias in English Masked Language Models Masashi Takeshita Rafal Rzepka K. Araki 26 6 0 10 Mar 2022
Survey of Generative Methods for Social Media Analysis Stan Matwin Aristides Milios P. Prałat Amílcar Soares Franccois Théberge 27 3 0 13 Dec 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models Robert Wolfe Aylin Caliskan 87 51 0 01 Oct 2021
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens Saad Hassan Matt Huenerfauth Cecilia Ovesdotter Alm 46 38 0 01 Oct 2021
Intersectional Bias in Causal Language Models Liam Magee Lida Ghahremanlou K. Soldatić S. Robertson 191 31 0 16 Jul 2021
Semantic maps and metrics for science Semantic maps and metrics for science using deep transformer encoders Brendan Chambers James A. Evans MedIm 13 0 0 13 Apr 2021
WordBias: An Interactive Visual Tool for Discovering Intersectional Biases Encoded in Word Embeddings Bhavya Ghai Md. Naimul Hoque Klaus Mueller 29 26 0 05 Mar 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling Leo Gao Stella Biderman Sid Black Laurence Golding Travis Hoppe ... Horace He Anish Thite Noa Nabeshima Shawn Presser Connor Leahy AIMat 279 1,996 0 31 Dec 2020
Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases Ryan Steed Aylin Caliskan SSL 24 156 0 28 Oct 2020