Bias and Fairness in Large Language Models: A Survey

2 September 2023

Isabel O. Gallegos

Ryan A. Rossi

Joe Barrow

Md Mehrab Tanjim

Sungchul Kim

Papers citing "Bias and Fairness in Large Language Models: A Survey"

50 / 87 papers shown

Title
Justified Evidence Collection for Argument-based AI Fairness Assurance Alpay Sabuncuoglu Christopher Burr Carsten Maple 31 0 0 12 May 2025
Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design Elena Musi Nadin Kokciyan Khalid Al Khatib Davide Ceolin Emmanuelle Dietz ... Jodi Schneider Jonas Scholz Cor Steging Jacky Visser Henning Wachsmuth LRM 49 0 0 08 May 2025
Retrieval Augmented Generation Evaluation for Health Documents Mario Ceresa Lorenzo Bertolini Valentin Comte Nicholas Spadaro Barbara Raffael ... Sergio Consoli Amalia Muñoz Piñeiro Alex Patak Maddalena Querci Tobias Wiesenthal RALM 3DV 39 0 1 07 May 2025
Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text Jennifer Healey Laurie Byrum Md Nadeem Akhtar Surabhi Bhargava Moumita Sinha 29 0 0 05 May 2025
Interpretable graph-based models on multimodal biomedical data integration: A technical review and benchmarking Alireza Sadeghi F. Hajati A. Argha Nigel H Lovell Min Yang Hamid Alinejad-Rokny 41 0 0 03 May 2025
On the Limitations of Steering in Language Model Alignment Chebrolu Niranjan Kokil Jaidka G. Yeo LLMSV 43 0 0 02 May 2025
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods Mahdi Dhaini Ege Erdogan Nils Feldhus Gjergji Kasneci 49 0 0 02 May 2025
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors Nicy Scaria Silvester John Joseph Kennedy Diksha Seth Ananya Thakur Deepak N. Subramani AI4Ed 23 0 0 02 May 2025
BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models Zhiting Fan Ruizhe Chen Zuozhu Liu 44 0 0 30 Apr 2025
Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers Quentin Guimard Moreno DÍncà Massimiliano Mancini Elisa Ricci SSL 72 0 0 29 Apr 2025
AI Awareness Xianrui Li Haoyuan Shi Rongwu Xu Wei Xu 56 0 0 25 Apr 2025
aiXamine: Simplified LLM Safety and Security Fatih Deniz Dorde Popovic Yazan Boshmaf Euisuh Jeong M. Ahmad Sanjay Chawla Issa M. Khalil ELM 80 0 0 21 Apr 2025
Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks Mohammad Saleha Azadeh Tabatabaeib 52 0 0 14 Apr 2025
Enhancements for Developing a Comprehensive AI Fairness Assessment Standard Avinash Agarwal Mayashankar Kumar Manisha J Nene 129 0 0 10 Apr 2025
Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups Rijul Magu Arka Dutta Sean Kim Ashiqur R. KhudaBukhsh Munmun De Choudhury 24 0 0 08 Apr 2025
Investigating and Mitigating Stereotype-aware Unfairness in LLM-based Recommendations Zihuai Zhao Wenqi Fan Yao Wu Qing Li 75 1 0 05 Apr 2025
Unequal Opportunities: Examining the Bias in Geographical Recommendations by Large Language Models Shiran Dudy Thulasi Tholeti R. Ramachandranpillai Muhammad Ali Toby Jia-Jun Li Ricardo Baeza-Yates 29 0 0 16 Mar 2025
PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation Yuxuan Liu 45 0 0 03 Mar 2025
Evaluating Large Language Models for Public Health Classification and Extraction Tasks Joshua Harris Timothy Laurence Leo Loman Fan Grayson Toby Nonnenmacher ... Hamish Mohammed Thomas Finnie Luke Hounsome Michael Borowitz Steven Riley LM&MA AI4MH 83 5 0 20 Feb 2025
Investigating Non-Transitivity in LLM-as-a-Judge Yi Xu Laura Ruis Tim Rocktaschel Robert Kirk 43 0 0 19 Feb 2025
Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen Jian Tian Liang Ma Di Xie Weijie Chen Jiang Zhu ALM ELM 54 0 0 10 Feb 2025
Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs Angelina Wang Michelle Phan Daniel E. Ho Sanmi Koyejo 54 2 0 04 Feb 2025
Dissecting Submission Limit in Desk-Rejections: A Mathematical Analysis of Fairness in AI Conference Policies Yuefan Cao Xiaoyu Li Yingyu Liang Zhizhou Sha Zhenmei Shi Zhao-quan Song Jiahao Zhang 92 7 0 02 Feb 2025
Hands-On Tutorial: Labeling with LLM and Human-in-the-Loop Ekaterina Artemova Akim Tsvigun Dominik Schlechtweg Natalia Fedorova Konstantin Chernyshev Sergei Tilga Boris Obmoroshev SyDa VLM 125 0 0 28 Jan 2025
Option-ID Based Elimination For Multiple Choice Questions Zhenhao Zhu Bulou Liu Qingyao Ai Yong-Jin Liu 54 0 0 25 Jan 2025
Unmasking Conversational Bias in AI Multiagent Systems Simone Mungari Giuseppe Manco Luca Maria Aiello LLMAG 56 0 0 24 Jan 2025
Addressing Bias in Generative AI: Challenges and Research Opportunities in Information Management Xiahua Wei Naveen Kumar Han Zhang 68 5 0 22 Jan 2025
An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts Dhia Elhaq Rzig Dhruba Jyoti Paul Kaiser Pister Jordan Henkel Foyzul Hassan 77 0 0 21 Jan 2025
Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) Brian E. Perron Lauri Goldkind Zia Qi Bryan G. Victor SILM 36 0 0 20 Jan 2025
Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude Yile Yan Bo Li Wentao Xu ELM 42 0 0 17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine Wasif Khan Seowung Leem Kyle B. See Joshua K. Wong Shaoting Zhang R. Fang AI4CE LM&MA VLM 105 18 0 17 Jan 2025
INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models Di Jin Xing Liu Yu Liu Jia Qing Yap Andrea Wong Adriana Crespo Qi Lin Zhiyuan Yin Qiang Yan Ryan Ye EGVM VLM 153 0 0 10 Jan 2025
Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine Yishen Liu Shengda Luo Zishao Zhong Tongtong Wu Jingyang Zhang Peiyao Ou Yong Liang Liang Liu Hudan Pan LM&MA 38 0 0 05 Jan 2025
ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning Wonduk Seo Zonghao Yuan Yi Bu VLM 50 1 0 02 Jan 2025
Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings Carolin M. Schuster Maria-Alexandra Dinisor Shashwat Ghatiwala Georg Groh 77 1 0 25 Nov 2024
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs) Leander Girrbach Yiran Huang Stephan Alaniz Trevor Darrell Zeynep Akata VLM 47 2 0 25 Oct 2024
AUTALIC: A Dataset for Anti-AUTistic Ableist Language In Context Naba Rizvi Harper Strickland Daniel Gitelman Tristan Cooper Alexis Morales-Flores ... Haaset Owens Saleha Ahmedi Isha Khirwadkar Imani Munyaka Nedjma Ousidhoum 34 0 0 21 Oct 2024
Enabling Scalable Evaluation of Bias Patterns in Medical LLMs Hamed Fayyaz Raphael Poulain Rahmatollah Beheshti 37 1 0 18 Oct 2024
Bias Similarity Across Large Language Models Hyejun Jeong Shiqing Ma Amir Houmansadr 54 0 0 15 Oct 2024
No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users Mengxuan Hu Hongyi Wu Zihan Guan Ronghang Zhu Dongliang Guo Daiqing Qi Sheng Li SILM 35 3 0 10 Oct 2024
Steering Large Language Models using Conceptors: Improving Addition-Based Activation Engineering Joris Postmus Steven Abreu LLMSV 97 1 0 09 Oct 2024
Collapsed Language Models Promote Fairness Jingxuan Xu Wuyang Chen Linyi Li Yao Zhao Yunchao Wei 44 0 0 06 Oct 2024
Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers Shijie Chen Bernal Jiménez Gutiérrez Yu Su 31 4 0 03 Oct 2024
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis Zeping Yu Sophia Ananiadou LRM MILM 27 6 0 21 Sep 2024
Neural embedding of beliefs reveals the role of relative dissonance in human decision-making Byunghwee Lee Rachith Aiyappa Yong-Yeol Ahn Haewoon Kwak Jisun An 22 2 0 13 Aug 2024
Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts Tingchen Fu Yupeng Hou Julian McAuley Rui Yan 38 3 0 09 Aug 2024
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation Riccardo Cantini Giada Cosenza A. Orsino Domenico Talia AAML 57 5 0 11 Jul 2024
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards Zhimin Zhao A. A. Bangash F. Côgo Bram Adams Ahmed E. Hassan 59 1 0 04 Jul 2024
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models Song Wang Peng Wang Tong Zhou Yushun Dong Zhen Tan Jundong Li CoGe 56 7 0 02 Jul 2024
An Investigation of Prompt Variations for Zero-shot LLM-based Rankers Shuoqi Sun Shengyao Zhuang Shuai Wang Guido Zuccon 42 5 0 20 Jun 2024