Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.05783
Cited By
Persistent Anti-Muslim Bias in Large Language Models
14 January 2021
Abubakar Abid
Maheen Farooqi
James Zou
AILaw
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Persistent Anti-Muslim Bias in Large Language Models"
50 / 295 papers shown
Title
REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning
Rameez Qureshi
Naim Es-Sebbani
Luis Galárraga
Yvette Graham
Miguel Couceiro
Zied Bouraoui
33
1
0
18 Aug 2024
Misrepresented Technological Solutions in Imagined Futures: The Origins and Dangers of AI Hype in the Research Community
Savannah Thais
44
3
0
08 Aug 2024
Are Social Sentiments Inherent in LLMs? An Empirical Study on Extraction of Inter-demographic Sentiments
Kunitomo Tanaka
Ryohei Sasano
Koichi Takeda
38
0
0
08 Aug 2024
Fairness in Large Language Models in Three Hours
Thang Doan Viet
Zichong Wang
Minh Nhat Nguyen
Wenbin Zhang
56
9
0
02 Aug 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
63
10
0
26 Jul 2024
Exploring Bengali Religious Dialect Biases in Large Language Models with Evaluation Perspectives
Azmine Toushik Wasi
Raima Islam
Mst Rafia Islam
Taki Hasan Rafi
Dong-Kyu Chae
56
3
0
25 Jul 2024
A Framework for Evaluating Appropriateness, Trustworthiness, and Safety in Mental Wellness AI Chatbots
Lucia Chen
David A. Preece
P. Sikka
James J. Gross
Ben Krause
AI4MH
43
1
0
16 Jul 2024
Evaluating Large Language Models with fmeval
Pola Schwöbel
Luca Franceschi
Muhammad Bilal Zafar
Keerthan Vasist
Aman Malhotra
Tomer Shenhar
Pinal Tailor
Pinar Yilmaz
Michael Diamond
Michele Donini
LM&MA
ELM
27
2
0
15 Jul 2024
Evaluating Nuanced Bias in Large Language Model Free Response Answers
Jennifer Healey
Laurie Byrum
Md Nadeem Akhtar
Moumita Sinha
41
1
0
11 Jul 2024
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
Riccardo Cantini
Giada Cosenza
A. Orsino
Domenico Talia
AAML
65
5
0
11 Jul 2024
Probability of Differentiation Reveals Brittleness of Homogeneity Bias in Large Language Models
Messi H.J. Lee
Calvin K. Lai
28
0
0
10 Jul 2024
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
Flor Miriam Plaza del Arco
Amanda Cercas Curry
Susanna Paoli
Alba Curry
Dirk Hovy
34
2
0
09 Jul 2024
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias
Jayanta Sadhu
Maneesha Rani Saha
Rifat Shahriyar
45
3
0
03 Jul 2024
Generative Monoculture in Large Language Models
Fan Wu
Emily Black
Varun Chandrasekaran
SyDa
40
3
0
02 Jul 2024
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
Song Wang
Peng Wang
Tong Zhou
Yushun Dong
Zhen Tan
Jundong Li
CoGe
63
7
0
02 Jul 2024
Characterizing Stereotypical Bias from Privacy-preserving Pre-Training
Stefan Arnold
Rene Gröbner
Annika Schreiner
47
0
0
30 Jun 2024
Aligning Large Language Models with Diverse Political Viewpoints
Dominik Stammbach
Philine Widmer
Eunjung Cho
Çağlar Gülçehre
Elliott Ash
47
3
0
20 Jun 2024
Exploring Safety-Utility Trade-Offs in Personalized Language Models
Anvesh Rao Vijjini
Somnath Basu Roy Chowdhury
Snigdha Chaturvedi
59
7
0
17 Jun 2024
Evaluation of Large Language Models: STEM education and Gender Stereotypes
Smilla Due
Sneha Das
Marianne Andersen
Berta Plandolit López
Sniff Andersen Nexø
Line Clemmensen
39
1
0
14 Jun 2024
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Xu Zhang
Xunjian Yin
Xiaojun Wan
55
3
0
13 Jun 2024
An Empirical Design Justice Approach to Identifying Ethical Considerations in the Intersection of Large Language Models and Social Robotics
Alva Markelius
45
1
0
10 Jun 2024
Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Xi Li
Yusen Zhang
Renze Lou
Chen Wu
Jiaqi Wang
LRM
AAML
45
12
0
10 Jun 2024
Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas
Chengyuan Deng
Yiqun Duan
Xin Jin
Heng Chang
Yijun Tian
...
Kuofeng Gao
Sihong He
Jun Zhuang
Lu Cheng
Haohan Wang
AILaw
51
16
0
08 Jun 2024
MoralBench: Moral Evaluation of LLMs
Jianchao Ji
Yutong Chen
Mingyu Jin
Wujiang Xu
Wenyue Hua
Yongfeng Zhang
ELM
49
6
0
06 Jun 2024
GPT-4's One-Dimensional Mapping of Morality: How the Accuracy of Country-Estimates Depends on Moral Domain
P. Strimling
Joel Krueger
Simon Karlsson
47
0
0
05 Jun 2024
A Robot Walks into a Bar: Can Language Models Serve as Creativity Support Tools for Comedy? An Evaluation of LLMs' Humour Alignment with Comedians
Piotr Wojciech Mirowski
Juliette Love
K. Mathewson
Shakir Mohamed
32
20
0
31 May 2024
Efficient Indirect LLM Jailbreak via Multimodal-LLM Jailbreak
Zhenxing Niu
Yuyao Sun
Haoxuan Ji
Zheng Lin
Haichang Gao
Xinbo Gao
Gang Hua
Rong Jin
44
2
0
30 May 2024
AI Risk Management Should Incorporate Both Safety and Security
Xiangyu Qi
Yangsibo Huang
Yi Zeng
Edoardo Debenedetti
Jonas Geiping
...
Chaowei Xiao
Bo Li
Dawn Song
Peter Henderson
Prateek Mittal
AAML
56
11
0
29 May 2024
Why Algorithms Remain Unjust: Power Structures Surrounding Algorithmic Activity
Andrew Balch
38
0
0
28 May 2024
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
Bilgehan Sel
Priya Shanmugasundaram
Mohammad Kachuee
Kun Zhou
Ruoxi Jia
Ming Jin
LRM
42
2
0
21 May 2024
Sociotechnical Implications of Generative Artificial Intelligence for Information Access
Bhaskar Mitra
Henriette Cramer
Olya Gurevich
52
2
0
19 May 2024
Assessing Political Bias in Large Language Models
Luca Rettenberger
Markus Reischl
Mark Schutera
28
7
0
17 May 2024
A survey on fairness of large language models in e-commerce: progress, application, and challenge
Qingyang Ren
Zilin Jiang
Jinghan Cao
Sijia Li
Chiqu Li
Yiyang Liu
Shuning Huo
Tiange He
Yuan Chen
AILaw
FaML
45
6
0
15 May 2024
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT
Shucheng Zhu
Weikang Wang
Ying Liu
47
5
0
11 May 2024
The Silicon Ceiling: Auditing GPT's Race and Gender Biases in Hiring
Lena Armstrong
Abbey Liu
Stephen MacNeil
D. Metaxa
45
12
0
07 May 2024
Data Feminism for AI
Lauren Klein
C. D’Ignazio
50
17
0
02 May 2024
Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP
Sanjana Gautam
Mukund Srinath
42
6
0
29 Apr 2024
More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness
Aaron Jiaxun Li
Satyapriya Krishna
Himabindu Lakkaraju
48
3
0
29 Apr 2024
Lazy Data Practices Harm Fairness Research
Jan Simson
Alessandro Fabris
Christoph Kern
28
5
0
26 Apr 2024
Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Sunhao Dai
Chen Xu
Shicheng Xu
Liang Pang
Zhenhua Dong
Jun Xu
53
67
0
17 Apr 2024
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
Bowen Jin
Chulin Xie
Jiawei Zhang
Kashob Kumar Roy
Yu Zhang
...
Ruirui Li
Xianfeng Tang
Suhang Wang
Yu Meng
Jiawei Han
LRM
RALM
53
40
0
10 Apr 2024
Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Generative Agents
Seth Lazar
SILM
39
0
0
10 Apr 2024
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Simone Tedeschi
Felix Friedrich
P. Schramowski
Kristian Kersting
Roberto Navigli
Huu Nguyen
Bo Li
ELM
43
46
0
06 Apr 2024
Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as Rankers
Yuan Wang
Xuyang Wu
Hsin-Tai Wu
Zhiqiang Tao
Yi Fang
ALM
39
7
0
04 Apr 2024
The Impact of Unstated Norms in Bias Analysis of Language Models
Farnaz Kohankhaki
D. B. Emerson
David B. Emerson
Laleh Seyyed-Kalantari
Faiza Khan Khattak
62
1
0
04 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Zichong Wang
Wenbin Zhang
AILaw
48
33
0
31 Mar 2024
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
Nihar Ranjan Sahoo
Pranamya Prashant Kulkarni
Narjis Asad
Arif Ahmad
Tanu Goyal
Aparna Garimella
Pushpak Bhattacharyya
38
10
0
29 Mar 2024
Debiasing Sentence Embedders through Contrastive Word Pairs
Philip Kenneweg
Sarah Schröder
Alexander Schulz
Barbara Hammer
49
0
0
27 Mar 2024
Recourse for reclamation: Chatting with generative language models
Jennifer Chien
Kevin R. McKee
Jackie Kay
William S. Isaac
27
0
0
21 Mar 2024
A Design Space for Intelligent and Interactive Writing Assistants
Mina Lee
Katy Ilonka Gero
John Joon Young Chung
S. Buckingham Shum
Vipul Raheja
...
Joonsuk Park
Roy Pea
Eugenia H Rho
Shannon Zejiang Shen
Pao Siangliulue
44
83
0
21 Mar 2024
Previous
1
2
3
4
5
6
Next