Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.05783
Cited By
Persistent Anti-Muslim Bias in Large Language Models
14 January 2021
Abubakar Abid
Maheen Farooqi
James Zou
AILaw
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Persistent Anti-Muslim Bias in Large Language Models"
50 / 295 papers shown
Title
Protected group bias and stereotypes in Large Language Models
Hadas Kotek
David Q. Sun
Zidi Xiu
Margit Bowler
Christopher Klein
AILaw
ALM
33
3
0
21 Mar 2024
From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Khaoula Chehbouni
Megha Roshan
Emmanuel Ma
Futian Andrew Wei
Afaf Taik
Jackie CK Cheung
G. Farnadi
39
7
0
20 Mar 2024
Evaluating LLMs for Gender Disparities in Notable Persons
L. Rhue
Sofie Goethals
Arun Sundararajan
52
5
0
14 Mar 2024
Emergence of Social Norms in Generative Agent Societies: Principles and Architecture
Siyue Ren
Zhiyao Cui
Ruiqi Song
Zhen Wang
Shuyue Hu
LLMAG
40
10
0
13 Mar 2024
KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts
Adam Joseph Coscia
Alex Endert
VLM
38
9
0
07 Mar 2024
MEGAnno+: A Human-LLM Collaborative Annotation System
H. Kim
Kushan Mitra
Rafael Li Chen
Sajjadur Rahman
Dan Zhang
51
23
0
28 Feb 2024
FairBelief -- Assessing Harmful Beliefs in Language Models
Mattia Setzu
Marta Marchiori Manerba
Pasquale Minervini
Debora Nozza
29
0
0
27 Feb 2024
Foundation Model Transparency Reports
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Betty Xiong
Sayash Kapoor
Nestor Maslej
Arvind Narayanan
Percy Liang
40
15
0
26 Feb 2024
Potential and Challenges of Model Editing for Social Debiasing
Jianhao Yan
Futing Wang
Yafu Li
Yue Zhang
KELM
75
9
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Wenlin Yao
Lu Cheng
Huan Liu
SyDa
56
53
0
21 Feb 2024
Investigating Cultural Alignment of Large Language Models
Badr AlKhamissi
Muhammad N. ElNokrashy
Mai AlKhamissi
Mona T. Diab
44
44
0
20 Feb 2024
Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Tianlin Li
Xiaoyu Zhang
Chao Du
Tianyu Pang
Qian Liu
Qing Guo
Chao Shen
Yang Liu
ALM
45
11
0
19 Feb 2024
Search Engines Post-ChatGPT: How Generative Artificial Intelligence Could Make Search Less Reliable
Shahan Ali Memon
Jevin D. West
31
6
0
18 Feb 2024
Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models
Smriti Singh
Shuvam Keshari
Vinija Jain
Aman Chadha
11
2
0
16 Feb 2024
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
Zhiyuan Chang
Mingyang Li
Yi Liu
Junjie Wang
Qing Wang
Yang Liu
96
38
0
14 Feb 2024
Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking
Nikhil Sharma
Q. V. Liao
Ziang Xiao
45
19
0
08 Feb 2024
A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models
Marc Braun
Jenny Kunz
18
2
0
07 Feb 2024
De-amplifying Bias from Differential Privacy in Language Model Fine-tuning
Sanjari Srivastava
Piotr (Peter) Mardziel
Zhikhun Zhang
Archana Ahlawat
Anupam Datta
John C. Mitchell
42
1
0
07 Feb 2024
Measuring Implicit Bias in Explicitly Unbiased Large Language Models
Xuechunzi Bai
Angelina Wang
Ilia Sucholutsky
Thomas Griffiths
100
30
0
06 Feb 2024
Large Language Models are Geographically Biased
Rohin Manvi
Samar Khanna
Marshall Burke
David B. Lobell
Stefano Ermon
49
43
0
05 Feb 2024
Jailbreaking Attack against Multimodal Large Language Model
Zhenxing Niu
Haoxuan Ji
Xinbo Gao
Gang Hua
Rong Jin
50
61
0
04 Feb 2024
Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes
Isabel O. Gallegos
Ryan A. Rossi
Joe Barrow
Md Mehrab Tanjim
Tong Yu
Hanieh Deilamsalehy
Ruiyi Zhang
Sungchul Kim
Franck Dernoncourt
24
19
0
03 Feb 2024
Redefining "Hallucination" in LLMs: Towards a psychology-informed framework for mitigating misinformation
Elijah Berberette
Jack Hutchins
Amir Sadovnik
21
9
0
01 Feb 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
27
6
0
31 Jan 2024
UnMASKed: Quantifying Gender Biases in Masked Language Models through Linguistically Informed Job Market Prompts
Inigo Parra
13
1
0
28 Jan 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
37
27
0
28 Jan 2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
Zhen Xiang
Fengqing Jiang
Zidi Xiong
Bhaskar Ramasubramanian
Radha Poovendran
Bo Li
LRM
SILM
42
40
0
20 Jan 2024
Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans
Messi H.J. Lee
Jacob M. Montgomery
Calvin K. Lai
25
17
0
16 Jan 2024
From Bytes to Biases: Investigating the Cultural Self-Perception of Large Language Models
Wolfgang Messner
Tatum Greene
Josephine Matalone
35
4
0
21 Dec 2023
Quantifying Bias in Text-to-Image Generative Models
J. Vice
Naveed Akhtar
Richard I. Hartley
Ajmal Mian
38
10
0
20 Dec 2023
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
28
1
0
12 Dec 2023
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Anay Mehrotra
Manolis Zampetakis
Paul Kassianik
Blaine Nelson
Hyrum Anderson
Yaron Singer
Amin Karbasi
46
211
0
04 Dec 2023
Developing Linguistic Patterns to Mitigate Inherent Human Bias in Offensive Language Detection
Toygar Tanyel
Besher Alkurdi
S. Ayvaz
18
0
0
04 Dec 2023
Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies
Vithya Yogarajan
Gillian Dobbie
Te Taka Keegan
R. Neuwirth
ALM
54
11
0
03 Dec 2023
Fair Text-to-Image Diffusion via Fair Mapping
Jia Li
Lijie Hu
Jingfeng Zhang
Tianhang Zheng
Hua Zhang
Di Wang
59
14
0
29 Nov 2023
Unveiling the Implicit Toxicity in Large Language Models
Jiaxin Wen
Pei Ke
Hao Sun
Zhexin Zhang
Chengfei Li
Jinfeng Bai
Minlie Huang
42
26
0
29 Nov 2023
SoUnD Framework: Analyzing (So)cial Representation in (Un)structured (D)ata
Mark Díaz
Sunipa Dev
Emily Reif
Remi Denton
Vinodkumar Prabhakaran
38
3
0
28 Nov 2023
Justifiable Artificial Intelligence: Engineering Large Language Models for Legal Applications
Sabine Wehnert
AILaw
59
4
0
27 Nov 2023
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
39
8
0
21 Nov 2023
LePaRD: A Large-Scale Dataset of Judges Citing Precedents
Robert Mahari
Dominik Stammbach
Elliott Ash
Alex Pentland
ELM
AILaw
22
2
0
15 Nov 2023
Understanding Users' Dissatisfaction with ChatGPT Responses: Types, Resolving Tactics, and the Effect of Knowledge Level
Yoonsu Kim
Jueon Lee
Seoyoung Kim
Jaehyuk Park
Juho Kim
44
37
0
13 Nov 2023
First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models
Naomi Saphra
Eve Fleisig
Kyunghyun Cho
Adam Lopez
LRM
32
8
0
08 Nov 2023
Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications
Fengqing Jiang
Zhangchen Xu
Luyao Niu
Wei Ping
Jinyuan Jia
Bo Li
Radha Poovendran
AAML
21
20
0
07 Nov 2023
"Honey, Tell Me What's Wrong", Global Explanation of Textual Discriminative Models through Cooperative Generation
Antoine Chaffin
Julien Delaunay
18
0
0
27 Oct 2023
Knowledge Editing for Large Language Models: A Survey
Song Wang
Yaochen Zhu
Haochen Liu
Zaiyi Zheng
Chen Chen
Wenlin Yao
KELM
81
138
0
24 Oct 2023
Generative Language Models Exhibit Social Identity Biases
Tiancheng Hu
Yara Kyrychenko
Steve Rathje
Nigel Collier
S. V. D. Linden
Jon Roozenbeek
38
38
0
24 Oct 2023
Moral Foundations of Large Language Models
Marwa Abdulhai
Gregory Serapio-Garcia
Clément Crepy
Daria Valter
John Canny
Natasha Jaques
LRM
62
42
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
34
29
0
23 Oct 2023
Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications
Yanchen Liu
Srishti Gautam
Jiaqi Ma
Himabindu Lakkaraju
LMTD
29
12
0
23 Oct 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Da Song
Xuan Xie
Jiayang Song
Derui Zhu
Yuheng Huang
Felix Juefei Xu
Lei Ma
ALM
40
3
0
22 Oct 2023
Previous
1
2
3
4
5
6
Next