ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.05783
  4. Cited By
Persistent Anti-Muslim Bias in Large Language Models

Persistent Anti-Muslim Bias in Large Language Models

14 January 2021
Abubakar Abid
Maheen Farooqi
James Zou
    AILaw
ArXivPDFHTML

Papers citing "Persistent Anti-Muslim Bias in Large Language Models"

50 / 295 papers shown
Title
Protected group bias and stereotypes in Large Language Models
Protected group bias and stereotypes in Large Language Models
Hadas Kotek
David Q. Sun
Zidi Xiu
Margit Bowler
Christopher Klein
AILaw
ALM
33
3
0
21 Mar 2024
From Representational Harms to Quality-of-Service Harms: A Case Study on
  Llama 2 Safety Safeguards
From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Khaoula Chehbouni
Megha Roshan
Emmanuel Ma
Futian Andrew Wei
Afaf Taik
Jackie CK Cheung
G. Farnadi
39
7
0
20 Mar 2024
Evaluating LLMs for Gender Disparities in Notable Persons
Evaluating LLMs for Gender Disparities in Notable Persons
L. Rhue
Sofie Goethals
Arun Sundararajan
52
5
0
14 Mar 2024
Emergence of Social Norms in Generative Agent Societies: Principles and
  Architecture
Emergence of Social Norms in Generative Agent Societies: Principles and Architecture
Siyue Ren
Zhiyao Cui
Ruiqi Song
Zhen Wang
Shuyue Hu
LLMAG
40
10
0
13 Mar 2024
KnowledgeVIS: Interpreting Language Models by Comparing
  Fill-in-the-Blank Prompts
KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts
Adam Joseph Coscia
Alex Endert
VLM
38
9
0
07 Mar 2024
MEGAnno+: A Human-LLM Collaborative Annotation System
MEGAnno+: A Human-LLM Collaborative Annotation System
H. Kim
Kushan Mitra
Rafael Li Chen
Sajjadur Rahman
Dan Zhang
51
23
0
28 Feb 2024
FairBelief -- Assessing Harmful Beliefs in Language Models
FairBelief -- Assessing Harmful Beliefs in Language Models
Mattia Setzu
Marta Marchiori Manerba
Pasquale Minervini
Debora Nozza
29
0
0
27 Feb 2024
Foundation Model Transparency Reports
Foundation Model Transparency Reports
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Betty Xiong
Sayash Kapoor
Nestor Maslej
Arvind Narayanan
Percy Liang
40
15
0
26 Feb 2024
Potential and Challenges of Model Editing for Social Debiasing
Potential and Challenges of Model Editing for Social Debiasing
Jianhao Yan
Futing Wang
Yafu Li
Yue Zhang
KELM
75
9
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Wenlin Yao
Lu Cheng
Huan Liu
SyDa
56
53
0
21 Feb 2024
Investigating Cultural Alignment of Large Language Models
Investigating Cultural Alignment of Large Language Models
Badr AlKhamissi
Muhammad N. ElNokrashy
Mai AlKhamissi
Mona T. Diab
44
44
0
20 Feb 2024
Your Large Language Model is Secretly a Fairness Proponent and You
  Should Prompt it Like One
Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Tianlin Li
Xiaoyu Zhang
Chao Du
Tianyu Pang
Qian Liu
Qing Guo
Chao Shen
Yang Liu
ALM
45
11
0
19 Feb 2024
Search Engines Post-ChatGPT: How Generative Artificial Intelligence
  Could Make Search Less Reliable
Search Engines Post-ChatGPT: How Generative Artificial Intelligence Could Make Search Less Reliable
Shahan Ali Memon
Jevin D. West
31
6
0
18 Feb 2024
Born With a Silver Spoon? Investigating Socioeconomic Bias in Large
  Language Models
Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models
Smriti Singh
Shuvam Keshari
Vinija Jain
Aman Chadha
11
2
0
16 Feb 2024
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit
  Clues
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
Zhiyuan Chang
Mingyang Li
Yi Liu
Junjie Wang
Qing Wang
Yang Liu
96
38
0
14 Feb 2024
Generative Echo Chamber? Effects of LLM-Powered Search Systems on
  Diverse Information Seeking
Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking
Nikhil Sharma
Q. V. Liao
Ziang Xiao
45
19
0
08 Feb 2024
A Hypothesis-Driven Framework for the Analysis of Self-Rationalising
  Models
A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models
Marc Braun
Jenny Kunz
18
2
0
07 Feb 2024
De-amplifying Bias from Differential Privacy in Language Model
  Fine-tuning
De-amplifying Bias from Differential Privacy in Language Model Fine-tuning
Sanjari Srivastava
Piotr (Peter) Mardziel
Zhikhun Zhang
Archana Ahlawat
Anupam Datta
John C. Mitchell
42
1
0
07 Feb 2024
Measuring Implicit Bias in Explicitly Unbiased Large Language Models
Measuring Implicit Bias in Explicitly Unbiased Large Language Models
Xuechunzi Bai
Angelina Wang
Ilia Sucholutsky
Thomas Griffiths
100
30
0
06 Feb 2024
Large Language Models are Geographically Biased
Large Language Models are Geographically Biased
Rohin Manvi
Samar Khanna
Marshall Burke
David B. Lobell
Stefano Ermon
49
43
0
05 Feb 2024
Jailbreaking Attack against Multimodal Large Language Model
Jailbreaking Attack against Multimodal Large Language Model
Zhenxing Niu
Haoxuan Ji
Xinbo Gao
Gang Hua
Rong Jin
50
61
0
04 Feb 2024
Self-Debiasing Large Language Models: Zero-Shot Recognition and
  Reduction of Stereotypes
Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes
Isabel O. Gallegos
Ryan A. Rossi
Joe Barrow
Md Mehrab Tanjim
Tong Yu
Hanieh Deilamsalehy
Ruiyi Zhang
Sungchul Kim
Franck Dernoncourt
24
19
0
03 Feb 2024
Redefining "Hallucination" in LLMs: Towards a psychology-informed
  framework for mitigating misinformation
Redefining "Hallucination" in LLMs: Towards a psychology-informed framework for mitigating misinformation
Elijah Berberette
Jack Hutchins
Amir Sadovnik
21
9
0
01 Feb 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
27
6
0
31 Jan 2024
UnMASKed: Quantifying Gender Biases in Masked Language Models through
  Linguistically Informed Job Market Prompts
UnMASKed: Quantifying Gender Biases in Masked Language Models through Linguistically Informed Job Market Prompts
Inigo Parra
13
1
0
28 Jan 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought
  Prompting
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
37
27
0
28 Jan 2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
Zhen Xiang
Fengqing Jiang
Zidi Xiong
Bhaskar Ramasubramanian
Radha Poovendran
Bo Li
LRM
SILM
42
40
0
20 Jan 2024
Large Language Models Portray Socially Subordinate Groups as More
  Homogeneous, Consistent with a Bias Observed in Humans
Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans
Messi H.J. Lee
Jacob M. Montgomery
Calvin K. Lai
25
17
0
16 Jan 2024
From Bytes to Biases: Investigating the Cultural Self-Perception of
  Large Language Models
From Bytes to Biases: Investigating the Cultural Self-Perception of Large Language Models
Wolfgang Messner
Tatum Greene
Josephine Matalone
35
4
0
21 Dec 2023
Quantifying Bias in Text-to-Image Generative Models
Quantifying Bias in Text-to-Image Generative Models
J. Vice
Naveed Akhtar
Richard I. Hartley
Ajmal Mian
38
10
0
20 Dec 2023
Saturn Platform: Foundation Model Operations and Generative AI for
  Financial Services
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
28
1
0
12 Dec 2023
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Anay Mehrotra
Manolis Zampetakis
Paul Kassianik
Blaine Nelson
Hyrum Anderson
Yaron Singer
Amin Karbasi
46
211
0
04 Dec 2023
Developing Linguistic Patterns to Mitigate Inherent Human Bias in
  Offensive Language Detection
Developing Linguistic Patterns to Mitigate Inherent Human Bias in Offensive Language Detection
Toygar Tanyel
Besher Alkurdi
S. Ayvaz
18
0
0
04 Dec 2023
Tackling Bias in Pre-trained Language Models: Current Trends and
  Under-represented Societies
Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies
Vithya Yogarajan
Gillian Dobbie
Te Taka Keegan
R. Neuwirth
ALM
54
11
0
03 Dec 2023
Fair Text-to-Image Diffusion via Fair Mapping
Fair Text-to-Image Diffusion via Fair Mapping
Jia Li
Lijie Hu
Jingfeng Zhang
Tianhang Zheng
Hua Zhang
Di Wang
59
14
0
29 Nov 2023
Unveiling the Implicit Toxicity in Large Language Models
Unveiling the Implicit Toxicity in Large Language Models
Jiaxin Wen
Pei Ke
Hao Sun
Zhexin Zhang
Chengfei Li
Jinfeng Bai
Minlie Huang
42
26
0
29 Nov 2023
SoUnD Framework: Analyzing (So)cial Representation in (Un)structured
  (D)ata
SoUnD Framework: Analyzing (So)cial Representation in (Un)structured (D)ata
Mark Díaz
Sunipa Dev
Emily Reif
Remi Denton
Vinodkumar Prabhakaran
38
3
0
28 Nov 2023
Justifiable Artificial Intelligence: Engineering Large Language Models
  for Legal Applications
Justifiable Artificial Intelligence: Engineering Large Language Models for Legal Applications
Sabine Wehnert
AILaw
59
4
0
27 Nov 2023
Compositional Capabilities of Autoregressive Transformers: A Study on
  Synthetic, Interpretable Tasks
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
39
8
0
21 Nov 2023
LePaRD: A Large-Scale Dataset of Judges Citing Precedents
LePaRD: A Large-Scale Dataset of Judges Citing Precedents
Robert Mahari
Dominik Stammbach
Elliott Ash
Alex Pentland
ELM
AILaw
22
2
0
15 Nov 2023
Understanding Users' Dissatisfaction with ChatGPT Responses: Types,
  Resolving Tactics, and the Effect of Knowledge Level
Understanding Users' Dissatisfaction with ChatGPT Responses: Types, Resolving Tactics, and the Effect of Knowledge Level
Yoonsu Kim
Jueon Lee
Seoyoung Kim
Jaehyuk Park
Juho Kim
44
37
0
13 Nov 2023
First Tragedy, then Parse: History Repeats Itself in the New Era of
  Large Language Models
First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models
Naomi Saphra
Eve Fleisig
Kyunghyun Cho
Adam Lopez
LRM
32
8
0
08 Nov 2023
Identifying and Mitigating Vulnerabilities in LLM-Integrated
  Applications
Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications
Fengqing Jiang
Zhangchen Xu
Luyao Niu
Wei Ping
Jinyuan Jia
Bo Li
Radha Poovendran
AAML
21
20
0
07 Nov 2023
"Honey, Tell Me What's Wrong", Global Explanation of Textual
  Discriminative Models through Cooperative Generation
"Honey, Tell Me What's Wrong", Global Explanation of Textual Discriminative Models through Cooperative Generation
Antoine Chaffin
Julien Delaunay
18
0
0
27 Oct 2023
Knowledge Editing for Large Language Models: A Survey
Knowledge Editing for Large Language Models: A Survey
Song Wang
Yaochen Zhu
Haochen Liu
Zaiyi Zheng
Chen Chen
Wenlin Yao
KELM
81
138
0
24 Oct 2023
Generative Language Models Exhibit Social Identity Biases
Generative Language Models Exhibit Social Identity Biases
Tiancheng Hu
Yara Kyrychenko
Steve Rathje
Nigel Collier
S. V. D. Linden
Jon Roozenbeek
38
38
0
24 Oct 2023
Moral Foundations of Large Language Models
Moral Foundations of Large Language Models
Marwa Abdulhai
Gregory Serapio-Garcia
Clément Crepy
Daria Valter
John Canny
Natasha Jaques
LRM
62
42
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for
  Large Language Models
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
34
29
0
23 Oct 2023
Confronting LLMs with Traditional ML: Rethinking the Fairness of Large
  Language Models in Tabular Classifications
Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications
Yanchen Liu
Srishti Gautam
Jiaqi Ma
Himabindu Lakkaraju
LMTD
29
12
0
23 Oct 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language
  Models
LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Da Song
Xuan Xie
Jiayang Song
Derui Zhu
Yuheng Huang
Felix Juefei Xu
Lei Ma
ALM
40
3
0
22 Oct 2023
Previous
123456
Next