Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.08283
Cited By
From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
15 May 2023
Shangbin Feng
Chan Young Park
Yuhan Liu
Yulia Tsvetkov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models"
50 / 142 papers shown
Title
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks
Yun-Shiuan Chuang
Zach Studdiford
Krirk Nirunwiroj
Agam Goyal
Vincent V. Frigo
Sijia Yang
Dhavan Shah
Junjie Hu
Timothy T. Rogers
AI4CE
37
12
0
25 Jun 2024
CAVE: Controllable Authorship Verification Explanations
Sahana Ramnath
Kartik Pandey
Elizabeth Boschee
Xiang Ren
61
1
0
24 Jun 2024
OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants
Jaspreet Ranjit
Brihi Joshi
Rebecca Dorn
Laura Petry
Olga Koumoundouros
Jayne Bottarini
Peichen Liu
Eric Rice
Swabha Swayamdipta
32
1
0
21 Jun 2024
Aligning Large Language Models with Diverse Political Viewpoints
Dominik Stammbach
Philine Widmer
Eunjung Cho
Çağlar Gülçehre
Elliott Ash
45
3
0
20 Jun 2024
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting
Sagnik Mukherjee
Muhammad Farid Adilazuarda
Sunayana Sitaram
Kalika Bali
Alham Fikri Aji
Monojit Choudhury
46
5
0
17 Jun 2024
Cultural Value Differences of LLMs: Prompt, Language, and Model Size
Qishuai Zhong
Yike Yun
Aixin Sun
42
2
0
17 Jun 2024
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Bolei Ma
Xinpeng Wang
Tiancheng Hu
Anna Haensch
Michael A. Hedderich
Barbara Plank
Frauke Kreuter
ALM
37
2
0
16 Jun 2024
Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles
Filip Trhlik
Pontus Stenetorp
29
6
0
16 Jun 2024
The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models
Yan Liu
Yu Liu
Xiaokang Chen
Pin-Yu Chen
Daoguang Zan
Min-Yen Kan
Tsung-Yi Ho
46
2
0
14 Jun 2024
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
Yusen Zhang
Nan Zhang
Jiawei Han
Rui Zhang
LRM
50
57
0
03 Jun 2024
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets
Peter Devine
ALM
19
3
0
29 May 2024
An Empirical Analysis on Large Language Models in Debate Evaluation
Xinyi Liu
Pinxin Liu
Hangfeng He
ELM
37
4
0
28 May 2024
Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Abhishek Kumar
Sarfaroz Yunusov
Ali Emami
41
3
0
23 May 2024
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models
Giada Pistilli
Alina Leidinger
Yacine Jernite
Atoosa Kasirzadeh
A. Luccioni
Margaret Mitchell
26
2
0
22 May 2024
Tagengo: A Multilingual Chat Dataset
P. Devine
42
3
0
21 May 2024
Relative Counterfactual Contrastive Learning for Mitigating Pretrained Stance Bias in Stance Detection
Jiarui Zhang
Shaojuan Wu
Xiaowang Zhang
Zhiyong Feng
39
0
0
16 May 2024
Facilitating Opinion Diversity through Hybrid NLP Approaches
Michiel van der Meer
47
0
0
15 May 2024
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias
Shan Chen
Jack Gallifant
Mingye Gao
Pedro Moreira
Nikolaj Munch
...
Hugo J. W. L. Aerts
Brian Anthony
Leo Anthony Celi
William G. La Cava
Danielle S. Bitterman
40
8
0
09 May 2024
Building a Large Japanese Web Corpus for Large Language Models
Naoaki Okazaki
Kakeru Hattori
Hirai Shota
Hiroki Iida
Masanari Ohi
Kazuki Fujii
Taishi Nakamura
Mengsay Loem
Rio Yokota
Sakae Mizuki
55
7
0
27 Apr 2024
CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment
Geyu Lin
Bin Wang
Zhengyuan Liu
Nancy F. Chen
37
7
0
18 Apr 2024
LLeMpower: Understanding Disparities in the Control and Access of Large Language Models
Vishwas Sathish
Hannah Lin
Aditya K Kamath
Anish Nyayachavadi
32
5
0
14 Apr 2024
Best Practices and Lessons Learned on Synthetic Data for Language Models
Ruibo Liu
Jerry W. Wei
Fangyu Liu
Chenglei Si
Yanzhe Zhang
...
Steven Zheng
Daiyi Peng
Diyi Yang
Denny Zhou
Andrew M. Dai
SyDa
EgoV
43
86
0
11 Apr 2024
Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in LLMs
Ahmed A. Agiza
Mohamed Mostagir
Sherief Reda
25
5
0
10 Apr 2024
Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness
Xincan Feng
A. Yoshimoto
41
2
0
10 Apr 2024
Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data
Tim Baumgärtner
Yang Gao
Dana Alon
Donald Metzler
AAML
33
18
0
08 Apr 2024
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Xinrun Du
Zhouliang Yu
Songyang Gao
Ding Pan
Yuyang Cheng
...
Tianyu Zheng
Xinchen Luo
Guorui Zhou
Wenhu Chen
Ge Zhang
48
17
0
05 Apr 2024
Attributions toward Artificial Agents in a modified Moral Turing Test
Eyal Aharoni
Sharlene Fernandes
Daniel J Brady
Caelan Alexander
Michael Criner
Kara Queen
Javier Rando
Eddy Nahmias
Victor Crespo
ELM
40
12
0
03 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Zichong Wang
Wenbin Zhang
AILaw
43
33
0
31 Mar 2024
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said
Yejin Bang
Delong Chen
Nayeon Lee
Pascale Fung
34
26
0
27 Mar 2024
Llama meets EU: Investigating the European Political Spectrum through the Lens of LLMs
Ilias Chalkidis
Stephanie Brandl
34
7
0
20 Mar 2024
How Gender Interacts with Political Values: A Case Study on Czech BERT Models
Adnan Al Ali
Jindvrich Libovický
27
0
0
20 Mar 2024
Correcting misinformation on social media with a large language model
Xinyi Zhou
Ashish Sharma
Amy X. Zhang
Tim Althoff
KELM
51
2
0
17 Mar 2024
Towards Measuring and Modeling "Culture" in LLMs: A Survey
Muhammad Farid Adilazuarda
Sagnik Mukherjee
Pradhyumna Lavania
Siddhant Singh
Alham Fikri Aji
Jacki OÑeill
Ashutosh Modi
Monojit Choudhury
67
55
0
05 Mar 2024
Think Fast, Think Slow, Think Critical: Designing an Automated Propaganda Detection Tool
L. Zavolokina
Kilian Sprenkamp
Zoya Katashinskaya
Daniel Gordon Jones
Gerhard Schwabe
45
13
0
29 Feb 2024
Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information
Seungjong Sun
Eungu Lee
Dongyan Nan
Xiangying Zhao
Wonbyung Lee
Bernard J. Jansen
Jang Hyun Kim
56
17
0
28 Feb 2024
Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs
Tanise Ceron
Neele Falk
Ana Barić
Dmitry Nikolaev
Sebastian Padó
44
15
0
27 Feb 2024
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
Paul Röttger
Valentin Hofmann
Valentina Pyatkin
Musashi Hinck
Hannah Rose Kirk
Hinrich Schütze
Dirk Hovy
ELM
26
53
0
26 Feb 2024
COBIAS: Assessing the Contextual Reliability of Bias Benchmarks for Language Models
Priyanshul Govil
Hemang Jain
Vamshi Krishna Bonagiri
Aman Chadha
Ponnurangam Kumaraguru
Manas Gaur
Sanorita Dey
53
2
0
22 Feb 2024
Data-driven Discovery with Large Generative Models
Bodhisattwa Prasad Majumder
Harshit Surana
Dhruv Agarwal
Sanchaita Hazra
Ashish Sabharwal
Peter Clark
43
9
0
21 Feb 2024
How Susceptible are Large Language Models to Ideological Manipulation?
Kai Chen
Zihao He
Jun Yan
Taiwei Shi
Kristina Lerman
40
10
0
18 Feb 2024
A Note on Bias to Complete
Jia Xu
Mona Diab
49
2
0
18 Feb 2024
Don't Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection
Min Zhang
Jianfeng He
Taoran Ji
Chang-Tien Lu
30
11
0
18 Feb 2024
Whose Emotions and Moral Sentiments Do Language Models Reflect?
Zihao He
Siyi Guo
Ashwin Rao
Kristina Lerman
47
12
0
16 Feb 2024
I Am Not Them: Fluid Identities and Persistent Out-group Bias in Large Language Models
Wenchao Dong
Assem Zhunis
Hyojin Chin
Jiyoung Han
Meeyoung Cha
32
2
0
16 Feb 2024
DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Herun Wan
Shangbin Feng
Zhaoxuan Tan
Heng Wang
Yulia Tsvetkov
Minnan Luo
72
29
0
16 Feb 2024
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
Rui Ye
Wenhao Wang
Jingyi Chai
Dihan Li
Zexi Li
Yinda Xu
Yaxin Du
Yanfeng Wang
Siheng Chen
ALM
FedML
AIFin
11
78
0
10 Feb 2024
A Roadmap to Pluralistic Alignment
Taylor Sorensen
Jared Moore
Jillian R. Fisher
Mitchell L. Gordon
Niloofar Mireshghallah
...
Liwei Jiang
Ximing Lu
Nouha Dziri
Tim Althoff
Yejin Choi
65
80
0
07 Feb 2024
The Political Preferences of LLMs
David Rozado
38
36
0
02 Feb 2024
Reading Between the Tweets: Deciphering Ideological Stances of Interconnected Mixed-Ideology Communities
Zihao He
Ashwin Rao
Siyi Guo
Negar Mokhberian
Kristina Lerman
27
6
0
02 Feb 2024
What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection
Shangbin Feng
Herun Wan
Ningnan Wang
Zhaoxuan Tan
Minnan Luo
Yulia Tsvetkov
AAML
DeLMO
25
16
0
01 Feb 2024
Previous
1
2
3
Next