Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.10395
Cited By
Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads
17 November 2023
Yi Yang
Hanyu Duan
Ahmed Abbasi
John P. Lalor
Kar Yan Tam
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads"
5 / 5 papers shown
Title
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
339
12,003
0
04 Mar 2022
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe
Aylin Caliskan
87
51
0
01 Oct 2021
Debiasing Pre-trained Contextualised Embeddings
Masahiro Kaneko
Danushka Bollegala
218
138
0
23 Jan 2021
The Woman Worked as a Babysitter: On Biases in Language Generation
Emily Sheng
Kai-Wei Chang
Premkumar Natarajan
Nanyun Peng
223
618
0
03 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,984
0
20 Apr 2018
1