Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.09010
Cited By
Datasheets for Datasets
23 March 2018
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Datasheets for Datasets"
50 / 966 papers shown
Title
Impact of Pretraining Term Frequencies on Few-Shot Reasoning
Yasaman Razeghi
Robert L Logan IV
Matt Gardner
Sameer Singh
ReLM
LRM
32
150
0
15 Feb 2022
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text
Sebastian Gehrmann
Elizabeth Clark
Thibault Sellam
ELM
AI4CE
71
184
0
14 Feb 2022
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?
P. Schramowski
Christopher Tauchmann
Kristian Kersting
FaML
25
87
0
14 Feb 2022
Accountability in an Algorithmic Society: Relationality, Responsibility, and Robustness in Machine Learning
A. Feder Cooper
Emanuel Moss
Benjamin Laufer
Helen Nissenbaum
MLAU
32
85
0
10 Feb 2022
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
Jack Hessel
Jena D. Hwang
Jinho Park
Rowan Zellers
Chandra Bhagavatula
Anna Rohrbach
Kate Saenko
Yejin Choi
ReLM
153
48
0
10 Feb 2022
The craft and coordination of data curation: complicating "workflow" views of data science
A. Thomer
Dharma Akmon
J. York
Allison R. B. Tyler
Faye O. Polasek
Sara Lafia
Libby Hemphill
E. Yakel
21
20
0
09 Feb 2022
Towards a consistent interpretation of AIOps models
Yingzhe Lyu
Gopi Krishnan Rajbahadur
Dayi Lin
Boyuan Chen
Zhen Ming
Z. Jiang
AI4CE
22
19
0
04 Feb 2022
Towards Training Reproducible Deep Learning Models
Boyuan Chen
Mingzhi Wen
Yong Shi
Dayi Lin
Gopi Krishnan Rajbahadur
Zhen Ming
Z. Jiang
SyDa
20
37
0
04 Feb 2022
Net benefit, calibration, threshold selection, and training objectives for algorithmic fairness in healthcare
Stephen R. Pfohl
Yizhe Xu
Agata Foryciarz
Nikolaos Ignatiadis
Julian Z. Genkins
N. Shah
25
29
0
03 Feb 2022
Adaptive Sampling Strategies to Construct Equitable Training Datasets
William Cai
R. Encarnación
Bobbie Chern
S. Corbett-Davies
Miranda Bogen
Stevie Bergman
Sharad Goel
89
30
0
31 Jan 2022
Fair ranking: a critical review, challenges, and future directions
Gourab K. Patro
Lorenzo Porcaro
Laura Mitchell
Qiuyue Zhang
Meike Zehlike
Nikhil Garg
26
51
0
29 Jan 2022
IMACS: Image Model Attribution Comparison Summaries
E. Schoop
Benjamin D. Wedin
A. Kapishnikov
Tolga Bolukbasi
Michael Terry
FAtt
26
1
0
26 Jan 2022
Natural Language Descriptions of Deep Visual Features
Evan Hernandez
Sarah Schwettmann
David Bau
Teona Bagashvili
Antonio Torralba
Jacob Andreas
MILM
206
117
0
26 Jan 2022
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Suchin Gururangan
Dallas Card
Sarah K. Drier
E. K. Gade
Leroy Z. Wang
Zeyu Wang
Luke Zettlemoyer
Noah A. Smith
175
74
0
25 Jan 2022
An Algorithmic Framework for Bias Bounties
Ira Globus-Harris
Michael Kearns
Aaron Roth
FedML
102
24
0
25 Jan 2022
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Angelina McMillan-Major
Zaid Alyafeai
Stella Biderman
Kimbo Chen
F. Toni
...
Aitor Soroa Etxabe
Pedro Ortiz Suarez
Zeerak Talat
Daniel Alexander van Strien
Yacine Jernite
40
14
0
25 Jan 2022
Evaluating a Methodology for Increasing AI Transparency: A Case Study
David Piorkowski
John T. Richards
Michael Hind
45
5
0
24 Jan 2022
Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals
Kathrin Blagec
J. Kraiger
Wolfgang Frühwirt
Matthias Samwald
AI4MH
30
26
0
18 Jan 2022
OmniPrint: A Configurable Printed Character Synthesizer
Haozhe Sun
Wei-Wei Tu
Isabelle M Guyon
SyDa
46
7
0
17 Jan 2022
The Dataset Nutrition Label (2nd Gen): Leveraging Context to Mitigate Harms in Artificial Intelligence
Kasia Chmielinski
S. Newman
Matt Taylor
Joshua Joseph
Kemi Thomas
Jessica Yurkofsky
Yue Qiu
30
51
0
10 Jan 2022
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Rowan Zellers
Jiasen Lu
Ximing Lu
Youngjae Yu
Yanpeng Zhao
Mohammadreza Salehi
Aditya Kusupati
Jack Hessel
Ali Farhadi
Yejin Choi
48
207
0
07 Jan 2022
Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation
Zoey Liu
Emily Tucker Prudhommeaux
45
4
0
05 Jan 2022
STEREO: Scientific Text Reuse in Open Access Publications
Lukas Gienapp
Wolfgang Kircheis
Bjarne Sievers
Benno Stein
Martin Potthast
25
8
0
22 Dec 2021
Validation and Transparency in AI systems for pharmacovigilance: a case study applied to the medical literature monitoring of adverse events
Bruno Ohana
Jack D. Sullivan
Nicole L. Baker
11
0
0
21 Dec 2021
AI Ethics Principles in Practice: Perspectives of Designers and Developers
Conrad Sanderson
David M. Douglas
Qinghua Lu
Emma Schleiger
Jon Whittle
J. Lacey
G. Newnham
S. Hajkowicz
Cathy J. Robinson
David Hansen
FaML
31
46
0
14 Dec 2021
A Framework for Fairness: A Systematic Review of Existing Fair AI Solutions
Brianna Richardson
J. Gilbert
FaML
29
35
0
10 Dec 2021
Whose Ground Truth? Accounting for Individual and Collective Identities Underlying Dataset Annotation
Emily L. Denton
Mark Díaz
Ian D Kivlichan
Vinodkumar Prabhakaran
Rachel Rosen
26
66
0
08 Dec 2021
Dataset Geography: Mapping Language Data to Language Users
Fahim Faisal
Yinkai Wang
Antonios Anastasopoulos
72
23
0
07 Dec 2021
Text2Mesh: Text-Driven Neural Stylization for Meshes
O. Michel
Roi Bar-On
Richard Liu
Sagie Benaim
Rana Hanocka
CLIP
AI4CE
226
353
0
06 Dec 2021
Thinking Beyond Distributions in Testing Machine Learned Models
Negar Rostamzadeh
B. Hutchinson
Christina Greer
Vinodkumar Prabhakaran
TTA
40
6
0
06 Dec 2021
Toward a Taxonomy of Trust for Probabilistic Machine Learning
Tamara Broderick
Andrew Gelman
Rachael Meager
Anna L. Smith
Tian Zheng
34
9
0
05 Dec 2021
Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution
Sahan Bulathwela
Maria Perez-Ortiz
C. Holloway
John Shawe-Taylor
28
19
0
03 Dec 2021
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research
Bernard Koch
Emily L. Denton
A. Hanna
J. Foster
53
140
0
03 Dec 2021
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer
Moein Sorkhei
Yue Liu
Hossein Azizpour
E. Azavedo
Karin Dembrower
Dimitra Ntoula
Athanasios Zouzos
Fredrik Strand
Kevin Smith
28
8
0
02 Dec 2021
A Causal Approach for Unfair Edge Prioritization and Discrimination Removal
Pavan Ravishankar
Pranshu Malviya
Balaraman Ravindran
33
1
0
29 Nov 2021
AI and the Everything in the Whole Wide World Benchmark
Inioluwa Deborah Raji
Emily M. Bender
Amandalynne Paullada
Emily L. Denton
A. Hanna
30
292
0
26 Nov 2021
RedCaps: web-curated image-text data created by the people, for the people
Karan Desai
Gaurav Kaul
Zubin Aysola
Justin Johnson
31
162
0
22 Nov 2021
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions
Hongwei Xue
Tiankai Hang
Yanhong Zeng
Yuchong Sun
Bei Liu
Huan Yang
Jianlong Fu
B. Guo
AI4TS
VLM
31
189
0
19 Nov 2021
ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
Laurynas Karazija
Iro Laina
Christian Rupprecht
3DV
VOS
47
84
0
19 Nov 2021
A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling
Eustache Diemert
Artem Betlei
Christophe Renaudin
Massih-Reza Amini
T. Gregoir
Thibaud Rahier
CML
33
10
0
19 Nov 2021
Software Engineering for Responsible AI: An Empirical Study and Operationalised Patterns
Qinghua Lu
Liming Zhu
Xiwei Xu
Jon Whittle
David M. Douglas
Conrad Sanderson
28
35
0
18 Nov 2021
Who Decides if AI is Fair? The Labels Problem in Algorithmic Auditing
Abhilash Mishra
Yash Gorana
29
3
0
16 Nov 2021
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
Maarten Sap
Swabha Swayamdipta
Laura Vianna
Xuhui Zhou
Yejin Choi
Noah A. Smith
46
268
0
15 Nov 2021
A Word on Machine Ethics: A Response to Jiang et al. (2021)
Zeerak Talat
Hagen Blix
Josef Valvoda
M. I. Ganesh
Ryan Cotterell
Adina Williams
SyDa
FaML
96
38
0
07 Nov 2021
EEGEyeNet: a Simultaneous Electroencephalography and Eye-tracking Dataset and Benchmark for Eye Movement Prediction
Ard Kastrati
M. Płomecka
Damian Pascual
L. Wolf
Victor Gillioz
Roger Wattenhofer
N. Langer
44
40
0
06 Nov 2021
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
33
216
0
04 Nov 2021
Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Xingjian Shi
Jonas W. Mueller
Nick Erickson
Mu Li
Alexander J. Smola
LMTD
48
29
0
04 Nov 2021
Feature and Label Embedding Spaces Matter in Addressing Image Classifier Bias
William Thong
Cees G. M. Snoek
25
14
0
27 Oct 2021
IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning
Pan Lu
Liang Qiu
Jiaqi Chen
Tony Xia
Yizhou Zhao
Wei Zhang
Zhou Yu
Xiaodan Liang
Song-Chun Zhu
AIMat
41
184
0
25 Oct 2021
What Would Jiminy Cricket Do? Towards Agents That Behave Morally
Dan Hendrycks
Mantas Mazeika
Andy Zou
Sahil Patel
Christine Zhu
Jesus Navarro
D. Song
Bo-wen Li
Jacob Steinhardt
16
58
0
25 Oct 2021
Previous
1
2
3
...
14
15
16
...
18
19
20
Next