Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.03004
Cited By
Show Your Work: Improved Reporting of Experimental Results
6 September 2019
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show Your Work: Improved Reporting of Experimental Results"
15 / 65 papers shown
Title
Utility is in the Eye of the User: A Critique of NLP Leaderboards
Kawin Ethayarajh
Dan Jurafsky
ELM
24
51
0
29 Sep 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
37
1,132
0
24 Sep 2020
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Swabha Swayamdipta
Roy Schwartz
Nicholas Lourie
Yizhong Wang
Hannaneh Hajishirzi
Noah A. Smith
Yejin Choi
44
429
0
22 Sep 2020
On the Effectiveness of Image Rotation for Open Set Domain Adaptation
S. Bucci
Mohammad Reza Loghmani
Tatiana Tommasi
57
142
0
24 Jul 2020
Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning
Pieter Delobelle
Paul Temple
Gilles Perrouin
Benoit Frénay
P. Heymans
Bettina Berendt
AAML
FaML
13
14
0
14 May 2020
Showing Your Work Doesn't Always Work
Raphael Tang
Jaejun Lee
Ji Xin
Xinyu Liu
Yaoliang Yu
Jimmy J. Lin
17
5
0
28 Apr 2020
The Cost of Training NLP Models: A Concise Overview
Or Sharir
Barak Peleg
Y. Shoham
40
210
0
19 Apr 2020
The Right Tool for the Job: Matching Model and Instance Complexities
Roy Schwartz
Gabriel Stanovsky
Swabha Swayamdipta
Jesse Dodge
Noah A. Smith
38
168
0
16 Apr 2020
Knowledge Fusion and Semantic Knowledge Ranking for Open Domain Question Answering
Pratyay Banerjee
Chitta Baral
RALM
22
24
0
07 Apr 2020
A Hierarchy of Limitations in Machine Learning
M. Malik
15
55
0
12 Feb 2020
oLMpics -- On what Language Model Pre-training Captures
Alon Talmor
Yanai Elazar
Yoav Goldberg
Jonathan Berant
LRM
34
300
0
31 Dec 2019
E-BERT: Efficient-Yet-Effective Entity Embeddings for BERT
Nina Poerner
Ulli Waltinger
Hinrich Schütze
13
156
0
09 Nov 2019
What Question Answering can Learn from Trivia Nerds
Jordan L. Boyd-Graber
Benjamin Borschinger
24
36
0
31 Oct 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
213
1,367
0
06 Jun 2016
Previous
1
2