Show Your Work: Improved Reporting of Experimental Results

Show Your Work: Improved Reporting of Experimental Results

6 September 2019

Suchin Gururangan

Papers citing "Show Your Work: Improved Reporting of Experimental Results"

15 / 65 papers shown

Title
Utility is in the Eye of the User: A Critique of NLP Leaderboards Kawin Ethayarajh Dan Jurafsky ELM 24 51 0 29 Sep 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models Samuel Gehman Suchin Gururangan Maarten Sap Yejin Choi Noah A. Smith 37 1,132 0 24 Sep 2020
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics Swabha Swayamdipta Roy Schwartz Nicholas Lourie Yizhong Wang Hannaneh Hajishirzi Noah A. Smith Yejin Choi 44 429 0 22 Sep 2020
On the Effectiveness of Image Rotation for Open Set Domain Adaptation S. Bucci Mohammad Reza Loghmani Tatiana Tommasi 57 142 0 24 Jul 2020
Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning Pieter Delobelle Paul Temple Gilles Perrouin Benoit Frénay P. Heymans Bettina Berendt AAML FaML 13 14 0 14 May 2020
Showing Your Work Doesn't Always Work Raphael Tang Jaejun Lee Ji Xin Xinyu Liu Yaoliang Yu Jimmy J. Lin 17 5 0 28 Apr 2020
The Cost of Training NLP Models: A Concise Overview Or Sharir Barak Peleg Y. Shoham 40 210 0 19 Apr 2020
The Right Tool for the Job: Matching Model and Instance Complexities Roy Schwartz Gabriel Stanovsky Swabha Swayamdipta Jesse Dodge Noah A. Smith 38 168 0 16 Apr 2020
Knowledge Fusion and Semantic Knowledge Ranking for Open Domain Question Answering Pratyay Banerjee Chitta Baral RALM 22 24 0 07 Apr 2020
A Hierarchy of Limitations in Machine Learning M. Malik 15 55 0 12 Feb 2020
oLMpics -- On what Language Model Pre-training Captures Alon Talmor Yanai Elazar Yoav Goldberg Jonathan Berant LRM 34 300 0 31 Dec 2019
E-BERT: Efficient-Yet-Effective Entity Embeddings for BERT Nina Poerner Ulli Waltinger Hinrich Schütze 13 156 0 09 Nov 2019
What Question Answering can Learn from Trivia Nerds Jordan L. Boyd-Graber Benjamin Borschinger 24 36 0 31 Oct 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 299 6,984 0 20 Apr 2018
A Decomposable Attention Model for Natural Language Inference Ankur P. Parikh Oscar Täckström Dipanjan Das Jakob Uszkoreit 213 1,367 0 06 Jun 2016