Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.00537
Cited By
v1
v2
v3 (latest)
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"
50 / 1,500 papers shown
Title
ColdGANs: Taming Language GANs with Cautious Sampling Strategies
Thomas Scialom
Paul-Alexis Dray
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
GAN
SyDa
68
18
0
08 Jun 2020
Probing Neural Dialog Models for Conversational Understanding
Abdelrhman Saleh
Tovly Deutsch
Stephen Casper
Yonatan Belinkov
Stuart M. Shieber
65
13
0
07 Jun 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
173
2,770
0
05 Jun 2020
Sponge Examples: Energy-Latency Attacks on Neural Networks
Ilia Shumailov
Yiren Zhao
Daniel Bates
Nicolas Papernot
Robert D. Mullins
Ross J. Anderson
SILM
79
138
0
05 Jun 2020
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning
Alessandro Suglia
Ioannis Konstas
Andrea Vanzo
E. Bastianelli
Desmond Elliott
Stella Frank
Oliver Lemon
62
16
0
03 Jun 2020
Interpretable Meta-Measure for Model Performance
Alicja Gosiewska
Katarzyna Wo'znica
P. Biecek
37
5
0
02 Jun 2020
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity
Lukas Muttenthaler
41
3
0
02 Jun 2020
WikiBERT models: deep transfer learning for many languages
S. Pyysalo
Jenna Kanerva
Antti Virtanen
Filip Ginter
KELM
89
38
0
02 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
91
75
0
31 May 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
991
42,651
0
28 May 2020
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAI
LRM
113
163
0
25 May 2020
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Anne Lauscher
Olga Majewska
Leonardo F. R. Ribeiro
Iryna Gurevych
Nikolai Rozanov
Goran Glavaš
KELM
82
81
0
24 May 2020
(Re)construing Meaning in NLP
Sean Trott
Tiago Timponi Torrent
Nancy Chang
Nathan Schneider
AI4CE
48
30
0
18 May 2020
Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Chitta Baral
AIMat
45
13
0
18 May 2020
INFOTABS: Inference on Tables as Semi-structured Data
Vivek Gupta
Maitrey Mehta
Pegah Nokhiz
Vivek Srikumar
LMTD
73
112
0
13 May 2020
A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELM
AILaw
69
69
0
11 May 2020
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
73
239
0
10 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
214
1,110
0
08 May 2020
A Systematic Assessment of Syntactic Generalization in Neural Language Models
Jennifer Hu
Jon Gauthier
Peng Qian
Ethan Gotlieb Wilcox
R. Levy
ELM
110
221
0
07 May 2020
Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words
Josef Klafka
Allyson Ettinger
80
43
0
04 May 2020
To Test Machine Comprehension, Start by Defining Comprehension
Jesse Dunietz
Greg Burnham
Akash Bharadwaj
Owen Rambow
Jennifer Chu-Carroll
D. Ferrucci
FaML
115
65
0
04 May 2020
The Sensitivity of Language Models and Humans to Winograd Schema Perturbations
Mostafa Abdou
Vinit Ravishankar
Maria Barrett
Yonatan Belinkov
Desmond Elliott
Anders Søgaard
ReLM
LRM
113
35
0
04 May 2020
From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)?
Reut Tsarfaty
Dan Bareket
Stav Klein
Amit Seker
74
40
0
04 May 2020
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering
Vikas Yadav
Steven Bethard
Mihai Surdeanu
RALM
119
51
0
04 May 2020
Out of the Echo Chamber: Detecting Countering Debate Speeches
Matan Orbach
Yonatan Bilu
Assaf Toledo
Dan Lahav
Michal Jacovi
R. Aharonov
Noam Slonim
60
22
0
03 May 2020
How Can We Accelerate Progress Towards Human-like Linguistic Generalization?
Tal Linzen
284
195
0
03 May 2020
Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco
Rowan Zellers
Ali Farhadi
Hannaneh Hajishirzi
105
14
0
01 May 2020
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Edoardo Ponti
Goran Glavaš
Olga Majewska
Qianchu Liu
Ivan Vulić
Anna Korhonen
LRM
126
329
0
01 May 2020
Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment
Forrest Davis
Marten van Schijndel
55
23
0
01 May 2020
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context
Anna Breit
Artem Revenko
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
107
26
0
30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions
Sharan Narang
Colin Raffel
Katherine Lee
Adam Roberts
Noah Fiedel
Karishma Malkan
80
201
0
30 Apr 2020
Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense Disambiguation
Daniel Loureiro
Jose Camacho-Collados
88
12
0
29 Apr 2020
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Alexandre Tamborrino
Nicola Pellicanò
B. Pannier
Pascal Voitot
Louise Naudin
LRM
81
63
0
29 Apr 2020
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
Mengjie Zhao
Tao R. Lin
Fei Mi
Martin Jaggi
Hinrich Schütze
77
121
0
26 Apr 2020
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
Simran Khanuja
Sandipan Dandapat
A. Srinivasan
Sunayana Sitaram
Monojit Choudhury
ELM
73
148
0
26 Apr 2020
MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization
Canwen Xu
Jiaxin Pei
Hongtao Wu
Yiyu Liu
Chenliang Li
MLLM
VLM
59
14
0
26 Apr 2020
Quantifying the Contextualization of Word Representations with Semantic Class Probing
Mengjie Zhao
Philipp Dufter
Yadollah Yaghoobzadeh
Hinrich Schütze
83
27
0
25 Apr 2020
New Protocols and Negative Results for Textual Entailment Data Collection
Samuel R. Bowman
J. Palomaki
Livio Baldini Soares
Emily Pitler
70
7
0
24 Apr 2020
A Review of Winograd Schema Challenge Datasets and Approaches
Vid Kocijan
Thomas Lukasiewicz
E. Davis
G. Marcus
L. Morgenstern
70
44
0
23 Apr 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
180
2,450
0
23 Apr 2020
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
102
360
0
21 Apr 2020
DIET: Lightweight Language Understanding for Dialogue Systems
Tanja Bunk
Daksh Varshneya
Vladimir Vlasov
Alan Nichol
74
162
0
21 Apr 2020
Are we pretraining it right? Digging deeper into visio-linguistic pretraining
Amanpreet Singh
Vedanuj Goswami
Devi Parikh
VLM
78
48
0
19 Apr 2020
CLUE: A Chinese Language Understanding Evaluation Benchmark
Liang Xu
Hai Hu
Xuanwei Zhang
Lu Li
Chenjie Cao
...
Cong Yue
Xinrui Zhang
Zhen-Yi Yang
Kyle Richardson
Zhenzhong Lan
ELM
108
388
0
13 Apr 2020
A New Dataset for Natural Language Inference from Code-mixed Conversations
Simran Khanuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
92
42
0
10 Apr 2020
Telling BERT's full story: from Local Attention to Global Aggregation
Damian Pascual
Gino Brunner
Roger Wattenhofer
57
19
0
10 Apr 2020
More Bang for Your Buck: Natural Perturbation for Robust Question Answering
Daniel Khashabi
Tushar Khot
Ashish Sabharwal
AAML
OOD
71
4
0
09 Apr 2020
Calibrating Structured Output Predictors for Natural Language Processing
Abhyuday N. Jagannatha
Hong-ye Yu
103
28
0
09 Apr 2020
TuringAdvice: A Generative and Dynamic Evaluation of Language Use
Rowan Zellers
Ari Holtzman
Elizabeth Clark
Lianhui Qin
Ali Farhadi
Yejin Choi
ELM
LRM
65
14
0
07 Apr 2020
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
Jiyeon Ham
Yo Joong Choe
Kyubyong Park
Ilji Choi
Hyungjoon Soh
67
78
0
07 Apr 2020
Previous
1
2
3
...
28
29
30
Next