ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.00537
  4. Cited By
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
v1v2v3 (latest)

SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
    ELM
ArXiv (abs)PDFHTML

Papers citing "SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"

50 / 1,500 papers shown
Title
ColdGANs: Taming Language GANs with Cautious Sampling Strategies
ColdGANs: Taming Language GANs with Cautious Sampling Strategies
Thomas Scialom
Paul-Alexis Dray
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
GANSyDa
68
18
0
08 Jun 2020
Probing Neural Dialog Models for Conversational Understanding
Probing Neural Dialog Models for Conversational Understanding
Abdelrhman Saleh
Tovly Deutsch
Stephen Casper
Yonatan Belinkov
Stuart M. Shieber
65
13
0
07 Jun 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
173
2,770
0
05 Jun 2020
Sponge Examples: Energy-Latency Attacks on Neural Networks
Sponge Examples: Energy-Latency Attacks on Neural Networks
Ilia Shumailov
Yiren Zhao
Daniel Bates
Nicolas Papernot
Robert D. Mullins
Ross J. Anderson
SILM
79
138
0
05 Jun 2020
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language
  Learning
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning
Alessandro Suglia
Ioannis Konstas
Andrea Vanzo
E. Bastianelli
Desmond Elliott
Stella Frank
Oliver Lemon
62
16
0
03 Jun 2020
Interpretable Meta-Measure for Model Performance
Interpretable Meta-Measure for Model Performance
Alicja Gosiewska
Katarzyna Wo'znica
P. Biecek
37
5
0
02 Jun 2020
Subjective Question Answering: Deciphering the inner workings of
  Transformers in the realm of subjectivity
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity
Lukas Muttenthaler
41
3
0
02 Jun 2020
WikiBERT models: deep transfer learning for many languages
WikiBERT models: deep transfer learning for many languages
S. Pyysalo
Jenna Kanerva
Antti Virtanen
Filip Ginter
KELM
89
38
0
02 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
91
75
0
31 May 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
991
42,651
0
28 May 2020
NILE : Natural Language Inference with Faithful Natural Language
  Explanations
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAILRM
113
163
0
25 May 2020
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge
  Injection into Pretrained Transformers
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Anne Lauscher
Olga Majewska
Leonardo F. R. Ribeiro
Iryna Gurevych
Nikolai Rozanov
Goran Glavaš
KELM
82
81
0
24 May 2020
(Re)construing Meaning in NLP
(Re)construing Meaning in NLP
Sean Trott
Tiago Timponi Torrent
Nancy Chang
Nathan Schneider
AI4CE
48
30
0
18 May 2020
Towards Question Format Independent Numerical Reasoning: A Set of
  Prerequisite Tasks
Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Chitta Baral
AIMat
45
13
0
18 May 2020
INFOTABS: Inference on Tables as Semi-structured Data
INFOTABS: Inference on Tables as Semi-structured Data
Vivek Gupta
Maitrey Mehta
Pegah Nokhiz
Vivek Srikumar
LMTD
73
112
0
13 May 2020
A Dataset for Statutory Reasoning in Tax Law Entailment and Question
  Answering
A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELMAILaw
69
69
0
11 May 2020
How Context Affects Language Models' Factual Predictions
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
73
239
0
10 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
214
1,110
0
08 May 2020
A Systematic Assessment of Syntactic Generalization in Neural Language
  Models
A Systematic Assessment of Syntactic Generalization in Neural Language Models
Jennifer Hu
Jon Gauthier
Peng Qian
Ethan Gotlieb Wilcox
R. Levy
ELM
110
221
0
07 May 2020
Spying on your neighbors: Fine-grained probing of contextual embeddings
  for information about surrounding words
Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words
Josef Klafka
Allyson Ettinger
80
43
0
04 May 2020
To Test Machine Comprehension, Start by Defining Comprehension
To Test Machine Comprehension, Start by Defining Comprehension
Jesse Dunietz
Greg Burnham
Akash Bharadwaj
Owen Rambow
Jennifer Chu-Carroll
D. Ferrucci
FaML
115
65
0
04 May 2020
The Sensitivity of Language Models and Humans to Winograd Schema
  Perturbations
The Sensitivity of Language Models and Humans to Winograd Schema Perturbations
Mostafa Abdou
Vinit Ravishankar
Maria Barrett
Yonatan Belinkov
Desmond Elliott
Anders Søgaard
ReLMLRM
113
35
0
04 May 2020
From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of
  Parsing Morphologically-Rich Languages (MRLs)?
From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)?
Reut Tsarfaty
Dan Bareket
Stav Klein
Amit Seker
74
40
0
04 May 2020
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop
  Question Answering
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering
Vikas Yadav
Steven Bethard
Mihai Surdeanu
RALM
119
51
0
04 May 2020
Out of the Echo Chamber: Detecting Countering Debate Speeches
Out of the Echo Chamber: Detecting Countering Debate Speeches
Matan Orbach
Yonatan Bilu
Assaf Toledo
Dan Lahav
Michal Jacovi
R. Aharonov
Noam Slonim
60
22
0
03 May 2020
How Can We Accelerate Progress Towards Human-like Linguistic
  Generalization?
How Can We Accelerate Progress Towards Human-like Linguistic Generalization?
Tal Linzen
284
195
0
03 May 2020
Probing Contextual Language Models for Common Ground with Visual
  Representations
Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco
Rowan Zellers
Ali Farhadi
Hannaneh Hajishirzi
105
14
0
01 May 2020
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Edoardo Ponti
Goran Glavaš
Olga Majewska
Qianchu Liu
Ivan Vulić
Anna Korhonen
LRM
126
329
0
01 May 2020
Recurrent Neural Network Language Models Always Learn English-Like
  Relative Clause Attachment
Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment
Forrest Davis
Marten van Schijndel
55
23
0
01 May 2020
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words
  in Context
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context
Anna Breit
Artem Revenko
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
107
26
0
30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions
WT5?! Training Text-to-Text Models to Explain their Predictions
Sharan Narang
Colin Raffel
Katherine Lee
Adam Roberts
Noah Fiedel
Karishma Malkan
80
201
0
30 Apr 2020
Don't Neglect the Obvious: On the Role of Unambiguous Words in Word
  Sense Disambiguation
Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense Disambiguation
Daniel Loureiro
Jose Camacho-Collados
88
12
0
29 Apr 2020
Pre-training Is (Almost) All You Need: An Application to Commonsense
  Reasoning
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Alexandre Tamborrino
Nicola Pellicanò
B. Pannier
Pascal Voitot
Louise Naudin
LRM
81
63
0
29 Apr 2020
Masking as an Efficient Alternative to Finetuning for Pretrained
  Language Models
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
Mengjie Zhao
Tao R. Lin
Fei Mi
Martin Jaggi
Hinrich Schütze
77
121
0
26 Apr 2020
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
Simran Khanuja
Sandipan Dandapat
A. Srinivasan
Sunayana Sitaram
Monojit Choudhury
ELM
73
148
0
26 Apr 2020
MATINF: A Jointly Labeled Large-Scale Dataset for Classification,
  Question Answering and Summarization
MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization
Canwen Xu
Jiaxin Pei
Hongtao Wu
Yiyu Liu
Chenliang Li
MLLMVLM
59
14
0
26 Apr 2020
Quantifying the Contextualization of Word Representations with Semantic
  Class Probing
Quantifying the Contextualization of Word Representations with Semantic Class Probing
Mengjie Zhao
Philipp Dufter
Yadollah Yaghoobzadeh
Hinrich Schütze
83
27
0
25 Apr 2020
New Protocols and Negative Results for Textual Entailment Data
  Collection
New Protocols and Negative Results for Textual Entailment Data Collection
Samuel R. Bowman
J. Palomaki
Livio Baldini Soares
Emily Pitler
70
7
0
24 Apr 2020
A Review of Winograd Schema Challenge Datasets and Approaches
A Review of Winograd Schema Challenge Datasets and Approaches
Vid Kocijan
Thomas Lukasiewicz
E. Davis
G. Marcus
L. Morgenstern
70
44
0
23 Apr 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLMAI4CECLL
180
2,450
0
23 Apr 2020
Experience Grounds Language
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
102
360
0
21 Apr 2020
DIET: Lightweight Language Understanding for Dialogue Systems
DIET: Lightweight Language Understanding for Dialogue Systems
Tanja Bunk
Daksh Varshneya
Vladimir Vlasov
Alan Nichol
74
162
0
21 Apr 2020
Are we pretraining it right? Digging deeper into visio-linguistic
  pretraining
Are we pretraining it right? Digging deeper into visio-linguistic pretraining
Amanpreet Singh
Vedanuj Goswami
Devi Parikh
VLM
78
48
0
19 Apr 2020
CLUE: A Chinese Language Understanding Evaluation Benchmark
CLUE: A Chinese Language Understanding Evaluation Benchmark
Liang Xu
Hai Hu
Xuanwei Zhang
Lu Li
Chenjie Cao
...
Cong Yue
Xinrui Zhang
Zhen-Yi Yang
Kyle Richardson
Zhenzhong Lan
ELM
108
388
0
13 Apr 2020
A New Dataset for Natural Language Inference from Code-mixed
  Conversations
A New Dataset for Natural Language Inference from Code-mixed Conversations
Simran Khanuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
92
42
0
10 Apr 2020
Telling BERT's full story: from Local Attention to Global Aggregation
Telling BERT's full story: from Local Attention to Global Aggregation
Damian Pascual
Gino Brunner
Roger Wattenhofer
57
19
0
10 Apr 2020
More Bang for Your Buck: Natural Perturbation for Robust Question
  Answering
More Bang for Your Buck: Natural Perturbation for Robust Question Answering
Daniel Khashabi
Tushar Khot
Ashish Sabharwal
AAMLOOD
71
4
0
09 Apr 2020
Calibrating Structured Output Predictors for Natural Language Processing
Calibrating Structured Output Predictors for Natural Language Processing
Abhyuday N. Jagannatha
Hong-ye Yu
103
28
0
09 Apr 2020
TuringAdvice: A Generative and Dynamic Evaluation of Language Use
TuringAdvice: A Generative and Dynamic Evaluation of Language Use
Rowan Zellers
Ari Holtzman
Elizabeth Clark
Lianhui Qin
Ali Farhadi
Yejin Choi
ELMLRM
65
14
0
07 Apr 2020
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language
  Understanding
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
Jiyeon Ham
Yo Joong Choe
Kyubyong Park
Ilji Choi
Hyungjoon Soh
67
78
0
07 Apr 2020
Previous
123...282930
Next