Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.00537
Cited By
v1
v2
v3 (latest)
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"
50 / 1,500 papers shown
Title
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELM
ReLM
111
215
0
16 Jan 2023
NarrowBERT: Accelerating Masked Language Model Pretraining and Inference
Haoxin Li
Phillip Keung
Daniel Cheng
Jungo Kasai
Noah A. Smith
63
4
0
11 Jan 2023
Towards Answering Climate Questionnaires from Unstructured Climate Reports
Daniel M. Spokoyny
Tanmay Laud
Thomas W. Corringham
Taylor Berg-Kirkpatrick
79
7
0
11 Jan 2023
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Steven H. Wang
Antoine Scardigli
Leonard Tang
Wei Chen
D.M. Levkin
Anya Chen
Spencer Ball
Thomas Woodside
Oliver Zhang
Dan Hendrycks
AILaw
ELM
70
22
0
02 Jan 2023
A Survey on In-context Learning
Qingxiu Dong
Lei Li
Damai Dai
Ce Zheng
Jingyuan Ma
...
Zhiyong Wu
Baobao Chang
Xu Sun
Lei Li
Zhifang Sui
ReLM
AIMat
152
546
0
31 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
117
91
0
28 Dec 2022
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
M Saiful Bari
Aston Zhang
Shuai Zheng
Xingjian Shi
Yi Zhu
Shafiq Joty
Mu Li
RALM
VLM
VPVLM
LRM
92
5
0
21 Dec 2022
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
Dheeraj Mekala
Jason Wolfe
Subhro Roy
95
9
0
21 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
109
45
0
21 Dec 2022
Self-Instruct: Aligning Language Models with Self-Generated Instructions
Yizhong Wang
Yeganeh Kordi
Swaroop Mishra
Alisa Liu
Noah A. Smith
Daniel Khashabi
Hannaneh Hajishirzi
ALM
SyDa
LRM
204
2,264
0
20 Dec 2022
DISCO: Distilling Counterfactuals with Large Language Models
Zeming Chen
Qiyue Gao
Antoine Bosselut
Ashish Sabharwal
Kyle Richardson
92
31
0
20 Dec 2022
Evaluation for Change
Rishi Bommasani
ELM
64
0
0
20 Dec 2022
Evaluating Human-Language Model Interaction
Mina Lee
Megha Srivastava
Amelia Hardy
John Thickstun
Esin Durmus
...
Hancheng Cao
Tony Lee
Rishi Bommasani
Michael S. Bernstein
Percy Liang
LM&MA
ALM
108
102
0
19 Dec 2022
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Hritik Bansal
Karthik Gopalakrishnan
Saket Dingliwal
S. Bodapati
Katrin Kirchhoff
Dan Roth
LRM
87
51
0
18 Dec 2022
Plansformer: Generating Symbolic Plans using Transformers
Vishal Pallagani
Bharath Muppasani
K. Murugesan
F. Rossi
L. Horesh
Biplav Srivastava
F. Fabiano
Andrea Loreggia
LM&Ro
LLMAG
OffRL
74
38
0
16 Dec 2022
ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent Neural Networks
Kai Xiong
Xiao Ding
Zhongyang Li
Li Du
Bing Qin
Yi Zheng
Baoxing Huai
LRM
BDL
CML
117
4
0
16 Dec 2022
ALERT: Adapting Language Models to Reasoning Tasks
Ping Yu
Tianlu Wang
O. Yu. Golovneva
Badr AlKhamissi
Siddharth Verma
Zhijing Jin
Gargi Ghosh
Mona T. Diab
Asli Celikyilmaz
ReLM
LRM
83
19
0
16 Dec 2022
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Y. Hao
Yutao Sun
Li Dong
Zhixiong Han
Yuxian Gu
Furu Wei
LRM
59
75
0
13 Dec 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
Sumanth Doddapaneni
Rahul Aralikatte
Gowtham Ramesh
Shreyansh Goyal
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
ELM
107
86
0
11 Dec 2022
Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Aran Komatsuzaki
J. Puigcerver
James Lee-Thorp
Carlos Riquelme Ruiz
Basil Mustafa
Joshua Ainslie
Yi Tay
Mostafa Dehghani
N. Houlsby
MoMe
MoE
106
124
0
09 Dec 2022
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks
Jiaqi Ma
Xingjian Zhang
Hezheng Fan
Jin Huang
Tianyue Li
Tinghong Li
Yiwen Tu
Chen Zhu
Qiaozhu Mei
114
5
0
08 Dec 2022
CySecBERT: A Domain-Adapted Language Model for the Cybersecurity Domain
Markus Bayer
Philip D. . Kuehn
Ramin Shanehsaz
Christian A. Reuter
62
49
0
06 Dec 2022
Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration
Feng Nie
Meixi Chen
Zhirui Zhang
Xuan Cheng
65
33
0
05 Dec 2022
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding
Negar Nejatishahidin
Pooya Fayyazsanavi
3DPC
61
0
0
04 Dec 2022
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
Qihuang Zhong
Liang Ding
Yibing Zhan
Yu Qiao
Yonggang Wen
...
Yixin Chen
Xinbo Gao
Steven C. H. Hoi
Xiaoou Tang
Dacheng Tao
VLM
ELM
124
35
0
04 Dec 2022
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Hamish Ivison
Noah A. Smith
Hannaneh Hajishirzi
Pradeep Dasigi
122
22
0
01 Dec 2022
ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT
Boyao Wang
Shizhe Diao
Jianlin Chen
Tong Zhang
VLM
79
8
0
30 Nov 2022
AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning
Jiaxin Wen
Yeshuang Zhu
Jinchao Zhang
Jie Zhou
Minlie Huang
CML
AAML
107
9
0
29 Nov 2022
On the Effectiveness of Parameter-Efficient Fine-Tuning
Z. Fu
Haoran Yang
Anthony Man-Cho So
Wai Lam
Lidong Bing
Nigel Collier
76
162
0
28 Nov 2022
Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation
Jiangyong Huang
William Zhu
Baoxiong Jia
Zan Wang
Xiaojian Ma
Qing Li
Siyuan Huang
123
5
0
28 Nov 2022
X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications
Junyuan Shang
Shuohuan Wang
Yu Sun
Yanjun Yu
Yue Zhou
Li Xiang
Guixiu Yang
69
2
0
27 Nov 2022
Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges
K. T. Baghaei
Amirreza Payandeh
Pooya Fayyazsanavi
Shahram Rahimi
Zhiqian Chen
Somayeh Bakhtiari Ramezani
FaML
AI4TS
69
6
0
27 Nov 2022
TRAC: A Textual Benchmark for Reasoning about Actions and Change
Weinan He
Canming Huang
Zhanhao Xiao
Yongmei Liu
LLMAG
ReLM
LRM
45
0
0
25 Nov 2022
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Zhuoyuan Yao
Shuo Ren
Sanyuan Chen
Ziyang Ma
Pengcheng Guo
Linfu Xie
82
5
0
24 Nov 2022
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
Amanpreet Singh
Mike DÁrcy
Arman Cohan
Doug Downey
Sergey Feldman
102
92
0
23 Nov 2022
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Lukasz Augustyniak
Kamil Tagowski
Albert Sawczyn
Denis Janiak
Roman Bartusiak
...
Arkadiusz Janz
Piotr Szymañski
M. Morzy
Tomasz Kajdanowicz
Maciej Piasecki
62
12
0
23 Nov 2022
TEMPERA: Test-Time Prompting via Reinforcement Learning
Tianjun Zhang
Xuezhi Wang
Denny Zhou
Dale Schuurmans
Joseph E. Gonzalez
VLM
63
39
0
21 Nov 2022
Validating Large Language Models with ReLM
Michael Kuchnik
Virginia Smith
George Amvrosiadis
126
31
0
21 Nov 2022
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation
Yinpei Dai
Wanwei He
Bowen Li
Yuchuan Wu
Zhen Cao
Zhongqi An
Jian Sun
Yongbin Li
ELM
ALM
92
13
0
21 Nov 2022
Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness
A. Zayed
Prasanna Parthasarathi
Gonçalo Mordido
Hamid Palangi
Samira Shabanian
Sarath Chandar
54
22
0
20 Nov 2022
Towards Explaining Subjective Ground of Individuals on Social Media
Younghun Lee
Dan Goldwasser
61
1
0
18 Nov 2022
On Measuring the Intrinsic Few-Shot Hardness of Datasets
Xinran Zhao
Shikhar Murty
Christopher D. Manning
30
5
0
16 Nov 2022
Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models
Wang Qi
Yu-Ping Ruan
Y. Zuo
Taihao Li
69
19
0
16 Nov 2022
Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future Works
Chao Tao
Ji Qi
Mingning Guo
Qing Zhu
Haifeng Li
SSL
104
59
0
15 Nov 2022
A Universal Discriminator for Zero-Shot Generalization
Haike Xu
Zongyu Lin
Jing Zhou
Yanan Zheng
Zhilin Yang
AI4CE
64
16
0
15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
181
82
0
15 Nov 2022
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
79
20
0
15 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELM
VLM
158
137
0
11 Nov 2022
Towards Human-Centred Explainability Benchmarks For Text Classification
Viktor Schlegel
Erick Mendez Guzman
Riza Batista-Navarro
91
5
0
10 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
468
2,398
0
09 Nov 2022
Previous
1
2
3
...
16
17
18
...
28
29
30
Next