ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.00537
  4. Cited By
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
v1v2v3 (latest)

SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
    ELM
ArXiv (abs)PDFHTML

Papers citing "SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"

50 / 1,500 papers shown
Title
Dissociating language and thought in large language models
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELMReLM
111
215
0
16 Jan 2023
NarrowBERT: Accelerating Masked Language Model Pretraining and Inference
NarrowBERT: Accelerating Masked Language Model Pretraining and Inference
Haoxin Li
Phillip Keung
Daniel Cheng
Jungo Kasai
Noah A. Smith
63
4
0
11 Jan 2023
Towards Answering Climate Questionnaires from Unstructured Climate
  Reports
Towards Answering Climate Questionnaires from Unstructured Climate Reports
Daniel M. Spokoyny
Tanmay Laud
Thomas W. Corringham
Taylor Berg-Kirkpatrick
79
7
0
11 Jan 2023
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement
  Understanding
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Steven H. Wang
Antoine Scardigli
Leonard Tang
Wei Chen
D.M. Levkin
Anya Chen
Spencer Ball
Thomas Woodside
Oliver Zhang
Dan Hendrycks
AILawELM
70
22
0
02 Jan 2023
A Survey on In-context Learning
A Survey on In-context Learning
Qingxiu Dong
Lei Li
Damai Dai
Ce Zheng
Jingyuan Ma
...
Zhiyong Wu
Baobao Chang
Xu Sun
Lei Li
Zhifang Sui
ReLMAIMat
152
546
0
31 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
117
91
0
28 Dec 2022
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
M Saiful Bari
Aston Zhang
Shuai Zheng
Xingjian Shi
Yi Zhu
Shafiq Joty
Mu Li
RALMVLMVPVLMLRM
92
5
0
21 Dec 2022
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language
  Models
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
Dheeraj Mekala
Jason Wolfe
Subhro Roy
95
9
0
21 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
109
45
0
21 Dec 2022
Self-Instruct: Aligning Language Models with Self-Generated Instructions
Self-Instruct: Aligning Language Models with Self-Generated Instructions
Yizhong Wang
Yeganeh Kordi
Swaroop Mishra
Alisa Liu
Noah A. Smith
Daniel Khashabi
Hannaneh Hajishirzi
ALMSyDaLRM
204
2,264
0
20 Dec 2022
DISCO: Distilling Counterfactuals with Large Language Models
DISCO: Distilling Counterfactuals with Large Language Models
Zeming Chen
Qiyue Gao
Antoine Bosselut
Ashish Sabharwal
Kyle Richardson
92
31
0
20 Dec 2022
Evaluation for Change
Evaluation for Change
Rishi Bommasani
ELM
64
0
0
20 Dec 2022
Evaluating Human-Language Model Interaction
Evaluating Human-Language Model Interaction
Mina Lee
Megha Srivastava
Amelia Hardy
John Thickstun
Esin Durmus
...
Hancheng Cao
Tony Lee
Rishi Bommasani
Michael S. Bernstein
Percy Liang
LM&MAALM
108
102
0
19 Dec 2022
Rethinking the Role of Scale for In-Context Learning: An
  Interpretability-based Case Study at 66 Billion Scale
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Hritik Bansal
Karthik Gopalakrishnan
Saket Dingliwal
S. Bodapati
Katrin Kirchhoff
Dan Roth
LRM
87
51
0
18 Dec 2022
Plansformer: Generating Symbolic Plans using Transformers
Plansformer: Generating Symbolic Plans using Transformers
Vishal Pallagani
Bharath Muppasani
K. Murugesan
F. Rossi
L. Horesh
Biplav Srivastava
F. Fabiano
Andrea Loreggia
LM&RoLLMAGOffRL
74
38
0
16 Dec 2022
ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent
  Neural Networks
ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent Neural Networks
Kai Xiong
Xiao Ding
Zhongyang Li
Li Du
Bing Qin
Yi Zheng
Baoxing Huai
LRMBDLCML
117
4
0
16 Dec 2022
ALERT: Adapting Language Models to Reasoning Tasks
ALERT: Adapting Language Models to Reasoning Tasks
Ping Yu
Tianlu Wang
O. Yu. Golovneva
Badr AlKhamissi
Siddharth Verma
Zhijing Jin
Gargi Ghosh
Mona T. Diab
Asli Celikyilmaz
ReLMLRM
83
19
0
16 Dec 2022
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Y. Hao
Yutao Sun
Li Dong
Zhixiong Han
Yuxian Gu
Furu Wei
LRM
59
75
0
13 Dec 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora,
  Benchmark and Models for Indic Languages
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
Sumanth Doddapaneni
Rahul Aralikatte
Gowtham Ramesh
Shreyansh Goyal
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
ELM
107
86
0
11 Dec 2022
Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Aran Komatsuzaki
J. Puigcerver
James Lee-Thorp
Carlos Riquelme Ruiz
Basil Mustafa
Joshua Ainslie
Yi Tay
Mostafa Dehghani
N. Houlsby
MoMeMoE
106
124
0
09 Dec 2022
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich
  Platform for Graph Learning Benchmarks
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks
Jiaqi Ma
Xingjian Zhang
Hezheng Fan
Jin Huang
Tianyue Li
Tinghong Li
Yiwen Tu
Chen Zhu
Qiaozhu Mei
114
5
0
08 Dec 2022
CySecBERT: A Domain-Adapted Language Model for the Cybersecurity Domain
CySecBERT: A Domain-Adapted Language Model for the Cybersecurity Domain
Markus Bayer
Philip D. . Kuehn
Ramin Shanehsaz
Christian A. Reuter
62
49
0
06 Dec 2022
Improving Few-Shot Performance of Language Models via Nearest Neighbor
  Calibration
Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration
Feng Nie
Meixi Chen
Zhirui Zhang
Xuan Cheng
65
33
0
05 Dec 2022
Review on 6D Object Pose Estimation with the focus on Indoor Scene
  Understanding
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding
Negar Nejatishahidin
Pooya Fayyazsanavi
3DPC
61
0
0
04 Dec 2022
Toward Efficient Language Model Pretraining and Downstream Adaptation
  via Self-Evolution: A Case Study on SuperGLUE
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
Qihuang Zhong
Liang Ding
Yibing Zhan
Yu Qiao
Yonggang Wen
...
Yixin Chen
Xinbo Gao
Steven C. H. Hoi
Xiaoou Tang
Dacheng Tao
VLMELM
124
35
0
04 Dec 2022
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Hamish Ivison
Noah A. Smith
Hannaneh Hajishirzi
Pradeep Dasigi
122
22
0
01 Dec 2022
ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT
ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT
Boyao Wang
Shizhe Diao
Jianlin Chen
Tong Zhang
VLM
79
8
0
30 Nov 2022
AutoCAD: Automatically Generating Counterfactuals for Mitigating
  Shortcut Learning
AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning
Jiaxin Wen
Yeshuang Zhu
Jinchao Zhang
Jie Zhou
Minlie Huang
CMLAAML
107
9
0
29 Nov 2022
On the Effectiveness of Parameter-Efficient Fine-Tuning
On the Effectiveness of Parameter-Efficient Fine-Tuning
Z. Fu
Haoran Yang
Anthony Man-Cho So
Wai Lam
Lidong Bing
Nigel Collier
76
162
0
28 Nov 2022
Perceive, Ground, Reason, and Act: A Benchmark for General-purpose
  Visual Representation
Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation
Jiangyong Huang
William Zhu
Baoxiong Jia
Zan Wang
Xiaojian Ma
Qing Li
Siyuan Huang
123
5
0
28 Nov 2022
X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task
  Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible
  Clarifications
X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications
Junyuan Shang
Shuohuan Wang
Yu Sun
Yanjun Yu
Yue Zhou
Li Xiang
Guixiu Yang
69
2
0
27 Nov 2022
Deep representation learning: Fundamentals, Perspectives, Applications,
  and Open Challenges
Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges
K. T. Baghaei
Amirreza Payandeh
Pooya Fayyazsanavi
Shahram Rahimi
Zhiqian Chen
Somayeh Bakhtiari Ramezani
FaMLAI4TS
69
6
0
27 Nov 2022
TRAC: A Textual Benchmark for Reasoning about Actions and Change
TRAC: A Textual Benchmark for Reasoning about Actions and Change
Weinan He
Canming Huang
Zhanhao Xiao
Yongmei Liu
LLMAGReLMLRM
45
0
0
25 Nov 2022
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Zhuoyuan Yao
Shuo Ren
Sanyuan Chen
Ziyang Ma
Pengcheng Guo
Linfu Xie
82
5
0
24 Nov 2022
SciRepEval: A Multi-Format Benchmark for Scientific Document
  Representations
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
Amanpreet Singh
Mike DÁrcy
Arman Cohan
Doug Downey
Sergey Feldman
102
92
0
23 Nov 2022
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP
  benchmark for Polish
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Lukasz Augustyniak
Kamil Tagowski
Albert Sawczyn
Denis Janiak
Roman Bartusiak
...
Arkadiusz Janz
Piotr Szymañski
M. Morzy
Tomasz Kajdanowicz
Maciej Piasecki
62
12
0
23 Nov 2022
TEMPERA: Test-Time Prompting via Reinforcement Learning
TEMPERA: Test-Time Prompting via Reinforcement Learning
Tianjun Zhang
Xuezhi Wang
Denny Zhou
Dale Schuurmans
Joseph E. Gonzalez
VLM
63
39
0
21 Nov 2022
Validating Large Language Models with ReLM
Validating Large Language Models with ReLM
Michael Kuchnik
Virginia Smith
George Amvrosiadis
126
31
0
21 Nov 2022
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog
  Evaluation
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation
Yinpei Dai
Wanwei He
Bowen Li
Yuchuan Wu
Zhen Cao
Zhongqi An
Jian Sun
Yongbin Li
ELMALM
92
13
0
21 Nov 2022
Deep Learning on a Healthy Data Diet: Finding Important Examples for
  Fairness
Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness
A. Zayed
Prasanna Parthasarathi
Gonçalo Mordido
Hamid Palangi
Samira Shabanian
Sarath Chandar
54
22
0
20 Nov 2022
Towards Explaining Subjective Ground of Individuals on Social Media
Towards Explaining Subjective Ground of Individuals on Social Media
Younghun Lee
Dan Goldwasser
61
1
0
18 Nov 2022
On Measuring the Intrinsic Few-Shot Hardness of Datasets
On Measuring the Intrinsic Few-Shot Hardness of Datasets
Xinran Zhao
Shikhar Murty
Christopher D. Manning
30
5
0
16 Nov 2022
Parameter-Efficient Tuning on Layer Normalization for Pre-trained
  Language Models
Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models
Wang Qi
Yu-Ping Ruan
Y. Zuo
Taihao Li
69
19
0
16 Nov 2022
Self-supervised remote sensing feature learning: Learning Paradigms,
  Challenges, and Future Works
Self-supervised remote sensing feature learning: Learning Paradigms, Challenges, and Future Works
Chao Tao
Ji Qi
Mingning Guo
Qing Zhu
Haifeng Li
SSL
104
59
0
15 Nov 2022
A Universal Discriminator for Zero-Shot Generalization
A Universal Discriminator for Zero-Shot Generalization
Haike Xu
Zongyu Lin
Jing Zhou
Yanan Zheng
Zhilin Yang
AI4CE
64
16
0
15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an
  Out-of-distribution Generalization Perspective
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
181
82
0
15 Nov 2022
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
79
20
0
15 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
A Survey of Knowledge Enhanced Pre-trained Language Models
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELMVLM
158
137
0
11 Nov 2022
Towards Human-Centred Explainability Benchmarks For Text Classification
Towards Human-Centred Explainability Benchmarks For Text Classification
Viktor Schlegel
Erick Mendez Guzman
Riza Batista-Navarro
91
5
0
10 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
468
2,398
0
09 Nov 2022
Previous
123...161718...282930
Next