Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.00537
Cited By
v1
v2
v3 (latest)
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"
50 / 1,500 papers shown
Title
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
Chunyuan Li
Haotian Liu
Liunian Harold Li
Pengchuan Zhang
J. Aneja
...
Ping Jin
Houdong Hu
Zicheng Liu
Yong Jae Lee
Jianfeng Gao
86
152
0
19 Apr 2022
DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks
Ziyang Luo
Yadong Xi
Jing Ma
Zhiwei Yang
Xiaoxi Mao
Changjie Fan
Rongsheng Zhang
35
3
0
19 Apr 2022
Zero-shot Entity and Tweet Characterization with Designed Conditional Prompts and Contexts
S. Srivatsa
Tushar Mohan
Kumari Neha
Nishchay Malakar
Ponnurangam Kumaraguru
Srinath Srinivasa
55
0
0
18 Apr 2022
Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey
Kento Nozawa
Issei Sato
AI4TS
139
5
0
18 Apr 2022
AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for Generation of Dialogues in Low-Resource, African Languages
Tosin Adewumi
Mofetoluwa Adeyemi
Aremu Anuoluwapo
Bukola Peters
Happy Buzaaba
...
Phylis Ngigi
Orevaoghene Ahia
Ruqayya Nasir
F. Liwicki
Marcus Liwicki
35
1
0
17 Apr 2022
Evaluation Benchmarks for Spanish Sentence Representations
Vladimir Araujo
Andrés Carvallo
Souvik Kundu
J. Canete
Marcelo Mendoza
Robert E. Mercer
Felipe Bravo-Marquez
Marie-Francine Moens
Alvaro Soto
ELM
60
10
0
15 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
116
155
0
15 Apr 2022
Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP Models
Phyllis Ang
Bhuwan Dhingra
Lisa Wu Wills
60
6
0
15 Apr 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
189
841
0
14 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
116
32
0
13 Apr 2022
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding
Zeming Chen
Qiyue Gao
ELM
67
4
0
13 Apr 2022
Probing for Constituency Structure in Neural Language Models
David Arps
Younes Samih
Laura Kallmeyer
Hassan Sajjad
57
14
0
13 Apr 2022
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Peter Clark
Chitta Baral
Ashwin Kalyan
AIMat
ReLM
ELM
LRM
98
110
0
12 Apr 2022
Metaethical Perspectives on 'Benchmarking' AI Ethics
Travis LaCroix
A. Luccioni
57
8
0
11 Apr 2022
Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks
Haoran Yang
Piji Li
Wai Lam
68
4
0
10 Apr 2022
KOBEST: Korean Balanced Evaluation of Significant Tasks
Dohyeong Kim
Myeongjun Jang
D. Kwon
Eric Davis
ALM
70
27
0
09 Apr 2022
Testing the limits of natural language models for predicting human language judgments
Tal Golan
Matthew Siegelman
N. Kriegeskorte
Christopher A. Baldassano
70
15
0
07 Apr 2022
Fusing finetuned models for better pretraining
Leshem Choshen
Elad Venezian
Noam Slonim
Yoav Katz
FedML
AI4CE
MoMe
130
96
0
06 Apr 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
557
6,320
0
05 Apr 2022
Fact Checking with Insufficient Evidence
Pepa Atanasova
J. Simonsen
Christina Lioma
Isabelle Augenstein
116
15
0
05 Apr 2022
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models
Rabeeh Karimi Mahabadi
Luke Zettlemoyer
James Henderson
Marzieh Saeidi
Lambert Mathias
Ves Stoyanov
Majid Yazdani
VLM
81
72
0
03 Apr 2022
A Survey on Aspect-Based Sentiment Classification
Gianni Brauwers
Flavius Frasincar
LLMAG
99
120
0
27 Mar 2022
Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View
Boxi Cao
Hongyu Lin
Xianpei Han
Fangchao Liu
Le Sun
ELM
AAML
57
43
0
23 Mar 2022
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense
Henrique M. Dinis Santos
Ke Shen
Alice M. Mulvehill
Yasaman Razeghi
D. McGuinness
Mayank Kejriwal
ELM
LRM
70
4
0
23 Mar 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
208
1,654
0
23 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
AAML
ELM
76
20
0
21 Mar 2022
Word Order Does Matter (And Shuffled Language Models Know It)
Vinit Ravishankar
Mostafa Abdou
Artur Kulmizev
Anders Søgaard
76
45
0
21 Mar 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
...
Orhan Firat
Michael Auli
Sebastian Ruder
Jason Riesa
Melvin Johnson
VLM
AILaw
ELM
155
22
0
21 Mar 2022
Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Shikib Mehri
Jinho Choi
L. F. D’Haro
Jan Deriu
M. Eskénazi
...
David Traum
Yi-Ting Yeh
Zhou Yu
Yizhe Zhang
Chen Zhang
99
22
0
18 Mar 2022
CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language Understanding
Milan Gritta
Ruoyu Hu
Ignacio Iacobacci
102
12
0
18 Mar 2022
Towards Lithuanian grammatical error correction
Lukas Stankevivcius
Mantas Lukovsevivcius
3DV
48
4
0
18 Mar 2022
An Analysis of Negation in Natural Language Understanding Corpora
Md Mosharaf Hossain
Dhivya Chinnappa
Eduardo Blanco
116
43
0
16 Mar 2022
FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing
Ilias Chalkidis
Tommaso Pasini
Shenmin Zhang
Letizia Tomada
Sebastian Felix Schwemer
Anders Søgaard
AILaw
96
54
0
14 Mar 2022
SciNLI: A Corpus for Natural Language Inference on Scientific Text
Mobashir Sadat
Cornelia Caragea
AILaw
89
37
0
13 Mar 2022
CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment
Lutfi Kerem Senel
Timo Schick
Hinrich Schütze
ELM
ALM
58
5
0
11 Mar 2022
HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks
Zhengkun Zhang
Wenya Guo
Xiaojun Meng
Yasheng Wang
Yadao Wang
Xin Jiang
Qun Liu
Zhenglu Yang
80
17
0
08 Mar 2022
ILDAE: Instance-Level Difficulty Analysis of Evaluation Data
Neeraj Varshney
Swaroop Mishra
Chitta Baral
69
19
0
07 Mar 2022
HEAR: Holistic Evaluation of Audio Representations
Joseph P. Turian
Jordie Shier
H. Khan
Bhiksha Raj
Björn W. Schuller
...
P. Esling
Pranay Manocha
Shinji Watanabe
Zeyu Jin
Yonatan Bisk
135
108
0
06 Mar 2022
Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents
Yicheng Zou
Hongwei Liu
Tao Gui
Junzhe Wang
Qi Zhang
M. Tang
Haixiang Li
Dan Wang
DRL
103
31
0
06 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
927
13,266
0
04 Mar 2022
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models
Mojan Javaheripi
Gustavo de Rosa
Subhabrata Mukherjee
S. Shah
Tomasz Religa
C. C. T. Mendes
Sébastien Bubeck
F. Koushanfar
Debadeepta Dey
57
20
0
04 Mar 2022
Mukayese: Turkish NLP Strikes Back
Ali Safaya
Emirhan Kurtulucs
Arda Goktougan
Deniz Yuret
77
23
0
02 Mar 2022
HyperPrompt: Prompt-based Task-Conditioning of Transformers
Yun He
H. Zheng
Yi Tay
Jai Gupta
Yu Du
...
Yaguang Li
Zhaoji Chen
Donald Metzler
Heng-Tze Cheng
Ed H. Chi
LRM
VLM
93
93
0
01 Mar 2022
E-LANG: Energy-Based Joint Inferencing of Super and Swift Language Models
Mohammad Akbari
Amin Banitalebi-Dehkordi
Yong Zhang
69
8
0
01 Mar 2022
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models
Daniel Gao
Yantao Jia
Lei Li
Chengzhen Fu
Zhicheng Dou
Hao Jiang
Xinyu Zhang
Lei Chen
Bo Zhao
KELM
74
8
0
28 Feb 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Sewon Min
Xinxi Lyu
Ari Holtzman
Mikel Artetxe
M. Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
LLMAG
LRM
196
1,504
0
25 Feb 2022
Capturing Failures of Large Language Models via Human Cognitive Biases
Erik Jones
Jacob Steinhardt
74
93
0
24 Feb 2022
Prompt-Learning for Short Text Classification
Yi Zhu
Xinke Zhou
Jipeng Qiang
Yun Li
Yunhao Yuan
Xindong Wu
VLM
59
36
0
23 Feb 2022
Y
\mathcal{Y}
Y
-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning
Yitao Liu
Chen An
Xipeng Qiu
93
17
0
20 Feb 2022
Mixture-of-Experts with Expert Choice Routing
Yan-Quan Zhou
Tao Lei
Han-Chu Liu
Nan Du
Yanping Huang
Vincent Zhao
Andrew M. Dai
Zhifeng Chen
Quoc V. Le
James Laudon
MoE
312
376
0
18 Feb 2022
Previous
1
2
3
...
20
21
22
...
28
29
30
Next