Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.00537
Cited By
v1
v2
v3 (latest)
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"
50 / 1,500 papers shown
Title
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
386
1,101
0
05 Oct 2022
Lexical semantics enhanced neural word embeddings
Dongqiang Yang
Ning Li
Li Zou
Hongwei Ma
67
4
0
03 Oct 2022
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Leandro von Werra
Lewis Tunstall
A. Thakur
A. Luccioni
Tristan Thrush
...
Julien Chaumond
Margaret Mitchell
Alexander M. Rush
Thomas Wolf
Douwe Kiela
ELM
101
26
0
30 Sep 2022
On the Impossible Safety of Large AI Models
El-Mahdi El-Mhamdi
Sadegh Farhadkhani
R. Guerraoui
Nirupam Gupta
L. Hoang
Rafael Pinot
Sébastien Rouault
John Stephan
110
33
0
30 Sep 2022
Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification
Muhammad N. ElNokrashy
Badr AlKhamissi
Mona T. Diab
MoMe
90
5
0
30 Sep 2022
Clinical Language Understanding Evaluation (CLUE)
Travis R. Goodwin
Dina Demner-Fushman
ELM
LM&MA
28
1
0
28 Sep 2022
ArNLI: Arabic Natural Language Inference for Entailment and Contradiction Detection
Khloud Al Jallad
Nada Ghneim
52
4
0
28 Sep 2022
EditEval: An Instruction-Based Benchmark for Text Improvements
Jane Dwivedi-Yu
Timo Schick
Zhengbao Jiang
Maria Lomeli
Patrick Lewis
Gautier Izacard
Edouard Grave
Sebastian Riedel
Fabio Petroni
104
28
0
27 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
237
120
0
22 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLM
LRM
51
1
0
20 Sep 2022
How to Adapt Pre-trained Vision-and-Language Models to a Text-only Input?
Lovisa Hagström
Richard Johansson
VLM
59
4
0
19 Sep 2022
Psychologically-informed chain-of-thought prompts for metaphor understanding in large language models
Ben Prystawski
P. Thibodeau
Christopher Potts
Noah D. Goodman
ReLM
LRM
AI4CE
76
21
0
16 Sep 2022
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Sagnik Ray Choudhury
Anna Rogers
Isabelle Augenstein
LRM
77
18
0
15 Sep 2022
TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media
Daniel Loureiro
Aminette D'Souza
Areej Muhajab
Isabella A. White
Gabriel Wong
Luis Espinosa Anke
Leonardo Neves
Francesco Barbieri
Jose Camacho-Collados
77
26
0
15 Sep 2022
VIPHY: Probing "Visible" Physical Commonsense Knowledge
Shikhar Singh
Ehsan Qasemi
Muhao Chen
92
7
0
15 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLM
VLM
205
741
0
14 Sep 2022
iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images
Peng Yin
Ivan Cisneros
Ji Zhang
Howie Choset
Sebastian Scherer
58
17
0
14 Sep 2022
Simple and Effective Gradient-Based Tuning of Sequence-to-Sequence Models
Jared Lichtarge
Chris Alberti
Shankar Kumar
88
4
0
10 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
81
1
0
08 Sep 2022
A Review of Sparse Expert Models in Deep Learning
W. Fedus
J. Dean
Barret Zoph
MoE
129
154
0
04 Sep 2022
Generalization in Neural Networks: A Broad Survey
Chris Rohlfs
OOD
AI4CE
67
7
0
04 Sep 2022
FOLIO: Natural Language Reasoning with First-Order Logic
Simeng Han
Hailey Schoelkopf
Yilun Zhao
Zhenting Qi
Martin Riddell
...
Yingbo Zhou
Caiming Xiong
Rex Ying
Arman Cohan
Dragomir R. Radev
ReLM
LRM
131
109
0
02 Sep 2022
Making Intelligence: Ethical Values in IQ and ML Benchmarks
Borhane Blili-Hamelin
Leif Hancox-Li
69
18
0
01 Sep 2022
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle Semantic Variations in Question Answering?
Sunjae Kwon
Cheongwoong Kang
Jiyeon Han
Jaesik Choi
59
0
0
01 Sep 2022
SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance
Li Zhang
Youkow Homma
Yujing Wang
Min-man Wu
Mao Yang
Ruofei Zhang
Ting Cao
Wei Shen
OffRL
76
5
0
30 Aug 2022
On Reality and the Limits of Language Data: Aligning LLMs with Human Norms
Nigel Collier
Fangyu Liu
Ehsan Shareghi
48
3
0
25 Aug 2022
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
VLM
CLL
94
44
0
22 Aug 2022
Cognitive Modeling of Semantic Fluency Using Transformers
Animesh Nighojkar
Anna Khlyzova
John Licato
42
3
0
20 Aug 2022
Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Yile Wang
Yue Zhang
59
5
0
20 Aug 2022
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding
Zhaoye Fei
Yu Tian
Yongkang Wu
Xinyu Zhang
Yutao Zhu
...
Dejiang Kong
Ruofei Lai
Bo Zhao
Zhicheng Dou
Xipeng Qiu
282
1
0
19 Aug 2022
Pathway to Future Symbiotic Creativity
Yi-Ting Guo
Qi-fei Liu
Jie Chen
Wei Xue
Jie Fu
...
Fernando Rosas
Jeffrey Shaw
Xing Wu
Jiji Zhang
Jianliang Xu
66
0
0
18 Aug 2022
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
Zhihua Jin
Xingbo Wang
Furui Cheng
Chunhui Sun
Qun Liu
Huamin Qu
56
9
0
17 Aug 2022
Entity Anchored ICD Coding
Jay DeYoung
Han-Chin Shing
Luyang Kong
C. Winestock
Chaitanya P. Shivade
108
5
0
15 Aug 2022
Social Simulacra: Creating Populated Prototypes for Social Computing Systems
J. Park
Lindsay Popowski
Carrie J. Cai
Meredith Ringel Morris
Percy Liang
Michael S. Bernstein
85
298
0
08 Aug 2022
Fusing Sentence Embeddings Into LSTM-based Autoregressive Language Models
Vilém Zouhar
Marius Mosbach
Dietrich Klakow
57
1
0
04 Aug 2022
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Saleh Soltan
Shankar Ananthakrishnan
Jack G. M. FitzGerald
Rahul Gupta
Wael Hamza
...
Mukund Sridhar
Fabian Triefenbach
Apurv Verma
Gokhan Tur
Premkumar Natarajan
129
83
0
02 Aug 2022
Sequence to sequence pretraining for a less-resourced Slovenian language
Matej Ulčar
Marko Robnik-Šikonja
AIMat
68
17
0
28 Jul 2022
Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Yi Tay
Mostafa Dehghani
Samira Abnar
Hyung Won Chung
W. Fedus
J. Rao
Sharan Narang
Vinh Q. Tran
Dani Yogatama
Donald Metzler
AI4CE
122
107
0
21 Jul 2022
Analyzing Bagging Methods for Language Models
Pranab Islam
Shaan Khosla
Arthur Lok
Mudit Saxena
UQCV
MoE
ELM
42
1
0
19 Jul 2022
Why do tree-based models still outperform deep learning on tabular data?
Léo Grinsztajn
Edouard Oyallon
Gaël Varoquaux
LMTD
99
373
0
18 Jul 2022
STT: Soft Template Tuning for Few-Shot Adaptation
Ping Yu
Wei Wang
Chunyuan Li
Ruiyi Zhang
Zhanpeng Jin
Changyou Chen
VLM
31
0
0
18 Jul 2022
N-Grammer: Augmenting Transformers with latent n-grams
Aurko Roy
Rohan Anil
Guangda Lai
Benjamin Lee
Jeffrey Zhao
...
Yu
Phuong Dao
Christopher Fifty
Zhiwen Chen
Yonghui Wu
77
8
0
13 Jul 2022
Rationale-Augmented Ensembles in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Denny Zhou
ReLM
LRM
119
126
0
02 Jul 2022
FL-Tuning: Layer Tuning for Feed-Forward Network in Transformer
Jingping Liu
Yuqiu Song
Kui Xue
Hongli Sun
Chao Wang
Lihan Chen
Haiyun Jiang
Jiaqing Liang
Tong Ruan
69
2
0
30 Jun 2022
Endowing Language Models with Multimodal Knowledge Graph Representations
Ningyuan Huang
Y. Deshpande
Yibo Liu
Houda Alberts
Kyunghyun Cho
Clara Vania
Iacer Calixto
VLM
72
16
0
27 Jun 2022
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Sebastian Gehrmann
Abhik Bhattacharjee
Abinaya Mahendiran
Alex Jinpeng Wang
Alexandros Papangelis
...
Yacine Jernite
Yi Xu
Yisi Sang
Yixin Liu
Yufang Hou
114
39
0
22 Jun 2022
BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing
Subhro Roy
Sam Thomson
Tongfei Chen
Richard Shin
Adam Pauls
Jason Eisner
Benjamin Van Durme
ALM
111
13
0
21 Jun 2022
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender Bias
Yarden Tal
Inbal Magar
Roy Schwartz
77
36
0
20 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
171
412
0
17 Jun 2022
PInKS: Preconditioned Commonsense Inference with Minimal Supervision
Ehsan Qasemi
Piyush Khanna
Qiang Ning
Muhao Chen
ReLM
LRM
87
8
0
16 Jun 2022
Previous
1
2
3
...
18
19
20
...
28
29
30
Next