ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.00537
  4. Cited By
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
v1v2v3 (latest)

SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
    ELM
ArXiv (abs)PDFHTML

Papers citing "SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"

50 / 1,500 papers shown
Title
Inherent Trade-Offs between Diversity and Stability in Multi-Task
  Benchmarks
Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks
Guanhua Zhang
Moritz Hardt
100
11
0
02 May 2024
AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of
  Low-Rank Adaptation Experts
AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts
Zefang Liu
Jiahua Luo
MoEKELM
82
13
0
01 May 2024
Towards a Search Engine for Machines: Unified Ranking for Multiple
  Retrieval-Augmented Large Language Models
Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models
Alireza Salemi
Hamed Zamani
61
5
0
30 Apr 2024
StablePT: Towards Stable Prompting for Few-shot Learning via Input
  Separation
StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation
Xiaoming Liu
Chen Liu
Zhaohan Zhang
Chengzhengxu Li
Longtian Wang
Y. Lan
Chao Shen
VLM
87
4
0
30 Apr 2024
Benchmarking Benchmark Leakage in Large Language Models
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
126
54
0
29 Apr 2024
Text Quality-Based Pruning for Efficient Training of Language Models
Text Quality-Based Pruning for Efficient Training of Language Models
Vasu Sharma
Karthik Padthe
Newsha Ardalani
Kushal Tirumala
Russell Howes
...
Po-Yao Huang
Shang-Wen Li
Armen Aghajanyan
Gargi Ghosh
Luke Zettlemoyer
120
6
0
26 Apr 2024
HYPE: Hyperbolic Entailment Filtering for Underspecified Images and
  Texts
HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
Wonjae Kim
Sanghyuk Chun
Taekyung Kim
Dongyoon Han
Sangdoo Yun
99
9
0
26 Apr 2024
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Ulme Wennberg
G. Henter
MILM
71
1
0
25 Apr 2024
Evaluating Large Language Models for Material Selection
Evaluating Large Language Models for Material Selection
Daniele Grandi
Yash Jain
Allin Groom
Brandon Cramer
Christopher McComb
66
9
0
23 Apr 2024
Q-Tuning: Queue-based Prompt Tuning for Lifelong Few-shot Language
  Learning
Q-Tuning: Queue-based Prompt Tuning for Lifelong Few-shot Language Learning
Yanhui Guo
Shaoyuan Xu
Jinmiao Fu
Jia-Wei Liu
Chaosheng Dong
Bryan Wang
VLMCLL
80
8
0
22 Apr 2024
Stronger Random Baselines for In-Context Learning
Stronger Random Baselines for In-Context Learning
Gregory Yauney
David M. Mimno
71
2
0
19 Apr 2024
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and
  Historical Languages
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Aleksei Dorkin
Kairit Sirts
45
2
0
19 Apr 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models
  Using Multisense Consistency
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
113
7
0
18 Apr 2024
A Survey on Retrieval-Augmented Text Generation for Large Language
  Models
A Survey on Retrieval-Augmented Text Generation for Large Language Models
Yizheng Huang
Jimmy X. Huang
3DVRALM
154
51
0
17 Apr 2024
Fewer Truncations Improve Language Modeling
Fewer Truncations Improve Language Modeling
Hantian Ding
Zijian Wang
Giovanni Paolini
Varun Kumar
Anoop Deoras
Dan Roth
Stefano Soatto
111
14
0
16 Apr 2024
Language Model Cascades: Token-level uncertainty and beyond
Language Model Cascades: Token-level uncertainty and beyond
Neha Gupta
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
A. Menon
Sanjiv Kumar
UQLM
140
56
0
15 Apr 2024
Modelling Language
Modelling Language
J. Grindrod
49
5
0
15 Apr 2024
Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
Tanmay Gautam
Youngsuk Park
Hao Zhou
Parameswaran Raman
Wooseok Ha
100
17
0
11 Apr 2024
MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
Mobashir Sadat
Cornelia Caragea
80
5
0
11 Apr 2024
XNLIeu: a dataset for cross-lingual NLI in Basque
XNLIeu: a dataset for cross-lingual NLI in Basque
Maite Heredia
Julen Etxaniz
Muitze Zulaika
X. Saralegi
Jeremy Barnes
A. Soroa
38
1
0
10 Apr 2024
FairPair: A Robust Evaluation of Biases in Language Models through
  Paired Perturbations
FairPair: A Robust Evaluation of Biases in Language Models through Paired Perturbations
Jane Dwivedi-Yu
Raaz Dwivedi
Timo Schick
60
2
0
09 Apr 2024
Language Models on a Diet: Cost-Efficient Development of Encoders for
  Closely-Related Languages via Additional Pretraining
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
Nikola Ljubesic
Vít Suchomel
Peter Rupnik
Taja Kuzman
Rik van Noord
CLL
63
5
0
08 Apr 2024
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for
  the Neural Processing of Portuguese
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
T. Osório
Bernardo Leite
Henrique Lopes Cardoso
Luís Gomes
João Rodrigues
Rodrigo Santos
António Branco
78
3
0
08 Apr 2024
Chart What I Say: Exploring Cross-Modality Prompt Alignment in
  AI-Assisted Chart Authoring
Chart What I Say: Exploring Cross-Modality Prompt Alignment in AI-Assisted Chart Authoring
Nazar Ponochevnyi
Anastasia Kuzminykh
80
1
0
07 Apr 2024
Data Bias According to Bipol: Men are Naturally Right and It is the Role
  of Women to Follow Their Lead
Data Bias According to Bipol: Men are Naturally Right and It is the Role of Women to Follow Their Lead
Irene Pagliai
G. V. Boven
Tosin Adewumi
Lama Alkhaled
Namrata Gurung
Isabella Sodergren
Elisa Barney
75
1
0
07 Apr 2024
Eigenpruning: an Interpretability-Inspired PEFT Method
Eigenpruning: an Interpretability-Inspired PEFT Method
Tomás Vergara-Browne
Álvaro Soto
A. Aizawa
86
1
0
04 Apr 2024
PRobELM: Plausibility Ranking Evaluation for Language Models
PRobELM: Plausibility Ranking Evaluation for Language Models
Moy Yuan
Chenxi Whitehouse
Eric Chamoun
Rami Aly
Andreas Vlachos
185
5
0
04 Apr 2024
BAdam: A Memory Efficient Full Parameter Optimization Method for Large
  Language Models
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
Qi Luo
Hengxu Yu
Xiao Li
82
6
0
03 Apr 2024
Deconstructing In-Context Learning: Understanding Prompts via Corruption
Deconstructing In-Context Learning: Understanding Prompts via Corruption
Namrata Shivagunde
Vladislav Lialin
Sherin Muckatira
Anna Rumshisky
83
3
0
02 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Zichong Wang
Wenbin Zhang
AILaw
127
42
0
31 Mar 2024
Benchmark Transparency: Measuring the Impact of Data on Evaluation
Benchmark Transparency: Measuring the Impact of Data on Evaluation
Venelin Kovatchev
Matthew Lease
59
4
0
31 Mar 2024
A Controlled Reevaluation of Coreference Resolution Models
A Controlled Reevaluation of Coreference Resolution Models
Ian Porada
Xiyuan Zou
Jackie Chi Kit Cheung
85
1
0
31 Mar 2024
ReALM: Reference Resolution As Language Modeling
ReALM: Reference Resolution As Language Modeling
Joel Ruben Antony Moniz
Soundarya Krishnan
Melis Ozyildirim
Prathamesh Saraf
Halim Cagri Ates
Yuan-kang Zhang
Hong-ye Yu
Nidhi Rajshree
77
7
0
29 Mar 2024
Measuring Taiwanese Mandarin Language Understanding
Measuring Taiwanese Mandarin Language Understanding
Po-Heng Chen
Sijia Cheng
Wei-Lin Chen
Yen-Ting Lin
Yun-Nung Chen
ELM
119
2
0
29 Mar 2024
MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of
  Large Language Models
MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Models
Peng Ding
Jiading Fang
Peng Li
Kangrui Wang
Xiaochen Zhou
Mo Yu
Jing Li
Matthew R. Walter
Hongyuan Mei
RALMELM
97
6
0
29 Mar 2024
Img2Loc: Revisiting Image Geolocalization using Multi-modality
  Foundation Models and Image-based Retrieval-Augmented Generation
Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Zhongliang Zhou
Jielu Zhang
Zihan Guan
Mengxuan Hu
Ni Lao
Lan Mu
Sheng Li
Gengchen Mai
VLM
151
17
0
28 Mar 2024
A Two-Phase Recall-and-Select Framework for Fast Model Selection
A Two-Phase Recall-and-Select Framework for Fast Model Selection
Jianwei Cui
Wenhang Shi
Honglin Tao
Wei Lu
Xiaoyong Du
89
0
0
28 Mar 2024
Targeted Visualization of the Backbone of Encoder LLMs
Targeted Visualization of the Backbone of Encoder LLMs
Isaac Roberts
Alexander Schulz
L. Hermes
Barbara Hammer
49
0
0
26 Mar 2024
Language Models for Text Classification: Is In-Context Learning Enough?
Language Models for Text Classification: Is In-Context Learning Enough?
A. Edwards
Jose Camacho-Collados
LRM
87
24
0
26 Mar 2024
Naive Bayes-based Context Extension for Large Language Models
Naive Bayes-based Context Extension for Large Language Models
Jianlin Su
Murtadha Ahmed
Wenbo Luo
Abhishek Rao
Denny Zhou
Hyeontaek Lim
69
6
0
26 Mar 2024
A Study on How Attention Scores in the BERT Model are Aware of Lexical
  Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Dongjun Jang
Sungjoo Byun
Hyopil Shin
45
1
0
25 Mar 2024
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language
  Models
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models
Zequan Liu
Jiawen Lyn
Wei-wei Zhu
Xing Tian
Yvette Graham
OffRL
111
18
0
24 Mar 2024
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for
  Vietnamese Natural Language Understanding
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding
Phong Nguyen-Thuan Do
Son Quoc Tran
Phu Gia Hoang
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ELM
69
5
0
23 Mar 2024
ChatGPT Alternative Solutions: Large Language Models Survey
ChatGPT Alternative Solutions: Large Language Models Survey
H. Alipour
Nick Pendar
Kohinoor Roy
LM&MA
49
7
0
21 Mar 2024
Extracting Emotion Phrases from Tweets using BART
Extracting Emotion Phrases from Tweets using BART
Mahdi Rezapour
45
2
0
21 Mar 2024
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric
  Behavior Understanding by Dyadic Contexts
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts
Guangzeng Han
Weisi Liu
Xiaolei Huang
Brian Borsari
76
22
0
20 Mar 2024
Defending Against Indirect Prompt Injection Attacks With Spotlighting
Defending Against Indirect Prompt Injection Attacks With Spotlighting
Keegan Hines
Gary Lopez
Matthew Hall
Federico Zarfati
Yonatan Zunger
Emre Kiciman
AAMLSILM
97
51
0
20 Mar 2024
Pragmatic Competence Evaluation of Large Language Models for Korean
Pragmatic Competence Evaluation of Large Language Models for Korean
Dojun Park
Jiwoo Lee
Hyeyun Jeong
Seohyun Park
Sungeun Lee
ELM
63
2
0
19 Mar 2024
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation
  Benchmark for Chinese Large Language Models
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Deyi Xiong
ELM
115
0
0
19 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias
  in Factual Knowledge Extraction
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction
Ziyang Xu
Keqin Peng
Liang Ding
Dacheng Tao
Xiliang Lu
74
10
0
15 Mar 2024
Previous
123...678...282930
Next