ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.00537
  4. Cited By
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
v1v2v3 (latest)

SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
    ELM
ArXiv (abs)PDFHTML

Papers citing "SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"

50 / 1,500 papers shown
Title
Enhancing One-shot Pruned Pre-trained Language Models through
  Sparse-Dense-Sparse Mechanism
Enhancing One-shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism
Guanchen Li
Xiandong Zhao
Lian Liu
Zeping Li
Dong Li
Lu Tian
Jie He
Ashish Sirasao
E. Barsoum
VLM
52
1
0
20 Aug 2024
A theory of understanding for artificial intelligence: composability,
  catalysts, and learning
A theory of understanding for artificial intelligence: composability, catalysts, and learning
Zijian Zhang
Sara Aronowitz
Alán Aspuru-Guzik
48
0
0
16 Aug 2024
Hermes 3 Technical Report
Hermes 3 Technical Report
Ryan Teknium
Jeffrey Quesnelle
Chen Guang
77
13
0
15 Aug 2024
Kraken: Inherently Parallel Transformers For Efficient Multi-Device
  Inference
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
R. Prabhakar
Hengrui Zhang
D. Wentzlaff
61
0
0
14 Aug 2024
Large Language Models Prompting With Episodic Memory
Large Language Models Prompting With Episodic Memory
Dai Do
Quan Tran
Svetha Venkatesh
Hung Le
LLMAG
74
1
0
14 Aug 2024
Generalisation First, Memorisation Second? Memorisation Localisation for
  Natural Language Classification Tasks
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
Verna Dankers
Ivan Titov
79
5
0
09 Aug 2024
Development of REGAI: Rubric Enabled Generative Artificial Intelligence
Development of REGAI: Rubric Enabled Generative Artificial Intelligence
Zach Johnson
Jeremy Straub
93
1
0
05 Aug 2024
Long Input Benchmark for Russian Analysis
Long Input Benchmark for Russian Analysis
I. Churin
Murat Apishev
Maria Tikhonova
Denis Shevelev
Aydar Bulatov
Yuri Kuratov
Sergej Averkiev
Alena Fenogenova
45
1
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey
  on Methods and Datasets
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
81
0
0
04 Aug 2024
Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with
  Accelerated LLMs
Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs
Afia Anjum
Maksim E. Eren
V. Setlur
Boian Alexandrov
Manish Bhattarai
67
2
0
02 Aug 2024
Distributed In-Context Learning under Non-IID Among Clients
Distributed In-Context Learning under Non-IID Among Clients
Siqi Liang
Sumyeong Ahn
Jiayu Zhou
41
0
0
31 Jul 2024
Data Contamination Report from the 2024 CONDA Shared Task
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
109
6
0
31 Jul 2024
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain
  Study in Italian
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
S. Auriemma
Martina Miliani
Mauro Madeddu
Alessandro Bondielli
Lucia Passaro
Alessandro Lenci
63
0
0
30 Jul 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
127
16
0
27 Jul 2024
Benchmarks as Microscopes: A Call for Model Metrology
Benchmarks as Microscopes: A Call for Model Metrology
Michael Stephen Saxon
Ari Holtzman
Peter West
William Y. Wang
Naomi Saphra
112
13
0
22 Jul 2024
Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners
Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners
Yifei Gao
Jie Ou
Lei Wang
Fanhua Shang
Jaji Wu
MQ
94
0
0
22 Jul 2024
TTSDS -- Text-to-Speech Distribution Score
TTSDS -- Text-to-Speech Distribution Score
Christoph Minixhofer
Ondˇrej Klejch
Peter Bell
85
0
0
17 Jul 2024
MSEval: A Dataset for Material Selection in Conceptual Design to
  Evaluate Algorithmic Models
MSEval: A Dataset for Material Selection in Conceptual Design to Evaluate Algorithmic Models
Yash Jain
Daniele Grandi
Allin Groom
Brandon Cramer
Christopher McComb
65
0
0
12 Jul 2024
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Jessica Echterhoff
Fartash Faghri
Raviteja Vemulapalli
Ting-Yao Hu
Chun-Liang Li
Oncel Tuzel
Hadi Pouransari
KELM
97
2
0
12 Jul 2024
Evaluating AI Evaluation: Perils and Prospects
Evaluating AI Evaluation: Perils and Prospects
John Burden
ELM
98
9
0
12 Jul 2024
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
Gonçalo Hora de Carvalho
Oscar Knap
R. Pollice
ReLMELMLRM
110
1
0
12 Jul 2024
Large Models of What? Mistaking Engineering Achievements for Human
  Linguistic Agency
Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency
Abeba Birhane
Marek McGann
61
7
0
11 Jul 2024
ROSA: Random Subspace Adaptation for Efficient Fine-Tuning
ROSA: Random Subspace Adaptation for Efficient Fine-Tuning
Marawan Gamal Abdel Hameed
Aristides Milios
Siva Reddy
Guillaume Rabusseau
CLL
52
3
0
10 Jul 2024
A Review of the Challenges with Massive Web-mined Corpora Used in Large
  Language Models Pre-Training
A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training
Michał Perełkiewicz
Rafał Poświata
71
3
0
10 Jul 2024
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of
  Modules
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Zhuocheng Gong
Ang Lv
Jian Guan
Junxi Yan
Wei Wu
Huishuai Zhang
Minlie Huang
Dongyan Zhao
Rui Yan
MoE
86
7
0
09 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
146
14
0
09 Jul 2024
Fostering Trust and Quantifying Value of AI and ML
Fostering Trust and Quantifying Value of AI and ML
Dalmo Cirne
Veena Calambur
26
0
0
08 Jul 2024
iSign: A Benchmark for Indian Sign Language Processing
iSign: A Benchmark for Indian Sign Language Processing
Abhinav Joshi
Romit Mohanty
Mounika Kanakanti
Andesha Mangla
Sudeep Choudhary
Monali Barbate
Ashutosh Modi
VLM
68
4
0
07 Jul 2024
IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning
IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning
Abhinav Joshi
Shounak Paul
Akshat Sharma
Pawan Goyal
Saptarshi Ghosh
Ashutosh Modi
AILawELM
71
12
0
07 Jul 2024
ElecBench: a Power Dispatch Evaluation Benchmark for Large Language
  Models
ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models
Xiyuan Zhou
Huan Zhao
Yuheng Cheng
Yuji Cao
Gaoqi Liang
Guolong Liu
Wenxuan Liu
Yan Xu
Junhua Zhao
ELM
83
6
0
07 Jul 2024
MAPO: Boosting Large Language Model Performance with Model-Adaptive
  Prompt Optimization
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen
Zhihao Wen
Ge Fan
Zhengyu Chen
Wei Wu
Dayiheng Liu
Zhixu Li
Bang Liu
Yanghua Xiao
100
20
0
04 Jul 2024
Investigating the Role of Instruction Variety and Task Difficulty in
  Robotic Manipulation Tasks
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Amit Parekh
Nikolas Vitsakis
Alessandro Suglia
Ioannis Konstas
AAML
88
6
0
04 Jul 2024
Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential
  Expression Generation
Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential Expression Generation
Polina Tsvilodub
Michael Franke
Fausto Carcassi
68
1
0
04 Jul 2024
Efficient Training of Language Models with Compact and Consistent Next
  Token Distributions
Efficient Training of Language Models with Compact and Consistent Next Token Distributions
Ashutosh Sathe
Sunita Sarawagi
62
0
0
03 Jul 2024
Evaluating the Robustness of Adverse Drug Event Classification Models
  Using Templates
Evaluating the Robustness of Adverse Drug Event Classification Models Using Templates
Dorothea MacPhail
David Harbecke
Lisa Raithel
Sebastian Möller
40
1
0
02 Jul 2024
Expressive and Generalizable Low-rank Adaptation for Large Models via
  Slow Cascaded Learning
Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning
Siwei Li
Yifan Yang
Yifei Shen
Fangyun Wei
Zongqing Lu
L. Qiu
Yuqing Yang
AI4CE
91
3
0
01 Jul 2024
Exploring Advanced Large Language Models with LLMsuite
Exploring Advanced Large Language Models with LLMsuite
Giorgio Roffo
LLMAG
36
0
0
01 Jul 2024
Data Generation Using Large Language Models for Text Classification: An
  Empirical Case Study
Data Generation Using Large Language Models for Text Classification: An Empirical Case Study
Yinheng Li
Rogerio Bonatti
Sara Abdali
Justin Wagle
K. Koishida
SyDa
91
7
0
27 Jun 2024
LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models
LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models
Shouchang Guo
Sonam Damani
Keng-hao Chang
VLM
51
1
0
27 Jun 2024
From Artificial Needles to Real Haystacks: Improving Retrieval
  Capabilities in LLMs by Finetuning on Synthetic Data
From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
Zheyang Xiong
Vasilis Papageorgiou
Kangwook Lee
Dimitris Papailiopoulos
SyDaRALM
94
13
0
27 Jun 2024
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for
  Memory-Efficient Large Language Models Fine-Tuning
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
Yifan Yang
Kai Zhen
Ershad Banijamal
Athanasios Mouchtaris
Zheng Zhang
71
9
0
26 Jun 2024
PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models
PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models
Huixuan Zhang
Yun Lin
Xiaojun Wan
132
0
0
26 Jun 2024
Autonomous Prompt Engineering in Large Language Models
Autonomous Prompt Engineering in Large Language Models
Daan Kepel
Konstantina Valogianni
LLMAG
88
8
0
25 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
106
10
0
24 Jun 2024
Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large
  Language Models without Training through Attention Calibration
Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration
Zhongzhi Yu
Zheng Wang
Yonggan Fu
Huihong Shi
Khalid Shaikh
Yingyan Celine Lin
113
25
0
22 Jun 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELMALM
160
8
0
20 Jun 2024
BiLD: Bi-directional Logits Difference Loss for Large Language Model
  Distillation
BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation
Minchong Li
Feng Zhou
Xiaohui Song
56
3
0
19 Jun 2024
Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt
  Tuning
Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
Chenyuan Wu
Gangwei Jiang
Defu Lian
CLL
47
0
0
18 Jun 2024
Causal Discovery Inspired Unsupervised Domain Adaptation for Emotion-Cause Pair Extraction
Causal Discovery Inspired Unsupervised Domain Adaptation for Emotion-Cause Pair Extraction
Yuncheng Hua
Yujin Huang
Shuo Huang
Tao Feng
Zhuang Li
Chris Bain
R. Bassed
Gholamreza Haffari
CMLOOD
125
2
0
18 Jun 2024
Probing the Decision Boundaries of In-context Learning in Large Language
  Models
Probing the Decision Boundaries of In-context Learning in Large Language Models
Siyan Zhao
Tung Nguyen
Aditya Grover
116
7
0
17 Jun 2024
Previous
123456...282930
Next