Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.00537
Cited By
v1
v2
v3 (latest)
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
2 May 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems"
50 / 1,500 papers shown
Title
Enhancing One-shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism
Guanchen Li
Xiandong Zhao
Lian Liu
Zeping Li
Dong Li
Lu Tian
Jie He
Ashish Sirasao
E. Barsoum
VLM
52
1
0
20 Aug 2024
A theory of understanding for artificial intelligence: composability, catalysts, and learning
Zijian Zhang
Sara Aronowitz
Alán Aspuru-Guzik
48
0
0
16 Aug 2024
Hermes 3 Technical Report
Ryan Teknium
Jeffrey Quesnelle
Chen Guang
77
13
0
15 Aug 2024
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
R. Prabhakar
Hengrui Zhang
D. Wentzlaff
61
0
0
14 Aug 2024
Large Language Models Prompting With Episodic Memory
Dai Do
Quan Tran
Svetha Venkatesh
Hung Le
LLMAG
74
1
0
14 Aug 2024
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
Verna Dankers
Ivan Titov
79
5
0
09 Aug 2024
Development of REGAI: Rubric Enabled Generative Artificial Intelligence
Zach Johnson
Jeremy Straub
93
1
0
05 Aug 2024
Long Input Benchmark for Russian Analysis
I. Churin
Murat Apishev
Maria Tikhonova
Denis Shevelev
Aydar Bulatov
Yuri Kuratov
Sergej Averkiev
Alena Fenogenova
45
1
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
81
0
0
04 Aug 2024
Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs
Afia Anjum
Maksim E. Eren
V. Setlur
Boian Alexandrov
Manish Bhattarai
67
2
0
02 Aug 2024
Distributed In-Context Learning under Non-IID Among Clients
Siqi Liang
Sumyeong Ahn
Jiayu Zhou
41
0
0
31 Jul 2024
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
109
6
0
31 Jul 2024
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
S. Auriemma
Martina Miliani
Mauro Madeddu
Alessandro Bondielli
Lucia Passaro
Alessandro Lenci
63
0
0
30 Jul 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
127
16
0
27 Jul 2024
Benchmarks as Microscopes: A Call for Model Metrology
Michael Stephen Saxon
Ari Holtzman
Peter West
William Y. Wang
Naomi Saphra
112
13
0
22 Jul 2024
Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners
Yifei Gao
Jie Ou
Lei Wang
Fanhua Shang
Jaji Wu
MQ
94
0
0
22 Jul 2024
TTSDS -- Text-to-Speech Distribution Score
Christoph Minixhofer
Ondˇrej Klejch
Peter Bell
85
0
0
17 Jul 2024
MSEval: A Dataset for Material Selection in Conceptual Design to Evaluate Algorithmic Models
Yash Jain
Daniele Grandi
Allin Groom
Brandon Cramer
Christopher McComb
65
0
0
12 Jul 2024
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Jessica Echterhoff
Fartash Faghri
Raviteja Vemulapalli
Ting-Yao Hu
Chun-Liang Li
Oncel Tuzel
Hadi Pouransari
KELM
97
2
0
12 Jul 2024
Evaluating AI Evaluation: Perils and Prospects
John Burden
ELM
98
9
0
12 Jul 2024
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
Gonçalo Hora de Carvalho
Oscar Knap
R. Pollice
ReLM
ELM
LRM
110
1
0
12 Jul 2024
Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency
Abeba Birhane
Marek McGann
61
7
0
11 Jul 2024
ROSA: Random Subspace Adaptation for Efficient Fine-Tuning
Marawan Gamal Abdel Hameed
Aristides Milios
Siva Reddy
Guillaume Rabusseau
CLL
52
3
0
10 Jul 2024
A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training
Michał Perełkiewicz
Rafał Poświata
71
3
0
10 Jul 2024
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Zhuocheng Gong
Ang Lv
Jian Guan
Junxi Yan
Wei Wu
Huishuai Zhang
Minlie Huang
Dongyan Zhao
Rui Yan
MoE
86
7
0
09 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
146
14
0
09 Jul 2024
Fostering Trust and Quantifying Value of AI and ML
Dalmo Cirne
Veena Calambur
26
0
0
08 Jul 2024
iSign: A Benchmark for Indian Sign Language Processing
Abhinav Joshi
Romit Mohanty
Mounika Kanakanti
Andesha Mangla
Sudeep Choudhary
Monali Barbate
Ashutosh Modi
VLM
68
4
0
07 Jul 2024
IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning
Abhinav Joshi
Shounak Paul
Akshat Sharma
Pawan Goyal
Saptarshi Ghosh
Ashutosh Modi
AILaw
ELM
71
12
0
07 Jul 2024
ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models
Xiyuan Zhou
Huan Zhao
Yuheng Cheng
Yuji Cao
Gaoqi Liang
Guolong Liu
Wenxuan Liu
Yan Xu
Junhua Zhao
ELM
83
6
0
07 Jul 2024
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen
Zhihao Wen
Ge Fan
Zhengyu Chen
Wei Wu
Dayiheng Liu
Zhixu Li
Bang Liu
Yanghua Xiao
100
20
0
04 Jul 2024
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Amit Parekh
Nikolas Vitsakis
Alessandro Suglia
Ioannis Konstas
AAML
88
6
0
04 Jul 2024
Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential Expression Generation
Polina Tsvilodub
Michael Franke
Fausto Carcassi
68
1
0
04 Jul 2024
Efficient Training of Language Models with Compact and Consistent Next Token Distributions
Ashutosh Sathe
Sunita Sarawagi
62
0
0
03 Jul 2024
Evaluating the Robustness of Adverse Drug Event Classification Models Using Templates
Dorothea MacPhail
David Harbecke
Lisa Raithel
Sebastian Möller
40
1
0
02 Jul 2024
Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning
Siwei Li
Yifan Yang
Yifei Shen
Fangyun Wei
Zongqing Lu
L. Qiu
Yuqing Yang
AI4CE
91
3
0
01 Jul 2024
Exploring Advanced Large Language Models with LLMsuite
Giorgio Roffo
LLMAG
36
0
0
01 Jul 2024
Data Generation Using Large Language Models for Text Classification: An Empirical Case Study
Yinheng Li
Rogerio Bonatti
Sara Abdali
Justin Wagle
K. Koishida
SyDa
91
7
0
27 Jun 2024
LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models
Shouchang Guo
Sonam Damani
Keng-hao Chang
VLM
51
1
0
27 Jun 2024
From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
Zheyang Xiong
Vasilis Papageorgiou
Kangwook Lee
Dimitris Papailiopoulos
SyDa
RALM
94
13
0
27 Jun 2024
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
Yifan Yang
Kai Zhen
Ershad Banijamal
Athanasios Mouchtaris
Zheng Zhang
71
9
0
26 Jun 2024
PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models
Huixuan Zhang
Yun Lin
Xiaojun Wan
132
0
0
26 Jun 2024
Autonomous Prompt Engineering in Large Language Models
Daan Kepel
Konstantina Valogianni
LLMAG
88
8
0
25 Jun 2024
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning
Somnath Basu Roy Chowdhury
Krzysztof Choromanski
Arijit Sehanobish
Avinava Dubey
Snigdha Chaturvedi
MU
106
10
0
24 Jun 2024
Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration
Zhongzhi Yu
Zheng Wang
Yonggan Fu
Huihong Shi
Khalid Shaikh
Yingyan Celine Lin
113
25
0
22 Jun 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELM
ALM
160
8
0
20 Jun 2024
BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation
Minchong Li
Feng Zhou
Xiaohui Song
56
3
0
19 Jun 2024
Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
Chenyuan Wu
Gangwei Jiang
Defu Lian
CLL
47
0
0
18 Jun 2024
Causal Discovery Inspired Unsupervised Domain Adaptation for Emotion-Cause Pair Extraction
Yuncheng Hua
Yujin Huang
Shuo Huang
Tao Feng
Zhuang Li
Chris Bain
R. Bassed
Gholamreza Haffari
CML
OOD
125
2
0
18 Jun 2024
Probing the Decision Boundaries of In-context Learning in Large Language Models
Siyan Zhao
Tung Nguyen
Aditya Grover
116
7
0
17 Jun 2024
Previous
1
2
3
4
5
6
...
28
29
30
Next