ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTML

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 3,408 papers shown
Title
RNR: Teaching Large Language Models to Follow Roles and Rules
RNR: Teaching Large Language Models to Follow Roles and Rules
Kuan-Chieh Wang
Alexander Bukharin
Haoming Jiang
Qingyu Yin
Zhengyang Wang
...
Chao Zhang
Bing Yin
Xian Li
Jianshu Chen
Shiyang Li
ALM
84
2
0
10 Sep 2024
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Jaeseong Lee
seung-won hwang
Aurick Qiao
Daniel F Campos
Z. Yao
Yuxiong He
65
3
0
10 Sep 2024
Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review
Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review
Neha Prakriya
Jui-Nan Yen
Cho-Jui Hsieh
Jason Cong
KELMAI4CELRM
107
1
0
10 Sep 2024
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation
Ilya Gusev
LLMAG
130
3
0
10 Sep 2024
DetoxBench: Benchmarking Large Language Models for Multitask Fraud &
  Abuse Detection
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection
Joymallya Chakraborty
Wei Xia
Anirban Majumder
Dan Ma
Walid Chaabene
Naveed Janvekar
49
3
0
09 Sep 2024
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous
  Low-Rank Adaptations
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations
Ziyao Wang
Zheyu Shen
Yexiao He
Guoheng Sun
Hongyi Wang
Lingjuan Lyu
Ang Li
93
49
0
09 Sep 2024
$\mathbb{USCD}$: Improving Code Generation of LLMs by Uncertainty-Aware
  Selective Contrastive Decoding
USCD\mathbb{USCD}USCD: Improving Code Generation of LLMs by Uncertainty-Aware Selective Contrastive Decoding
Shuai Wang
Liang Ding
Li Shen
Yong Luo
Zheng He
Wei Yu
Dacheng Tao
94
2
0
09 Sep 2024
Evaluating Large Language Models with Tests of Spanish as a Foreign
  Language: Pass or Fail?
Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail?
Marina Mayor-Rocher
Nina Melero
Elena Merino-Gómez
María Grandury
Javier Conde
Pedro Reviriego
ELM
55
1
0
08 Sep 2024
ELMS: Elasticized Large Language Models On Mobile Devices
ELMS: Elasticized Large Language Models On Mobile Devices
Wangsong Yin
Rongjie Yi
Daliang Xu
Gang Huang
Mengwei Xu
Xuanzhe Liu
80
6
0
08 Sep 2024
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve
  Generalization in Large Language Models
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models
Sonam Gupta
Yatin Nandwani
Asaf Yehudai
Mayank Mishra
Gaurav Pandey
Dinesh Raghu
Sachindra Joshi
LRM
84
2
0
07 Sep 2024
Sparse Rewards Can Self-Train Dialogue Agents
Sparse Rewards Can Self-Train Dialogue Agents
B. Lattimer
Varun Gangal
Ryan McDonald
Yi Yang
LLMAG
96
2
0
06 Sep 2024
Customizing Large Language Model Generation Style using
  Parameter-Efficient Finetuning
Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning
Xinyue Liu
Harshita Diddee
Daphne Ippolito
ALM
54
3
0
06 Sep 2024
Residual Stream Analysis with Multi-Layer SAEs
Residual Stream Analysis with Multi-Layer SAEs
Tim Lawson
Lucy Farnik
Conor Houghton
Laurence Aitchison
95
5
0
06 Sep 2024
Attention Heads of Large Language Models: A Survey
Attention Heads of Large Language Models: A Survey
Zifan Zheng
Yezhaohui Wang
Yuxin Huang
Shichao Song
Mingchuan Yang
Bo Tang
Feiyu Xiong
Zhiyu Li
LRM
119
29
0
05 Sep 2024
The representation landscape of few-shot learning and fine-tuning in
  large language models
The representation landscape of few-shot learning and fine-tuning in large language models
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
137
4
0
05 Sep 2024
Understanding LLM Development Through Longitudinal Study: Insights from the Open Ko-LLM Leaderboard
Understanding LLM Development Through Longitudinal Study: Insights from the Open Ko-LLM Leaderboard
Chanjun Park
Hyeonwoo Kim
LRM
113
1
0
05 Sep 2024
Towards a Unified View of Preference Learning for Large Language Models:
  A Survey
Towards a Unified View of Preference Learning for Large Language Models: A Survey
Bofei Gao
Feifan Song
Yibo Miao
Zefan Cai
Zhiyong Yang
...
Houfeng Wang
Zhifang Sui
Peiyi Wang
Baobao Chang
Baobao Chang
163
14
0
04 Sep 2024
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through
  Corpus Retrieval and Augmentation
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation
Ingo Ziegler
Abdullatif Köksal
Desmond Elliott
Hinrich Schütze
80
6
0
03 Sep 2024
Interpreting and Improving Large Language Models in Arithmetic
  Calculation
Interpreting and Improving Large Language Models in Arithmetic Calculation
Wei Zhang
Chaoqun Wan
Yonggang Zhang
Yiu-ming Cheung
Xinmei Tian
Xu Shen
Jieping Ye
LRM
109
22
0
03 Sep 2024
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka
  Culture
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture
Chen-Chi Chang
Ching-Yuan Chen
Hung-Shin Lee
Chih-Cheng Lee
78
3
0
03 Sep 2024
From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning
From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning
Wei Chen
Zhen Huang
Liang Xie
Binbin Lin
Houqiang Li
...
Deng Cai
Yonggang Zhang
Wenxiao Wang
Xu Shen
Jieping Ye
152
10
0
03 Sep 2024
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through
  Multi-Objective Partitioning
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective Partitioning
Keer Lu
Xiaonan Nie
Zhuoran Zhang
Zheng Liang
Da Pan
...
Weipeng Chen
Guosheng Dong
Bin Cui
Bin Cui
Wentao Zhang
109
0
0
02 Sep 2024
ToolACE: Winning the Points of LLM Function Calling
ToolACE: Winning the Points of LLM Function Calling
Weiwen Liu
Xiaolin Huang
Xingshan Zeng
Xinlong Hao
Shuai Yu
...
Xin Jiang
Ruiming Tang
Defu Lian
Qun Liu
Enhong Chen
LLMAG
112
48
0
02 Sep 2024
Report Cards: Qualitative Evaluation of Language Models Using Natural
  Language Summaries
Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Blair Yang
Fuyang Cui
Keiran Paster
Jimmy Ba
Pashootan Vaezipoor
Silviu Pitis
Michael Ruogu Zhang
84
1
0
01 Sep 2024
Hyper-Compression: Model Compression via Hyperfunction
Hyper-Compression: Model Compression via Hyperfunction
Fenglei Fan
Juntong Fan
Dayang Wang
Jingbo Zhang
Zelin Dong
Shijun Zhang
Ge Wang
Tieyong Zeng
116
0
0
01 Sep 2024
Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Jasper Dekoninck
Maximilian Baader
Martin Vechev
ALM
189
0
0
01 Sep 2024
LongRecipe: Recipe for Efficient Long Context Generalization in Large
  Language Models
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
Zhiyuan Hu
Yuliang Liu
Jinman Zhao
Suyuchen Wang
Yan Wang
...
Qing Gu
Anh Tuan Luu
See-Kiong Ng
Zhiwei Jiang
Bryan Hooi
152
13
0
31 Aug 2024
Does Alignment Tuning Really Break LLMs' Internal Confidence?
Does Alignment Tuning Really Break LLMs' Internal Confidence?
Hongseok Oh
Wonseok Hwang
137
0
0
31 Aug 2024
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
Jonathan Bourne
130
4
0
30 Aug 2024
Modularity in Transformers: Investigating Neuron Separability &
  Specialization
Modularity in Transformers: Investigating Neuron Separability & Specialization
Nicholas Pochinkov
Thomas Jones
Mohammed Rashidur Rahman
58
0
0
30 Aug 2024
Flexible and Effective Mixing of Large Language Models into a Mixture of
  Domain Experts
Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts
Rhui Dih Lee
L. Wynter
R. Ganti
MoE
96
1
0
30 Aug 2024
Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using
  Prefix-Tuning
Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning
Maxime Méloux
Christophe Cerisara
KELMCLL
91
0
0
30 Aug 2024
InkubaLM: A small language model for low-resource African languages
InkubaLM: A small language model for low-resource African languages
A. Tonja
Bonaventure F. P. Dossou
Jessica Ojo
Jenalea Rajab
Fadel Thior
...
Anuoluwapo Aremu
Pelonomi Moiloa
Jade Z. Abbott
Vukosi Marivate
Benjamin Rosman
100
11
0
30 Aug 2024
A Survey for Large Language Models in Biomedicine
A Survey for Large Language Models in Biomedicine
Chong Wang
Mengyao Li
Junjun He
Zhongruo Wang
Erfan Darzi
...
Yi Yu
Pietro Liò
Tianyun Wang
Yu Guang Wang
Yiqing Shen
LM&MA
136
13
0
29 Aug 2024
Nexus: Specialization meets Adaptability for Efficiently Training
  Mixture of Experts
Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts
Nikolas Gritsch
Qizhen Zhang
Acyr Locatelli
Sara Hooker
Ahmet Üstün
MoE
91
3
0
28 Aug 2024
Language Adaptation on a Tight Academic Compute Budget: Tokenizer
  Swapping Works and Pure bfloat16 Is Enough
Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure bfloat16 Is Enough
Konstantin Dobler
Gerard de Melo
85
1
0
28 Aug 2024
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language
  Models
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Jiayi Gui
Yiming Liu
Jiale Cheng
Xiaotao Gu
Xiao-Yang Liu
Hongning Wang
Yuxiao Dong
Jie Tang
Minlie Huang
ELMLLMAGLRM
97
7
0
28 Aug 2024
LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of
  Relational Knowledge in Language Models
LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models
Max Ploner
Jacek Wiland
Sebastian Pohl
Alan Akbik
KELM
73
2
0
28 Aug 2024
An Investigation of Warning Erroneous Chat Translations in Cross-lingual
  Communication
An Investigation of Warning Erroneous Chat Translations in Cross-lingual Communication
Yunmeng Li
Jun Suzuki
Makoto Morishita
Kaori Abe
Kentaro Inui
123
1
0
28 Aug 2024
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and
  Deduplication by Introducing a Competitive Large Language Model Baseline
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Bin Cui
Zheng Liang
Yiding Sun
Da Pan
Zhuoran Zhang
...
Bingning Wang
Wentao Zhang
Jiaxin Mao
Guosheng Dong
Weipeng Chen
ALM
69
3
0
27 Aug 2024
TourSynbio: A Multi-Modal Large Model and Agent Framework to Bridge Text
  and Protein Sequences for Protein Engineering
TourSynbio: A Multi-Modal Large Model and Agent Framework to Bridge Text and Protein Sequences for Protein Engineering
Yiqing Shen
Zan Chen
Michail Mamalakis
Yungeng Liu
Tianbin Li
Yanzhou Su
Junjun He
Pietro Liò
Yu Guang Wang
LLMAG
86
9
0
27 Aug 2024
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure
  Multi-Agent Systems
AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Chi-Min Chan
Jianxuan Yu
Weize Chen
Chunyang Jiang
Xinyu Liu
Weijie Shi
Zhiyuan Liu
Wei Xue
Yike Guo
LLMAG
85
3
0
27 Aug 2024
IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question
  Answering
IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Ruosen Li
Barry Wang
Ruochen Li
Xinya Du
ELM
86
6
0
24 Aug 2024
LalaEval: A Holistic Human Evaluation Framework for Domain-Specific
  Large Language Models
LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models
Chongyan Sun
Ken Lin
Shiwei Wang
Hulong Wu
Chengfei Fu
Zhen Wang
ALM
40
2
0
23 Aug 2024
LLM-PBE: Assessing Data Privacy in Large Language Models
LLM-PBE: Assessing Data Privacy in Large Language Models
Qinbin Li
Junyuan Hong
Chulin Xie
Jeffrey Tan
Rachel Xin
...
Dan Hendrycks
Zhangyang Wang
Bo Li
Bingsheng He
Dawn Song
ELMPILM
118
18
0
23 Aug 2024
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Bin Wang
Chunyu Xie
Dawei Leng
Yuhui Yin
MLLM
181
1
0
23 Aug 2024
Building and better understanding vision-language models: insights and
  future directions
Building and better understanding vision-language models: insights and future directions
Hugo Laurençon
Andrés Marafioti
Victor Sanh
Léo Tronchon
VLM
138
78
0
22 Aug 2024
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Jamba Team
Barak Lenz
Alan Arazi
Amir Bergman
Avshalom Manevich
...
Yehoshua Cohen
Yonatan Belinkov
Y. Globerson
Yuval Peleg Levy
Y. Shoham
114
33
0
22 Aug 2024
Large Language Models Are Self-Taught Reasoners: Enhancing LLM
  Applications via Tailored Problem-Solving Demonstrations
Large Language Models Are Self-Taught Reasoners: Enhancing LLM Applications via Tailored Problem-Solving Demonstrations
Kai Tzu-iunn Ong
Taeyoon Kwon
Jinyoung Yeo
LRM
61
1
0
22 Aug 2024
Toward the Evaluation of Large Language Models Considering Score
  Variance across Instruction Templates
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
Yusuke Sakai
Adam Nohejl
Jiangnan Hang
Hidetaka Kamigaito
Taro Watanabe
ELM
136
5
0
22 Aug 2024
Previous
123...313233...676869
Next