ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTML

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 3,408 papers shown
Title
Mini-DALLE3: Interactive Text to Image by Prompting Large Language
  Models
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai
Xizhou Zhu
Jifeng Dai
Yu Qiao
Wenhai Wang
MLLMDiffM
105
24
0
11 Oct 2023
Evaluating Large Language Models at Evaluating Instruction Following
Evaluating Large Language Models at Evaluating Instruction Following
Zhiyuan Zeng
Jiatong Yu
Tianyu Gao
Yu Meng
Tanya Goyal
Danqi Chen
ELMALM
148
192
0
11 Oct 2023
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in
  Self-Refined Open-Source Models
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models
Sumuk Shashidhar
Abhinav Chinta
Vaibhav Sahai
Zhenhailong Wang
Heng Ji
77
10
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng Zhang
Yue Zhang
HILMKELM
172
201
0
11 Oct 2023
Parrot: Enhancing Multi-Turn Instruction Following for Large Language
  Models
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models
Yuchong Sun
Che Liu
Kun Zhou
Jinwen Huang
Ruihua Song
Xin Zhao
Fuzheng Zhang
Di Zhang
Kun Gai
LRM
76
11
0
11 Oct 2023
Exploring the landscape of large language models in medical question
  answering
Exploring the landscape of large language models in medical question answering
Andrew M. Bean
Karolina Korgul
"Felix H. Krones
Robert McCraith
Adam Mahdi
LM&MAELM
18
1
0
11 Oct 2023
OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language Models
OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language Models
Yuhe Liu
Changhua Pei
Longlong Xu
Bohan Chen
Mingze Sun
...
Gaogang Xie
Xidao Wen
Xiaohui Nie
Minghua Ma
Dan Pei
ELM
56
2
0
11 Oct 2023
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
Zhikai Li
Xiaoxuan Liu
Banghua Zhu
Zhen Dong
Qingyi Gu
Kurt Keutzer
MQ
104
7
0
11 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
Lemur: Harmonizing Natural Language and Code for Language Agents
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
99
74
0
10 Oct 2023
Mistral 7B
Mistral 7B
Albert Q. Jiang
Alexandre Sablayrolles
A. Mensch
Chris Bamford
Devendra Singh Chaplot
...
Teven Le Scao
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoELRM
157
2,263
0
10 Oct 2023
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Keiran Paster
Marco Dos Santos
Zhangir Azerbayev
Jimmy Ba
LRM
86
93
0
10 Oct 2023
TRACE: A Comprehensive Benchmark for Continual Learning in Large
  Language Models
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Xiao Wang
Yuan Zhang
Tianze Chen
Songyang Gao
Senjie Jin
...
Rui Zheng
Yicheng Zou
Tao Gui
Qi Zhang
Xuanjing Huang
ALMLRMCLL
91
23
0
10 Oct 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured
  Pruning
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia
Tianyu Gao
Zhiyuan Zeng
Danqi Chen
125
311
0
10 Oct 2023
Making Large Language Models Perform Better in Knowledge Graph
  Completion
Making Large Language Models Perform Better in Knowledge Graph Completion
Yichi Zhang
Zhuo Chen
Lingbing Guo
Yajing Xu
Wen Zhang
Hua-zeng Chen
101
53
0
10 Oct 2023
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of
  Multi-modal Language Models
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models
Letian Zhang
Xiaotong Zhai
Zhongkai Zhao
Yongshuo Zong
Xin Wen
Bingchen Zhao
LRM
38
0
0
10 Oct 2023
Let Models Speak Ciphers: Multiagent Debate through Embeddings
Let Models Speak Ciphers: Multiagent Debate through Embeddings
Chau Pham
Boyi Liu
Yingxiang Yang
Zhengyu Chen
Tianyi Liu
Jianbo Yuan
Bryan A. Plummer
Zhaoran Wang
Hongxia Yang
LLMAG
102
19
0
10 Oct 2023
Compressing Context to Enhance Inference Efficiency of Large Language
  Models
Compressing Context to Enhance Inference Efficiency of Large Language Models
Yucheng Li
Bo Dong
Chenghua Lin
Frank Guerin
63
73
0
09 Oct 2023
Take a Step Back: Evoking Reasoning via Abstraction in Large Language
  Models
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
Huaixiu Steven Zheng
Swaroop Mishra
Xinyun Chen
Heng-Tze Cheng
Ed H. Chi
Quoc V. Le
Denny Zhou
RALMLRM
91
126
0
09 Oct 2023
FireAct: Toward Language Agent Fine-tuning
FireAct: Toward Language Agent Fine-tuning
Baian Chen
Chang Shu
Ehsan Shareghi
Nigel Collier
Karthik Narasimhan
Shunyu Yao
ALMLLMAG
177
112
0
09 Oct 2023
NEFTune: Noisy Embeddings Improve Instruction Finetuning
NEFTune: Noisy Embeddings Improve Instruction Finetuning
Neel Jain
Ping Yeh-Chiang
Yuxin Wen
John Kirchenbauer
Hong-Min Chu
...
Avi Schwarzschild
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
84
81
0
09 Oct 2023
Guiding Language Model Math Reasoning with Planning Tokens
Guiding Language Model Math Reasoning with Planning Tokens
Xinyi Wang
Lucas Caccia
O. Ostapenko
Xingdi Yuan
William Yang Wang
Alessandro Sordoni
LRM
91
2
0
09 Oct 2023
Scaling Laws of RoPE-based Extrapolation
Scaling Laws of RoPE-based Extrapolation
Xiaoran Liu
Hang Yan
Shuo Zhang
Chen An
Xipeng Qiu
Dahua Lin
89
89
0
08 Oct 2023
Do Large Language Models Know about Facts?
Do Large Language Models Know about Facts?
Xuming Hu
Junzhe Chen
Xiaochuan Li
Yingxin Lai
Lijie Wen
Philip S. Yu
Zhijiang Guo
HILMKELM
71
54
0
08 Oct 2023
Compresso: Structured Pruning with Collaborative Prompting Learns
  Compact Large Language Models
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models
Song Guo
Jiahang Xu
Li Zhang
Mao Yang
87
15
0
08 Oct 2023
Large Language Models Only Pass Primary School Exams in Indonesia: A
  Comprehensive Test on IndoMMLU
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
Fajri Koto
Nurul Aisyah
Haonan Li
Timothy Baldwin
AI4EdLRMELM
104
46
0
07 Oct 2023
Critique Ability of Large Language Models
Critique Ability of Large Language Models
Liangchen Luo
Zi Lin
Yinxiao Liu
Lei Shu
Yun Zhu
Jingbo Shang
Lei Meng
AI4MHLRMELM
65
16
0
07 Oct 2023
EMO: Earth Mover Distance Optimization for Auto-Regressive Language
  Modeling
EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling
Siyu Ren
Zhiyong Wu
Kenny Q. Zhu
72
4
0
07 Oct 2023
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language
  Models
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models
Iman Mirzadeh
Keivan Alizadeh-Vahid
Sachin Mehta
C. C. D. Mundo
Oncel Tuzel
Golnoosh Samei
Mohammad Rastegari
Mehrdad Farajtabar
188
74
0
06 Oct 2023
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Wanyun Cui
Qianle Wang
LRM
92
9
0
06 Oct 2023
Analysis of the Reasoning with Redundant Information Provided Ability of
  Large Language Models
Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models
Wenbei Xie
LRM
64
2
0
06 Oct 2023
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
  Reasoning
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Ke Wang
Houxing Ren
Aojun Zhou
Zimu Lu
Sichun Luo
Weikang Shi
Renrui Zhang
Linqi Song
Mingjie Zhan
Hongsheng Li
ReLMLRMSyDa
122
106
0
05 Oct 2023
InstructProtein: Aligning Human and Protein Language via Knowledge
  Instruction
InstructProtein: Aligning Human and Protein Language via Knowledge Instruction
Zeyuan Wang
Qiang Zhang
Keyan Ding
Ming Qin
Zhuang Xiang
Xiaotong Li
Huajun Chen
104
32
0
05 Oct 2023
Predicting Emergent Abilities with Infinite Resolution Evaluation
Predicting Emergent Abilities with Infinite Resolution Evaluation
Shengding Hu
Xin Liu
Xu Han
Xinrong Zhang
Chaoqun He
...
Ning Ding
Zebin Ou
Guoyang Zeng
Zhiyuan Liu
Maosong Sun
ELMLRM
69
15
0
05 Oct 2023
Deep Variational Multivariate Information Bottleneck -- A Framework for Variational Losses
Deep Variational Multivariate Information Bottleneck -- A Framework for Variational Losses
Eslam Abdelaleem
I. Nemenman
K. M. Martini
122
6
0
05 Oct 2023
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language
  Models
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
VLM
92
14
0
04 Oct 2023
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
Chang Gao
Wenxuan Zhang
Guizhen Chen
Wai Lam
224
6
0
04 Oct 2023
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
Xianjun Yang
Xiao Wang
Qi Zhang
Linda R. Petzold
William Y. Wang
Xun Zhao
Dahua Lin
83
190
0
04 Oct 2023
ReForm-Eval: Evaluating Large Vision Language Models via Unified
  Re-Formulation of Task-Oriented Benchmarks
ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
Zejun Li
Ye Wang
Mengfei Du
Qingwen Liu
Binhao Wu
...
Zhihao Fan
Jie Fu
Jingjing Chen
Xuanjing Huang
Zhongyu Wei
118
15
0
04 Oct 2023
NOLA: Compressing LoRA using Linear Combination of Random Basis
NOLA: Compressing LoRA using Linear Combination of Random Basis
Soroush Abbasi Koohpayegani
K. Navaneet
Parsa Nooralinejad
Soheil Kolouri
Hamed Pirsiavash
141
16
0
04 Oct 2023
Ask Again, Then Fail: Large Language Models' Vacillations in Judgment
Ask Again, Then Fail: Large Language Models' Vacillations in Judgment
Qiming Xie
Zengzhi Wang
Yi Feng
Rui Xia
AAMLHILM
111
9
0
03 Oct 2023
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with
  Agent Team Optimization
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
Zijun Liu
Yanzhe Zhang
Peng Li
Yang Liu
Diyi Yang
LLMAG
98
128
0
03 Oct 2023
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology
  View
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View
Jintian Zhang
Xin Xu
Ningyu Zhang
Ruibo Liu
Bryan Hooi
Shumin Deng
LLMAG
123
149
0
03 Oct 2023
Fool Your (Vision and) Language Model With Embarrassingly Simple
  Permutations
Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations
Yongshuo Zong
Tingyang Yu
Ruchika Chavhan
Bingchen Zhao
Timothy M. Hospedales
MLLMAAMLLRM
76
20
0
02 Oct 2023
SmartPlay: A Benchmark for LLMs as Intelligent Agents
SmartPlay: A Benchmark for LLMs as Intelligent Agents
Yue Wu
Xuan Tang
Tom Michael Mitchell
Yuanzhi Li
ELMLLMAG
133
73
0
02 Oct 2023
Fusing Models with Complementary Expertise
Fusing Models with Complementary Expertise
Hongyi Wang
Felipe Maia Polo
Yuekai Sun
Souvik Kundu
Eric Xing
Mikhail Yurochkin
FedMLMoMe
94
33
0
02 Oct 2023
Compressing LLMs: The Truth is Rarely Pure and Never Simple
Compressing LLMs: The Truth is Rarely Pure and Never Simple
Ajay Jaiswal
Zhe Gan
Xianzhi Du
Bowen Zhang
Zhangyang Wang
Yinfei Yang
MQ
128
50
0
02 Oct 2023
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Xi Lin
Xilun Chen
Mingda Chen
Weijia Shi
Maria Lomeli
...
Jacob Kahn
Gergely Szilvasy
Mike Lewis
Luke Zettlemoyer
Scott Yih
RALM
157
157
0
02 Oct 2023
Probing the Multi-turn Planning Capabilities of LLMs via 20 Question
  Games
Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games
Yizhe Zhang
Jiarui Lu
Navdeep Jaitly
LRMELM
73
13
0
02 Oct 2023
BTR: Binary Token Representations for Efficient Retrieval Augmented
  Language Models
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
Qingqing Cao
Sewon Min
Yizhong Wang
Hannaneh Hajishirzi
MQRALM
81
4
0
02 Oct 2023
PACIT: Unlocking the Power of Examples for Better In-Context Instruction
  Tuning
PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
Tianci Xue
Ziqi Wang
Yixia Li
Yun-Nung Chen
Guanhua Chen
66
2
0
02 Oct 2023
Previous
123...606162...676869
Next