Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.03300
Cited By
v1
v2
v3 (latest)
Measuring Massive Multitask Language Understanding
7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Measuring Massive Multitask Language Understanding"
50 / 3,408 papers shown
Title
Hybrid Student-Teacher Large Language Model Refinement for Cancer Toxicity Symptom Extraction
Reza Khanmohammadi
A. Ghanem
Kyle Verdecchia
Ryan Hall
Mohamed Elshaikh
...
Bing Luo
I. Chetty
Tuka Alhanai
Kundan Thind
Mohammad M. Ghassemi
91
0
0
08 Aug 2024
Better Alignment with Instruction Back-and-Forth Translation
Thao Nguyen
Jeffrey Li
Sewoong Oh
Ludwig Schmidt
Jason Weston
Luke Zettlemoyer
Xian Li
SyDa
88
7
0
08 Aug 2024
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation
Junde Wu
Jiayuan Zhu
Yunli Qi
72
41
0
08 Aug 2024
UNLEARN Efficient Removal of Knowledge in Large Language Models
Tyler Lizzo
Larry Heck
KELM
MoMe
MU
69
1
0
08 Aug 2024
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation
Jingjing Xie
Yuxin Zhang
Mingbao Lin
Liujuan Cao
Rongrong Ji
MQ
82
5
0
07 Aug 2024
In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models
Ayrton San Joaquin
Bin Wang
Zhengyuan Liu
Nicholas Asher
Brian Lim
Philippe Muller
Nancy Chen
99
2
0
07 Aug 2024
EXAONE 3.0 7.8B Instruction Tuned Language Model
LG AI Research
:
Soyoung An
Kyunghoon Bae
Eunbi Choi
...
Boseong Seo
Sihoon Yang
Heuiyeen Yeen
Kyungjae Yoo
Hyeongu Yun
ELM
ALM
110
12
0
07 Aug 2024
MoExtend: Tuning New Experts for Modality and Task Extension
Shanshan Zhong
Shanghua Gao
Zhongzhan Huang
Wushao Wen
Marinka Zitnik
Pan Zhou
VLM
MLLM
MoE
113
7
0
07 Aug 2024
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation
Boxi Cao
Mengjie Ren
Hongyu Lin
Xianpei Han
Feng Zhang
Junfeng Zhan
Le Sun
ELM
78
3
0
06 Aug 2024
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Jiaxi Yang
Binyuan Hui
Min Yang
Jian Yang
Junyang Lin
Chang Zhou
SyDa
102
34
0
06 Aug 2024
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Le Yu
Bowen Yu
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
79
8
0
06 Aug 2024
Accuracy and Consistency of LLMs in the Registered Dietitian Exam: The Impact of Prompt Engineering and Knowledge Retrieval
Iman Azimi
Mohan Qi
Li Wang
Amir M. Rahmani
Youlin Li
93
1
0
06 Aug 2024
Non-Determinism of "Deterministic" LLM Settings
Berk Atil
Alexa Chittams
Liseng Fu
Ferhan Ture
Lixinyu Xu
...
Tomasz Tudrej
Ferhan Ture
Zhe Wu
Lixinyu Xu
Breck Baldwin
119
6
0
06 Aug 2024
Development of REGAI: Rubric Enabled Generative Artificial Intelligence
Zach Johnson
Jeremy Straub
105
1
0
05 Aug 2024
Winning Amazon KDD Cup'24
Chris Deotte
Ivan Sorokin
Ahmet Erdem
Benedikt Schifferer
Gilberto Titericz Jr
Simon Jegou
32
0
0
05 Aug 2024
Pula: Training Large Language Models for Setswana
Nathan Brown
Vukosi Marivate
OSLM
63
0
0
05 Aug 2024
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
Samuel Ackerman
Ella Rabinovich
E. Farchi
Ateret Anaby-Tavor
67
1
0
04 Aug 2024
Cross-layer Attention Sharing for Large Language Models
Yongyu Mu
Yuzhang Wu
Yuchun Fan
Chenglong Wang
Hengyu Li
Qiaozhi He
Murun Yang
Tong Xiao
Jingbo Zhu
87
5
0
04 Aug 2024
Coalitions of Large Language Models Increase the Robustness of AI Agents
Prattyush Mangal
Carol Mak
Theo Kanakis
Timothy Donovan
Dave Braines
Edward Pyzer-Knapp
53
1
0
02 Aug 2024
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
Yunwen Xia
Hui Fang
Emmanouil Benetos
Jie Zhang
Chong Long
Dmitry Bogdanov
AuLLM
101
22
0
02 Aug 2024
FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only
He Zhu
Junyou Su
Tianle Lun
Yicheng Tao
Wenjia Zhang
Zipei Fan
Guanhua Chen
ALM
86
5
0
02 Aug 2024
BioRAG: A RAG-LLM Framework for Biological Question Reasoning
Chengrui Wang
Qingqing Long
Meng Xiao
Xunxin Cai
Chengjun Wu
Xuezhi Wang
Yuanchun Zhou
Yuanchun Zhou
110
30
0
02 Aug 2024
Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions
Jin Gao
Lei Gan
Yuankai Li
Yixin Ye
Dequan Wang
73
3
0
02 Aug 2024
Bridging Information Gaps in Dialogues With Grounded Exchanges Using Knowledge Graphs
Phillip Schneider
Nektarios Machner
Kristiina Jokinen
Florian Matthes
59
1
0
02 Aug 2024
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
Leo Micklem
Yan-Bin Shen
Wenjing Luo
Yan Zhang
Hao Liang
...
Weipeng Chen
Bin Cui
Blair Thornton
Wentao Zhang
Guosheng Dong
ELM
142
21
0
02 Aug 2024
Hybrid Querying Over Relational Databases and Large Language Models
T. Pham
Cody T. Reynolds
A. El Abbadi
93
1
0
01 Aug 2024
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions
Guangzhi Xiong
Qiao Jin
Xiao Wang
Minjia Zhang
Zhiyong Lu
Aidong Zhang
RALM
144
36
0
01 Aug 2024
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Mingcong Lu
Jiangcai Zhu
Wang Hao
Zheng Li
Shusheng Zhang
Kailai Shao
Chao Chen
Nan Li
Feng Wang
Xin Lu
67
0
0
01 Aug 2024
Tamper-Resistant Safeguards for Open-Weight LLMs
Rishub Tamirisa
Bhrugu Bharathi
Long Phan
Andy Zhou
Alice Gatti
...
Andy Zou
Dawn Song
Bo Li
Dan Hendrycks
Mantas Mazeika
AAML
MU
133
63
0
01 Aug 2024
Gemma 2: Improving Open Language Models at a Practical Size
Gemma Team
Gemma Team Morgane Riviere
Shreya Pathak
Pier Giuseppe Sessa
Cassidy Hardin
...
Noah Fiedel
Armand Joulin
Kathleen Kenealy
Robert Dadashi
Alek Andreev
VLM
MoE
OSLM
151
924
0
31 Jul 2024
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Richard Ren
Steven Basart
Adam Khoja
Alice Gatti
Long Phan
...
Alexander Pan
Gabriel Mukobi
Ryan H. Kim
Stephen Fitz
Dan Hendrycks
ELM
87
25
0
31 Jul 2024
PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning
Min Jae Jung
Romain Rouvoy
KELM
MoE
CLL
85
4
0
31 Jul 2024
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
121
6
0
31 Jul 2024
How to Measure the Intelligence of Large Language Models?
Nils Korber
Silvan Wehrli
Christopher Irrgang
ELM
ALM
90
0
0
30 Jul 2024
Meltemi: The first open Large Language Model for Greek
Leon Voukoutis
Dimitris Roussis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
VLM
77
9
0
30 Jul 2024
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
Weiyu Huang
Yuezhou Hu
Guohao Jian
Jun Zhu
Jianfei Chen
107
8
0
30 Jul 2024
Machine Unlearning in Generative AI: A Survey
Zheyuan Liu
Guangyao Dou
Zhaoxuan Tan
Yijun Tian
Meng Jiang
MU
109
19
0
30 Jul 2024
Automated Review Generation Method Based on Large Language Models
Shican Wu
Xiao Ma
Dehui Luo
Lulu Li
Xiangcheng Shi
...
Ran Luo
Chunlei Pei
Zhijian Zhao
Zhi-Jian Zhao
Jinlong Gong
173
0
0
30 Jul 2024
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
Yupeng Chen
Senmiao Wang
Zhihang Lin
Zhihang Lin
Yushun Zhang
Tian Ding
Ruoyu Sun
Ruoyu Sun
CLL
177
5
0
30 Jul 2024
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Marco AF Pimentel
Clément Christophe
Tathagata Raha
Prateek Munjal
Praveen K Kanithi
Shadab Khan
ELM
77
3
0
29 Jul 2024
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Wenxuan Zhang
Hou Pong Chan
Yiran Zhao
Mahani Aljunied
Jianyu Wang
...
Zhiqiang Hu
Weiwen Xu
Yew Ken Chia
Xin Li
Li Bing
LRM
145
15
0
29 Jul 2024
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain
Pierre Colombo
T. Pires
Malik Boudiaf
Rui Melo
Dominic Culver
Sofia Morgado
Etienne Malaboeuf
Gabriel Hautreux
Johanne Charpentier
Michael Desa
ELM
AILaw
ALM
98
17
0
28 Jul 2024
Parameter-Efficient Fine-Tuning via Circular Convolution
Aochuan Chen
Jiashun Cheng
Zijing Liu
Ziqi Gao
Fugee Tsung
Yu-Feng Li
Jia Li
151
3
0
27 Jul 2024
Effective Large Language Model Debugging with Best-first Tree Search
Jialin Song
Jonathan Raiman
Bryan Catanzaro
LRM
87
0
0
26 Jul 2024
Towards Effective and Efficient Continual Pre-training of Large Language Models
Jie Chen
Zhipeng Chen
Jiapeng Wang
Kun Zhou
Yutao Zhu
...
Rui Yan
Zhewei Wei
Di Hu
Wenbing Huang
Ji-Rong Wen
KELM
ALM
CLL
ELM
LRM
337
6
0
26 Jul 2024
Scaling Trends in Language Model Robustness
Nikolhaus Howe
Michal Zajac
I. R. McKenzie
Oskar Hollinsworth
Tom Tseng
Aaron David Tucker
Pierre-Luc Bacon
Adam Gleave
181
1
0
25 Jul 2024
Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance
Ao Shen
Qiang Wang
Zhiquan Lai
Xionglve Li
Dongsheng Li
ALM
MQ
61
1
0
24 Jul 2024
ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering
Preslav Nakov
Tairan Wang
Taicheng Guo
Kehan Guo
Juexiao Zhou
Haoyang Li
Mingchen Zhuge
Jürgen Schmidhuber
Xin Gao
Xiangliang Zhang
95
3
0
24 Jul 2024
Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Jared Quincy Davis
Boris Hanin
Lingjiao Chen
Peter Bailis
Ion Stoica
Matei A. Zaharia
65
8
0
23 Jul 2024
Course-Correction: Safety Alignment Using Synthetic Preferences
Rongwu Xu
Yishuo Cai
Zhenhong Zhou
Renjie Gu
Haiqin Weng
Yan Liu
Tianwei Zhang
Wei Xu
Han Qiu
76
7
0
23 Jul 2024
Previous
1
2
3
...
33
34
35
...
67
68
69
Next