ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTML

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 3,408 papers shown
Title
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits
Duy Nguyen
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
51
3
0
02 Oct 2024
Positional Attention: Expressivity and Learnability of Algorithmic Computation
Positional Attention: Expressivity and Learnability of Algorithmic Computation
George Giapitzakis
Artur Back de Luca
Shenghao Yang
Petar Veličković
Kimon Fountoulakis
159
0
0
02 Oct 2024
Endless Jailbreaks with Bijection Learning
Endless Jailbreaks with Bijection Learning
Brian R. Y. Huang
Maximilian Li
Leonard Tang
AAML
181
8
0
02 Oct 2024
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Lucas Bandarkar
Benjamin Muller
Pritish Yuvraj
Rui Hou
Nayan Singhal
Hongjiang Lv
Bing-Quan Liu
KELMLRMMoMe
150
5
0
02 Oct 2024
DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models
DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models
Yuxuan Zhang
Ruizhe Li
MoMe
199
2
0
02 Oct 2024
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu
Pei-Yu Lo
ReLMLRM
131
2
0
02 Oct 2024
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and
  Reliability
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Weitong Zhang
Chengqi Zang
Bernhard Kainz
70
0
0
01 Oct 2024
Addition is All You Need for Energy-efficient Language Models
Addition is All You Need for Energy-efficient Language Models
Hongyin Luo
Wei Sun
30
7
0
01 Oct 2024
Aligning Human and LLM Judgments: Insights from EvalAssist on
  Task-Specific Evaluations and AI-assisted Assessment Strategy Preferences
Aligning Human and LLM Judgments: Insights from EvalAssist on Task-Specific Evaluations and AI-assisted Assessment Strategy Preferences
Zahra Ashktorab
Michael Desmond
Qian Pan
James M. Johnson
Martin Santillan Cooper
Elizabeth M. Daly
Rahul Nair
Tejaswini Pedapati
Swapnaja Achintalwar
Werner Geyer
ELM
108
7
0
01 Oct 2024
BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and
  Multistructured Data
BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Xuwu Wang
Qiwen Cui
Yunzhe Tao
Yiran Wang
Ziwei Chai
...
Yufeng Zhang
Sirui Zheng
Quanzeng You
Yang Yang
Hongxia Yang
91
0
0
01 Oct 2024
FlipGuard: Defending Preference Alignment against Update Regression with
  Constrained Optimization
FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization
Mingye Zhu
Yi Liu
Quan Wang
Junbo Guo
Zhendong Mao
61
1
0
01 Oct 2024
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture
  of Shards
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards
Sheng Wang
Liheng Chen
Pengan Chen
Jingwei Dong
Boyang Xue
Jiyue Jiang
Lingpeng Kong
Chuan Wu
MoE
100
9
0
01 Oct 2024
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data
  Mining
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining
Vinayak Arannil
Neha Narwal
Sourav Sanjukta Bhabesh
Sai Nikhil Thirandas
Darren Yow-Bang Wang
Graham Horwood
Alex Anto Chirayath
Gouri Pandeshwar
106
0
0
30 Sep 2024
Are Large Language Models In-Context Personalized Summarizers? Get an
  iCOPERNICUS Test Done!
Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done!
Divya Patel
Pathik Patel
Ankush Chander
Sourish Dasgupta
Tanmoy Chakraborty
70
2
0
30 Sep 2024
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs
Mehdi Ali
Michael Fromm
Klaudia Thellmann
Jan Ebert
Alexander Arno Weber
...
René Jäkel
Georg Rehm
Stefan Kesselheim
Joachim Köhler
Nicolas Flores-Herr
100
7
0
30 Sep 2024
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Xiaosong Yuan
Chen Shen
Shaotian Yan
Xiaofeng Zhang
Liang Xie
Wenxiao Wang
Renchu Guan
Ying Wang
Jieping Ye
ReLMLRM
103
8
0
30 Sep 2024
Wait, but Tylenol is Acetaminophen... Investigating and Improving
  Language Models' Ability to Resist Requests for Misinformation
Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation
Shan Chen
Mingye Gao
Kuleen Sasse
Thomas Hartvigsen
Brian Anthony
Lizhou Fan
Hugo J. W. L. Aerts
Jack Gallifant
Danielle S. Bitterman
LM&MA
86
1
0
30 Sep 2024
The Perfect Blend: Redefining RLHF with Mixture of Judges
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
139
14
0
30 Sep 2024
Reference Trustable Decoding: A Training-Free Augmentation Paradigm for
  Large Language Models
Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Luohe Shi
Yao Yao
Zuchao Li
Lefei Zhang
Hai Zhao
69
0
0
30 Sep 2024
Federated Instruction Tuning of LLMs with Domain Coverage Augmentation
Federated Instruction Tuning of LLMs with Domain Coverage Augmentation
Zezhou Wang
Yaxin Du
Zhuzhong Qian
Yugang Jiang
Zhuzhong Qian
Siheng Chen
FedML
526
1
0
30 Sep 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling
  Large Language Models
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Shuhao Chen
Weisen Jiang
Baijiong Lin
James T. Kwok
Yu Zhang
RALMMQ
113
13
0
30 Sep 2024
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
David Grangier
Simin Fan
Skyler Seto
Pierre Ablin
209
5
0
30 Sep 2024
Robust LLM safeguarding via refusal feature adversarial training
Robust LLM safeguarding via refusal feature adversarial training
L. Yu
Virginie Do
Karen Hambardzumyan
Nicola Cancedda
AAML
150
19
0
30 Sep 2024
SSR: Alignment-Aware Modality Connector for Speech Language Models
SSR: Alignment-Aware Modality Connector for Speech Language Models
Weiting Tan
Hirofumi Inaguma
Ning Dong
Paden Tomasello
Xutai Ma
128
6
0
30 Sep 2024
Calibrating Language Models with Adaptive Temperature Scaling
Calibrating Language Models with Adaptive Temperature Scaling
Johnathan Xie
Annie S. Chen
Yoonho Lee
Eric Mitchell
Chelsea Finn
65
17
0
29 Sep 2024
AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs
  for Astronomy
AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy
Boyao Wang
Tuan Dung Nguyen
Hardik Arora
Alberto Accomazzi
Tirthankar Ghosal
Yuan-Sen Ting
58
1
0
29 Sep 2024
PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances
  Retrieval-Augmented Generation with Zero Inference Overhead
PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead
Tao Tan
Yining Qian
Ang Lv
Hongzhan Lin
Songhao Wu
Yongbo Wang
Feng Wang
Jingtong Wu
Xin Lu
Rui Yan
85
1
0
29 Sep 2024
Hyper-Connections
Hyper-Connections
Defa Zhu
Hongzhi Huang
Zihao Huang
Yutao Zeng
Yunyao Mao
Banggu Wu
Qiyang Min
Xun Zhou
89
6
0
29 Sep 2024
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk
  Assessment and Disclosure
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure
Mahasweta Chakraborti
Bert Joseph Prestoza
Nicholas Vincent
Seth Frey
86
1
0
27 Sep 2024
RepairBench: Leaderboard of Frontier Models for Program Repair
RepairBench: Leaderboard of Frontier Models for Program Repair
André Silva
Martin Monperrus
KELM
60
9
0
27 Sep 2024
Ruler: A Model-Agnostic Method to Control Generated Length for Large
  Language Models
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Jiaming Li
Lei Zhang
Yunshui Li
Ziqiang Liu
Yuelin Bai
Run Luo
Longze Chen
Min Yang
ALM
44
0
0
27 Sep 2024
SciDFM: A Large Language Model with Mixture-of-Experts for Science
SciDFM: A Large Language Model with Mixture-of-Experts for Science
Liangtai Sun
Danyu Luo
Da Ma
Zihan Zhao
Baocai Chen
Zhennan Shen
Su Zhu
Lu Chen
Xin Chen
Kai Yu
MoE
56
2
0
27 Sep 2024
Mitigating Selection Bias with Node Pruning and Auxiliary Options
Mitigating Selection Bias with Node Pruning and Auxiliary Options
Hyeong Kyu Choi
Weijie Xu
Chi Xue
Stephanie Eckman
Chandan K. Reddy
92
2
0
27 Sep 2024
Predicting memorization within Large Language Models fine-tuned for classification
Predicting memorization within Large Language Models fine-tuned for classification
Jérémie Dentan
Davide Buscaldi
A. Shabou
Sonia Vanier
88
1
0
27 Sep 2024
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking
Michelle S. Lam
Fred Hohman
Dominik Moritz
Jeffrey P. Bigham
Kenneth Holstein
Mary Beth Kery
78
1
0
26 Sep 2024
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A
  Survey
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey
Tiansheng Huang
Sihao Hu
Fatih Ilhan
Selim Furkan Tekin
Ling Liu
AAML
140
46
0
26 Sep 2024
DARE: Diverse Visual Question Answering with Robustness Evaluation
DARE: Diverse Visual Question Answering with Robustness Evaluation
Hannah Sterz
Jonas Pfeiffer
Ivan Vulić
OODVLM
41
2
0
26 Sep 2024
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan
  Arabic Dialect
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect
Guokan Shang
Hadi Abdine
Yousef Khoubrane
Amr Mohamed
Yassine Abbahaddou
...
Xuguang Ren
Eric Moulines
Preslav Nakov
Michalis Vazirgiannis
Eric Xing
88
6
0
26 Sep 2024
PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent
  Representation MOdification
PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification
Tianfang Xie
Tianjing Li
Wei Zhu
Wei Han
Yi Zhao
85
5
0
26 Sep 2024
MIO: A Foundation Model on Multimodal Tokens
MIO: A Foundation Model on Multimodal Tokens
Zekun Wang
King Zhu
Chunpu Xu
Wangchunshu Zhou
Jiaheng Liu
...
Yuanxing Zhang
Ge Zhang
Ke Xu
Jie Fu
Wenhao Huang
MLLMAuLLM
175
12
0
26 Sep 2024
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Gongfan Fang
Hongxu Yin
Saurav Muralidharan
Greg Heinrich
Jeff Pool
Jan Kautz
Pavlo Molchanov
Xinchao Wang
73
10
0
26 Sep 2024
RED QUEEN: Safeguarding Large Language Models against Concealed Multi-Turn Jailbreaking
RED QUEEN: Safeguarding Large Language Models against Concealed Multi-Turn Jailbreaking
Yifan Jiang
Kriti Aggarwal
Tanmay Laud
Kashif Munir
Jay Pujara
Subhabrata Mukherjee
AAML
116
13
0
26 Sep 2024
An Adversarial Perspective on Machine Unlearning for AI Safety
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki
Boyi Wei
Yangsibo Huang
Peter Henderson
F. Tramèr
Javier Rando
MUAAML
206
53
0
26 Sep 2024
Post-hoc Reward Calibration: A Case Study on Length Bias
Post-hoc Reward Calibration: A Case Study on Length Bias
Zeyu Huang
Zihan Qiu
Zili Wang
Edoardo M. Ponti
Ivan Titov
94
6
0
25 Sep 2024
Data-Centric AI Governance: Addressing the Limitations of Model-Focused
  Policies
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
Ritwik Gupta
Leah Walker
Rodolfo Corona
Stephanie Fu
Suzanne Petryk
Janet Napolitano
Trevor Darrell
Andrew W. Reddie
ELM
83
5
0
25 Sep 2024
Harnessing Diversity for Important Data Selection in Pretraining Large
  Language Models
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models
Chi Zhang
Huaping Zhong
Kuan Zhang
Chengliang Chai
Rui Wang
...
Lei Cao
Ju Fan
Ye Yuan
Guoren Wang
Conghui He
TDI
105
10
0
25 Sep 2024
The Role of Language Models in Modern Healthcare: A Comprehensive Review
The Role of Language Models in Modern Healthcare: A Comprehensive Review
Amna Khalid
Ayma Khalid
Umar Khalid
LM&MA
68
0
0
25 Sep 2024
LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ
LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ
Marc-Antoine Allard
Matin Ansaripour
Maria Yuffa
Paul Teiletche
LRM
40
0
0
25 Sep 2024
CJEval: A Benchmark for Assessing Large Language Models Using Chinese
  Junior High School Exam Data
CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Qian-Wen Zhang
Haochen Wang
Fang Li
Siyu An
Lingfeng Qiao
Liangcai Gao
Di Yin
Xing Sun
ELMAI4Ed
61
0
0
24 Sep 2024
HelloBench: Evaluating Long Text Generation Capabilities of Large
  Language Models
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Haoran Que
Feiyu Duan
Liqun He
Yutao Mou
Wangchunshu Zhou
...
Ge Zhang
Junran Peng
Zhaoxiang Zhang
Songyang Zhang
Kai Chen
LM&MAELMVLM
106
16
0
24 Sep 2024
Previous
123...293031...676869
Next