ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTML

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 3,408 papers shown
Title
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Ling Team
B. Zeng
Chenyu Huang
Chao Zhang
Changxin Tian
...
Zhaoxin Huan
Zujie Wen
Zhenhang Sun
Zhuoxuan Du
Z. He
MoEALM
196
5
0
07 Mar 2025
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance
Bryan Etzine
Masoud Hashemi
Nishanth Madhusudhan
Sagar Davasam
Roshnee Sharma
Sathwik Tejaswi Madhusudhan
Vikas Yadav
72
0
0
07 Mar 2025
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
Shwai He
Weilin Cai
Jiayi Huang
Ang Li
MoE
189
2
0
07 Mar 2025
Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs
Zara Siddique
Irtaza Khalid
Liam D. Turner
Luis Espinosa-Anke
LLMSV
161
2
0
07 Mar 2025
Adding Alignment Control to Language Models
Wenhong Zhu
Weinan Zhang
Rui Wang
124
0
0
06 Mar 2025
Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size
Alireza Behtash
Marijan Fofonjka
Ethan Baird
Tyler Mauer
Hossein Moghimifam
David Stout
Joel Dennison
MQ
135
1
0
06 Mar 2025
Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English
Runtao Zhou
Guangya Wan
Saadia Gabriel
Sheng Li
Alexander J Gates
Maarten Sap
Thomas Hartvigsen
LRM
172
2
0
06 Mar 2025
Efficient Algorithms for Verifying Kruskal Rank in Sparse Linear Regression and Related Applications
Fengqin Zhou
121
6
0
06 Mar 2025
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
Zhijian Zhuo
Yutao Zeng
Ya Wang
Sijun Zhang
Jian Yang
Xiaoqing Li
Xun Zhou
Jinwen Ma
114
0
0
06 Mar 2025
The Challenge of Identifying the Origin of Black-Box Large Language Models
Ziqing Yang
Yixin Wu
Yun Shen
Wei Dai
Michael Backes
Yang Zhang
AAML
79
1
0
06 Mar 2025
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models
Benyamin Jamialahmadi
Parsa Kavehzadeh
Mehdi Rezagholizadeh
Parsa Farinneya
Hossein Rajabzadeh
A. Jafari
Boxing Chen
Marzieh S. Tahaei
92
0
0
06 Mar 2025
Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
J.N. Zhang
159
1
0
06 Mar 2025
Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication
Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication
Tom Kouwenhoven
Max Peeperkorn
R. D. Kleijn
Tessa Verhoef
110
0
0
06 Mar 2025
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
Zichong Li
Xinyu Feng
Yuheng Cai
Zixuan Zhang
Tianyi Liu
Chen Liang
Weizhu Chen
Haoyu Wang
Tiejun Zhao
LRM
115
2
0
06 Mar 2025
A Practical Memory Injection Attack against LLM Agents
Shen Dong
Shaocheng Xu
Pengfei He
Yuchen Li
Jiliang Tang
Tianming Liu
Hui Liu
Zhen Xiang
LLMAGAAML
94
4
0
05 Mar 2025
Improving LLM Safety Alignment with Dual-Objective Optimization
Improving LLM Safety Alignment with Dual-Objective Optimization
Xuandong Zhao
Will Cai
Tianneng Shi
David Huang
Licong Lin
Song Mei
Dawn Song
AAMLMU
207
5
0
05 Mar 2025
Framing the Game: How Context Shapes LLM Decision-Making
Isaac Robinson
John Burden
73
0
0
05 Mar 2025
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Guangyi Liu
Shuo Tang
Rui Ge
Yaxin Du
Zhenfei Yin
Tian Jin
Jing Shao
LLMAG
151
7
0
05 Mar 2025
Robust Learning of Diverse Code Edits
Robust Learning of Diverse Code Edits
Tushar Aggarwal
Swayam Singh
Abhijeet Awasthi
Aditya Kanade
Nagarajan Natarajan
SyDa
545
0
0
05 Mar 2025
Targeted Distillation for Sentiment Analysis
Yice Zhang
Guangyu Xie
Jingjie Lin
Jianzhu Bao
Qianlong Wang
Xi Zeng
Ruifeng Xu
87
0
0
05 Mar 2025
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
Hiroshi Takahashi
Tomoharu Iwata
Atsutoshi Kumagai
Yuuki Yamanaka
Tomoya Yamashita
DiffM
125
0
0
05 Mar 2025
Extrapolation Merging: Keep Improving With Extrapolation and Merging
Yiguan Lin
Bin Xu
Yinghao Li
Yang Gao
MoMe
95
1
0
05 Mar 2025
Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Emmy Liu
Amanda Bertsch
Lintang Sutawika
Lindia Tjuatja
Patrick Fernandes
...
Siyang Song
Carolin (Haas) Lawrence
Aditi Raghunathan
Kiril Gashteovski
Graham Neubig
275
3
0
05 Mar 2025
Token-Level Privacy in Large Language Models
Reém Harel
Niv Gilboa
Yuval Pinter
89
0
0
05 Mar 2025
Large Language Models as Natural Selector for Embodied Soft Robot Design
Changhe Chen
Xiaohao Xu
Xiangdong Wang
Xiaonan Huang
LLMAGLM&Ro
103
0
0
04 Mar 2025
OWLViz: An Open-World Benchmark for Visual Question Answering
OWLViz: An Open-World Benchmark for Visual Question Answering
T. Nguyen
Dang Nguyen
Hoang Nguyen
Thuan Luong
Long Hoang Dang
Viet Dac Lai
VLM
97
0
0
04 Mar 2025
LLM-Safety Evaluations Lack Robustness
Tim Beyer
Sophie Xhonneux
Simon Geisler
Gauthier Gidel
Leo Schwinn
Stephan Günnemann
ALMELM
485
2
0
04 Mar 2025
EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports
L. Moukheiber
Mira Moukheiber
Dana Moukheiiber
Jae-Woo Ju
Hyung-Chul Lee
LM&MA
136
0
0
04 Mar 2025
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Zehan Li
Yuhao Du
Xiaoqi Jiao
Yiwen Guo
Yuege Feng
Xiang Wan
Anningzhe Gao
Jinpeng Hu
98
0
0
04 Mar 2025
Invisible Strings: Revealing Latent Dancer-to-Dancer Interactions with Graph Neural Networks
Luis Zerkowski
Zixuan Wang
I. Vidrin
M. Pettee
61
1
0
04 Mar 2025
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
Jianghao Chen
Junhong Wu
Yangyifan Xu
J.N. Zhang
105
1
0
04 Mar 2025
Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models
Cheng-Kuang Wu
Zhi Rui Tam
Chieh-Yen Lin
Yun-Nung Chen
Hung-yi Lee
76
0
0
03 Mar 2025
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Abdelrahman Abouelenin
Atabak Ashfaq
Adam Atkinson
Hany Awadalla
Nguyen Bach
...
Ishmam Zabir
Yunan Zhang
Li Zhang
Yanzhe Zhang
Xiren Zhou
MoESyDa
122
70
0
03 Mar 2025
Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding
Yun Wang
Pei Zhang
Siyuan Huang
Baosong Yang
Zizhuo Zhang
Fei Huang
Rui Wang
BDLLRM
139
11
0
03 Mar 2025
Large-Scale Data Selection for Instruction Tuning
Large-Scale Data Selection for Instruction Tuning
Hamish Ivison
Muru Zhang
Faeze Brahman
Pang Wei Koh
Pradeep Dasigi
ALM
112
5
0
03 Mar 2025
What do Large Language Models Say About Animals? Investigating Risks of Animal Harm in Generated Text
What do Large Language Models Say About Animals? Investigating Risks of Animal Harm in Generated Text
Arturs Kanepajs
Aditi Basu
Sankalpa Ghose
Constance Li
Akshat Mehta
Ronak Mehta
Samuel David Tucker-Davis
Eric Zhou
Bob Fischer
Jacy Reese Anthis
ELMALM
149
1
0
03 Mar 2025
Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning
Wenshuai Huo
Xiaocheng Feng
Yichong Huang
Chengpeng Fu
Baohang Li
...
Dandan Tu
Duyu Tang
Yunfei Lu
Hui Wang
Bing Qin
99
4
0
03 Mar 2025
KoWit-24: A Richly Annotated Dataset of Wordplay in News Headlines
Alexander Baranov
Anna Palatkina
Yulia Makovka
Pavel Braslavski
212
0
0
03 Mar 2025
AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Nicholas Carlini
Javier Rando
Edoardo Debenedetti
Milad Nasr
F. Tramèr
AAMLELM
92
3
0
03 Mar 2025
None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering
Zhi Rui Tam
Cheng-Kuang Wu
Chieh-Yen Lin
Yun-Nung Chen
100
2
0
03 Mar 2025
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation
Eliya Habba
Ofir Arviv
Itay Itzhak
Yotam Perlitz
Elron Bandel
Leshem Choshen
Michal Shmueli-Scheuer
Gabriel Stanovsky
129
5
0
03 Mar 2025
KurTail : Kurtosis-based LLM Quantization
Mohammad Sadegh Akhondzadeh
Aleksandar Bojchevski
E. Eleftheriou
M. Dazzi
MQ
79
0
0
03 Mar 2025
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
Lu Dai
Yijie Xu
Jinhui Ye
Hao Liu
Hui Xiong
3DVRALM
209
3
0
03 Mar 2025
TMIQ: Quantifying Test and Measurement Domain Intelligence in Large Language Models
Emmanuel A. Olowe
Danial Chitnis
LM&MA
64
0
0
03 Mar 2025
RSQ: Learning from Important Tokens Leads to Better Quantized LLMs
Yi-Lin Sung
Prateek Yadav
Jialu Li
Jaehong Yoon
Joey Tianyi Zhou
MQ
101
1
0
03 Mar 2025
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
Fajri Koto
Rituraj Joshi
Nurdaulet Mukhituly
Yanjie Wang
Zhuohan Xie
...
Avraham Sheinin
Natalia Vassilieva
Neha Sengupta
Larry Murray
Preslav Nakov
ALMKELM
132
0
0
03 Mar 2025
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Kashun Shum
Yuanmin Huang
Hongjian Zou
Qi Ding
Yixuan Liao
Xiao Chen
Qian Liu
Junxian He
180
4
0
02 Mar 2025
Evaluating Polish linguistic and cultural competency in large language models
Sławomir Dadas
Małgorzata Grębowiec
Michał Perełkiewicz
Rafał Poświata
ELM
105
2
0
02 Mar 2025
Pseudo-Knowledge Graph: Meta-Path Guided Retrieval and In-Graph Text for RAG-Equipped LLM
Yuxin Yang
Haoyang Wu
Tao Wang
Jia-Qi Yang
Hao Ma
Guojie Luo
RALM
65
0
0
01 Mar 2025
LoR2C : Low-Rank Residual Connection Adaptation for Parameter-Efficient Fine-Tuning
Jiancheng Zhao
Xingda Yu
Yuxiang Zhang
Zhen Yang
OffRL
80
0
0
01 Mar 2025
Previous
123...151617...676869
Next