ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03300
  4. Cited By
Measuring Massive Multitask Language Understanding
v1v2v3 (latest)

Measuring Massive Multitask Language Understanding

7 September 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
    ELMRALM
ArXiv (abs)PDFHTML

Papers citing "Measuring Massive Multitask Language Understanding"

50 / 3,408 papers shown
Title
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models
Kunsheng Tang
Wenbo Zhou
Jie Zhang
Aishan Liu
Gelei Deng
Shuai Li
Peigui Qi
Weiming Zhang
Tianwei Zhang
Nenghai Yu
135
4
0
22 Aug 2024
Great Memory, Shallow Reasoning: Limits of $k$NN-LMs
Great Memory, Shallow Reasoning: Limits of kkkNN-LMs
Shangyi Geng
Wenting Zhao
Alexander M. Rush
RALMReLMLRM
93
2
0
21 Aug 2024
LLM Pruning and Distillation in Practice: The Minitron Approach
LLM Pruning and Distillation in Practice: The Minitron Approach
Sharath Turuvekere Sreenivas
Saurav Muralidharan
Raviraj Joshi
Marcin Chochowski
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
Jan Kautz
Pavlo Molchanov
98
36
0
21 Aug 2024
CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical
  Researcher
CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher
Derry Pratama
Naufal Suryanto
Andro Aprila Adiputra
Thi-Thu-Huong Le
Ahmada Yusril Kadiptya
Muhammad Iqbal
Howon Kim
82
9
0
21 Aug 2024
Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision
  and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring
  at Intersections
Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections
Ahmed S. Abdelrahman
Mohamed Abdel-Aty
Dongdong Wang
87
4
0
21 Aug 2024
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free
  Curricular Meaningful Learning
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Kai Xiong
Xiao Ding
Li Du
Jiahao Ying
Ting Liu
Bing Qin
Yixin Cao
97
2
0
21 Aug 2024
RAGLAB: A Modular and Research-Oriented Unified Framework for
  Retrieval-Augmented Generation
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
Xuanwang Zhang
Yunze Song
Yidong Wang
Shuyun Tang
Xinfeng Li
...
Wei Dong
Yue Zhang
Xinyu Dai
Shikun Zhang
Qingsong Wen
110
5
0
21 Aug 2024
SORSA: Singular Values and Orthonormal Regularized Singular Vectors
  Adaptation of Large Language Models
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
Yang Cao
154
2
0
21 Aug 2024
WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain
WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain
Rounak Meyur
Hung Phan
S. Wagle
Jan Strube
M. Halappanavar
Sameera Horawalavithana
Anurag Acharya
Sai Munikoti
93
1
0
21 Aug 2024
Beyond Labels: Aligning Large Language Models with Human-like Reasoning
Beyond Labels: Aligning Large Language Models with Human-like Reasoning
Muhammad Rafsan Kabir
Rafeed Mohammad Sultan
Ihsanul Haque Asif
Jawad Ibn Ahad
Fuad Rahman
Mohammad Ruhul Amin
Nabeel Mohammed
Shafin Rahman
LRM
89
2
0
20 Aug 2024
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
J. Huang
Dong Li
Mengxi Xiao
Zihao Jiang
Yuzhe Yang
...
Benyou Wang
Alejandro Lopez-Lira
Qianqian Xie
Sophia Ananiadou
Junichi Tsujii
AIFinAI4TS
81
25
0
20 Aug 2024
Performance Law of Large Language Models
Performance Law of Large Language Models
Chuhan Wu
Ruiming Tang
LRM
82
2
0
19 Aug 2024
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for
  Efficient MoE Inference
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
Shuzhang Zhong
Ling Liang
Yuan Wang
Runsheng Wang
Ru Huang
Meng Li
MoE
60
12
0
19 Aug 2024
How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments
How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments
Yusuke Ide
Yuto Nishida
Miyu Oba
Miyu Oba
Justin Vasselli
Hidetaka Kamigaito
Taro Watanabe
131
0
0
19 Aug 2024
MoDeGPT: Modular Decomposition for Large Language Model Compression
MoDeGPT: Modular Decomposition for Large Language Model Compression
Chi-Heng Lin
Shangqian Gao
James Seale Smith
Abhishek Patel
Shikhar Tuli
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
168
13
0
19 Aug 2024
Activated Parameter Locating via Causal Intervention for Model Merging
Activated Parameter Locating via Causal Intervention for Model Merging
Fanshuang Kong
Richong Zhang
Ziqiao Wang
MoMe
44
2
0
18 Aug 2024
Fostering Natural Conversation in Large Language Models with NICO: a
  Natural Interactive COnversation dataset
Fostering Natural Conversation in Large Language Models with NICO: a Natural Interactive COnversation dataset
Renliang Sun
Mengyuan Liu
Shiping Yang
Rui Wang
Junqing He
Jiaxing Zhang
92
2
0
18 Aug 2024
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity
  Instructions
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity Instructions
Matan Levi
Yair Alluouche
Daniel Ohayon
Anton Puzanov
76
6
0
17 Aug 2024
Selective Prompt Anchoring for Code Generation
Selective Prompt Anchoring for Code Generation
Yuan Tian
Tianyi Zhang
267
3
0
17 Aug 2024
CogLM: Tracking Cognitive Development of Large Language Models
CogLM: Tracking Cognitive Development of Large Language Models
Xinglin Wang
Peiwen Yuan
Shaoxiong Feng
Yiwei Li
Boyuan Pan
Heda Wang
Yao Hu
Kan Li
ELM
160
1
0
17 Aug 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering
  LLM Weaknesses
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALMELM
83
11
0
16 Aug 2024
CIKMar: A Dual-Encoder Approach to Prompt-Based Reranking in Educational
  Dialogue Systems
CIKMar: A Dual-Encoder Approach to Prompt-Based Reranking in Educational Dialogue Systems
Joanito Agili Lopo
Marina Indah Prasasti
Alma Permatasari
68
0
0
16 Aug 2024
Reasoning Beyond Bias: A Study on Counterfactual Prompting and Chain of
  Thought Reasoning
Reasoning Beyond Bias: A Study on Counterfactual Prompting and Chain of Thought Reasoning
Kyle Moore
Jesse Roberts
Thao Pham
Douglas H. Fisher
LRM
51
4
0
16 Aug 2024
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
Do Xuan Long
Hai Nguyen Ngoc
Tiviatis Sim
Hieu Dao
Shafiq Joty
Kenji Kawaguchi
Nancy F. Chen
Min-Yen Kan
135
11
0
16 Aug 2024
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
Kaushal Kumar Maurya
KV Aditya Srivatsa
Ekaterina Kochmar
105
2
0
16 Aug 2024
Assessing and Enhancing Large Language Models in Rare Disease
  Question-answering
Assessing and Enhancing Large Language Models in Rare Disease Question-answering
Guanchu Wang
Junhao Ran
Ruixiang Tang
Chia-Yuan Chang
Chia-Yuan Chang
Yu-Neng Chuang
Zirui Liu
Vladimir Braverman
Zhandong Liu
Xia Hu
LM&MA
104
7
0
15 Aug 2024
Hermes 3 Technical Report
Hermes 3 Technical Report
Ryan Teknium
Jeffrey Quesnelle
Chen Guang
77
14
0
15 Aug 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
  Language Models
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Zhongyu Zhao
Menghang Dong
Rongyu Zhang
Wenzhao Zheng
Yunpeng Zhang
Huanrui Yang
Dalong Du
Kurt Keutzer
Shanghang Zhang
104
0
0
15 Aug 2024
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative
  Self-Enhancement Paradigm
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm
Yiming Liang
Ge Zhang
Xingwei Qu
Tianyu Zheng
Jiawei Guo
...
Jiaheng Liu
Chenghua Lin
Lei Ma
Wenhao Huang
Jiajun Zhang
ALM
126
11
0
15 Aug 2024
ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal
  Knowledge in Large Language Models
ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language Models
Faris Hijazi
Somayah Alharbi
Abdulaziz AlHussein
Harethah Shairah
Reem Alzahrani
Hebah Alshamlan
Omar Knio
G. Turkiyyah
AILawELM
87
4
0
15 Aug 2024
Can Large Language Models Understand Symbolic Graphics Programs?
Can Large Language Models Understand Symbolic Graphics Programs?
Zeju Qiu
Weiyang Liu
Haiwen Feng
Zhen Liu
Tim Z. Xiao
Katherine M. Collins
J. Tenenbaum
Adrian Weller
Michael J. Black
Bernhard Schölkopf
127
14
0
15 Aug 2024
Automated Design of Agentic Systems
Automated Design of Agentic Systems
Shengran Hu
Cong Lu
Jeff Clune
AI4CE
148
62
0
15 Aug 2024
Image-Based Leopard Seal Recognition: Approaches and Challenges in
  Current Automated Systems
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
Jorge Yero Salazar
Pablo Rivas
Renato Borras-Chavez
Sarah Kienle
62
0
0
14 Aug 2024
Can Large Language Models Reason? A Characterization via 3-SAT
Can Large Language Models Reason? A Characterization via 3-SAT
Rishi Hazra
Gabriele Venturato
Pedro Zuidberg Dos Martires
Luc de Raedt
ELMReLMLRM
80
6
0
13 Aug 2024
A Survey on Model MoErging: Recycling and Routing Among Specialized
  Experts for Collaborative Learning
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
Prateek Yadav
Colin Raffel
Mohammed Muqeeth
Lucas Caccia
Haokun Liu
Tianlong Chen
Joey Tianyi Zhou
Leshem Choshen
Alessandro Sordoni
MoMe
120
25
0
13 Aug 2024
Layerwise Recurrent Router for Mixture-of-Experts
Layerwise Recurrent Router for Mixture-of-Experts
Zihan Qiu
Zeyu Huang
Shuang Cheng
Yizhi Zhou
Zili Wang
Ivan Titov
Jie Fu
MoE
155
2
0
13 Aug 2024
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty
Yongjin Yang
Haneul Yoo
Hwaran Lee
165
4
0
13 Aug 2024
The advantages of context specific language models: the case of the Erasmian Language Model
The advantages of context specific language models: the case of the Erasmian Language Model
João Gonçalves
Nick Jelicic
Michele Murgia
Evert Stamhuis
101
0
0
13 Aug 2024
Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives
Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives
Zhihu Wang
Shiwan Zhao
Yu Wang
Heyuan Huang
Sitao Xie
Y. Zhang
Jiaxin Shi
Zhixing Wang
H. Li
Junchi Yan
LRM
124
6
0
13 Aug 2024
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced
  Data
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Haoran Sun
Renren Jin
Shaoyang Xu
Leiyu Pan
Supryadi
...
Lei Yang
Ling Shi
Juesi Xiao
Shaolin Zhu
Deyi Xiong
98
4
0
12 Aug 2024
Anchored Preference Optimization and Contrastive Revisions: Addressing
  Underspecification in Alignment
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
Karel DÓosterlinck
Winnie Xu
Chris Develder
Thomas Demeester
A. Singh
Christopher Potts
Douwe Kiela
Shikib Mehri
80
17
0
12 Aug 2024
Prompto: An open source library for asynchronous querying of LLM
  endpoints
Prompto: An open source library for asynchronous querying of LLM endpoints
Ryan Sze-Yin Chan
Federico Nanni
Edwin Brown
Ed Chapman
Angus R. Williams
Jonathan Bright
Evelina Gabasova
LRM
56
1
0
12 Aug 2024
Med42-v2: A Suite of Clinical LLMs
Med42-v2: A Suite of Clinical LLMs
Clément Christophe
Praveen K Kanithi
Tathagata Raha
Shadab Khan
Marco AF Pimentel
ELMLM&MAAI4MH
86
27
0
12 Aug 2024
LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference
LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference
Zhiwen Mo
Lei Wang
Jianyu Wei
Zhichen Zeng
Shijie Cao
...
Naifeng Jing
Ting Cao
Jilong Xue
Fan Yang
Mao Yang
120
0
0
12 Aug 2024
Post-Training Sparse Attention with Double Sparsity
Post-Training Sparse Attention with Double Sparsity
Shuo Yang
Ying Sheng
Joseph E. Gonzalez
Ion Stoica
Lianmin Zheng
103
12
0
11 Aug 2024
ProFuser: Progressive Fusion of Large Language Models
ProFuser: Progressive Fusion of Large Language Models
Tianyuan Shi
Fanqi Wan
Canbin Huang
Xiaojun Quan
Chenliang Li
Ming Yan
Ji Zhang
MoMe
67
3
0
09 Aug 2024
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic
  Mathematical Reasoning
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
Bo-Wen Zhang
Yan Yan
Lin Li
Guang Liu
ReLMLRM
33
6
0
09 Aug 2024
GlitchProber: Advancing Effective Detection and Mitigation of Glitch
  Tokens in Large Language Models
GlitchProber: Advancing Effective Detection and Mitigation of Glitch Tokens in Large Language Models
Zhibo Zhang
Wuxia Bai
Yuxi Li
Max Meng
Kaidi Wang
Ling Shi
Li Li
Jun Wang
Haoyu Wang
71
4
0
09 Aug 2024
VITA: Towards Open-Source Interactive Omni Multimodal LLM
VITA: Towards Open-Source Interactive Omni Multimodal LLM
Chaoyou Fu
Haojia Lin
Zuwei Long
Yunhang Shen
Meng Zhao
...
Rongrong Ji
Xing Sun
Ran He
Caifeng Shan
Xing Sun
MLLM
140
96
0
09 Aug 2024
Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models
Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models
Zikai Xie
HILMLRM
154
7
0
09 Aug 2024
Previous
123...323334...676869
Next