Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks
Zihao Zhao
Chenxiao Fan
Chongming Gao
Fuli Feng
Xiangnan He
LM&MA
AI4MH
116
1
0
05 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
184
10
0
05 Mar 2025
Implicit Bias in LLMs: A Survey
Xinru Lin
Luyang Li
101
3
0
04 Mar 2025
AugFL: Augmenting Federated Learning with Pretrained Models
Sheng Yue
Zerui Qin
Yongheng Deng
Ju Ren
Yaoxue Zhang
Junshan Zhang
FedML
135
0
0
04 Mar 2025
Mamba base PKD for efficient knowledge compression
José Medina
Amnir Hadachi
Paul Honeine
Abdelaziz Bensrhair
Mamba
79
0
0
03 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
Yifang Chen
Xuyang Guo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
107
3
0
03 Mar 2025
Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace
Jia-Chen Zhang
Yu-Jie Xiong
Chun-Ming Xia
Dong-Hai Zhu
Xi-He Qiu
113
4
0
03 Mar 2025
AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Nicholas Carlini
Javier Rando
Edoardo Debenedetti
Milad Nasr
F. Tramèr
AAML
ELM
92
3
0
03 Mar 2025
SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity
Xiangyu Xi
Deyang Kong
Jian Yang
Jiawei Yang
Zheyu Chen
Wei Wang
Jinqiao Wang
Xunliang Cai
Shikun Zhang
Wei Ye
113
0
0
03 Mar 2025
Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction
Liping Liu
Chunhong Zhang
Likang Wu
Chuang Zhao
Zheng Hu
Ming He
Jianping Fan
LLMAG
LRM
75
2
0
02 Mar 2025
From Vague Instructions to Task Plans: A Feedback-Driven HRC Task Planning Framework based on LLMs
Afagh Mehri Shervedani
Matthew R. Walter
Milos Zefran
88
0
0
02 Mar 2025
ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer
Omer Goldman
Uri Shaham
Dan Malkin
Sivan Eiger
Avinatan Hassidim
...
Shruti Rijhwani
Laura Rimell
Idan Szpektor
Reut Tsarfaty
Matan Eyal
125
7
0
28 Feb 2025
Adaptive Keyframe Sampling for Long Video Understanding
Xi Tang
Jihao Qiu
Lingxi Xie
Yunjie Tian
Jianbin Jiao
Qixiang Ye
125
5
0
28 Feb 2025
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
ObjD
VLM
82
0
0
28 Feb 2025
ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs
Hao Ge
Junda Feng
Qi Huang
Fangcheng Fu
Xiaonan Nie
Lei Zuo
Yanghua Peng
Tengjiao Wang
Xin Liu
104
2
0
28 Feb 2025
NaijaNLP: A Survey of Nigerian Low-Resource Languages
Isa Inuwa-Dutse
109
0
0
27 Feb 2025
FedMentalCare: Towards Privacy-Preserving Fine-Tuned LLMs to Analyze Mental Health Status Using Federated Learning Framework
S M Sarwar
AI4MH
77
1
0
27 Feb 2025
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
Shalev Lifshitz
Sheila A. McIlraith
Yilun Du
LRM
138
8
0
27 Feb 2025
AoECR: AI-ization of Elderly Care Robot
Linkun Zhou
Jian Li
Yadong Mo
Wei Wei
Ying Zhang
Shimin Wei
58
0
0
27 Feb 2025
Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models
Yu He
Boheng Li
Lu Liu
Zhongjie Ba
Wei Dong
Yiming Li
Zhan Qin
Kui Ren
Chong Chen
MIALM
183
3
0
26 Feb 2025
Can LLMs Help Uncover Insights about LLMs? A Large-Scale, Evolving Literature Analysis of Frontier LLMs
Jungsoo Park
Junmo Kang
Gabriel Stanovsky
Alan Ritter
134
0
0
26 Feb 2025
A Sliding Layer Merging Method for Efficient Depth-Wise Pruning in LLMs
Xuan Ding
Rui Sun
Yunjian Zhang
Xiu Yan
Yueqi Zhou
Kaihao Huang
Suzhong Fu
Angelica I Aviles-Rivero
Chuanlong Xie
Yao Zhu
260
1
0
26 Feb 2025
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
Teng Lin
RALM
115
2
0
26 Feb 2025
Kanana: Compute-efficient Bilingual Language Models
Kanana LLM Team
Yunju Bak
Hojin Lee
Minho Ryu
Jiyeon Ham
...
Daniel Lee
Minchul Lee
MinHyung Lee
Shinbok Lee
Gaeun Seo
192
1
0
26 Feb 2025
Binary Neural Networks for Large Language Model: A Survey
Liangdong Liu
Zhitong Zheng
Cong Wang
TianHuang Su
ZhenYu Yang
MQ
147
0
0
26 Feb 2025
Data-Efficient Multi-Agent Spatial Planning with LLMs
Huangyuan Su
Aaron Walsman
Daniel Garces
Sham Kakade
Stephanie Gil
LLMAG
Presented at
ResearchTrend Connect | LLMAG
on
28 Mar 2025
220
0
0
26 Feb 2025
CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning
Ping Zhang
Zhaorui Zhang
Sheng Di
Yao Xin
Benben Liu
103
2
0
26 Feb 2025
FaithUn: Toward Faithful Forgetting in Language Models by Investigating the Interconnectedness of Knowledge
Nakyeong Yang
Minsung Kim
Seunghyun Yoon
Joongbo Shin
Kyomin Jung
KELM
MU
98
1
0
26 Feb 2025
Learning to Generate Structured Output with Schema Reinforcement Learning
Yaojie Lu
Haolun Li
Xin Cong
Zhong Zhang
Yesai Wu
Yankai Lin
Zhiyuan Liu
Fangming Liu
Maosong Sun
107
1
0
26 Feb 2025
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization
Ru Wang
Wei Huang
Selena Song
Haoyu Zhang
Yusuke Iwasawa
Y. Matsuo
Jiaxian Guo
OODD
LRM
130
3
0
25 Feb 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models
Radu Marinescu
D. Bhattacharjya
Junkyu Lee
T. Tchrakian
Javier Carnerero-Cano
Yufang Hou
Elizabeth M. Daly
Alessandra Pascale
HILM
LRM
83
0
0
25 Feb 2025
A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
Zihan Wang
Ziqi Zhao
Yougang Lyu
Ziyang Chen
Maarten de Rijke
Zhaochun Ren
134
4
0
25 Feb 2025
CHBench: A Chinese Dataset for Evaluating Health in Large Language Models
Chenlu Guo
Nuo Xu
Yi-Ju Chang
Yuan Wu
AI4MH
LM&MA
118
2
0
24 Feb 2025
CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought
Boxuan Zhang
Ruqi Zhang
LRM
76
3
0
24 Feb 2025
When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models
Weilan Wang
Yu Mao
Dongdong Tang
Hongchao Du
Nan Guan
Chun Jason Xue
MQ
126
2
0
24 Feb 2025
Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam
Tianjin Huang
Haotian Hu
Zhenyu Zhang
Gaojie Jin
Xianrui Li
...
Tianlong Chen
Lu Liu
Qingsong Wen
Zhangyang Wang
Shiwei Liu
MQ
126
2
0
24 Feb 2025
Streaming Looking Ahead with Token-level Self-reward
Han Zhang
Ruixin Hong
Dong Yu
76
2
0
24 Feb 2025
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation
Simin Chen
Yiming Chen
Zexin Li
Yifan Jiang
Zhongwei Wan
...
Dezhi Ran
Tianle Gu
Haoyang Li
Tao Xie
Baishakhi Ray
99
6
0
23 Feb 2025
Navigation-GPT: A Robust and Adaptive Framework Utilizing Large Language Models for Navigation Applications
Feng Ma
Xiang Wang
Chen Chen
Xiao-bin Xu
Xin-ping Yan
474
0
0
23 Feb 2025
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
Qianqi Yan
Yue Fan
Hongquan Li
Shan Jiang
Yang Zhao
Xinze Guan
Ching-Chen Kuo
Xinze Wang
VLM
LRM
236
2
0
22 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
109
7
0
21 Feb 2025
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation
Yue Zhou
Yi-Ju Chang
Yuan Wu
MoMe
122
3
0
21 Feb 2025
PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System
Yintao He
Haiyu Mao
Christina Giannoula
Mohammad Sadrosadati
Juan Gómez Luna
Huawei Li
Xiaowei Li
Ying Wang
O. Mutlu
91
8
0
21 Feb 2025
Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework
Reza Averly
Xia Ning
183
2
0
21 Feb 2025
DP-MemArc: Differential Privacy Transfer Learning for Memory Efficient Language Models
Yanming Liu
Xinyue Peng
Yuwei Zhang
Xiaolan Ke
Songhang Deng
...
Sheng Cheng
Xun Wang
Yuxiang Cai
Tianyu Du
Xuhong Zhang
181
0
0
21 Feb 2025
Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques
Sangjun Han
Taeil Hur
Youngmi Hur
Kathy Sangkyung Lee
Myungyoon Lee
Hyojae Lim
501
0
0
20 Feb 2025
Stacking as Accelerated Gradient Descent
Naman Agarwal
Pranjal Awasthi
Satyen Kale
Eric Zhao
ODL
135
3
0
20 Feb 2025
EvoP: Robust LLM Inference via Evolutionary Pruning
Shangyu Wu
Hongchao Du
Ying Xiong
Shuai Chen
Tei-Wei Kuo
Nan Guan
Chun Jason Xue
102
1
0
19 Feb 2025
Q-STRUM Debate: Query-Driven Contrastive Summarization for Recommendation Comparison
George Saad
Scott Sanner
40
0
0
18 Feb 2025
Conditioning LLMs to Generate Code-Switched Text
Maite Heredia
Gorka Labaka
Jeremy Barnes
A. Soroa
44
1
0
18 Feb 2025
Previous
1
2
3
...
6
7
8
...
85
86
87
Next