Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question Answering
Boyi Zhang
Zhuo Liu
Hangfeng He
LRM
29
0
0
31 May 2025
Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations
Pardis Sadat Zahraei
Ali Emami
35
0
0
31 May 2025
Stepsize anything: A unified learning rate schedule for budgeted-iteration training
Anda Tang
Yiming Dong
Yutao Zeng
zhou Xun
Zhouchen Lin
378
0
0
30 May 2025
DLM-One: Diffusion Language Models for One-Step Sequence Generation
Tianqi Chen
Shujian Zhang
Mingyuan Zhou
39
0
0
30 May 2025
Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules
Y. Zhang
Peiwen Yuan
Shaoxiong Feng
Yiwei Li
Xinglin Wang
Jiayi Shi
Chuyi Tan
Boyuan Pan
Yao Hu
Kan Li
29
0
0
30 May 2025
A Mathematical Framework for AI-Human Integration in Work
Elisa Celis
Lingxiao Huang
Nisheeth K. Vishnoi
81
0
0
29 May 2025
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data
Seohyeong Lee
Eunwon Kim
Hwaran Lee
Buru Chang
86
0
0
29 May 2025
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts
Xuweiyi Chen
Wentao Zhou
Aruni RoyChowdhury
Zezhou Cheng
3DPC
67
0
0
29 May 2025
A Survey of Generative Categories and Techniques in Multimodal Large Language Models
Longzhen Han
Awes Mubarak
Almas Baimagambetov
Nikolaos Polatidis
Thar Baker
LRM
72
0
0
29 May 2025
EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse
Tianyu Guo
Hande Dong
Yichong Leng
Feng Liu
Cheater Lin
Nong Xiao
X. Zhang
RALM
35
0
0
28 May 2025
RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering
Bolei He
Xinran He
Mengke Chen
Xianwei Xue
Ying Zhu
Zhenhua Ling
ReLM
LRM
56
0
0
28 May 2025
DES-LOC: Desynced Low Communication Adaptive Optimizers for Training Foundation Models
Alex Iacob
Lorenzo Sani
M. Safaryan
Paris Giampouras
Samuel Horváth
...
Meghdad Kurmanji
Preslav Aleksandrov
William F. Shen
Xinchi Qiu
Nicholas D. Lane
OffRL
112
0
0
28 May 2025
Taming Transformer Without Using Learning Rate Warmup
Xianbiao Qi
Yelin He
Jiaquan Ye
Chun-Guang Li
Bojia Zi
Xili Dai
Qin Zou
Rong Xiao
38
0
0
28 May 2025
Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Shintaro Ozaki
Tatsuya Hiraoka
Hiroto Otake
Hiroki Ouchi
Masaru Isonuma
...
Kentaro Inui
Taro Watanabe
Yusuke Miyao
Yohei Oseki
Yu Takagi
LRM
54
0
0
27 May 2025
What happens when generative AI models train recursively on each others' generated outputs?
Hung Ahn Vu
Galen Reeves
Emily Wenger
71
0
0
27 May 2025
In Search of Adam's Secret Sauce
Antonio Orvieto
Robert Gower
49
1
0
27 May 2025
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations
Zeinab Dehghani
Koorosh Aslansefat
Adil Khan
Mohammed Naveed Akram
MILM
LRM
141
0
0
27 May 2025
LLMs Think, But Not In Your Flow: Reasoning-Level Personalization for Black-Box Large Language Models
Jieyong Kim
Tongyoung Kim
Soojin Yoon
Jaehyung Kim
Dongha Lee
LRM
98
0
0
27 May 2025
Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment
Xiaojun Jia
Sensen Gao
Simeng Qin
Tianyu Pang
C. Du
Yihao Huang
Xinfeng Li
Yiming Li
Bo Li
Yang Liu
AAML
50
0
0
27 May 2025
Scrapers selectively respect robots.txt directives: evidence from a large-scale empirical study
Taein Kim
Karstan Bock
Claire Luo
Amanda Liswood
Emily Wenger
9
0
0
27 May 2025
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models
Xiaqiang Tang
Jian Li
Keyu Hu
Du Nan
Xiaolong Li
Xi Zhang
Weigao Sun
Sihong Xie
HILM
55
0
0
27 May 2025
Pretrained LLMs Learn Multiple Types of Uncertainty
Roi Cohen
Omri Fahn
Gerard de Melo
43
0
0
27 May 2025
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
57
0
0
27 May 2025
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs
Pooneh Mousavi
Yingzhi Wang
Mirco Ravanelli
Cem Subakan
AuLLM
77
0
0
26 May 2025
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers
Rihui Xin
Han Liu
Zecheng Wang
Yupeng Zhang
Dianbo Sui
Xiaolin Hu
Bingning Wang
SyDa
73
1
0
26 May 2025
LlamaSeg: Image Segmentation via Autoregressive Mask Generation
Jiru Deng
Tengjin Weng
Tianyu Yang
Wenhan Luo
Zhiheng Li
Wenhao Jiang
VLM
154
0
0
26 May 2025
Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks
Mohammad Mahdi Moradi
Walid Ahmed
Shuangyue Wen
Sudhir Mudur
Weiwei Zhang
Yang Liu
98
0
0
26 May 2025
DISRetrieval: Harnessing Discourse Structure for Long Document Retrieval
H. Chen
Yi Yang
Yinghui Li
Meishan Zhang
Min Zhang
RALM
24
0
0
26 May 2025
GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models
Tingjia Shen
Hao Wang
Chuan Qin
Ruijun Sun
Yang Song
Defu Lian
Hengshu Zhu
Enhong Chen
57
0
0
26 May 2025
SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety
Geon-hyeong Kim
Youngsoo Jang
Yu Jin Kim
Byoungjip Kim
Honglak Lee
Kyunghoon Bae
Moontae Lee
32
2
0
26 May 2025
Enhancing Visual Reliance in Text Generation: A Bayesian Perspective on Mitigating Hallucination in Large Vision-Language Models
Nanxing Hu
Xiaoyue Duan
Jinchao Zhang
Guoliang Kang
MLLM
76
0
0
26 May 2025
Efficient Data Selection at Scale via Influence Distillation
Mahdi Nikdan
Vincent Cohen-Addad
Dan Alistarh
Vahab Mirrokni
TDI
75
0
0
25 May 2025
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation
Jiwan Chung
Junhyeok Kim
Siyeol Kim
Jaeyoung Lee
Min Soo Kim
Youngjae Yu
LRM
95
0
0
24 May 2025
Multi-Scale Manifold Alignment: A Unified Framework for Enhanced Explainability of Large Language Models
Yukun Zhang
Qi Dong
36
0
0
24 May 2025
Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning
Xingyu Tan
Xiaoyang Wang
Qing Liu
Xiwei Xu
Xin Yuan
Liming Zhu
Wenjie Zhang
RALM
LRM
62
0
0
23 May 2025
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models
Doohyuk Jang
Yoonjeon Kim
Chanjae Park
Hyun Ryu
Eunho Yang
LRM
100
0
0
22 May 2025
ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training
Maryam Dialameh
Rezaul Karim
Hossein Rajabzadeh
Omar Mohamed Awad
Hyock Ju Kwon
Boxing Chen
Walid Ahmed
Yang Liu
99
0
0
22 May 2025
LightRouter: Towards Efficient LLM Collaboration with Minimal Overhead
Yifan Zhang
Xinkui Zhao
Zuxin Wang
Guanjie Cheng
Yueshen Xu
Shuiguang Deng
Yuxiang Cai
100
0
0
22 May 2025
Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs
Zeyu Wei
Shuo Wang
Xiaohui Rong
Xuemin Liu
He Li
HILM
45
0
0
22 May 2025
SC4ANM: Identifying Optimal Section Combinations for Automated Novelty Prediction in Academic Papers
Wenqing Wu
Chengzhi Zhang
Tong Bao
Yi Zhao
221
1
0
22 May 2025
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training
Huishuai Zhang
Bohan Wang
Luoxin Chen
ODL
232
0
0
22 May 2025
ChemMLLM: Chemical Multimodal Large Language Model
Qian Tan
Dongzhan Zhou
Peng Xia
Wanhao Liu
Wanli Ouyang
Lei Bai
Yuqiang Li
Tianfan Fu
MLLM
49
0
0
22 May 2025
Logic-of-Thought: Empowering Large Language Models with Logic Programs for Solving Puzzles in Natural Language
Naiqi Li
Peiyuan Liu
Zheng Liu
Tao Dai
Yong Jiang
Shu-Tao Xia
ReLM
LRM
37
0
0
22 May 2025
HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation
Jinyu Guo
Xunlei Chen
Qiyang Xia
Zhaokun Wang
Jie Ou
Libo Qin
Shunyu Yao
Wenhong Tian
207
0
0
22 May 2025
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset
Hua Li
Shijie Lian
Zhiyuan Li
Runmin Cong
Sam Kwong
VLM
85
0
0
21 May 2025
SUS backprop: linear backpropagation algorithm for long inputs in transformers
Sergey Pankov
Georges Harik
112
0
0
21 May 2025
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack
Silvia Cappelletti
Tobia Poppi
Samuele Poppi
Zheng-Xin Yong
Diego Garcia-Olano
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
KELM
AAML
61
0
0
21 May 2025
Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution
Suvadeep Hajra
107
0
0
21 May 2025
The Effects of Data Augmentation on Confidence Estimation for LLMs
Rui Wang
Renyu Zhu
Minmin Lin
R. Wu
Tangjie Lv
Changjie Fan
Haobo Wang
23
0
0
21 May 2025
AAPO: Enhance the Reasoning Capabilities of LLMs with Advantage Momentum
Jian Xiong
Jingbo Zhou
Jingyong Ye
Dejing Dou
LRM
102
0
0
20 May 2025
Previous
1
2
3
4
5
...
85
86
87
Next