Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
GoalLadder: Incremental Goal Discovery with Vision-Language Models
Alexey Zakharov
Shimon Whiteson
24
0
0
19 Jun 2025
From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem
Yanxu Mao
Tiehan Cui
Peipei Liu
Datao You
Hongsong Zhu
AAML
21
0
0
18 Jun 2025
SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling
Md Imbesat Hassan Rizvi
Xiaodan Zhu
Iryna Gurevych
LRM
43
0
0
18 Jun 2025
Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Roger Creus Castanyer
J. Obando-Ceron
Lu Li
Pierre-Luc Bacon
Glen Berseth
Aaron Courville
Pablo Samuel Castro
32
0
0
18 Jun 2025
Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models
Chenchen Yuan
Zheyu Zhang
Shuo Yang
Bardh Prenkaj
Gjergji Kasneci
43
0
0
17 Jun 2025
FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization
Rajat Kumar Jenamani
Tom Silver
Ben Dodson
Shiqin Tong
Anthony Song
Yuting Yang
Ziang Liu
Benjamin Howe
Aimee Whitneck
Tapomayukh Bhattacharjee
40
1
0
17 Jun 2025
GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors
Hengyuan Zhang
Xinrong Chen
Yingmin Qiu
Xiao Liang
Ziyue Li
...
Weiping Li
Tong Mo
Wenyue Li
Hayden Kwok-Hay So
Ngai Wong
MoE
ALM
37
0
0
17 Jun 2025
Don't throw the baby out with the bathwater: How and why deep learning for ARC
Jack Cole
Mohamed Osman
LRM
45
0
0
17 Jun 2025
Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models
Muhammad Reza Qorib
Junyi Li
Hwee Tou Ng
LRM
32
0
0
16 Jun 2025
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection
Shang-Chi Tsai
Seiya Kawano
Angel García Contreras
Koichiro Yoshino
Yun-Nung Chen
LM&Ro
45
2
0
16 Jun 2025
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
Zhaochen Hong
Haofei Yu
Jiaxuan You
25
0
0
14 Jun 2025
Theoretical Tensions in RLHF: Reconciling Empirical Success with Inconsistencies in Social Choice Theory
Jiancong Xiao
Zhekun Shi
Kaizhao Liu
Q. Long
Weijie J. Su
43
0
0
14 Jun 2025
Is your batch size the problem? Revisiting the Adam-SGD gap in language modeling
Teodora Srećković
Jonas Geiping
Antonio Orvieto
MoE
34
0
0
14 Jun 2025
Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning
Chengye Li
Haiyun Liu
Yuanxi Li
31
0
0
13 Jun 2025
Long-Short Alignment for Effective Long-Context Modeling in LLMs
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
27
0
0
13 Jun 2025
One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers
Diana Abagyan
Alejandro Salamanca
Andres Felipe Cruz-Salinas
Kris Cao
Hangyu Lin
Acyr Locatelli
Marzieh Fadaee
Ahmet Üstün
Sara Hooker
CLL
150
0
0
12 Jun 2025
Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification
Peidong Wei Shiyu Miao Lin Li
Shiyu Miao
Lin Li
130
0
0
12 Jun 2025
OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
Xiaozhe Li
Jixuan Chen
Xinyu Fang
Shengyuan Ding
Haodong Duan
Qingwen Liu
Kai-xiang Chen
LLMAG
LRM
113
0
0
12 Jun 2025
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song
Ruihan Ji
Naichen Shi
Fan Lai
Raed Al Kontar
91
0
0
11 Jun 2025
Memorization in Language Models through the Lens of Intrinsic Dimension
Stefan Arnold
PILM
114
0
0
11 Jun 2025
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
Xiyao Wang
Zhengyuan Yang
Chao Feng
Yongyuan Liang
Yuhang Zhou
...
Chung-Ching Lin
Kevin Lin
Linjie Li
Furong Huang
L. xilinx Wang
OffRL
LRM
66
0
0
11 Jun 2025
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Dongge Han
Menglin Xia
Daniel Madrigal Diaz
Samuel Kessler
Ankur Mallick
Xuchao Zhang
Mirian Hipolito Garcia
Jin Xu
Victor Rühle
Saravan Rajmohan
LRM
50
0
0
10 Jun 2025
Low-resource domain adaptation while minimizing energy and hardware resource consumption
Hernán Maina
Nicolás Wolovick
Luciana Benotti
34
0
0
10 Jun 2025
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai
A. Chowdhury
Zihe Wang
Sooyoung Jeon
Jingyan Bai
Jiacheng Hou
Jihyung Kil
Wei-Lun Chao
CoGe
61
0
0
10 Jun 2025
MIRA: Medical Time Series Foundation Model for Real-World Health Data
Hao Li
Bowen Deng
Chang Xu
Zhiyuan Feng
Viktor Schlegel
...
Yizheng Sun
Jingyuan Sun
Kailai Yang
Yiyao Yu
Jiang Bian
AI4TS
OOD
AI4CE
62
0
0
09 Jun 2025
Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models
Ramakrishna Appicharla
Baban Gain
Santanu Pal
Asif Ekbal
LRM
27
0
0
09 Jun 2025
Chain-of-Code Collapse: Reasoning Failures in LLMs via Adversarial Prompting in Code Generation
Jaechul Roh
Varun Gandhi
Shivani Anilkumar
Arin Garg
AAML
ReLM
LRM
48
0
0
08 Jun 2025
What Makes a Good Natural Language Prompt?
Do Xuan Long
Duy Dinh
Ngoc-Hai Nguyen
Kenji Kawaguchi
Nancy F. Chen
Shafiq Joty
Min-Yen Kan
43
0
0
07 Jun 2025
Elementary Math Word Problem Generation using Large Language Models
Nimesh Ariyarathne
Harshani Bandara
Yasith Heshan
Omega Gamage
Surangika Ranathunga
...
Gayathri Lihinikaduarachchi
Tharoosha Vihidun
Meenambika Chandirakumar
Sanujen Premakumar
Sanjula Gathsara
AI4Ed
76
0
0
06 Jun 2025
A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos
Alexandru-Gabriel Ganea
Antonia-Adelina Popovici
Adrian-Marius Dumitran
51
0
0
06 Jun 2025
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Tennison Liu
M. Schaar
AIFin
LRM
133
0
0
05 Jun 2025
Inference economics of language models
Ege Erdil
108
0
0
05 Jun 2025
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Baihe Huang
Shanda Li
Tianhao Wu
Yiming Yang
Ameet Talwalkar
Kannan Ramchandran
Michael I. Jordan
Jiantao Jiao
LRM
115
0
0
05 Jun 2025
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Zesheng Ye
C. Cai
Ruijiang Dong
Jianzhong Qi
Lei Feng
Pin-Yu Chen
Feng Liu
234
0
0
05 Jun 2025
Detection Method for Prompt Injection by Integrating Pre-trained Model and Heuristic Feature Engineering
Yi Ji
Runzhi Li
Baolei Mao
AAML
22
0
0
05 Jun 2025
Prompting LLMs: Length Control for Isometric Machine Translation
Dávid Javorský
Ondrej Bojar
François Yvon
104
0
0
05 Jun 2025
Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques
Jisu An
Junseok Lee
Jeoungeun Lee
Yongseok Son
158
0
0
05 Jun 2025
Adaptive Preconditioners Trigger Loss Spikes in Adam
Zhiwei Bai
Zhangchen Zhou
Jiajie Zhao
Xiaolong Li
Zhiyu Li
Feiyu Xiong
Hongkang Yang
Yaoyu Zhang
Z. Xu
ODL
114
0
0
05 Jun 2025
You Only Train Once
Christos Sakaridis
41
0
0
04 Jun 2025
Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models
Fangrui Zhu
Hanhui Wang
Yiming Xie
Jing Gu
Tianye Ding
Jianwei Yang
Huaizu Jiang
3DV
LRM
116
0
0
04 Jun 2025
Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks
Lin Mu
Guowei Chu
Li Ni
Lei Sang
Zhize Wu
Peiquan Jin
Yiwen Zhang
101
0
0
04 Jun 2025
Understanding Gender Bias in AI-Generated Product Descriptions
Markelle Kelly
Mohammad Tahaei
Padhraic Smyth
Lauren Wilcox
31
0
0
03 Jun 2025
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation
Siqi Chen
Xinyu Dong
Haolei Xu
Xingyu Wu
Fei Tang
...
Wenqi Zhang
Guiyang Hou
Yongliang Shen
Weiming Lu
Yueting Zhuang
VLM
66
0
0
03 Jun 2025
From Transformers to Large Language Models: A systematic review of AI applications in the energy sector towards Agentic Digital Twins
Gabriel Antonesi
T. Cioara
I. Anghel
Vasilis Michalakopoulos
Elissaios Sarmas
Liana Toderean
LLMAG
MedIm
AI4CE
20
0
0
03 Jun 2025
A Trustworthiness-based Metaphysics of Artificial Intelligence Systems
Andrea Ferrario
44
0
0
03 Jun 2025
Beyond Text Compression: Evaluating Tokenizers Across Scales
Jonas F. Lotz
António V. Lopes
Stephan Peitz
Hendra Setiawan
Leonardo Emili
63
0
0
03 Jun 2025
Taming LLMs by Scaling Learning Rates with Gradient Grouping
Siyuan Li
Juanxi Tian
Zedong Wang
Xin Jin
Zicheng Liu
Wentao Zhang
Dan Xu
52
0
0
01 Jun 2025
OntoRAG: Enhancing Question-Answering through Automated Ontology Derivation from Unstructured Knowledge Bases
Yash Tiwari
Owais Ahmad Lone
Mayukha Pal
36
0
0
31 May 2025
TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question Answering
Boyi Zhang
Zhuo Liu
Hangfeng He
LRM
29
0
0
31 May 2025
Exploring In-context Example Generation for Machine Translation
Dohyun Lee
Seungil Lee
Chanwoo Yang
Yujin Baek
Jaegul Choo
40
0
0
31 May 2025
1
2
3
4
...
85
86
87
Next