Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Ecosystem Graphs: The Social Footprint of Foundation Models
Rishi Bommasani
Dilara Soylu
Thomas I. Liao
Kathleen A. Creel
Percy Liang
MLAU
81
35
0
28 Mar 2023
Language Models Trained on Media Diets Can Predict Public Opinion
Eric Chu
Jacob Andreas
S. Ansolabehere
Dwaipayan Roy
64
31
0
28 Mar 2023
Solving Regularized Exp, Cosh and Sinh Regression Problems
Zhihang Li
Zhao Song
Dinesh Manocha
97
39
0
28 Mar 2023
Foundation Models and Fair Use
Peter Henderson
Xuechen Li
Dan Jurafsky
Tatsunori Hashimoto
Mark A. Lemley
Percy Liang
92
126
0
28 Mar 2023
ChatGPT4PCG Competition: Character-like Level Generation for Science Birds
Pittawat Taveekitworachai
Febri Abdullah
Mury F. Dewantoro
R. Thawonmas
Julian Togelius
Jochen Renz
66
17
0
28 Mar 2023
ChatGPT as a Factual Inconsistency Evaluator for Text Summarization
Zheheng Luo
Qianqian Xie
Sophia Ananiadou
ELM
HILM
ALM
92
80
0
27 Mar 2023
Typhoon: Towards an Effective Task-Specific Masking Strategy for Pre-trained Language Models
Muhammed Shahir Abdurrahman
Hashem Elezabi
B. Xu
30
0
0
27 Mar 2023
On the Stepwise Nature of Self-Supervised Learning
James B. Simon
Maksis Knutins
Liu Ziyin
Daniel Geisz
Abraham J. Fetterman
Joshua Albrecht
SSL
101
35
0
27 Mar 2023
Scaling Pre-trained Language Models to Deeper via Parameter-efficient Architecture
Peiyu Liu
Ze-Feng Gao
Yushuo Chen
Wayne Xin Zhao
Ji-Rong Wen
MoE
74
0
0
27 Mar 2023
On the Creativity of Large Language Models
Giorgio Franceschelli
Mirco Musolesi
224
60
0
27 Mar 2023
MGTBench: Benchmarking Machine-Generated Text Detection
Xinlei He
Xinyue Shen
Zhenpeng Chen
Michael Backes
Yang Zhang
DeLMO
134
114
0
26 Mar 2023
Koala: An Index for Quantifying Overlaps with Pre-training Corpora
Thuy-Trang Vu
Xuanli He
Gholamreza Haffari
Ehsan Shareghi
CLL
84
15
0
26 Mar 2023
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases
Yunjie Ji
Yong Deng
Yan Gong
Yiping Peng
Qiang Niu
Lefei Zhang
Baochang Ma
Xiangang Li
ALM
70
97
0
26 Mar 2023
An Evaluation of Memory Optimization Methods for Training Neural Networks
Xiaoxuan Liu
Siddharth Jha
Alvin Cheung
54
0
0
26 Mar 2023
Chat-REC: Towards Interactive and Explainable LLMs-Augmented Recommender System
Yunfan Gao
Tao Sheng
Youlin Xiang
Yun Xiong
Haofen Wang
Jiawei Zhang
RALM
KELM
198
297
0
25 Mar 2023
SmartBook: AI-Assisted Situation Report Generation for Intelligence Analysts
R. Reddy
Daniel Lee
Yi R. Fung
Khanh Duy Nguyen
Qi Zeng
Manling Li
Ziqi Wang
Clare R. Voss
Heng Ji
67
6
0
25 Mar 2023
GPT is becoming a Turing machine: Here are some ways to program it
A. Jojic
Zhen Wang
Nebojsa Jojic
LRM
113
17
0
25 Mar 2023
Errors are Useful Prompts: Instruction Guided Task Programming with Verifier-Assisted Iterative Prompting
Marta Skreta
Naruki Yoshikawa
Sebastian Arellano-Rubach
Zhi Ji
L. B. Kristensen
Kourosh Darvish
Alán Aspuru-Guzik
Florian Shkurti
Animesh Garg
131
58
0
24 Mar 2023
Accelerating Vision-Language Pretraining with Free Language Modeling
Teng Wang
Yixiao Ge
Feng Zheng
Ran Cheng
Ying Shan
Xiaohu Qie
Ping Luo
VLM
MLLM
118
10
0
24 Mar 2023
k
k
k
NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference
Benfeng Xu
Quan Wang
Zhendong Mao
Yajuan Lyu
Qiaoqiao She
Yongdong Zhang
150
53
0
24 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
83
1
0
23 Mar 2023
Salient Span Masking for Temporal Understanding
Jeremy R. Cole
Aditi Chaudhary
Bhuwan Dhingra
Partha P. Talukdar
91
13
0
22 Mar 2023
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
Fengji Zhang
B. Chen
Yue Zhang
Jacky Keung
Jin Liu
Daoguang Zan
Yi Mao
Jian-Guang Lou
Weizhu Chen
75
245
0
22 Mar 2023
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja
Harshita Diddee
Rishav Hada
Millicent Ochieng
Krithika Ramesh
...
T. Ganu
Sameer Segal
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
LM&MA
LRM
ELM
153
292
0
22 Mar 2023
Fundamentals of Generative Large Language Models and Perspectives in Cyber-Defense
Andrei Kucharavy
Z. Schillaci
Loic Maréchal
Maxime Wursch
Ljiljana Dolamic
Remi Sabonnadiere
Dimitri Percia David
Alain Mermoud
Vincent Lenders
ELM
AI4CE
83
33
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
116
142
0
21 Mar 2023
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
120
109
0
20 Mar 2023
eP-ALM: Efficient Perceptual Augmentation of Language Models
Mustafa Shukor
Corentin Dancette
Matthieu Cord
MLLM
VLM
74
31
0
20 Mar 2023
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
E. Azarnasab
Faisal Ahmed
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
ReLM
KELM
LRM
128
397
0
20 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
148
289
0
20 Mar 2023
Context-faithful Prompting for Large Language Models
Wenxuan Zhou
Sheng Zhang
Hoifung Poon
Muhao Chen
KELM
61
65
0
20 Mar 2023
Unit Scaling: Out-of-the-Box Low-Precision Training
Charlie Blake
Douglas Orr
Carlo Luschi
MQ
87
7
0
20 Mar 2023
What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring
Yonadav Shavit
80
23
0
20 Mar 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
Zheng-Long Liu
Yue Huang
Xiao-Xing Yu
Lu Zhang
Zihao Wu
...
Dinggang Shen
Quanzheng Li
Tianming Liu
Dajiang Zhu
Xiang Li
LM&MA
MedIm
129
179
0
20 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
133
89
0
20 Mar 2023
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Xiaozhe Ren
Pingyi Zhou
Xinfan Meng
Xinjing Huang
Yadao Wang
...
Jiansheng Wei
Xin Jiang
Teng Su
Qun Liu
Jun Yao
ALM
MoE
128
63
0
20 Mar 2023
Revisiting the Plastic Surgery Hypothesis via Large Language Models
Chun Xia
Yifeng Ding
Lingming Zhang
80
12
0
18 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
169
25
0
18 Mar 2023
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Vithursan Thangarasa
Abhay Gupta
William Marshall
Tianda Li
Kevin Leong
D. DeCoste
Sean Lie
Shreyas Saxena
MoE
AI4CE
90
22
0
18 Mar 2023
A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models
Junjie Ye
Xuanting Chen
Nuo Xu
Can Zu
Zekai Shao
...
Jie Zhou
Siming Chen
Tao Gui
Qi Zhang
Xuanjing Huang
ELM
85
337
0
18 Mar 2023
IRGen: Generative Modeling for Image Retrieval
Yidan Zhang
Ting Zhang
Dong Chen
Yujing Wang
Qi Chen
...
Qi Zhang
Fan Yang
Mao Yang
Q. Liao
B. Guo
3DV
VLM
143
15
0
17 Mar 2023
Measuring the Impact of Explanation Bias: A Study of Natural Language Justifications for Recommender Systems
K. Balog
Filip Radlinski
Andrey Petrov
45
3
0
16 Mar 2023
A Picture is Worth a Thousand Words: Language Models Plan from Pixels
Anthony Z. Liu
Lajanugen Logeswaran
Sungryull Sohn
Honglak Lee
LM&Ro
41
6
0
16 Mar 2023
ART: Automatic multi-step reasoning and tool-use for large language models
Bhargavi Paranjape
Scott M. Lundberg
Sameer Singh
Hannaneh Hajishirzi
Luke Zettlemoyer
Marco Tulio Ribeiro
KELM
ReLM
LRM
99
154
0
16 Mar 2023
Automated Interactive Domain-Specific Conversational Agents that Understand Human Dialogs
Yankai Zeng
Abhiramon Rajasekharan
Parth Padalkar
Kinjal Basu
Joaquín Arias
Gopal Gupta
53
17
0
15 Mar 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul
Adian Liusie
Mark Gales
HILM
LRM
255
448
0
15 Mar 2023
Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!
Yubo Ma
Yixin Cao
YongChing Hong
Aixin Sun
RALM
187
157
0
15 Mar 2023
Lana: A Language-Capable Navigator for Instruction Following and Generation
Xiaohan Wang
Wenguan Wang
Jiayi Shao
Yi Yang
LLMAG
LM&Ro
98
41
0
15 Mar 2023
How Many Demonstrations Do You Need for In-context Learning?
Jiuhai Chen
Lichang Chen
Chen Zhu
Dinesh Manocha
LRM
94
43
0
14 Mar 2023
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family
Yiming Tan
Dehai Min
Y. Li
Wenbo Li
Nan Hu
Yongrui Chen
Guilin Qi
AI4MH
ELM
110
102
0
14 Mar 2023
Previous
1
2
3
...
73
74
75
...
85
86
87
Next