Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Balancing Privacy and Performance for Private Federated Learning Algorithms
Xiangjiang Hou
Sarit Khirirat
Mohammad Yaqub
Samuel Horváth
FedML
64
0
0
11 Apr 2023
Human-machine cooperation for semantic feature listing
Kushin Mukherjee
Siddharth Suresh
Timothy T. Rogers
VLM
56
2
0
11 Apr 2023
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Tao Lei
Junwen Bai
Siddhartha Brahma
Joshua Ainslie
Kenton Lee
...
Vincent Zhao
Yuexin Wu
Yue Liu
Yu Zhang
Ming-Wei Chang
BDL
AI4CE
115
64
0
11 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
88
21
0
10 Apr 2023
Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis
Wenhao Zhu
Hongyi Liu
Qingxiu Dong
Jingjing Xu
Shujian Huang
Lingpeng Kong
Jiajun Chen
Lei Li
LRM
149
152
0
10 Apr 2023
Randomized and Deterministic Attention Sparsification Algorithms for Over-parameterized Feature Dimension
Yichuan Deng
Sridhar Mahadevan
Zhao Song
64
35
0
10 Apr 2023
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLM
LRM
128
232
0
10 Apr 2023
Is ChatGPT a Good Sentiment Analyzer? A Preliminary Study
Zengzhi Wang
Qiming Xie
Yi Feng
Zixiang Ding
Zinong Yang
Rui Xia
AI4MH
LLMAG
108
157
0
10 Apr 2023
A Preliminary Evaluation of ChatGPT for Zero-shot Dialogue Understanding
Wenbo Pan
Qiguang Chen
Xiao Xu
Wanxiang Che
Libo Qin
80
46
0
09 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
69
44
0
08 Apr 2023
Why think step by step? Reasoning emerges from the locality of experience
Ben Prystawski
Michael Y. Li
Noah D. Goodman
LRM
ReLM
91
107
0
07 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
134
264
0
07 Apr 2023
Interpretable Unified Language Checking
Tianhua Zhang
Hongyin Luo
Yung-Sung Chuang
Wei Fang
Luc Gaitskell
Thomas Hartvigsen
Xixin Wu
D. Fox
Helen M. Meng
James R. Glass
83
22
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster
Nolan Dey
Gurpreet Gosal
Zhiming Chen
Chen
Hemant Khachane
William Marshall
Ribhu Pathria
Marvin Tom
Joel Hestness
MoE
LRM
135
108
0
06 Apr 2023
Zero-Shot Next-Item Recommendation using Large Pretrained Language Models
Lei Wang
Ee-Peng Lim
LRM
82
58
0
06 Apr 2023
Conceptual structure coheres in human cognition but not in large language models
Siddharth Suresh
Kushin Mukherjee
Xizheng Yu
Wei-Chun Huang
Lisa Padua
Timothy T. Rogers
65
11
0
05 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
521
7,487
0
05 Apr 2023
Effective Theory of Transformers at Initialization
Emily Dinan
Sho Yaida
Susan Zhang
92
16
0
04 Apr 2023
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Zhiqiang Hu
Lei Wang
Yihuai Lan
Wanyu Xu
Ee-Peng Lim
Lidong Bing
Xing Xu
Soujanya Poria
Roy Ka-wei Lee
ALM
184
275
0
04 Apr 2023
Optimizing Group Utility in Itinerary Planning: A Strategic and Crowd-Aware Approach
Junhua Liu
Kwan Hui Lim
Kristin L. Wood
Menglin Li
131
0
0
04 Apr 2023
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Kang Liu
Jun Zhao
103
6
0
04 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MA
AI4MH
LRM
ELM
106
138
0
04 Apr 2023
TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
N. Jouppi
George Kurian
Sheng Li
Peter C. Ma
R. Nagarajan
...
Brian Towles
C. Young
Xiaoping Zhou
Zongwei Zhou
David A. Patterson
BDL
VLM
182
371
0
04 Apr 2023
Scientists' Perspectives on the Potential for Generative AI in their Fields
Meredith Ringel Morris
AI4CE
71
43
0
04 Apr 2023
The Vector Grounding Problem
Dimitri Coelho Mollo
Raphael Milliere
148
28
0
04 Apr 2023
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Stella Biderman
Hailey Schoelkopf
Quentin G. Anthony
Herbie Bradley
Kyle O'Brien
...
USVSN Sai Prashanth
Edward Raff
Aviya Skowron
Lintang Sutawika
Oskar van der Wal
161
1,312
0
03 Apr 2023
Specialty-Oriented Generalist Medical AI for Chest CT Screening
Chuang Niu
Qing Lyu
Christopher D. Carothers
P. Kaviani
Josh Tan
Pingkun Yan
Mannudeep K. Kalra
C. Whitlow
Ge Wang
73
6
0
03 Apr 2023
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study
Yi Chen
Rui Wang
Haiyun Jiang
Shuming Shi
Ruifeng Xu
LM&MA
144
87
0
03 Apr 2023
Eight Things to Know about Large Language Models
Sam Bowman
ALM
103
117
0
02 Apr 2023
Towards Healthy AI: Large Language Models Need Therapists Too
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Kush R. Varshney
AI4MH
101
19
0
02 Apr 2023
Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics
J. Holmes
Zheng Liu
Hua Zhou
Yuzhen Ding
Terence T. Sio
...
Jonathan B. Ashman
Xiang Li
Tianming Liu
Jiajian Shen
Wen Liu
LM&MA
AI4CE
ELM
94
124
0
01 Apr 2023
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Wenhu Chen
Hexiang Hu
Yandong Li
Nataniel Rui
Xuhui Jia
Ming-Wei Chang
William W. Cohen
DiffM
215
194
0
01 Apr 2023
Enhancing Large Language Models with Climate Resources
Mathias Kraus
J. Bingler
Markus Leippold
Tobias Schimanski
Chiara Colesanti-Senni
Dominik Stammbach
S. Vaghefi
Nicolas Webersinke
101
25
0
31 Mar 2023
A Closer Look at Parameter-Efficient Tuning in Diffusion Models
Chendong Xiang
Fan Bao
Chongxuan Li
Hang Su
Jun Zhu
DiffM
58
16
0
31 Mar 2023
CQSumDP: A ChatGPT-Annotated Resource for Query-Focused Abstractive Summarization Based on Debatepedia
Md Tahmid Rahman Laskar
Mizanur Rahman
Israt Jahan
Enamul Hoque
J. Huang
86
9
0
31 Mar 2023
Pair Programming with Large Language Models for Sampling and Estimation of Copulas
Jan Górecki
LLMAG
39
1
0
31 Mar 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
Ge Li
Hasan Hammoud
Hani Itani
Dmitrii Khizbullin
Guohao Li
SyDa
ALM
215
521
0
31 Mar 2023
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
175
913
0
30 Mar 2023
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X
Qinkai Zheng
Xiao Xia
Xu Zou
Yuxiao Dong
Shanshan Wang
...
Andi Wang
Yang Li
Teng Su
Zhilin Yang
Jie Tang
ELM
ALM
SyDa
178
347
0
30 Mar 2023
Language Models can Solve Computer Tasks
Geunwoo Kim
Pierre Baldi
Stephen Marcus McAleer
LLMAG
LM&Ro
170
374
0
30 Mar 2023
Humans in Humans Out: On GPT Converging Toward Common Sense in both Success and Failure
Philipp E. Koralus
Vincent Wang-Ma'scianica
LRM
47
13
0
30 Mar 2023
DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents
Varun Nair
Elliot Schumacher
Geoffrey Tso
Anitha Kannan
VLM
73
64
0
30 Mar 2023
Evaluating GPT-3.5 and GPT-4 Models on Brazilian University Admission Exams
Desnes Nunes
Ricardo Primi
Ramon Pires
R. Lotufo
Rodrigo Nogueira
ELM
42
34
0
29 Mar 2023
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
Xingwei He
Zheng-Wen Lin
Yeyun Gong
Alex Jin
Hang Zhang
Chen Lin
Jian Jiao
Siu-Ming Yiu
Nan Duan
Weizhu Chen
119
201
0
29 Mar 2023
Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
Haoqi Yuan
Chi Zhang
Hongchen Wang
Feiyang Xie
Penglin Cai
Hao Dong
Zongqing Lu
LM&Ro
LLMAG
137
23
0
29 Mar 2023
An Over-parameterized Exponential Regression
Yeqi Gao
Sridhar Mahadevan
Zhao Song
81
39
0
29 Mar 2023
Larger Probes Tell a Different Story: Extending Psycholinguistic Datasets Via In-Context Learning
Namrata Shivagunde
Vladislav Lialin
Anna Rumshisky
71
1
0
29 Mar 2023
On Codex Prompt Engineering for OCL Generation: An Empirical Study
Seif Abukhalaf
Mohammad Hamdaqa
Foutse Khomh
74
23
0
28 Mar 2023
Hallucinations in Large Multilingual Translation Models
Nuno M. Guerreiro
Duarte M. Alves
Jonas Waldendorf
Barry Haddow
Alexandra Birch
Pierre Colombo
André F.T. Martins
VLM
HILM
LRM
201
154
0
28 Mar 2023
Previous
1
2
3
...
72
73
74
...
85
86
87
Next