Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Disentangled Phonetic Representation for Chinese Spelling Correction
Zihong Liang
Xiaojun Quan
Qifan Wang
70
20
0
24 May 2023
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models
Amirhossein Kazemnejad
Mehdi Rezagholizadeh
Prasanna Parthasarathi
Sarath Chandar
ELM
65
2
0
24 May 2023
Allies: Prompting Large Language Model with Beam Search
Hao Sun
Xiao Liu
Yeyun Gong
Yan Zhang
Daxin Jiang
Linjun Yang
Nan Duan
RALM
105
6
0
24 May 2023
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
Sheng Shen
Le Hou
Yan-Quan Zhou
Nan Du
Shayne Longpre
...
Vincent Zhao
Hongkun Yu
Kurt Keutzer
Trevor Darrell
Denny Zhou
ALM
MoE
109
60
0
24 May 2023
Can Transformers Learn to Solve Problems Recursively?
Shizhuo Zhang
Curt Tigges
Stella Biderman
Maxim Raginsky
Talia Ringer
54
17
0
24 May 2023
Emergent inabilities? Inverse scaling over the course of pretraining
J. Michaelov
Benjamin Bergen
LRM
ReLM
68
3
0
24 May 2023
Selectively Answering Ambiguous Questions
Jeremy R. Cole
Michael J.Q. Zhang
D. Gillick
Julian Martin Eisenschlos
Bhuwan Dhingra
Jacob Eisenstein
UQLM
123
31
0
24 May 2023
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding
Li Sun
F. Luisier
Kayhan Batmanghelich
D. Florêncio
Changrong Zhang
VLM
51
6
0
23 May 2023
Unraveling ChatGPT: A Critical Analysis of AI-Generated Goal-Oriented Dialogues and Annotations
Tiziano Labruna
Sofia Brenna
Andrea Zaninello
Bernardo Magnini
50
15
0
23 May 2023
Sources of Hallucination by Large Language Models on Inference Tasks
Nick McKenna
Tianyi Li
Liang Cheng
Mohammad Javad Hosseini
Mark Johnson
Mark Steedman
LRM
HILM
107
201
0
23 May 2023
NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders
Livio Baldini Soares
D. Gillick
Jeremy R. Cole
Tom Kwiatkowski
79
1
0
23 May 2023
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Zhiheng Xi
Senjie Jin
Yuhao Zhou
Rui Zheng
Songyang Gao
Tao Gui
Qi Zhang
Xuanjing Huang
ReLM
LRM
91
48
0
23 May 2023
Language Model Self-improvement by Reinforcement Learning Contemplation
Jing-Cheng Pang
Pengyuan Wang
Kaiyuan Li
Xiong-Hui Chen
Jiacheng Xu
Zongzhang Zhang
Yang Yu
LRM
KELM
64
52
0
23 May 2023
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
Hong Liu
Zhiyuan Li
David Leo Wright Hall
Percy Liang
Tengyu Ma
VLM
144
149
0
23 May 2023
Automatic Model Selection with Large Language Models for Reasoning
Xu Zhao
Yuxi Xie
Kenji Kawaguchi
Junxian He
Qizhe Xie
ReLM
LRM
87
40
0
23 May 2023
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Benjamin Muller
John Wieting
J. Clark
Tom Kwiatkowski
Sebastian Ruder
Livio Baldini Soares
Roee Aharoni
Jonathan Herzig
Xinyi Wang
HILM
87
17
0
23 May 2023
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems
Navita Goyal
Eleftheria Briakou
Amanda Liu
Connor Baumler
C. Bonial
J. Micher
Clare R. Voss
Marine Carpuat
Hal Daumé
91
9
0
23 May 2023
DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation
Susung Hong
Junyoung Seo
Heeseong Shin
Sung‐Jin Hong
Seung Wook Kim
DiffM
VGen
106
36
0
23 May 2023
Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation
Da Yin
Xiao Liu
Fan Yin
Ming Zhong
Hritik Bansal
Jiawei Han
Kai-Wei Chang
ALM
101
39
0
23 May 2023
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
Zhongfu Chen
Kun Zhou
Beichen Zhang
Zheng Gong
Wayne Xin Zhao
Ji-Rong Wen
KELM
LRM
143
31
0
23 May 2023
RET-LLM: Towards a General Read-Write Memory for Large Language Models
Ali Modarressi
Ayyoob Imani
Mohsen Fayyaz
Hinrich Schütze
KELM
LLMAG
71
37
0
23 May 2023
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
Cheng Qian
Chi Han
Yi R. Fung
Yujia Qin
Zhiyuan Liu
Heng Ji
LRM
113
38
0
23 May 2023
Debiasing should be Good and Bad: Measuring the Consistency of Debiasing Techniques in Language Models
Robert D Morabito
Jad Kabbara
Ali Emami
54
7
0
23 May 2023
Query Rewriting for Retrieval-Augmented Large Language Models
Xinbei Ma
Yeyun Gong
Pengcheng He
Hai Zhao
Nan Duan
KELM
LRM
113
115
0
23 May 2023
Active Learning Principles for In-Context Learning with Large Language Models
Katerina Margatina
Timo Schick
Nikolaos Aletras
Jane Dwivedi-Yu
113
44
0
23 May 2023
Hierarchical Prompting Assists Large Language Model on Web Navigation
Abishek Sridhar
Robert Lo
Frank F. Xu
Hao Zhu
Shuyan Zhou
LRM
95
35
0
23 May 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
87
2
0
23 May 2023
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
Ning Ding
Yulin Chen
Bokai Xu
Yujia Qin
Zhi Zheng
Shengding Hu
Zhiyuan Liu
Maosong Sun
Bowen Zhou
ALM
178
555
0
23 May 2023
mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Jonas Pfeiffer
Francesco Piccinno
Massimo Nicosia
Xinyi Wang
Machel Reid
Sebastian Ruder
VLM
LRM
106
31
0
23 May 2023
Question Answering as Programming for Solving Time-Sensitive Questions
Xinyu Zhu
Cheng Yang
B. Chen
Siheng Li
Jian-Guang Lou
Yujiu Yang
KELM
104
11
0
23 May 2023
Exploring Chain-of-Thought Style Prompting for Text-to-SQL
Chang-You Tai
Ziru Chen
Tianshu Zhang
Xiang Deng
Huan Sun
LRM
AI4CE
ReLM
86
57
0
23 May 2023
Skill-Based Few-Shot Selection for In-Context Learning
Shengnan An
Bo Zhou
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Weizhu Chen
Jian-Guang Lou
121
34
0
23 May 2023
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks
Tiedong Liu
K. H. Low
ALM
100
87
0
23 May 2023
DetGPT: Detect What You Need via Reasoning
Renjie Pi
Jiahui Gao
Shizhe Diao
Boyao Wang
Hanze Dong
...
Lewei Yao
Jianhua Han
Hang Xu
Lingpeng Kong Tong Zhang
Tong Zhang
LRM
LM&Ro
90
99
0
23 May 2023
Dr.ICL: Demonstration-Retrieved In-context Learning
Man Luo
Xin Xu
Zhuyun Dai
Panupong Pasupat
Mehran Kazemi
Chitta Baral
Vaiva Imbrasaite
Vincent Zhao
RALM
127
58
0
23 May 2023
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale
Christos Baziotis
Biao Zhang
Alexandra Birch
Barry Haddow
150
2
0
23 May 2023
Better Zero-Shot Reasoning with Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
H. Dai
Sercan O. Arik
Tomas Pfister
ReLM
OffRL
LRM
86
54
0
23 May 2023
CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Aswanth Kumar
Ratish Puduppully
Raj Dabre
Anoop Kunchukuttan
108
13
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
144
3
0
23 May 2023
Revisiting Acceptability Judgements
Hai Hu
Ziyin Zhang
Wei-Ping Huang
J. Lai
Aini Li
Yi Ma
Jiahui Huang
Peng Zhang
Chien-Jer Charles Lin
Rui Wang
81
2
0
23 May 2023
One-stop Training of Multiple Capacity Models
Lan Jiang
Haoyang Huang
Dongdong Zhang
R. Jiang
Furu Wei
115
0
0
23 May 2023
Can Language Models Understand Physical Concepts?
Lei Li
Jingjing Xu
Qingxiu Dong
Ce Zheng
Qi Liu
Lingpeng Kong
Xu Sun
ALM
61
22
0
23 May 2023
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Seungone Kim
Se June Joo
Doyoung Kim
Joel Jang
Seonghyeon Ye
Jamin Shin
Minjoon Seo
ALM
RALM
LRM
143
107
0
23 May 2023
Improving Language Models via Plug-and-Play Retrieval Feedback
Wenhao Yu
Zhihan Zhang
Zhenwen Liang
Meng Jiang
Ashish Sabharwal
KELM
LRM
114
78
0
23 May 2023
Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model
Leo Liu
Tim Dettmers
Xi Lin
Ves Stoyanov
Xian Li
MoE
71
10
0
23 May 2023
Generating Data for Symbolic Language with Large Language Models
Jiacheng Ye
Chengzu Li
Lingpeng Kong
Tao Yu
91
11
0
23 May 2023
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought
Vaishnavi Himakunthala
Andy Ouyang
Daniel Philip Rose
Ryan He
Alex Mei
Yujie Lu
Chinmay Sonar
Michael Stephen Saxon
William Y. Wang
MLLM
LRM
91
2
0
23 May 2023
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
Xuekai Zhu
Biqing Qi
Kaiyan Zhang
Xingwei Long
Zhouhan Lin
Bowen Zhou
ALM
LRM
67
21
0
23 May 2023
Selective Pre-training for Private Fine-tuning
Da Yu
Sivakanth Gopi
Janardhan Kulkarni
Zinan Lin
Saurabh Naik
Tomasz Religa
Jian Yin
Huishuai Zhang
96
19
0
23 May 2023
A Trip Towards Fairness: Bias and De-Biasing in Large Language Models
Leonardo Ranaldi
Elena Sofia Ruzzetti
Davide Venditti
Dario Onorati
Fabio Massimo Zanzotto
93
37
0
23 May 2023
Previous
1
2
3
...
66
67
68
...
85
86
87
Next