Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Language models are not naysayers: An analysis of language models on negation benchmarks
Thinh Hung Truong
Timothy Baldwin
Karin Verspoor
Trevor Cohn
128
60
0
14 Jun 2023
INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Yuji Chai
John Gkountouras
Glenn G. Ko
David Brooks
Gu-Yeon Wei
MQ
66
19
0
13 Jun 2023
Large-scale Language Model Rescoring on Long-form Data
Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel S. Park
David Rybach
...
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
Michael Riley
98
16
0
13 Jun 2023
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent
Ziniu Hu
Ahmet Iscen
Chen Sun
Kai-Wei Chang
Yizhou Sun
David A. Ross
Cordelia Schmid
Alireza Fathi
92
11
0
13 Jun 2023
Generating Images with 3D Annotations Using Diffusion Models
Wufei Ma
Qihao Liu
Jiahao Wang
Angtian Wang
Xiaoding Yuan
...
Ruxiao Duan
Yongrui Qi
Adam Kortylewski
Yaoyao Liu
Alan Yuille
DiffM
86
5
0
13 Jun 2023
FLamE: Few-shot Learning from Natural Language Explanations
Yangqiaoyu Zhou
Yiming Zhang
Chenhao Tan
LRM
FAtt
97
11
0
13 Jun 2023
BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information
Mehran Kazemi
Quan Yuan
Deepti Bhatia
Najoung Kim
Xin Xu
Vaiva Imbrasaite
Deepak Ramachandran
LRM
108
50
0
13 Jun 2023
Image Captioners Are Scalable Vision Learners Too
Michael Tschannen
Manoj Kumar
Andreas Steiner
Xiaohua Zhai
N. Houlsby
Lucas Beyer
VLM
CLIP
116
60
0
13 Jun 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Xiao Liu
Hanyu Lai
Hao Yu
Yifan Xu
Aohan Zeng
Zhengxiao Du
Peng Zhang
Yuxiao Dong
Jie Tang
80
105
0
13 Jun 2023
ReadProbe: A Demo of Retrieval-Enhanced Large Language Models to Support Lateral Reading
Dake Zhang
Ronak Pradeep
RALM
30
2
0
13 Jun 2023
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
Longtao Zheng
Rongpin Wang
Xinrun Wang
Bo An
LLMAG
105
73
0
13 Jun 2023
Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models
Yin Fang
Xiaozhuan Liang
Ningyu Zhang
Kangwei Liu
Rui Huang
Zhuo Chen
Xiaohui Fan
Huajun Chen
139
88
0
13 Jun 2023
SqueezeLLM: Dense-and-Sparse Quantization
Sehoon Kim
Coleman Hooper
A. Gholami
Zhen Dong
Xiuyu Li
Sheng Shen
Michael W. Mahoney
Kurt Keutzer
MQ
173
198
0
13 Jun 2023
SayTap: Language to Quadrupedal Locomotion
Yujin Tang
Wenhao Yu
Jie Tan
Heiga Zen
Aleksandra Faust
Tatsuya Harada
108
43
0
13 Jun 2023
Lost in Translation: Large Language Models in Non-English Content Analysis
Gabriel Nicholas
Aliya Bhatia
ELM
96
39
0
12 Jun 2023
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
Wenqi Zhang
Yongliang Shen
Weiming Lu
Yueting Zhuang
LLMAG
140
56
0
12 Jun 2023
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
Jiale Xu
Xintao Wang
Yannan Cao
Weihao Cheng
Ying Shan
Shenghua Gao
DiffM
88
11
0
12 Jun 2023
Gradient Ascent Post-training Enhances Language Model Generalization
Dongkeun Yoon
Joel Jang
Sungdong Kim
Minjoon Seo
VLM
AI4CE
82
3
0
12 Jun 2023
Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context Reasoning with Language Models
Soochan Lee
Gunhee Kim
ReLM
LRM
65
27
0
12 Jun 2023
Recurrent Attention Networks for Long-text Modeling
Xianming Li
Zongxi Li
Xiaotian Luo
Haoran Xie
Xing Lee
Yingbin Zhao
Fu Lee Wang
Qing Li
RALM
102
15
0
12 Jun 2023
Valley: Video Assistant with Large Language model Enhanced abilitY
Ruipu Luo
Ziwang Zhao
Min Yang
Junwei Dong
Da Li
Pengcheng Lu
Tao Wang
Linmei Hu
Ming-Hui Qiu
MLLM
152
209
0
12 Jun 2023
AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing
Asaad Alghamdi
Xinyu Duan
Wei Jiang
Zhenhai Wang
Yimeng Wu
...
Yifei Zheng
Mehdi Rezagholizadeh
Baoxing Huai
Peilun Cheng
Abbas Ghaddar
VLM
64
9
0
11 Jun 2023
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Jiacheng Ye
Xijia Tao
Lingpeng Kong
LRM
77
27
0
11 Jun 2023
Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
William Chen
Xuankai Chang
Yifan Peng
Zhaoheng Ni
Soumi Maiti
Shinji Watanabe
SSL
95
27
0
11 Jun 2023
Empowering Molecule Discovery for Molecule-Caption Translation with Large Language Models: A ChatGPT Perspective
Jiatong Li
Yunqing Liu
Wenqi Fan
Xiao Wei
Hui Liu
Jiliang Tang
Qing Li
104
96
0
11 Jun 2023
14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon
Kevin Maik Jablonka
Qianxiang Ai
Alexander H Al-Feghali
S. Badhwar
Joshua D. Bocarsly Andres M Bran
...
Aristana Scourtas
K. J. Schmidt
Ian Foster
Andrew D. White
Ben Blaiszik
172
111
0
09 Jun 2023
Measuring and Modifying Factual Knowledge in Large Language Models
Pouya Pezeshkpour
KELM
77
18
0
09 Jun 2023
Value function estimation using conditional diffusion models for control
Bogdan Mazoure
Walter A. Talbott
Miguel Angel Bautista
R. Devon Hjelm
Alexander Toshev
J. Susskind
DiffM
83
4
0
09 Jun 2023
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
121
30
0
09 Jun 2023
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Wissam Antoun
Virginie Mouilleron
Benoît Sagot
Djamé Seddah
DeLMO
85
33
0
09 Jun 2023
How Can Recommender Systems Benefit from Large Language Models: A Survey
Jianghao Lin
Xinyi Dai
Yunjia Xi
Weiwen Liu
Bo Chen
...
Chenxu Zhu
Huifeng Guo
Yong Yu
Ruiming Tang
Weinan Zhang
LRM
193
224
0
09 Jun 2023
On the Challenges and Perspectives of Foundation Models for Medical Image Analysis
Shaoting Zhang
Dimitris N. Metaxas
LM&MA
VLM
MedIm
AI4CE
106
156
0
09 Jun 2023
Customizing General-Purpose Foundation Models for Medical Report Generation
Bang-ju Yang
Asif Raza
Yuexian Zou
Tong Zhang
MedIm
91
11
0
09 Jun 2023
The economic trade-offs of large language models: A case study
Kristen Howell
Gwen Christian
P. Fomitchov
Gitit Kehat
Julianne Marzulla
Leanne Rolston
Jadin Tredup
Ilana Zimmerman
Ethan Selfridge
Joe Bradley
39
1
0
08 Jun 2023
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Yidong Wang
Zhuohao Yu
Zhengran Zeng
Linyi Yang
Cunxiang Wang
...
Jindong Wang
Xingxu Xie
Wei Ye
Shi-Bo Zhang
Yue Zhang
ALM
ELM
175
249
0
08 Jun 2023
DLAMA: A Framework for Curating Culturally Diverse Facts for Probing the Knowledge of Pretrained Language Models
Amr Keleg
Walid Magdy
KELM
HILM
97
11
0
08 Jun 2023
ModuleFormer: Modularity Emerges from Mixture-of-Experts
Songlin Yang
Zheyu Zhang
Tianyou Cao
Shawn Tan
Zhenfang Chen
Chuang Gan
KELM
MoE
79
10
0
07 Jun 2023
The Two Word Test: A Semantic Benchmark for Large Language Models
Nicholas Riccardi
Rutvik H. Desai
ELM
57
5
0
07 Jun 2023
Fine-Grained Visual Prompting
Lingfeng Yang
Yueze Wang
Xiang Li
Xinlong Wang
Jian Yang
ObjD
VLM
126
68
0
07 Jun 2023
Benchmarking Foundation Models with Language-Model-as-an-Examiner
Yushi Bai
Jiahao Ying
Yixin Cao
Xin Lv
Yuze He
...
Yijia Xiao
Haozhe Lyu
Jiayin Zhang
Juanzi Li
Lei Hou
ALM
ELM
130
149
0
07 Jun 2023
World Models for Math Story Problems
Andreas Opedal
Niklas Stoehr
Abulhair Saparov
Mrinmaya Sachan
ReLM
124
13
0
07 Jun 2023
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments
Xiulong Liu
Sudipta Paul
Moitreya Chatterjee
A. Cherian
82
9
0
06 Jun 2023
Certified Deductive Reasoning with Language Models
Gabriel Poesia
Kanishk Gandhi
E. Zelikman
Noah D. Goodman
ELM
ReLM
LRM
87
0
0
06 Jun 2023
Triggering Multi-Hop Reasoning for Question Answering in Language Models using Soft Prompts and Random Walks
Kanishka Misra
Cicero Nogueira dos Santos
Siamak Shakeri
KELM
LRM
73
2
0
06 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLM
LRM
121
136
0
06 Jun 2023
Iterative Translation Refinement with Large Language Models
Pinzhen Chen
Zhicheng Guo
Barry Haddow
Kenneth Heafield
LRM
73
23
0
06 Jun 2023
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter
Ajay Jaiswal
Shiwei Liu
Tianlong Chen
Zhangyang Wang
VLM
93
34
0
06 Jun 2023
Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models
Fobo Shi
Peijun Qing
Ke Wang
Nan Wang
Youbo Lei
H. Lu
Xiaodong Lin
Duantengchuan Li
VLM
ReLM
LLMAG
LRM
95
12
0
06 Jun 2023
Early Weight Averaging meets High Learning Rates for LLM Pre-training
Sunny Sanyal
A. Neerkaje
Jean Kaddour
Abhishek Kumar
Sujay Sanghavi
MoMe
102
19
0
05 Jun 2023
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems
Tianyang Liu
Canwen Xu
Julian McAuley
ALM
116
171
0
05 Jun 2023
Previous
1
2
3
...
62
63
64
...
85
86
87
Next