ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Can neural networks do arithmetic? A survey on the elementary numerical
  skills of state-of-the-art deep learning models
Can neural networks do arithmetic? A survey on the elementary numerical skills of state-of-the-art deep learning models
Alberto Testolin
AIMat
72
22
0
14 Mar 2023
Query2doc: Query Expansion with Large Language Models
Query2doc: Query Expansion with Large Language Models
Liang Wang
Nan Yang
Furu Wei
141
113
0
14 Mar 2023
The Life Cycle of Knowledge in Big Language Models: A Survey
The Life Cycle of Knowledge in Big Language Models: A Survey
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
95
29
0
14 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on
  Consistency with Human Preferences
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELMALMAI4MH
76
38
0
14 Mar 2023
Architext: Language-Driven Generative Architecture Design
Architext: Language-Driven Generative Architecture Design
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
VLMAI4CE
80
6
0
13 Mar 2023
FlexGen: High-Throughput Generative Inference of Large Language Models
  with a Single GPU
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
225
404
0
13 Mar 2023
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched
  Visual Descriptions
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
Deyao Zhu
Jun Chen
Kilichbek Haydarov
Xiaoqian Shen
Wenxuan Zhang
Mohamed Elhoseiny
MLLM
100
106
0
12 Mar 2023
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and
  Multilingual Natural Language Generation
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Bang-ju Yang
Fenglin Liu
Yuexian Zou
Xian Wu
Yaowei Wang
David Clifton
88
9
0
11 Mar 2023
Parachute: Evaluating Interactive Human-LM Co-writing Systems
Parachute: Evaluating Interactive Human-LM Co-writing Systems
Hua Shen
Tongshuang Wu
KELM
52
16
0
11 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
95
47
0
10 Mar 2023
ChatGPT may Pass the Bar Exam soon, but has a Long Way to Go for the
  LexGLUE benchmark
ChatGPT may Pass the Bar Exam soon, but has a Long Way to Go for the LexGLUE benchmark
Ilias Chalkidis
ALMELMAI4MHAILaw
85
45
0
09 Mar 2023
Greener yet Powerful: Taming Large Code Generation Models with
  Quantization
Greener yet Powerful: Taming Large Code Generation Models with Quantization
Xiaokai Wei
Sujan Kumar Gonugondla
W. Ahmad
Shiqi Wang
Baishakhi Ray
...
Ben Athiwaratkun
Mingyue Shang
M. K. Ramanathan
Parminder Bhatia
Bing Xiang
MQ
61
6
0
09 Mar 2023
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for
  Document Information Extraction
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
Jiabang He
Lei Wang
Yingpeng Hu
Ning Liu
Hui-juan Liu
Xingdong Xu
Hengtao Shen
MLLM
80
46
0
09 Mar 2023
Baldur: Whole-Proof Generation and Repair with Large Language Models
Baldur: Whole-Proof Generation and Repair with Large Language Models
E. First
M. Rabe
Talia Ringer
Yuriy Brun
147
107
0
08 Mar 2023
A Prompt Log Analysis of Text-to-Image Generation Systems
A Prompt Log Analysis of Text-to-Image Generation Systems
Yutong Xie
Zhaoying Pan
Jing Ma
Jie Luo
Qiaozhu Mei
DiffM
166
43
0
08 Mar 2023
Streaming Kernel PCA Algorithm With Small Space
Streaming Kernel PCA Algorithm With Small Space
Yichuan Deng
Zhao Song
Zifan Wang
Hangke Zhang
114
4
0
08 Mar 2023
MenuCraft: Interactive Menu System Design with Large Language Models
MenuCraft: Interactive Menu System Design with Large Language Models
Amir Hossein Kargaran
Nafiseh Nikeghbal
Abbas Heydarnoori
Hinrich Schütze
LLMAG
82
4
0
08 Mar 2023
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Ruixiang Tang
Xiaotian Han
Xiaoqian Jiang
Helen Zhou
LM&MAAI4MHSyDa
104
186
0
08 Mar 2023
Provable Pathways: Learning Multiple Tasks over Multiple Paths
Provable Pathways: Learning Multiple Tasks over Multiple Paths
Yingcong Li
Samet Oymak
MoE
73
4
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
120
554
0
07 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
SemEval-2023 Task 10: Explainable Detection of Online Sexism
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
85
122
0
07 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&RoOffRLLRMAI4CE
206
172
0
07 Mar 2023
Larger language models do in-context learning differently
Larger language models do in-context learning differently
Jerry W. Wei
Jason W. Wei
Yi Tay
Dustin Tran
Albert Webson
...
Xinyun Chen
Hanxiao Liu
Da Huang
Denny Zhou
Tengyu Ma
ReLMLRM
125
374
0
07 Mar 2023
Large Language Models as Zero-Shot Human Models for Human-Robot
  Interaction
Large Language Models as Zero-Shot Human Models for Human-Robot Interaction
Bowen Zhang
Harold Soh
LLMAGLM&Ro
155
44
0
06 Mar 2023
Towards provably efficient quantum algorithms for large-scale
  machine-learning models
Towards provably efficient quantum algorithms for large-scale machine-learning models
Junyu Liu
Minzhao Liu
Jin-Peng Liu
Ziyu Ye
Yunfei Wang
Yuri Alexeev
Jens Eisert
Liang Jiang
162
61
0
06 Mar 2023
PaLM-E: An Embodied Multimodal Language Model
PaLM-E: An Embodied Multimodal Language Model
Danny Driess
F. Xia
Mehdi S. M. Sajjadi
Corey Lynch
Aakanksha Chowdhery
...
Marc Toussaint
Klaus Greff
Andy Zeng
Igor Mordatch
Peter R. Florence
LM&Ro
166
1,679
0
06 Mar 2023
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code
  Understanding, Generation, Translation and Retrieval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan
M Saiful Bari
Xuan Long Do
Weishi Wang
Md. Rizwan Parvez
Shafiq Joty
ALMELM
124
23
0
06 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative
  Language Model
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
120
7
0
06 Mar 2023
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations
  and Infographics using Large Language Models
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Victor C. Dibia
VLM
139
92
0
06 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLMMLLM
145
25
0
04 Mar 2023
MathPrompter: Mathematical Reasoning using Large Language Models
MathPrompter: Mathematical Reasoning using Large Language Models
Shima Imani
Liang Du
H. Shrivastava
KELMReLMLRM
117
214
0
04 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
87
28
0
03 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLMLRM
90
65
0
03 Mar 2023
Ada-Grouper: Accelerating Pipeline Parallelism in Preempted Network by
  Adaptive Group-Scheduling for Micro-Batches
Ada-Grouper: Accelerating Pipeline Parallelism in Preempted Network by Adaptive Group-Scheduling for Micro-Batches
Siyu Wang
Zongyan Cao
Chang Si
Lansong Diao
Jiamang Wang
W. Lin
55
0
0
03 Mar 2023
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable
  Transformers
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers
Tianlong Chen
Zhenyu Zhang
Ajay Jaiswal
Shiwei Liu
Zhangyang Wang
MoE
121
50
0
02 Mar 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for
  Embodied Agents
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents
Wenlong Huang
Fei Xia
Dhruv Shah
Danny Driess
Andy Zeng
...
Pete Florence
Igor Mordatch
Sergey Levine
Karol Hausman
Brian Ichter
LM&Ro
91
49
0
01 Mar 2023
R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility
  Across Random User Intents
R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents
Daniel D. Johnson
Daniel Tarlow
Christian J. Walder
77
6
0
01 Mar 2023
A Framework for Neurosymbolic Robot Action Planning using Large Language
  Models
A Framework for Neurosymbolic Robot Action Planning using Large Language Models
Alessio Capitanelli
Fulvio Mastrogiovanni
LM&RoLLMAG
80
7
0
01 Mar 2023
Competence-Based Analysis of Language Models
Competence-Based Analysis of Language Models
Adam Davies
Jize Jiang
Chengxiang Zhai
ELM
71
5
0
01 Mar 2023
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language
  Understanding Tasks
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks
Xuanting Chen
Junjie Ye
Can Zu
Nuo Xu
Rui Zheng
Minlong Peng
Jie Zhou
Tao Gui
Qi Zhang
Xuanjing Huang
AI4MHELM
72
83
0
01 Mar 2023
The Trade-off between Universality and Label Efficiency of
  Representations from Contrastive Learning
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning
Zhenmei Shi
Jiefeng Chen
Kunyang Li
Jayaram Raghuram
Xi Wu
Yingyu Liang
S. Jha
SSL
79
20
0
28 Feb 2023
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for
  Instruction Following
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Seonghyeon Ye
Hyeonbin Hwang
Sohee Yang
Hyeongu Yun
Yireun Kim
Minjoon Seo
LRM
100
39
0
28 Feb 2023
A Study of Comfortability between Interactive AI and Human
A Study of Comfortability between Interactive AI and Human
Yi Ru Wang
Jiafei Duan
Sidharth Talia
Hao Zhu
111
1
0
28 Feb 2023
Task-Oriented Grasp Prediction with Visual-Language Inputs
Task-Oriented Grasp Prediction with Visual-Language Inputs
Chao Tang
Dehao Huang
Lingxiao Meng
Weiyu Liu
Kuanqi Cai
75
37
0
28 Feb 2023
Language Is Not All You Need: Aligning Perception with Language Models
Language Is Not All You Need: Aligning Perception with Language Models
Shaohan Huang
Li Dong
Wenhui Wang
Y. Hao
Saksham Singhal
...
Johan Bjorck
Vishrav Chaudhary
Subhojit Som
Xia Song
Furu Wei
VLMLRMMLLM
146
567
0
27 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
167
106
0
27 Feb 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALMPILM
1.8K
13,560
0
27 Feb 2023
Fast Attention Requires Bounded Entries
Fast Attention Requires Bounded Entries
Josh Alman
Zhao Song
116
86
0
26 Feb 2023
On pitfalls (and advantages) of sophisticated large language models
On pitfalls (and advantages) of sophisticated large language models
A. Strasser
82
14
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
143
161
0
25 Feb 2023
Previous
123...747576...858687
Next