ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Distilling Reasoning Capabilities into Smaller Language Models
Distilling Reasoning Capabilities into Smaller Language Models
Kumar Shridhar
Alessandro Stolfo
Mrinmaya Sachan
LRMReLM
134
176
0
01 Dec 2022
Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Explicit Knowledge Transfer for Weakly-Supervised Code Generation
Zhangir Azerbayev
Ansong Ni
Hailey Schoelkopf
Dragomir R. Radev
49
5
0
30 Nov 2022
An Efficient Split Fine-tuning Framework for Edge and Cloud
  Collaborative Learning
An Efficient Split Fine-tuning Framework for Edge and Cloud Collaborative Learning
Shaoshuai Shi
Qing Yang
Yang Xiang
Shuhan Qi
Xinyu Wang
82
1
0
30 Nov 2022
Chaining Simultaneous Thoughts for Numerical Reasoning
Chaining Simultaneous Thoughts for Numerical Reasoning
Zhihong Shao
Fei Huang
Minlie Huang
AIMatAI4CE
71
18
0
29 Nov 2022
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information
Arnold Overwijk
Chenyan Xiong
X. Liu
Cameron VandenBerg
Jamie Callan
3DV
41
16
0
29 Nov 2022
What learning algorithm is in-context learning? Investigations with
  linear models
What learning algorithm is in-context learning? Investigations with linear models
Ekin Akyürek
Dale Schuurmans
Jacob Andreas
Tengyu Ma
Denny Zhou
138
493
0
28 Nov 2022
Fine-tuning language models to find agreement among humans with diverse
  preferences
Fine-tuning language models to find agreement among humans with diverse preferences
Michiel A. Bakker
Martin Chadwick
Hannah R. Sheahan
Michael Henry Tessler
Lucy Campbell-Gillingham
...
Nat McAleese
Amelia Glaese
John Aslanides
M. Botvinick
Christopher Summerfield
ALM
112
237
0
28 Nov 2022
Understanding BLOOM: An empirical study on diverse NLP tasks
Understanding BLOOM: An empirical study on diverse NLP tasks
Parag Dakle
Sai Krishna Rallabandi
Preethi Raghavan
AI4CE
96
4
0
27 Nov 2022
Solving math word problems with process- and outcome-based feedback
Solving math word problems with process- and outcome-based feedback
J. Uesato
Nate Kushman
Ramana Kumar
Francis Song
Noah Y. Siegel
L. Wang
Antonia Creswell
G. Irving
I. Higgins
FaMLReLMAIMatLRM
147
362
0
25 Nov 2022
PipeFisher: Efficient Training of Large Language Models Using Pipelining
  and Fisher Information Matrices
PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Kazuki Osawa
Shigang Li
Torsten Hoefler
AI4CE
89
26
0
25 Nov 2022
Complementary Explanations for Effective In-Context Learning
Complementary Explanations for Effective In-Context Learning
Xi Ye
Srini Iyer
Asli Celikyilmaz
Ves Stoyanov
Greg Durrett
Ramakanth Pasunuru
ReLMLRM
116
96
0
25 Nov 2022
Signed Binary Weight Networks
Sachit Kuhar
Alexey Tumanov
Judy Hoffman
MQ
89
1
0
25 Nov 2022
GitHub Considered Harmful? Analyzing Open-Source Projects for the
  Automatic Generation of Cryptographic API Call Sequences
GitHub Considered Harmful? Analyzing Open-Source Projects for the Automatic Generation of Cryptographic API Call Sequences
Catherine Tony
Nicolás E. Díaz Ferreyra
Riccardo Scandariato
59
4
0
24 Nov 2022
Automatic Generation of Socratic Subquestions for Teaching Math Word
  Problems
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
Kumar Shridhar
Jakub Macina
Mennatallah El-Assady
Tanmay Sinha
Manu Kapur
Mrinmaya Sachan
AIMat
104
49
0
23 Nov 2022
Program of Thoughts Prompting: Disentangling Computation from Reasoning
  for Numerical Reasoning Tasks
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Wenhu Chen
Xueguang Ma
Xinyi Wang
William W. Cohen
ReLMReCodLRM
406
829
0
22 Nov 2022
HyperTuning: Toward Adapting Large Language Models without
  Back-propagation
HyperTuning: Toward Adapting Large Language Models without Back-propagation
Jason Phang
Yi Mao
Pengcheng He
Weizhu Chen
96
34
0
22 Nov 2022
TEMPERA: Test-Time Prompting via Reinforcement Learning
TEMPERA: Test-Time Prompting via Reinforcement Learning
Tianjun Zhang
Xuezhi Wang
Denny Zhou
Dale Schuurmans
Joseph E. Gonzalez
VLM
63
39
0
21 Nov 2022
Deanthropomorphising NLP: Can a Language Model Be Conscious?
Deanthropomorphising NLP: Can a Language Model Be Conscious?
Matthew Shardlow
Piotr Przybyła
77
7
0
21 Nov 2022
A Theory of Unsupervised Translation Motivated by Understanding Animal
  Communication
A Theory of Unsupervised Translation Motivated by Understanding Animal Communication
S. Goldwasser
David F. Gruber
Adam Tauman Kalai
Orr Paradise
83
3
0
20 Nov 2022
The Stack: 3 TB of permissively licensed source code
The Stack: 3 TB of permissively licensed source code
Denis Kocetkov
Raymond Li
Loubna Ben Allal
Jia Li
Chenghao Mou
...
Sean M. Hughes
Thomas Wolf
Dzmitry Bahdanau
Leandro von Werra
H. D. Vries
111
339
0
20 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large
  Language Models
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
273
847
0
18 Nov 2022
PAL: Program-aided Language Models
PAL: Program-aided Language Models
Luyu Gao
Aman Madaan
Shuyan Zhou
Uri Alon
Pengfei Liu
Yiming Yang
Jamie Callan
Graham Neubig
ReLMLRM
147
462
0
18 Nov 2022
VeLO: Training Versatile Learned Optimizers by Scaling Up
VeLO: Training Versatile Learned Optimizers by Scaling Up
Luke Metz
James Harrison
C. Freeman
Amil Merchant
Lucas Beyer
...
Naman Agrawal
Ben Poole
Igor Mordatch
Adam Roberts
Jascha Narain Sohl-Dickstein
143
60
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
66
22
0
16 Nov 2022
Prompting PaLM for Translation: Assessing Strategies and Performance
Prompting PaLM for Translation: Assessing Strategies and Performance
David Vilar
Markus Freitag
Colin Cherry
Jiaming Luo
Viresh Ratnakar
George F. Foster
LRM
122
167
0
16 Nov 2022
Towards Computationally Verifiable Semantic Grounding for Language
  Models
Towards Computationally Verifiable Semantic Grounding for Language Models
Chris Alberti
Kuzman Ganchev
Michael Collins
Sebastian Gehrmann
Ciprian Chelba
50
0
0
16 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
79
27
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
83
0
0
16 Nov 2022
On the Compositional Generalization Gap of In-Context Learning
On the Compositional Generalization Gap of In-Context Learning
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Rameswar Panda
81
25
0
15 Nov 2022
QAmeleon: Multilingual QA with Only 5 Examples
QAmeleon: Multilingual QA with Only 5 Examples
Priyanka Agrawal
Chris Alberti
Fantine Huot
Joshua Maynez
Ji Ma
Sebastian Ruder
Kuzman Ganchev
Dipanjan Das
Mirella Lapata
75
30
0
15 Nov 2022
A Universal Discriminator for Zero-Shot Generalization
A Universal Discriminator for Zero-Shot Generalization
Haike Xu
Zongyu Lin
Jing Zhou
Yanan Zheng
Zhilin Yang
AI4CE
66
16
0
15 Nov 2022
Teaching Algorithmic Reasoning via In-context Learning
Teaching Algorithmic Reasoning via In-context Learning
Hattie Zhou
Azade Nova
Hugo Larochelle
Rameswar Panda
Behnam Neyshabur
Hanie Sedghi
LRMReLM
125
117
0
15 Nov 2022
Evaluating How Fine-tuning on Bimodal Data Effects Code Generation
Evaluating How Fine-tuning on Bimodal Data Effects Code Generation
Gabriel Orlanski
Seonhye Yang
Michael Healy
ALM
57
5
0
15 Nov 2022
Prompting Language Models for Linguistic Structure
Prompting Language Models for Linguistic Structure
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
127
44
0
15 Nov 2022
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
83
20
0
15 Nov 2022
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Ippei Fujisawa
Ryota Kanai
ELMLRM
76
4
0
14 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMCLIP
269
730
0
14 Nov 2022
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum
  Bayes Risk Decoding
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
Mirac Suzgun
Luke Melas-Kyriazi
Dan Jurafsky
81
46
0
14 Nov 2022
UGIF: UI Grounded Instruction Following
UGIF: UI Grounded Instruction Following
S. Venkatesh
Partha P. Talukdar
S. Narayanan
140
12
0
14 Nov 2022
High-Resource Methodological Bias in Low-Resource Investigations
High-Resource Methodological Bias in Low-Resource Investigations
Maartje ter Hoeve
David Grangier
Natalie Schluter
78
2
0
14 Nov 2022
Are Hard Examples also Harder to Explain? A Study with Human and
  Model-Generated Explanations
Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated Explanations
Swarnadeep Saha
Peter Hase
Nazneen Rajani
Joey Tianyi Zhou
LRM
70
15
0
14 Nov 2022
Towards Robust Numerical Question Answering: Diagnosing Numerical
  Capabilities of NLP Systems
Towards Robust Numerical Question Answering: Diagnosing Numerical Capabilities of NLP Systems
Jialiang Xu
Mengyu Zhou
Xinyi He
Shi Han
Dongmei Zhang
AAMLELMAIMat
58
6
0
14 Nov 2022
Metaphors We Learn By
Metaphors We Learn By
Roland Memisevic
68
0
0
11 Nov 2022
The CRINGE Loss: Learning what language not to model
The CRINGE Loss: Learning what language not to model
Leonard Adolphs
Tianyu Gao
Jing Xu
Kurt Shuster
Sainbayar Sukhbaatar
Jason Weston
MU
95
37
0
10 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
190
700
0
10 Nov 2022
Collateral facilitation in humans and language models
Collateral facilitation in humans and language models
J. Michaelov
Benjamin Bergen
118
11
0
09 Nov 2022
Grammatical Error Correction: A Survey of the State of the Art
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
91
88
0
09 Nov 2022
Large Language Models with Controllable Working Memory
Large Language Models with Controllable Working Memory
Daliang Li
A. S. Rawat
Manzil Zaheer
Xin Wang
Michal Lukasik
Andreas Veit
Felix X. Yu
Surinder Kumar
KELM
146
171
0
09 Nov 2022
Efficiently Scaling Transformer Inference
Efficiently Scaling Transformer Inference
Reiner Pope
Sholto Douglas
Aakanksha Chowdhery
Jacob Devlin
James Bradbury
Anselm Levskaya
Jonathan Heek
Kefan Xiao
Shivani Agrawal
J. Dean
124
327
0
09 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
510
2,401
0
09 Nov 2022
Previous
123...798081...858687
Next