ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Yassir Fathullah
Puria Radmard
Adian Liusie
Mark Gales
OODD
74
1
0
09 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
83
6
0
08 May 2023
How Do In-Context Examples Affect Compositional Generalization?
How Do In-Context Examples Affect Compositional Generalization?
Shengnan An
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Jian-Guang Lou
Dongmei Zhang
126
55
0
08 May 2023
Augmented Large Language Models with Parametric Knowledge Guiding
Augmented Large Language Models with Parametric Knowledge Guiding
Ziyang Luo
Can Xu
Pu Zhao
Xiubo Geng
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
KELMRALM
117
47
0
08 May 2023
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Gibbeum Lee
Volker Hartmann
Jongho Park
Dimitris Papailiopoulos
Kangwook Lee
88
67
0
08 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning
  by Large Language Models
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-wei Lee
Ee-Peng Lim
ReLMLRM
158
358
0
06 May 2023
Towards Applying Powerful Large AI Models in Classroom Teaching:
  Opportunities, Challenges and Prospects
Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects
Kehui Tan
Tianqi Pang
Chenyou Fan
Song Yu
70
16
0
05 May 2023
Using ChatGPT for Entity Matching
Using ChatGPT for Entity Matching
Ralph Peeters
Christian Bizer
AI4MH
113
29
0
05 May 2023
Improved Logical Reasoning of Language Models via Differentiable
  Symbolic Programming
Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming
Hanlin Zhang
Jiani Huang
Ziyang Li
Mayur Naik
Eric P. Xing
ReLMLRM
87
28
0
05 May 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Ruochen Zhao
Xingxuan Li
Shafiq Joty
Chengwei Qin
Lidong Bing
LRMKELM
102
170
0
05 May 2023
Neuromodulation Gated Transformer
Neuromodulation Gated Transformer
Kobe Knowles
Joshua Bensemann
Diana Benavides-Prado
Vithya Yogarajan
Michael Witbrock
Gillian Dobbie
Yang Chen
70
0
0
05 May 2023
LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics
LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics
Shervin Ardeshir
73
0
0
04 May 2023
Generating Virtual On-body Accelerometer Data from Virtual Textual
  Descriptions for Human Activity Recognition
Generating Virtual On-body Accelerometer Data from Virtual Textual Descriptions for Human Activity Recognition
Zi-Jian Leng
Hyeokhyen Kwon
Thomas Plötz
62
21
0
04 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for
  Large-Scale Database Grounded Text-to-SQLs
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
186
422
0
04 May 2023
Principle-Driven Self-Alignment of Language Models from Scratch with
  Minimal Human Supervision
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Zhiqing Sun
Songlin Yang
Qinhong Zhou
Hongxin Zhang
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
SyDaALM
163
339
0
04 May 2023
The Application of Affective Measures in Text-based Emotion Aware
  Recommender Systems
The Application of Affective Measures in Text-based Emotion Aware Recommender Systems
John Kalung Leung
Igor Griva
W. Kennedy
J. Kinser
Sohyun Park
Seoyoon Lee
120
2
0
04 May 2023
Should ChatGPT and Bard Share Revenue with Their Data Providers? A New
  Business Model for the AI Era
Should ChatGPT and Bard Share Revenue with Their Data Providers? A New Business Model for the AI Era
Dong Zhang
46
3
0
04 May 2023
AutoML-GPT: Automatic Machine Learning with GPT
AutoML-GPT: Automatic Machine Learning with GPT
Shujian Zhang
Chengyue Gong
Lemeng Wu
Xingchao Liu
Mi Zhou
LLMAG
149
67
0
04 May 2023
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive
  Transformer APIs
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Deepak Narayanan
Keshav Santhanam
Peter Henderson
Rishi Bommasani
Tony Lee
Percy Liang
194
3
0
03 May 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less
  Training Data and Smaller Model Sizes
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
373
563
0
03 May 2023
GPT-RE: In-context Learning for Relation Extraction using Large Language
  Models
GPT-RE: In-context Learning for Relation Extraction using Large Language Models
Michele Focchi
Fei Cheng
Zhuoyuan Mao
Qianying Liu
Haiyue Song
Jiwei Li
Sadao Kurohashi
LRM
117
94
0
03 May 2023
A Systematic Study of Knowledge Distillation for Natural Language
  Generation with Pseudo-Target Training
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
102
17
0
03 May 2023
Psychologically-Inspired Causal Prompts
Psychologically-Inspired Causal Prompts
Zhiheng Lyu
Zhijing Jin
Justus Mattern
Rada Mihalcea
Mrinmaya Sachan
Bernhard Schoelkopf
CML
70
0
0
02 May 2023
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner
Zhengxiang Shi
Aldo Lipani
VLMCLL
94
22
0
02 May 2023
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Wes Gurnee
Neel Nanda
Matthew Pauly
Katherine Harvey
Dmitrii Troitskii
Dimitris Bertsimas
MILM
293
218
0
02 May 2023
How to Unleash the Power of Large Language Models for Few-shot Relation
  Extraction?
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
Xin Xu
Yuqi Zhu
Xiaohan Wang
Ningyu Zhang
KELMLRM
131
55
0
02 May 2023
Mitigating Approximate Memorization in Language Models via Dissimilarity
  Learned Policy
Mitigating Approximate Memorization in Language Models via Dissimilarity Learned Policy
Aly M. Kassem
67
2
0
02 May 2023
VPGTrans: Transfer Visual Prompt Generator across LLMs
VPGTrans: Transfer Visual Prompt Generator across LLMs
Ao Zhang
Hao Fei
Yuan Yao
Wei Ji
Li Li
Zhiyuan Liu
Tat-Seng Chua
MLLMVLM
92
89
0
02 May 2023
Complex Logical Reasoning over Knowledge Graphs using Large Language
  Models
Complex Logical Reasoning over Knowledge Graphs using Large Language Models
Nurendra Choudhary
Chandan K. Reddy
LRM
112
25
0
02 May 2023
RadAdapt: Radiology Report Summarization via Lightweight Domain
  Adaptation of Large Language Models
RadAdapt: Radiology Report Summarization via Lightweight Domain Adaptation of Large Language Models
Dave Van Veen
Cara Van Uden
Maayane Attias
Anuj Pareek
Christian Blüthgen
...
Jean-Benoit Delbrouck
Juan Manuel Zambrano Chaves
C. Langlotz
Akshay S. Chaudhari
John M. Pauly
LM&MA
87
29
0
02 May 2023
Learning to Reason and Memorize with Self-Notes
Learning to Reason and Memorize with Self-Notes
Jack Lanchantin
Shubham Toshniwal
Jason Weston
Arthur Szlam
Sainbayar Sukhbaatar
ReLMLRMLLMAG
160
30
0
01 May 2023
Self-Evaluation Guided Beam Search for Reasoning
Self-Evaluation Guided Beam Search for Reasoning
Yuxi Xie
Kenji Kawaguchi
Yiran Zhao
Xu Zhao
MingSung Kan
Junxian He
Qizhe Xie
LRM
281
159
0
01 May 2023
Are Emergent Abilities of Large Language Models a Mirage?
Are Emergent Abilities of Large Language Models a Mirage?
Rylan Schaeffer
Alycia Lee
Oluwasanmi Koyejo
LRM
165
439
0
28 Apr 2023
MLCopilot: Unleashing the Power of Large Language Models in Solving
  Machine Learning Tasks
MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks
Lei Zhang
Yuge Zhang
Kan Ren
Dongsheng Li
Yuqing Yang
LLMAG
99
41
0
28 Apr 2023
ResiDual: Transformer with Dual Residual Connections
ResiDual: Transformer with Dual Residual Connections
Shufang Xie
Huishuai Zhang
Junliang Guo
Xu Tan
Jiang Bian
Hany Awadalla
Arul Menezes
Tao Qin
Rui Yan
103
20
0
28 Apr 2023
Outline, Then Details: Syntactically Guided Coarse-To-Fine Code
  Generation
Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation
Wenqing Zheng
S. Sharan
Ajay Jaiswal
Kevin Wang
Yihan Xi
Dejia Xu
Zhangyang Wang
135
27
0
28 Apr 2023
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale
  Instructions
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
Minghao Wu
Abdul Waheed
Chiyu Zhang
Muhammad Abdul-Mageed
Alham Fikri Aji
ALM
213
128
0
27 Apr 2023
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to
  Guardrail Models for Virtual Assistants
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants
A. Sun
Varun Nair
Elliot Schumacher
Anitha Kannan
90
3
0
27 Apr 2023
q2d: Turning Questions into Dialogs to Teach Models How to Search
q2d: Turning Questions into Dialogs to Teach Models How to Search
Yonatan Bitton
Shlomi Cohen-Ganor
Ido Hakimi
Yoad Lewenberg
Roee Aharoni
Enav Weinreb
97
4
0
27 Apr 2023
Controlled Text Generation with Natural Language Instructions
Controlled Text Generation with Natural Language Instructions
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ethan Gotlieb Wilcox
Ryan Cotterell
Mrinmaya Sachan
227
92
0
27 Apr 2023
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs
  Answering
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Xiangyang Liu
Tianqi Pang
Chenyou Fan
FedMLLRM
91
27
0
27 Apr 2023
Exploiting Simulated User Feedback for Conversational Search: Ranking,
  Rewriting, and Beyond
Exploiting Simulated User Feedback for Conversational Search: Ranking, Rewriting, and Beyond
Paul Owoicho
Ivan Sekulić
Mohammad Aliannejadi
Jeffrey Stephen Dalton
Fabio Crestani
120
34
0
26 Apr 2023
TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional
  Generation
TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation
Zhaoyan Liu
Noël Vouitsis
S. Gorti
Jimmy Ba
Gabriel Loaiza-Ganem
ViT
78
1
0
26 Apr 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
223
687
0
26 Apr 2023
Multimodal Grounding for Embodied AI via Augmented Reality Headsets for
  Natural Language Driven Task Planning
Multimodal Grounding for Embodied AI via Augmented Reality Headsets for Natural Language Driven Task Planning
Selma Wanna
Fabian Parra
R. Valner
Karl Kruusamäe
Mitch Pryor
LM&Ro
77
2
0
26 Apr 2023
The Roles of Symbols in Neural-based AI: They are Not What You Think!
The Roles of Symbols in Neural-based AI: They are Not What You Think!
D. Silver
Tom Michael Mitchell
18
4
0
26 Apr 2023
Multidimensional Evaluation for Text Style Transfer Using ChatGPT
Multidimensional Evaluation for Text Style Transfer Using ChatGPT
Huiyuan Lai
Antonio Toral
Malvina Nissim
99
17
0
26 Apr 2023
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate
  Representation
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation
Krishnam Hasija
Shrishti Pradhan
Manasi Patwardhan
Raveendra Kumar Medicherla
Lovekesh Vig
Ravindra Naik
67
2
0
26 Apr 2023
The Closeness of In-Context Learning and Weight Shifting for Softmax
  Regression
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
Shuai Li
Zhao Song
Yu Xia
Tong Yu
Dinesh Manocha
84
43
0
26 Apr 2023
SAFE: Machine Unlearning With Shard Graphs
SAFE: Machine Unlearning With Shard Graphs
Yonatan Dukler
Benjamin Bowman
Alessandro Achille
Aditya Golatkar
A. Swaminathan
Stefano Soatto
MU
89
26
0
25 Apr 2023
Previous
123...707172...858687
Next