ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Role-Play with Large Language Models
Role-Play with Large Language Models
Murray Shanahan
Kyle McDonell
Laria Reynolds
LLMAG
84
307
0
25 May 2023
Self-contradictory Hallucinations of Large Language Models: Evaluation,
  Detection and Mitigation
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation
Niels Mündler
Jingxuan He
Slobodan Jenko
Martin Vechev
HILM
85
119
0
25 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout
  Interpreter with Generative Feedback
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
76
13
0
25 May 2023
Dynamic Context Pruning for Efficient and Interpretable Autoregressive
  Transformers
Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
Sotiris Anagnostidis
Dario Pavllo
Luca Biggio
Lorenzo Noci
Aurelien Lucchi
Thomas Hofmann
121
57
0
25 May 2023
On the Planning Abilities of Large Language Models : A Critical
  Investigation
On the Planning Abilities of Large Language Models : A Critical Investigation
Karthik Valmeekam
Matthew Marquez
S. Sreedharan
Subbarao Kambhampati
LLMAGLRM
65
241
0
25 May 2023
The False Promise of Imitating Proprietary LLMs
The False Promise of Imitating Proprietary LLMs
Arnav Gudibande
Eric Wallace
Charles Burton Snell
Xinyang Geng
Hao Liu
Pieter Abbeel
Sergey Levine
Dawn Song
ALM
169
208
0
25 May 2023
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
Lei Shu
Liangchen Luo
Jayakumar Hoskere
Yun Zhu
Canoee Liu
Simon Tong
Jindong Chen
Lei Meng
KELMLRM
98
51
0
25 May 2023
BookGPT: A General Framework for Book Recommendation Empowered by Large
  Language Model
BookGPT: A General Framework for Book Recommendation Empowered by Large Language Model
Aakas Zhiyuli
YanFang Chen
Xuan Zhang
Xun Liang
ALMLLMAG
97
33
0
25 May 2023
Scaling Data-Constrained Language Models
Scaling Data-Constrained Language Models
Niklas Muennighoff
Alexander M. Rush
Boaz Barak
Teven Le Scao
Aleksandra Piktus
Nouamane Tazi
S. Pyysalo
Thomas Wolf
Colin Raffel
ALM
198
226
0
25 May 2023
Harnessing the Power of Large Language Models for Natural Language to
  First-Order Logic Translation
Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation
Yuan Yang
Siheng Xiong
Ali Payani
Ehsan Shareghi
Faramarz Fekri
LRM
84
41
0
24 May 2023
Large Language Models are Few-Shot Health Learners
Large Language Models are Few-Shot Health Learners
Xin Liu
Daniel J. McDuff
G. Kovács
I. Galatzer-Levy
Jacob Sunshine
Jiening Zhan
M. Poh
Shun Liao
P. Achille
Shwetak N. Patel
LM&MAAI4MH
132
117
0
24 May 2023
The Larger They Are, the Harder They Fail: Language Models do not
  Recognize Identifier Swaps in Python
The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python
Antonio Valerio Miceli Barone
Fazl Barez
Ioannis Konstas
Shay B. Cohen
50
32
0
24 May 2023
Large Language Models for User Interest Journeys
Large Language Models for User Interest Journeys
Konstantina Christakopoulou
Alberto Lalama
Cj Adams
Iris Qu
Yifat Amir
...
Dina Bseiso
Sarah Scodel
Lucas Dixon
Ed H. Chi
Minmin Chen
107
30
0
24 May 2023
Towards Revealing the Mystery behind Chain of Thought: A Theoretical
  Perspective
Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective
Guhao Feng
Bohang Zhang
Yuntian Gu
Haotian Ye
Di He
Liwei Wang
LRM
168
262
0
24 May 2023
Gorilla: Large Language Model Connected with Massive APIs
Gorilla: Large Language Model Connected with Massive APIs
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELMCLLALMSyDa
196
572
0
24 May 2023
Breaking the Curse of Quality Saturation with User-Centric Ranking
Breaking the Curse of Quality Saturation with User-Centric Ranking
Zhuokai Zhao
Yang Yang
Wenyu Wang
Chi-Yu Liu
Yunluo Shi
Wenjie Hu
Haotian Zhang
Shuangjun Yang
68
3
0
24 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
130
51
0
24 May 2023
Testing the General Deductive Reasoning Capacity of Large Language
  Models Using OOD Examples
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Abulhair Saparov
Richard Yuanzhe Pang
Vishakh Padmakumar
Nitish Joshi
Seyed Mehran Kazemi
Najoung Kim
He He
ELMLRM
158
94
0
24 May 2023
Revisiting Parallel Context Windows: A Frustratingly Simple Alternative
  and Chain-of-Thought Deterioration
Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration
Kejuan Yang
Xiao Liu
Kaiwen Men
Aohan Zeng
Yuxiao Dong
Jie Tang
LLMAGLRM
60
3
0
24 May 2023
Spoken Question Answering and Speech Continuation Using
  Spectrogram-Powered LLM
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Eliya Nachmani
Alon Levkovitch
Roy Hirsch
Julián Salazar
Chulayutsh Asawaroengchai
Soroosh Mariooryad
Ehud Rivlin
RJ Skerry-Ryan
Michelle Tadmor Ramanovich
AuLLM
120
45
0
24 May 2023
Visually-Situated Natural Language Understanding with Contrastive
  Reading Model and Frozen Large Language Models
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Geewook Kim
Hodong Lee
D. Kim
Haeji Jung
S. Park
Yoon Kim
Sangdoo Yun
Taeho Kil
Bado Lee
Seunghyun Park
VLM
111
4
0
24 May 2023
Meta-Learning Online Adaptation of Language Models
Meta-Learning Online Adaptation of Language Models
Nathan J. Hu
E. Mitchell
Christopher D. Manning
Chelsea Finn
KELM
101
37
0
24 May 2023
PathAsst: A Generative Foundation AI Assistant Towards Artificial
  General Intelligence of Pathology
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology
Yuxuan Sun
Chenglu Zhu
S. Zheng
Kai Zhang
Xiaoxuan Yu
Zhongyi Shui
Yunlong Zhang
Honglin Li
Lin Yang
LM&MAMedIm
137
49
0
24 May 2023
AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With
  Large Language Models
AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models
Siqi Ouyang
Lei Li
LM&RoLLMAG
54
10
0
24 May 2023
Lawyer LLaMA Technical Report
Lawyer LLaMA Technical Report
Quzhe Huang
Mingxu Tao
Chen Zhang
Zhenwei An
Cong Jiang
Zhibin Chen
Zirui Wu
Yansong Feng
ELMALMAILaw
131
55
0
24 May 2023
Who Wrote this Code? Watermarking for Code Generation
Who Wrote this Code? Watermarking for Code Generation
Taehyun Lee
Seokhee Hong
Jaewoo Ahn
Ilgee Hong
Hwaran Lee
Sangdoo Yun
Jamin Shin
Gunhee Kim
WaLM
69
98
0
24 May 2023
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models
  using Causal Mediation Analysis
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Alessandro Stolfo
Yonatan Belinkov
Mrinmaya Sachan
MILMKELMLRM
113
54
0
24 May 2023
Self-ICL: Zero-Shot In-Context Learning with Self-Generated
  Demonstrations
Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations
Wei-Lin Chen
Cheng-Kuang Wu
Yun-Nung Chen
Hsin-Hsi Chen
115
35
0
24 May 2023
Dior-CVAE: Pre-trained Language Models and Diffusion Priors for
  Variational Dialog Generation
Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog Generation
Tianyu Yang
Thy Thy Tran
Iryna Gurevych
DiffM
81
1
0
24 May 2023
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large
  Language Models
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Gen Luo
Yiyi Zhou
Tianhe Ren
Shen Chen
Xiaoshuai Sun
Rongrong Ji
VLMMLLM
124
98
0
24 May 2023
Unlocking Temporal Question Answering for Large Language Models Using
  Code Execution
Unlocking Temporal Question Answering for Large Language Models Using Code Execution
Xingxuan Li
Liying Cheng
Qingyu Tan
Hwee Tou Ng
Shafiq Joty
Lidong Bing
LRMAI4CE
81
0
0
24 May 2023
Bactrian-X: Multilingual Replicable Instruction-Following Models with
  Low-Rank Adaptation
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation
Haonan Li
Fajri Koto
Minghao Wu
Alham Fikri Aji
Timothy Baldwin
ALM
81
76
0
24 May 2023
Sentiment Analysis in the Era of Large Language Models: A Reality Check
Sentiment Analysis in the Era of Large Language Models: A Reality Check
Wenxuan Zhang
Yue Deng
Bing-Quan Liu
Sinno Jialin Pan
Lidong Bing
AI4MH
103
312
0
24 May 2023
The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language
  Models
The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models
Jingyuan Qi
Zhiyang Xu
Ying Shen
Minqian Liu
dingnan jin
Qifan Wang
Lifu Huang
ReLMLRMKELM
63
13
0
24 May 2023
Reasoning with Language Model is Planning with World Model
Reasoning with Language Model is Planning with World Model
Shibo Hao
Yi Gu
Haodi Ma
Joshua Jiahua Hong
Zhen Wang
D. Wang
Zhiting Hu
ReLMLRMLLMAG
170
604
0
24 May 2023
Investigating Table-to-Text Generation Capabilities of LLMs in
  Real-World Information Seeking Scenarios
Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking Scenarios
Yilun Zhao
Haowei Zhang
Shengyun Si
Linyong Nan
Xiangru Tang
Arman Cohan
LMTD
108
12
0
24 May 2023
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
Md. Tawkat Islam Khondaker
Abdul Waheed
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELMLM&MA
103
70
0
24 May 2023
How Predictable Are Large Language Model Capabilities? A Case Study on
  BIG-bench
How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench
Qinyuan Ye
Harvey Yiyun Fu
Xiang Ren
Robin Jia
ELM
115
24
0
24 May 2023
In-Context Impersonation Reveals Large Language Models' Strengths and
  Biases
In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Leonard Salewski
Stephan Alaniz
Isabel Rio-Torto
Eric Schulz
Zeynep Akata
102
159
0
24 May 2023
Universal Self-Adaptive Prompting
Universal Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan O. Arik
Tomas Pfister
LRM
110
12
0
24 May 2023
Frugal Prompting for Dialog Models
Frugal Prompting for Dialog Models
Bishal Santra
Sakya Basak
Abhinandan De
Manish Gupta
Pawan Goyal
43
2
0
24 May 2023
PURR: Efficiently Editing Language Model Hallucinations by Denoising
  Language Model Corruptions
PURR: Efficiently Editing Language Model Hallucinations by Denoising Language Model Corruptions
Anthony Chen
Panupong Pasupat
Sameer Singh
Hongrae Lee
Kelvin Guu
125
48
0
24 May 2023
PIVOINE: Instruction Tuning for Open-world Information Extraction
PIVOINE: Instruction Tuning for Open-world Information Extraction
Keming Lu
Xiaoman Pan
Kaiqiang Song
Hongming Zhang
Dong Yu
Jianshu Chen
75
11
0
24 May 2023
Extracting Psychological Indicators Using Question Answering
Extracting Psychological Indicators Using Question Answering
Luka Pavlović
21
0
0
24 May 2023
Leveraging GPT-4 for Automatic Translation Post-Editing
Leveraging GPT-4 for Automatic Translation Post-Editing
Vikas Raunak
Amr Sharaf
Yiren Wang
H. Awadallah
Arul Menezes
80
69
0
24 May 2023
Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient
  Pre-LN Transformers
Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient Pre-LN Transformers
Zixuan Jiang
Jiaqi Gu
Hanqing Zhu
David Z. Pan
AI4CE
104
18
0
24 May 2023
PromptNER: Prompting For Named Entity Recognition
PromptNER: Prompting For Named Entity Recognition
D. Ashok
Zachary Chase Lipton
101
38
0
24 May 2023
Mitigating Temporal Misalignment by Discarding Outdated Facts
Mitigating Temporal Misalignment by Discarding Outdated Facts
Michael J.Q. Zhang
Eunsol Choi
KELMHILM
103
20
0
24 May 2023
Estimating Large Language Model Capabilities without Labeled Test Data
Estimating Large Language Model Capabilities without Labeled Test Data
Harvey Yiyun Fu
Qinyuan Ye
Albert Xu
Xiang Ren
Robin Jia
73
9
0
24 May 2023
Anthropomorphization of AI: Opportunities and Risks
Anthropomorphization of AI: Opportunities and Risks
Ameet Deshpande
Tanmay Rajpurohit
Karthik Narasimhan
Ashwin Kalyan
83
24
0
24 May 2023
Previous
123...656667...858687
Next