ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Stable and low-precision training for large-scale vision-language models
Stable and low-precision training for large-scale vision-language models
Mitchell Wortsman
Tim Dettmers
Luke Zettlemoyer
Ari S. Morcos
Ali Farhadi
Ludwig Schmidt
MQMLLMVLM
144
44
0
25 Apr 2023
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Ori Yoran
Tomer Wolfson
Ben Bogin
Uri Katz
Daniel Deutch
Jonathan Berant
ReLMLRMKELM
111
101
0
25 Apr 2023
PEFT-Ref: A Modular Reference Architecture and Typology for
  Parameter-Efficient Finetuning Techniques
PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Mohammed Sabry
Anya Belz
111
8
0
24 Apr 2023
On the Challenges of Using Black-Box APIs for Toxicity Evaluation in
  Research
On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research
Luiza Amador Pozzobon
Beyza Ermis
Patrick Lewis
Sara Hooker
93
48
0
24 Apr 2023
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam
  and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted
  Medical Education and Decision Making in Radiation Oncology
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology
Yixing Huang
A. Gomaa
S. Semrau
M. Haderlein
S. Lettmaier
...
L. Distel
Andreas Maier
R. Fietkau
Christoph Bert
F. Putz
ELMLM&MAAI4MH
73
9
0
24 Apr 2023
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in
  Large Language Models
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
Jiashuo Sun
Yi Luo
Yeyun Gong
Chen Lin
Yelong Shen
Jian Guo
Nan Duan
LRM
120
21
0
23 Apr 2023
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment
  of Performance, Explainability, Calibration, and Faithfulness
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Bo Li
Gexiang Fang
Yang Yang
Quansen Wang
Wei Ye
Wen Zhao
Shikun Zhang
ELMAI4MH
147
168
0
23 Apr 2023
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL
X. Liu
Zhao Tan
ReLMLRM
103
17
0
23 Apr 2023
LLM+P: Empowering Large Language Models with Optimal Planning
  Proficiency
LLM+P: Empowering Large Language Models with Optimal Planning Proficiency
B. Liu
Yuqian Jiang
Xiaohan Zhang
Qian Liu
Shiqi Zhang
Joydeep Biswas
Peter Stone
LM&RoLLMAG
128
421
0
22 Apr 2023
Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism
Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism
Xin Chen
Hengheng Zhang
Xiaotao Gu
Kaifeng Bi
Lingxi Xie
Qi Tian
MoE
54
4
0
22 Apr 2023
Learn What NOT to Learn: Towards Generative Safety in Chatbots
Learn What NOT to Learn: Towards Generative Safety in Chatbots
Leila Khalatbari
Yejin Bang
Jane Polak Scowcroft
Willy Chung
Saeedeh Ghadimi
Hossein Sameti
Pascale Fung
78
7
0
21 Apr 2023
Emergent and Predictable Memorization in Large Language Models
Emergent and Predictable Memorization in Large Language Models
Stella Biderman
USVSN Sai Prashanth
Lintang Sutawika
Hailey Schoelkopf
Quentin G. Anthony
Shivanshu Purohit
Edward Raf
94
125
0
21 Apr 2023
ChatABL: Abductive Learning via Natural Language Interaction with
  ChatGPT
ChatABL: Abductive Learning via Natural Language Interaction with ChatGPT
Tianyang Zhong
Yaonai Wei
Li Yang
Zihao Wu
Zheng Liu
...
Xi Jiang
Jun-Feng Han
Dinggang Shen
Tianming Liu
Tuo Zhang
LRM
81
29
0
21 Apr 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad
Swarnadeep Saha
Xiang Zhou
Joey Tianyi Zhou
LRM
129
50
0
21 Apr 2023
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large
  Language Models
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Deyao Zhu
Jun Chen
Xiaoqian Shen
Xiang Li
Mohamed Elhoseiny
VLMMLLM
177
2,080
0
20 Apr 2023
GPT-NER: Named Entity Recognition via Large Language Models
GPT-NER: Named Entity Recognition via Large Language Models
Shuhe Wang
Xiaofei Sun
Xiaoya Li
Rongbin Ouyang
Leilei Gan
Tianwei Zhang
Jiwei Li
Guoyin Wang
134
202
0
20 Apr 2023
Attention Scheme Inspired Softmax Regression
Attention Scheme Inspired Softmax Regression
Yichuan Deng
Zhihang Li
Zhao Song
93
43
0
20 Apr 2023
A Latent Space Theory for Emergent Abilities in Large Language Models
A Latent Space Theory for Emergent Abilities in Large Language Models
Hui Jiang
LRM
115
40
0
19 Apr 2023
Chameleon: Plug-and-Play Compositional Reasoning with Large Language
  Models
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Pan Lu
Baolin Peng
Hao Cheng
Michel Galley
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Jianfeng Gao
KELMMLLMLRM
159
326
0
19 Apr 2023
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
Chongjian Ge
Junsong Chen
Enze Xie
Zhongdao Wang
Lanqing Hong
Huchuan Lu
Zhenguo Li
Ping Luo
MoE
80
12
0
19 Apr 2023
Progressive-Hint Prompting Improves Reasoning in Large Language Models
Progressive-Hint Prompting Improves Reasoning in Large Language Models
Chuanyang Zheng
Zhengying Liu
Enze Xie
Zhenguo Li
Yu Li
LLMAGReLMLRM
123
106
0
19 Apr 2023
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved
  Access to Biomedical Information
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information
Qiao Jin
Yifan Yang
Qingyu Chen
Zhiyong Lu
LM&MALLMAG
75
156
0
19 Apr 2023
A Theory on Adam Instability in Large-Scale Machine Learning
A Theory on Adam Instability in Large-Scale Machine Learning
Igor Molybog
Peter Albert
Moya Chen
Zach DeVito
David Esiobu
...
Puxin Xu
Yuchen Zhang
Melanie Kambadur
Stephen Roller
Susan Zhang
AI4CE
87
35
0
19 Apr 2023
Loss Minimization Yields Multicalibration for Large Neural Networks
Loss Minimization Yields Multicalibration for Large Neural Networks
Jarosław Błasiok
Parikshit Gopalan
Lunjia Hu
Adam Tauman Kalai
Preetum Nakkiran
FaMLUQCV
95
13
0
19 Apr 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale
  Multilingual Pretraining
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Hyung Won Chung
Noah Constant
Xavier Garcia
Adam Roberts
Yi Tay
Sharan Narang
Orhan Firat
116
57
0
18 Apr 2023
An Evaluation on Large Language Model Outputs: Discourse and
  Memorization
An Evaluation on Large Language Model Outputs: Discourse and Memorization
Adrian de Wynter
Xun Wang
Alex Sokolov
Qilong Gu
Si-Qing Chen
ELM
141
34
0
17 Apr 2023
Visual Instruction Tuning
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDaVLMMLLM
587
4,950
0
17 Apr 2023
Supporting Qualitative Analysis with Large Language Models: Combining
  Codebook with GPT-3 for Deductive Coding
Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding
Ziang Xiao
Xingdi Yuan
Q. V. Liao
Rania Abdelghani
Pierre-Yves Oudeyer
80
149
0
17 Apr 2023
Solving Math Word Problems by Combining Language Models With Symbolic
  Solvers
Solving Math Word Problems by Combining Language Models With Symbolic Solvers
Joy He-Yueya
Gabriel Poesia
Rose E. Wang
Noah D. Goodman
92
121
0
16 Apr 2023
ArguGPT: evaluating, understanding and identifying argumentative essays
  generated by GPT models
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models
Yikang Liu
Ziyin Zhang
Wanyang Zhang
Shisen Yue
Xiaojing Zhao
Xinyuan Cheng
Yiwen Zhang
Hai Hu
DeLMO
103
55
0
16 Apr 2023
STen: Productive and Efficient Sparsity in PyTorch
STen: Productive and Efficient Sparsity in PyTorch
Andrei Ivanov
Nikoli Dryden
Tal Ben-Nun
Saleh Ashkboos
Torsten Hoefler
71
4
0
15 Apr 2023
DINOv2: Learning Robust Visual Features without Supervision
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLMCLIPSSL
600
3,536
0
14 Apr 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Boyao Wang
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
177
470
0
13 Apr 2023
Shall We Pretrain Autoregressive Language Models with Retrieval? A
  Comprehensive Study
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Wei Ping
Ming-Yu Liu
Peng Xu
Lawrence C. McAfee
Zihan Liu
...
Oleksii Kuchaiev
Yue Liu
Chaowei Xiao
Anima Anandkumar
Bryan Catanzaro
RALM
114
60
0
13 Apr 2023
Verbs in Action: Improving verb understanding in video-language models
Verbs in Action: Improving verb understanding in video-language models
Liliane Momeni
Mathilde Caron
Arsha Nagrani
Andrew Zisserman
Cordelia Schmid
111
71
0
13 Apr 2023
Solving Tensor Low Cycle Rank Approximation
Solving Tensor Low Cycle Rank Approximation
Yichuan Deng
Yeqi Gao
Zhao Song
80
6
0
13 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision
  Transformer
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
94
49
0
13 Apr 2023
LINGO : Visually Debiasing Natural Language Instructions to Support Task
  Diversity
LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity
Anjana Arunkumar
Shubham Sharma
Rakhi Agrawal
Sriramakrishnan Chandrasekaran
Chris Bryan
82
0
0
12 Apr 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image
  Generation
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
185
413
0
12 Apr 2023
Boosted Prompt Ensembles for Large Language Models
Boosted Prompt Ensembles for Large Language Models
Silviu Pitis
Michael Ruogu Zhang
Andrew Wang
Jimmy Ba
LRMLLMAG
81
43
0
12 Apr 2023
Meta-Learned Models of Cognition
Meta-Learned Models of Cognition
Marcel Binz
Ishita Dasgupta
Akshay K. Jagadish
M. Botvinick
Jane X. Wang
Eric Schulz
111
27
0
12 Apr 2023
chatClimate: Grounding Conversational AI in Climate Science
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELMAI4MHHILM
117
74
0
11 Apr 2023
ChemCrow: Augmenting large-language models with chemistry tools
ChemCrow: Augmenting large-language models with chemistry tools
Andres M Bran
Sam Cox
Oliver Schilter
Carlo Baldassari
Andrew D. White
P. Schwaller
LLMAG
182
409
0
11 Apr 2023
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models
Ameet Deshpande
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
LM&MALLMAG
114
374
0
11 Apr 2023
Emergent autonomous scientific research capabilities of large language
  models
Emergent autonomous scientific research capabilities of large language models
Daniil A. Boiko
R. MacKnight
Gabe Gomes
ELMLM&RoAI4CELLMAG
169
129
0
11 Apr 2023
RRHF: Rank Responses to Align Language Models with Human Feedback
  without tears
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Zheng Yuan
Hongyi Yuan
Chuanqi Tan
Wei Wang
Songfang Huang
Feiran Huang
ALM
221
385
0
11 Apr 2023
Approximating Online Human Evaluation of Social Chatbots with Prompting
Approximating Online Human Evaluation of Social Chatbots with Prompting
Ekaterina Svikhnushina
Pearl Pu
ELM
76
13
0
11 Apr 2023
A Billion-scale Foundation Model for Remote Sensing Images
A Billion-scale Foundation Model for Remote Sensing Images
Keumgang Cha
Junghoon Seo
Taekyung Lee
121
71
0
11 Apr 2023
Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Ahmet Iscen
Alireza Fathi
Cordelia Schmid
VLM3DV
87
26
0
11 Apr 2023
Teaching Large Language Models to Self-Debug
Teaching Large Language Models to Self-Debug
Xinyun Chen
Maxwell Lin
Nathanael Scharli
Denny Zhou
LRM
169
711
0
11 Apr 2023
Previous
123...717273...858687
Next