ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Strategic Reasoning with Language Models
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&RoLRM
90
41
0
30 May 2023
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot
  Manipulation
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Chuhao Jin
Wenhui Tan
Jiange Yang
Bei Liu
Ruihua Song
Limin Wang
Jianlong Fu
LM&RoLRM
62
24
0
30 May 2023
Generate then Select: Open-ended Visual Question Answering Guided by
  World Knowledge
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Xingyu Fu
Shenmin Zhang
Gukyeong Kwon
Pramuditha Perera
Henghui Zhu
...
Zhiguo Wang
Vittorio Castelli
Patrick Ng
Dan Roth
Bing Xiang
90
22
0
30 May 2023
Universality and Limitations of Prompt Tuning
Universality and Limitations of Prompt Tuning
Yihan Wang
Jatin Chauhan
Wei Wang
Cho-Jui Hsieh
147
18
0
30 May 2023
Domain Specialization as the Key to Make Large Language Models
  Disruptive: A Comprehensive Survey
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Liang Zhao
ALM
172
140
0
30 May 2023
Faith and Fate: Limits of Transformers on Compositionality
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLMLRM
224
388
0
29 May 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
164
203
0
29 May 2023
Brainformers: Trading Simplicity for Efficiency
Brainformers: Trading Simplicity for Efficiency
Yan-Quan Zhou
Nan Du
Yanping Huang
Daiyi Peng
Chang Lan
...
Zhifeng Chen
Quoc V. Le
Claire Cui
J.H.J. Laundon
J. Dean
MoE
92
27
0
29 May 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward
  Model
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
407
4,190
0
29 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjDVLMMLLM
129
88
0
29 May 2023
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning
Zhanming Jie
Wei Lu
LRMReLM
90
16
0
29 May 2023
BigTranslate: Augmenting Large Language Models with Multilingual
  Translation Capability over 100 Languages
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
Wen Yang
Chong Li
Jiajun Zhang
Chengqing Zong
LRM
99
54
0
29 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark
  Datasets
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq Joty
J. Huang
LM&MAELMALM
131
193
0
29 May 2023
Large Language Models are not Fair Evaluators
Large Language Models are not Fair Evaluators
Peiyi Wang
Lei Li
Liang Chen
Zefan Cai
Dawei Zhu
Binghuai Lin
Yunbo Cao
Qi Liu
Tianyu Liu
Zhifang Sui
ALM
178
575
0
29 May 2023
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Zechun Liu
Barlas Oğuz
Changsheng Zhao
Ernie Chang
Pierre Stock
Yashar Mehdad
Yangyang Shi
Raghuraman Krishnamoorthi
Vikas Chandra
MQ
134
209
0
29 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for
  Multi-Task Reinforcement Learning
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffMOffRL
101
98
0
29 May 2023
Vec2Gloss: definition modeling leveraging contextualized vectors with
  Wordnet gloss
Vec2Gloss: definition modeling leveraging contextualized vectors with Wordnet gloss
Yu-Hsiang Tseng
Mao-Chang Ku
Wei-Ling Chen
Yu-Lin Chang
S. Hsieh
59
2
0
29 May 2023
Large Language Models, scientific knowledge and factuality: A systematic
  analysis in antibiotic discovery
Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery
Magdalena Wysocka
Oskar Wysocki
Maxime Delmas
V. Mutel
André Freitas
LM&MA
74
6
0
28 May 2023
ConvGenVisMo: Evaluation of Conversational Generative Vision Models
ConvGenVisMo: Evaluation of Conversational Generative Vision Models
Narjes Nikzad Khasmakhi
M. Asgari-Chenaghlu
Nabiha Asghar
Philipp Schaer
Dietlind Zuhlke
36
2
0
28 May 2023
Language Models are Bounded Pragmatic Speakers: Understanding RLHF from
  a Bayesian Cognitive Modeling Perspective
Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective
Khanh Nguyen
LRM
142
8
0
28 May 2023
Mitigating Label Biases for In-context Learning
Mitigating Label Biases for In-context Learning
Yu Fei
Buse Giledereli
Zeming Chen
Antoine Bosselut
103
76
0
28 May 2023
FuseCap: Leveraging Large Language Models for Enriched Fused Image
  Captions
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein
David Bensaid
Shaked Brody
Roy Ganz
Ron Kimmel
VLM
89
31
0
28 May 2023
Knowledge-Augmented Reasoning Distillation for Small Language Models in
  Knowledge-Intensive Tasks
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
Minki Kang
Seanie Lee
Jinheon Baek
Kenji Kawaguchi
Sung Ju Hwang
ALMLRM
123
66
0
28 May 2023
Plug-and-Play Document Modules for Pre-trained Models
Plug-and-Play Document Modules for Pre-trained Models
Chaojun Xiao
Zhengyan Zhang
Xu Han
Chi-Min Chan
Yankai Lin
Zhiyuan Liu
Xiangyang Li
Zhonghua Li
Bo Zhao
Maosong Sun
KELM
110
6
0
28 May 2023
FERMAT: An Alternative to Accuracy for Numerical Reasoning
FERMAT: An Alternative to Accuracy for Numerical Reasoning
Jasivan Sivakumar
N. Moosavi
ReLMLRM
93
4
0
27 May 2023
Query-Efficient Black-Box Red Teaming via Bayesian Optimization
Query-Efficient Black-Box Red Teaming via Bayesian Optimization
Deokjae Lee
JunYeong Lee
Jung-Woo Ha
Jin-Hwa Kim
Sang-Woo Lee
Hwaran Lee
Hyun Oh Song
AAML
96
25
0
27 May 2023
Augmentation-Adapted Retriever Improves Generalization of Language
  Models as Generic Plug-In
Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In
Zichun Yu
Chenyan Xiong
S. Yu
Zhiyuan Liu
KELMVLM
109
69
0
27 May 2023
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language
  Models
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models
Yuhui Zhang
Michihiro Yasunaga
Zhengping Zhou
Jeff Z. HaoChen
James Zou
Percy Liang
Serena Yeung
103
9
0
27 May 2023
Chain-of-Thought Hub: A Continuous Effort to Measure Large Language
  Models' Reasoning Performance
Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance
Yao Fu
Litu Ou
Mingyu Chen
Yuhao Wan
Hao-Chun Peng
Tushar Khot
LLMAGELMLRMReLM
84
115
0
26 May 2023
Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale
Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale
Vijeta Deshpande
Dan Pechi
Shree Thatte
Vladislav Lialin
Anna Rumshisky
132
8
0
26 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas Griffiths
N. Jha
LRMMLLM
111
2
0
26 May 2023
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for
  Attribute-Controlled Translation
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation
Gabriele Sarti
Phu Mon Htut
Xing Niu
B. Hsu
Anna Currey
Georgiana Dinu
Maria Nadejde
LRM
104
12
0
26 May 2023
Large Language Models as Tool Makers
Large Language Models as Tool Makers
Tianle Cai
Xuezhi Wang
Tengyu Ma
Xinyun Chen
Denny Zhou
LLMAG
109
212
0
26 May 2023
Manifold Regularization for Memory-Efficient Training of Deep Neural
  Networks
Manifold Regularization for Memory-Efficient Training of Deep Neural Networks
Shadi Sartipi
Edgar A. Bernal
50
0
0
26 May 2023
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large
  Language Models
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou
Yicong Hong
Qi Wu
ELMLM&RoLLMAGLRM
147
164
0
26 May 2023
MixCE: Training Autoregressive Language Models by Mixing Forward and
  Reverse Cross-Entropies
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Shiyue Zhang
Shijie Wu
Ozan Irsoy
Steven Lu
Joey Tianyi Zhou
Mark Dredze
David S. Rosenberg
95
10
0
26 May 2023
On Evaluating Adversarial Robustness of Large Vision-Language Models
On Evaluating Adversarial Robustness of Large Vision-Language Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Chongxuan Li
Ngai-Man Cheung
Min Lin
VLMAAMLMLLM
161
184
0
26 May 2023
Do GPTs Produce Less Literal Translations?
Do GPTs Produce Less Literal Translations?
Vikas Raunak
Arul Menezes
Matt Post
H. Awadallah
78
33
0
26 May 2023
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate
  Model
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
Yibo Miao
Hongcheng Gao
Hao Zhang
Zhijie Deng
DeLMO
103
20
0
26 May 2023
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in
  Language Models
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models
Yao Yao
Z. Li
Hai Zhao
ReLMLRM
85
22
0
26 May 2023
CONA: A novel CONtext-Aware instruction paradigm for communication using
  large language model
CONA: A novel CONtext-Aware instruction paradigm for communication using large language model
Nan Zhou
Xinghui Tao
Xi Chen
31
0
0
26 May 2023
Imitating Task and Motion Planning with Visuomotor Transformers
Imitating Task and Motion Planning with Visuomotor Transformers
Murtaza Dalal
Ajay Mandlekar
Caelan Reed Garrett
Ankur Handa
Ruslan Salakhutdinov
Dieter Fox
177
57
0
25 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&RoSyDa
194
844
0
25 May 2023
Diversity-Aware Coherence Loss for Improving Neural Topic Models
Diversity-Aware Coherence Loss for Improving Neural Topic Models
Raymond Li
Felipe González-Pizarro
Linzi Xing
Gabriel Murray
Giuseppe Carenini
BDL
474
4
0
25 May 2023
Scan and Snap: Understanding Training Dynamics and Token Composition in
  1-layer Transformer
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Yuandong Tian
Yiping Wang
Beidi Chen
S. Du
MLT
120
79
0
25 May 2023
Training Data Extraction From Pre-trained Language Models: A Survey
Training Data Extraction From Pre-trained Language Models: A Survey
Shotaro Ishihara
124
48
0
25 May 2023
ChatBridge: Bridging Modalities with Large Language Model as a Language
  Catalyst
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Zijia Zhao
Longteng Guo
Tongtian Yue
Si-Qing Chen
Shuai Shao
Xinxin Zhu
Zehuan Yuan
Jing Liu
MLLM
117
61
0
25 May 2023
UFO: Unified Fact Obtaining for Commonsense Question Answering
UFO: Unified Fact Obtaining for Commonsense Question Answering
Zhifeng Li
Yifan Fan
Bowei Zou
Yu Hong
HILMLRM
74
1
0
25 May 2023
Efficient Document Embeddings via Self-Contrastive Bregman Divergence
  Learning
Efficient Document Embeddings via Self-Contrastive Bregman Divergence Learning
Daniel Saggau
Mina Rezaei
Bernd Bischl
Ilias Chalkidis
SSLMedIm
75
2
0
25 May 2023
SING: A Plug-and-Play DNN Learning Technique
SING: A Plug-and-Play DNN Learning Technique
Adrien Courtois
Damien Scieur
Jean-Michel Morel
Pablo Arias
Thomas Eboli
70
0
0
25 May 2023
Previous
123...646566...858687
Next