ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World
  Model Disentanglement
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
106
9
0
15 Oct 2024
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs
Syeda Nahida Akter
Shrimai Prabhumoye
John Kamalu
S. Satheesh
Eric Nyberg
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
LRMSyDaReLM
169
2
0
15 Oct 2024
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Bo Chen
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
154
22
0
15 Oct 2024
Data Quality Control in Federated Instruction-tuning of Large Language Models
Data Quality Control in Federated Instruction-tuning of Large Language Models
Yaxin Du
Guangyi Liu
Fengting Yuchi
W. Zhao
Jingjing Qu
Yanjie Wang
Siheng Chen
ALMFedML
129
2
0
15 Oct 2024
Will LLMs Replace the Encoder-Only Models in Temporal Relation
  Classification?
Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification?
Gabriel Roccabruna
Massimo Rizzoli
Giuseppe Riccardi
90
1
0
14 Oct 2024
Scalable Multi-Domain Adaptation of Language Models using Modular
  Experts
Scalable Multi-Domain Adaptation of Language Models using Modular Experts
Peter Schafhalter
Shun Liao
Yanqi Zhou
Chih-Kuan Yeh
Arun Kandoor
James Laudon
MoE
84
1
0
14 Oct 2024
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality
Peijun Qing
Chongyang Gao
Yefan Zhou
Xingjian Diao
Yaoqing Yang
Soroush Vosoughi
MoMeMoE
104
10
0
14 Oct 2024
ChuLo: Chunk-Level Key Information Representation for Long Document Processing
ChuLo: Chunk-Level Key Information Representation for Long Document Processing
Yan Li
Soyeon Caren Han
Yue Dai
Feiqi Cao
91
0
0
14 Oct 2024
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning
Yongxin Xu
Ruizhe Zhang
Xinke Jiang
Yujie Feng
Yuzhen Xiao
Xinyu Ma
Runchuan Zhu
Xu Chu
Junfeng Zhao
Yasha Wang
KELM
107
4
0
14 Oct 2024
A Multi-LLM Orchestration Engine for Personalized, Context-Rich
  Assistance
A Multi-LLM Orchestration Engine for Personalized, Context-Rich Assistance
Sumedh Rasal
62
0
0
13 Oct 2024
ImagineNav: Prompting Vision-Language Models as Embodied Navigator
  through Scene Imagination
ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination
Xinxin Zhao
Wenzhe Cai
Likun Tang
Teng Wang
LM&Ro
73
10
0
13 Oct 2024
'Quis custodiet ipsos custodes?' Who will watch the watchmen? On
  Detecting AI-generated peer-reviews
'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews
Sandeep Kumar
Mohit Sahu
Vardhan Gacche
Tirthankar Ghosal
Asif Ekbal
DeLMO
101
2
0
13 Oct 2024
Reverse Modeling in Large Language Models
Reverse Modeling in Large Language Models
S. Yu
Yuanchen Xu
Cunxiao Du
Yanying Zhou
Minghui Qiu
Q. Sun
Hao Zhang
Jiawei Wu
162
2
0
13 Oct 2024
Skipping Computations in Multimodal LLMs
Skipping Computations in Multimodal LLMs
Mustafa Shukor
Matthieu Cord
76
3
0
12 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
238
7
0
12 Oct 2024
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Hojae Lee
Junho Kim
SangKeun Lee
LRM
77
3
0
11 Oct 2024
Diffusion Models Need Visual Priors for Image Generation
Diffusion Models Need Visual Priors for Image Generation
Xiaoyu Yue
Zidong Wang
Zeyu Lu
S. Sun
Meng Wei
Wanli Ouyang
Junlin Wu
Luping Zhou
VLM
101
1
0
11 Oct 2024
Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks
Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks
Mahmood Hegazy
LLMAGLRMAI4CE
100
2
0
10 Oct 2024
Mars: Situated Inductive Reasoning in an Open-World Environment
Mars: Situated Inductive Reasoning in an Open-World Environment
Xiaojuan Tang
Jiaqi Li
Yitao Liang
Song-chun Zhu
Muhan Zhang
Zilong Zheng
LM&RoLRMLLMAG
77
5
0
10 Oct 2024
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for
  Enhancing Reasoning in Large Language Models
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models
Wenting Tan
Dongxiao Chen
Jieting Xue
Zihao Wang
Taijie Chen
LRM
87
2
0
10 Oct 2024
Executing Arithmetic: Fine-Tuning Large Language Models as Turing
  Machines
Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines
Junyu Lai
Jiahe Xu
Yao Yang
Yunpeng Huang
Chun Cao
Jingwei Xu
LRM
76
3
0
10 Oct 2024
Dialectical Behavior Therapy Approach to LLM Prompting
Dialectical Behavior Therapy Approach to LLM Prompting
Oxana Vitman
Nika Amaglobeli
Paul Plachinda
LRM
19
0
0
10 Oct 2024
Plug-and-Play Performance Estimation for LLM Services without Relying on
  Labeled Data
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data
Can Wang
Dianbo Sui
Hongliang Sun
Hao Ding
Jiahao Wang
Zhiying Tu
64
0
0
10 Oct 2024
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency
Preferred Elements
:
Kenshin Abe
Kaizaburo Chubachi
Yasuhiro Fujita
...
Yoshihiko Ozaki
Shotaro Sano
Shuji Suzuki
Tianqi Xu
Toshihiko Yanase
101
0
0
10 Oct 2024
LecPrompt: A Prompt-based Approach for Logical Error Correction with
  CodeBERT
LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT
Zhenyu Xu
Victor S. Sheng
KELM
86
0
0
10 Oct 2024
Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration
Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration
Tianyi Bai
Ling Yang
Zhen Hao Wong
Fupeng Sun
Jiahui Peng
...
Lijun Wu
Jiantao Qiu
Wentao Zhang
Binhang Yuan
Conghui He
LLMAG
88
6
0
10 Oct 2024
DemoShapley: Valuation of Demonstrations for In-Context Learning
DemoShapley: Valuation of Demonstrations for In-Context Learning
Shan Xie
Man Luo
Chadly Daniel Stern
Jundong Li
Lu Cheng
119
1
0
10 Oct 2024
Exploring Prompt Engineering: A Systematic Review with SWOT Analysis
Exploring Prompt Engineering: A Systematic Review with SWOT Analysis
Aditi Singh
Abul Ehtesham
Gaurav Kumar Gupta
Nikhil Kumar Chatta
Saket Kumar
T. T. Khoei
81
2
0
09 Oct 2024
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation
  Experts
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
117
6
0
09 Oct 2024
Personalized Visual Instruction Tuning
Personalized Visual Instruction Tuning
Renjie Pi
Jianshu Zhang
Tianyang Han
Jipeng Zhang
Boyao Wang
Tong Zhang
MLLM
91
9
0
09 Oct 2024
UniAutoML: A Human-Centered Framework for Unified Discriminative and
  Generative AutoML with Large Language Models
UniAutoML: A Human-Centered Framework for Unified Discriminative and Generative AutoML with Large Language Models
Jiayi Guo
Zan Chen
Yingrui Ji
Liyun Zhang
Daqin Luo
Zhigang Li
Yiqin Shen
HAI
71
0
0
09 Oct 2024
Stanceformer: Target-Aware Transformer for Stance Detection
Stanceformer: Target-Aware Transformer for Stance Detection
Krishna Garg
Cornelia Caragea
73
1
0
09 Oct 2024
Towards Generalisable Time Series Understanding Across Domains
Towards Generalisable Time Series Understanding Across Domains
Özgün Turgut
Philip Muller
Fernando Navarro
Daniel Rueckert
AI4TS
139
3
0
09 Oct 2024
Multi-Task Program Error Repair and Explanatory Diagnosis
Multi-Task Program Error Repair and Explanatory Diagnosis
Zhenyu Xu
Victor S. Sheng
KELMLRM
201
0
0
09 Oct 2024
On the Similarity of Circuits across Languages: a Case Study on the
  Subject-verb Agreement Task
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task
Javier Ferrando
Marta R. Costa-jussá
62
7
0
09 Oct 2024
MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction
  Equations Using Massive PINN-Based Prior Data
MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data
Mingu Kang
Dongseok Lee
Woojin Cho
Jaehyeon Park
Kookjin Lee
Anthony Gruber
Youngjoon Hong
Noseong Park
DiffMAI4CE
72
0
0
09 Oct 2024
Fine-tuning can Help Detect Pretraining Data from Large Language Models
Fine-tuning can Help Detect Pretraining Data from Large Language Models
Han Zhang
Songxin Zhang
Bingyi Jing
Jianguo Huang
152
1
0
09 Oct 2024
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
Wanchao Liang
Tianyu Liu
Less Wright
Will Constable
Andrew Gu
...
Howard Huang
Junjie Wang
Sanket Purandare
Gokul Nadathur
Stratos Idreos
OffRL
126
19
0
09 Oct 2024
Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs
Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs
Ruijia Niu
D. Wu
Rose Yu
Yi-An Ma
132
2
0
09 Oct 2024
Data Selection via Optimal Control for Language Models
Data Selection via Optimal Control for Language Models
Yuxian Gu
Li Dong
Hongning Wang
Y. Hao
Qingxiu Dong
Furu Wei
Minlie Huang
AI4CE
179
9
0
09 Oct 2024
Auto-Evolve: Enhancing Large Language Model's Performance via
  Self-Reasoning Framework
Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework
Krishna Aswani
Huilin Lu
Pranav Patankar
Priya Dhalwani
Iris Tan
Jayant Ganeshmohan
Simon Lacasse
ReLMLLMAGLRM
71
1
0
08 Oct 2024
O1 Replication Journey: A Strategic Progress Report -- Part 1
O1 Replication Journey: A Strategic Progress Report -- Part 1
Yiwei Qin
Xuefeng Li
Haoyang Zou
Yixiu Liu
Shijie Xia
...
Yixin Ye
Weizhe Yuan
Hector Liu
Yuezun Li
Pengfei Liu
VLM
110
92
0
08 Oct 2024
A second-order-like optimizer with adaptive gradient scaling for deep
  learning
A second-order-like optimizer with adaptive gradient scaling for deep learning
Jérôme Bolte
Ryan Boustany
Edouard Pauwels
Andrei Purica
ODL
72
0
0
08 Oct 2024
Retrieving, Rethinking and Revising: The Chain-of-Verification Can
  Improve Retrieval Augmented Generation
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Bolei He
Nuo Chen
Xinran He
Lingyong Yan
Zhenkai Wei
Jinchang Luo
Zhen-Hua Ling
RALMLRM
56
2
0
08 Oct 2024
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing
  with Language Models
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models
Ranchi Zhao
Zhen Leng Thai
Yifan Zhang
Shengding Hu
Yunqi Ba
Jie Zhou
Jie Cai
Zhiyuan Liu
Maosong Sun
147
1
0
08 Oct 2024
Attribute Controlled Fine-tuning for Large Language Models: A Case Study
  on Detoxification
Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Tao Meng
Ninareh Mehrabi
Palash Goyal
Anil Ramakrishna
Aram Galstyan
Richard Zemel
Kai-Wei Chang
Rahul Gupta
Charith Peris
28
1
0
07 Oct 2024
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with
  Explanatory Argumentative Structures
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures
Ekaterina Sviridova
Anar Yeginbergen
A. Estarrona
Elena Cabrio
S. Villata
Rodrigo Agerri
99
6
0
07 Oct 2024
Initialization of Large Language Models via Reparameterization to
  Mitigate Loss Spikes
Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes
Kosuke Nishida
Kyosuke Nishida
Kuniko Saito
56
2
0
07 Oct 2024
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Akira Kawabata
Saku Sugawara
LRM
119
5
0
07 Oct 2024
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
Chuanyang Zheng
Yihang Gao
Han Shi
Jing Xiong
Jiankai Sun
...
Xiaozhe Ren
Michael Ng
Xin Jiang
Zhenguo Li
Yu Li
85
3
0
07 Oct 2024
Previous
123...121314...858687
Next