ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways
v1v2v3v4v5 (latest)

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILMLRM
ArXiv (abs)PDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,332 papers shown
Title
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for
  Complex Problem Solving
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving
Pei Chen
Boran Han
Shuai Zhang
LRMLLMAG
83
5
0
26 Apr 2024
Large Language Model Agent as a Mechanical Designer
Large Language Model Agent as a Mechanical Designer
Yayati Jadhav
A. Farimani
AI4CELLMAG
201
11
0
26 Apr 2024
IndicGenBench: A Multilingual Benchmark to Evaluate Generation
  Capabilities of LLMs on Indic Languages
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages
Harman Singh
Nitish Gupta
Shikhar Bharadwaj
Dinesh Tewari
Partha P. Talukdar
ELM
86
28
0
25 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLLKELMLRM
167
88
0
25 Apr 2024
Evaluating Large Language Models on Time Series Feature Understanding: A
  Comprehensive Taxonomy and Benchmark
Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark
Elizabeth Fons
Rachneet Kaur
Soham Palande
Zhen Zeng
Svitlana Vyetrenko
T. Balch
AI4TS
52
13
0
25 Apr 2024
Energy-Latency Manipulation of Multi-modal Large Language Models via
  Verbose Samples
Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples
Kuofeng Gao
Jindong Gu
Yang Bai
Shu-Tao Xia
Philip Torr
Wei Liu
Zhifeng Li
132
13
0
25 Apr 2024
How to Parameterize Asymmetric Quantization Ranges for
  Quantization-Aware Training
How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training
Jaeseong You
Minseop Park
Kyunggeun Lee
Seokjun An
Chirag I. Patel
Markus Nagel
MQ
70
2
0
25 Apr 2024
zkLLM: Zero Knowledge Proofs for Large Language Models
zkLLM: Zero Knowledge Proofs for Large Language Models
Haochen Sun
Jason Li
Hongyang Zhang
ALM
121
28
0
24 Apr 2024
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Timin Gao
Peixian Chen
Mengdan Zhang
Chaoyou Fu
Yunhang Shen
...
Shengchuan Zhang
Xiawu Zheng
Xing Sun
Liujuan Cao
Rongrong Ji
MLLMLRM
126
22
0
24 Apr 2024
A Human-Computer Collaborative Tool for Training a Single Large Language
  Model Agent into a Network through Few Examples
A Human-Computer Collaborative Tool for Training a Single Large Language Model Agent into a Network through Few Examples
Lihang Pan
Yuxuan Li
Chun Yu
Yuanchun Shi
LLMAG
82
2
0
24 Apr 2024
Nyonic Technical Report
Nyonic Technical Report
Junfeng Tian
Rui Wang
Cong Li
Yudong Zhou
Jun Liu
Jun Wang
58
1
0
24 Apr 2024
Graph Machine Learning in the Era of Large Language Models (LLMs)
Graph Machine Learning in the Era of Large Language Models (LLMs)
Wenqi Fan
Shijie Wang
Jiani Huang
Zhikai Chen
Yu Song
...
Haitao Mao
Hui Liu
Xiaorui Liu
D. Yin
Qing Li
AI4CE
132
29
0
23 Apr 2024
Beyond the Speculative Game: A Survey of Speculative Execution in Large
  Language Models
Beyond the Speculative Game: A Survey of Speculative Execution in Large Language Models
Chen Zhang
Zhuorui Liu
Dawei Song
LRM
85
4
0
23 Apr 2024
EEGEncoder: Advancing BCI with Transformer-Based Motor Imagery
  Classification
EEGEncoder: Advancing BCI with Transformer-Based Motor Imagery Classification
Wangdan Liao
Weidong Wang
70
4
0
23 Apr 2024
Advances and Open Challenges in Federated Learning with Foundation
  Models
Advances and Open Challenges in Federated Learning with Foundation Models
Chao Ren
Han Yu
Hongyi Peng
Xiaoli Tang
Anran Li
...
A. Tan
Bo Zhao
Xiaoxiao Li
Zengxiang Li
Qiang Yang
FedMLAIFinAI4CE
154
11
0
23 Apr 2024
Watch Out for Your Guidance on Generation! Exploring Conditional Backdoor Attacks against Large Language Models
Watch Out for Your Guidance on Generation! Exploring Conditional Backdoor Attacks against Large Language Models
Jiaming He
Wenbo Jiang
Guanyu Hou
Wenshu Fan
Rui Zhang
Hongwei Li
AAML
96
0
0
23 Apr 2024
Retrieval Augmented Generation for Domain-specific Question Answering
Retrieval Augmented Generation for Domain-specific Question Answering
Sanat Sharma
David Seunghyun Yoon
Franck Dernoncourt
Dewang Sultania
Karishma Bagga
Mengjiao Zhang
Trung Bui
Varun Kotte
RALM
87
9
0
23 Apr 2024
MisgenderMender: A Community-Informed Approach to Interventions for
  Misgendering
MisgenderMender: A Community-Informed Approach to Interventions for Misgendering
Tamanna Hossain
Sunipa Dev
Sameer Singh
111
5
0
23 Apr 2024
Pegasus-v1 Technical Report
Pegasus-v1 Technical Report
Raehyuk Jung
Hyojun Go
Jaehyuk Yi
Jiho Jang
Daniel Kim
...
Maninder Saini
Meredith Sanders
Soyoung Lee
Sue Kim
Travis Couture
MLLMVLM
54
5
0
23 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
217
61
0
23 Apr 2024
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Qihuang Zhong
Kang Wang
Ziyang Xu
Juhua Liu
Liang Ding
Bo Du
LRMAIMat
170
4
0
23 Apr 2024
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks
Amir Saeidi
Shivanshu Verma
Chitta Baral
Chitta Baral
ALM
123
26
0
23 Apr 2024
Integrating Disambiguation and User Preferences into Large Language
  Models for Robot Motion Planning
Integrating Disambiguation and User Preferences into Large Language Models for Robot Motion Planning
Mohammed Abugurain
Shinkyu Park
58
1
0
22 Apr 2024
AutoAD III: The Prequel -- Back to the Pixels
AutoAD III: The Prequel -- Back to the Pixels
Tengda Han
Max Bain
Arsha Nagrani
Gül Varol
Weidi Xie
Andrew Zisserman
VGenDiffM
135
22
0
22 Apr 2024
Filtered Direct Preference Optimization
Filtered Direct Preference Optimization
Tetsuro Morimura
Mitsuki Sakamoto
Yuu Jinnai
Kenshi Abe
Kaito Air
124
15
0
22 Apr 2024
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based
  Mixture of Experts
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts
Dengchun Li
Yingzi Ma
Naizheng Wang
Zhengmao Ye
Zhiyuan Cheng
...
Yan Zhang
Lei Duan
Jie Zuo
Cal Yang
Mingjie Tang
MoE
128
59
0
22 Apr 2024
PARAMANU-GANITA: Can Small Math Language Models Rival with Large Language Models on Mathematical Reasoning?
PARAMANU-GANITA: Can Small Math Language Models Rival with Large Language Models on Mathematical Reasoning?
Mitodru Niyogi
Arnab Bhattacharya
LRMReLM
65
0
0
22 Apr 2024
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
Yuying Ge
Sijie Zhao
Jinguo Zhu
Yixiao Ge
Kun Yi
Lin Song
Chen Li
Xiaohan Ding
Ying Shan
VLM
146
142
0
22 Apr 2024
EPI-SQL: Enhancing Text-to-SQL Translation with Error-Prevention
  Instructions
EPI-SQL: Enhancing Text-to-SQL Translation with Error-Prevention Instructions
X. Liu
Zhao Tan
71
7
0
21 Apr 2024
Unified Scene Representation and Reconstruction for 3D Large Language
  Models
Unified Scene Representation and Reconstruction for 3D Large Language Models
Tao Chu
Pan Zhang
Xiao-wen Dong
Yuhang Zang
Qiong Liu
Jiaqi Wang
120
2
0
19 Apr 2024
Sample Design Engineering: An Empirical Study of What Makes Good
  Downstream Fine-Tuning Samples for LLMs
Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs
Biyang Guo
He Wang
Wenyilin Xiao
Hong Chen
Zhuxin Lee
Songqiao Han
Hailiang Huang
123
3
0
19 Apr 2024
Towards Universal Performance Modeling for Machine Learning Training on
  Multi-GPU Platforms
Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Zhongyi Lin
Ning Sun
Pallab Bhattacharya
Xizhou Feng
Louis Feng
John Douglas Owens
124
2
0
19 Apr 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Chengwei Qin
Wenhan Xia
Tan Wang
Fangkai Jiao
Yuchen Hu
Bosheng Ding
Ruirui Chen
Shafiq Joty
LRM
129
5
0
19 Apr 2024
UIClip: A Data-driven Model for Assessing User Interface Design
UIClip: A Data-driven Model for Assessing User Interface Design
Jason Wu
Yi-Hao Peng
Amanda Li
Amanda Swearngin
Jeffrey P. Bigham
Jeffrey Nichols
HAI
89
8
0
18 Apr 2024
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation
Chao Jin
Zili Zhang
Xuanlin Jiang
Fangyue Liu
Xin Liu
Xuanzhe Liu
Xin Jin
120
47
0
18 Apr 2024
Characterizing LLM Abstention Behavior in Science QA with Context
  Perturbations
Characterizing LLM Abstention Behavior in Science QA with Context Perturbations
Bingbing Wen
Bill Howe
Lucy Lu Wang
79
12
0
18 Apr 2024
BLINK: Multimodal Large Language Models Can See but Not Perceive
BLINK: Multimodal Large Language Models Can See but Not Perceive
Xingyu Fu
Yushi Hu
Bangzheng Li
Yu Feng
Haoyu Wang
Xudong Lin
Dan Roth
Noah A. Smith
Wei-Chiu Ma
Ranjay Krishna
VLMLRMMLLM
156
150
0
18 Apr 2024
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language
  Models
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models
Aitor Ormazabal
Che Zheng
Cyprien de Masson dÁutume
Dani Yogatama
Deyu Fu
...
Yazheng Yang
Yi Tay
Yuqi Wang
Zhongkai Zhu
Zhihui Xie
LRMVLMReLM
100
52
0
18 Apr 2024
MedThink: Explaining Medical Visual Question Answering via Multimodal
  Decision-Making Rationale
MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale
Xiaotang Gai
Chenyi Zhou
Jiaxiang Liu
Yang Feng
Jian Wu
Zuo-Qiang Liu
MedIm
109
6
0
18 Apr 2024
Stance Detection on Social Media with Fine-Tuned Large Language Models
Stance Detection on Social Media with Fine-Tuned Large Language Models
Ilker Gül
R. Lebret
Karl Aberer
52
9
0
18 Apr 2024
mABC: multi-Agent Blockchain-Inspired Collaboration for root cause
  analysis in micro-services architecture
mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture
Wei Emma Zhang
Hongcheng Guo
Jian Yang
Yi Zhang
Chaoran Yan
...
Chao Chen
Yi Liang
Xu Shi
Liangfan Zheng
Bowei Zhang
92
10
0
18 Apr 2024
The devil is in the object boundary: towards annotation-free instance
  segmentation using Foundation Models
The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
Cheng Shi
Sibei Yang
VLM
96
4
0
18 Apr 2024
TriForce: Lossless Acceleration of Long Sequence Generation with
  Hierarchical Speculative Decoding
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Hanshi Sun
Zhuoming Chen
Xinyu Yang
Yuandong Tian
Beidi Chen
121
65
0
18 Apr 2024
From Image to Video, what do we need in multimodal LLMs?
From Image to Video, what do we need in multimodal LLMs?
Suyuan Huang
Haoxin Zhang
Yan Gao
Honggu Chen
Yan Gao
Yao Hu
Zhan Qin
VLM
113
8
0
18 Apr 2024
AgentKit: Flow Engineering with Graphs, not Coding
AgentKit: Flow Engineering with Graphs, not Coding
Yue Wu
Yewen Fan
So Yeon Min
Shrimai Prabhumoye
Stephen Marcus McAleer
Yonatan Bisk
Ruslan Salakhutdinov
Yuanzhi Li
Tom Michael Mitchell
AI4CE
106
1
0
17 Apr 2024
A Preference-driven Paradigm for Enhanced Translation with Large
  Language Models
A Preference-driven Paradigm for Enhanced Translation with Large Language Models
D. Zhu
Sony Trenous
Xiaoyu Shen
Dietrich Klakow
Bill Byrne
Eva Hasler
105
3
0
17 Apr 2024
Towards Human Awareness in Robot Task Planning with Large Language
  Models
Towards Human Awareness in Robot Task Planning with Large Language Models
Yuchen Liu
Luigi Palmieri
Sebastian Koch
Ilche Georgievski
Marco Aiello
LM&Ro
84
6
0
17 Apr 2024
Small Language Models are Good Too: An Empirical Study of Zero-Shot
  Classification
Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification
Pierre Lepagnol
Thomas Gerald
Sahar Ghannay
Christophe Servan
Sophie Rosset
90
11
0
17 Apr 2024
A Survey on Retrieval-Augmented Text Generation for Large Language
  Models
A Survey on Retrieval-Augmented Text Generation for Large Language Models
Yizheng Huang
Jimmy X. Huang
3DVRALM
154
51
0
17 Apr 2024
SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA
  of LLMs
SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs
Jaehyung Kim
Jaehyun Nam
Sangwoo Mo
Jongjin Park
Sang-Woo Lee
Minjoon Seo
Jung-Woo Ha
Jinwoo Shin
AIFinRALMELM
121
51
0
17 Apr 2024
Previous
123...262728...858687
Next