ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILM
    LRM
ArXivPDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,245 papers shown
Title
Iterative Data Smoothing: Mitigating Reward Overfitting and
  Overoptimization in RLHF
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Banghua Zhu
Michael I. Jordan
Jiantao Jiao
36
25
0
29 Jan 2024
LLaMandement: Large Language Models for Summarization of French
  Legislative Proposals
LLaMandement: Large Language Models for Summarization of French Legislative Proposals
Joseph Gesnouin
Yannis Tannier
Christophe Gomes Da Silva
Hatim Tapory
Camille Brier
...
Emmanuel Cortes
Pierre-Etienne Devineau
Ulrich Tan
Esther Mac Namara
Su Yang
AILaw
46
8
0
29 Jan 2024
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Fuzhao Xue
Zian Zheng
Yao Fu
Jinjie Ni
Zangwei Zheng
Wangchunshu Zhou
Yang You
MoE
36
90
0
29 Jan 2024
Corrective Retrieval Augmented Generation
Corrective Retrieval Augmented Generation
Shi-Qi Yan
Jia-Chen Gu
Yun Zhu
Zhen-Hua Ling
RALM
155
74
0
29 Jan 2024
PRE: A Peer Review Based Large Language Model Evaluator
PRE: A Peer Review Based Large Language Model Evaluator
Zhumin Chu
Qingyao Ai
Yiteng Tu
Haitao Li
Yiqun Liu
LRM
ALM
41
21
0
28 Jan 2024
Quantifying Stereotypes in Language
Quantifying Stereotypes in Language
Yang Liu
40
1
0
28 Jan 2024
ProtAgents: Protein discovery via large language model multi-agent
  collaborations combining physics and machine learning
ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning
Alireza Ghafarollahi
Markus J. Buehler
LLMAG
AI4CE
34
24
0
27 Jan 2024
To Burst or Not to Burst: Generating and Quantifying Improbable Text
To Burst or Not to Burst: Generating and Quantifying Improbable Text
Kuleen Sasse
Samuel Barham
Efsun Sarioglu Kayi
Edward W. Staley
DeLMO
27
1
0
27 Jan 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for
  Hallucination Mitigation
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
Yuxin Liang
Zhuoyang Song
Hao Wang
Jiaxing Zhang
HILM
45
30
0
27 Jan 2024
A Survey on Data Augmentation in Large Model Era
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
56
24
0
27 Jan 2024
Learning Universal Predictors
Learning Universal Predictors
Jordi Grau-Moya
Tim Genewein
Marcus Hutter
Laurent Orseau
Grégoire Delétang
...
Anian Ruoss
Wenliang Kevin Li
Christopher Mattern
Matthew Aitchison
J. Veness
41
13
0
26 Jan 2024
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from
  Public Corpora
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Zhaoye Fei
Yunfan Shao
Linyang Li
Zhiyuan Zeng
Conghui He
Hang Yan
Dahua Lin
Xipeng Qiu
36
8
0
26 Jan 2024
Looking Right is Sometimes Right: Investigating the Capabilities of
  Decoder-only LLMs for Sequence Labeling
Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling
David Dukić
Jan Šnajder
33
14
0
25 Jan 2024
The Case for Co-Designing Model Architectures with Hardware
The Case for Co-Designing Model Architectures with Hardware
Quentin G. Anthony
Jacob Hatef
Deepak Narayanan
Stella Biderman
Stas Bekman
Junqi Yin
Hari Subramoni
Hari Subramoni
Dhabaleswar Panda
3DV
27
4
0
25 Jan 2024
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language
  Models
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models
Yao Fu
Leyang Xue
Yeqi Huang
Andrei-Octavian Brabete
Dmitrii Ustiugov
Yuvraj Patel
Luo Mai
28
28
0
25 Jan 2024
RomanSetu: Efficiently unlocking multilingual capabilities of Large
  Language Models via Romanization
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization
Jaavid Aktar Husain
Raj Dabre
Aswanth Kumar
Jay Gala
Thanmay Jayakumar
Ratish Puduppully
Anoop Kunchukuttan
43
12
0
25 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
61
387
0
25 Jan 2024
True Knowledge Comes from Practice: Aligning LLMs with Embodied
  Environments via Reinforcement Learning
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
Weihao Tan
Wentao Zhang
Shanqi Liu
Longtao Zheng
Xinrun Wang
Bo An
OffRL
46
17
0
25 Jan 2024
LanDA: Language-Guided Multi-Source Domain Adaptation
LanDA: Language-Guided Multi-Source Domain Adaptation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
40
10
0
25 Jan 2024
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Zhihao Zhang
Alan Zhu
Lijie Yang
Yihua Xu
Lanting Li
P. Phothilimthana
Zhihao Jia
RALM
KELM
56
16
0
25 Jan 2024
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent
  Self-Evolution
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution
Cheng Qian
Shihao Liang
Yujia Qin
Yining Ye
Xin Cong
Yankai Lin
Yesai Wu
Zhiyuan Liu
Maosong Sun
LLMAG
24
13
0
25 Jan 2024
WebVoyager: Building an End-to-End Web Agent with Large Multimodal
  Models
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Hongliang He
Wenlin Yao
Kaixin Ma
Wenhao Yu
Yong Dai
Hongming Zhang
Zhenzhong Lan
Dong Yu
LLMAG
40
126
0
25 Jan 2024
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web
  Tasks
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks
Jing Yu Koh
Robert Lo
Lawrence Jang
Vikram Duvvur
Ming Chong Lim
Po-Yu Huang
Graham Neubig
Shuyan Zhou
Ruslan Salakhutdinov
Daniel Fried
23
0
0
24 Jan 2024
How Good is ChatGPT at Face Biometrics? A First Look into Recognition,
  Soft Biometrics, and Explainability
How Good is ChatGPT at Face Biometrics? A First Look into Recognition, Soft Biometrics, and Explainability
Ivan Deandres-Tame
Ruben Tolosana
R. Vera-Rodríguez
Aythami Morales
Julian Fierrez
J. Ortega-Garcia
CVBM
57
22
0
24 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRL
LRM
56
183
0
24 Jan 2024
Question answering systems for health professionals at the point of care
  -- a systematic review
Question answering systems for health professionals at the point of care -- a systematic review
Gregory Kell
A. Roberts
Serge Umansky
Linglong Qian
Davide Ferrari
Frank Soboczenski
Byron Wallace
Nikhil Patel
Iain J. Marshall
AI4MH
34
7
0
24 Jan 2024
ChatterBox: Multi-round Multimodal Referring and Grounding
ChatterBox: Multi-round Multimodal Referring and Grounding
Yunjie Tian
Tianren Ma
Lingxi Xie
Jihao Qiu
Xi Tang
Yuan Zhang
Jianbin Jiao
Qi Tian
Qixiang Ye
33
14
0
24 Jan 2024
Towards Explainable Harmful Meme Detection through Multimodal Debate
  between Large Language Models
Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models
Hongzhan Lin
Ziyang Luo
Wei Gao
Jing Ma
Bo Wang
Ruichao Yang
42
13
0
24 Jan 2024
GraphiMind: LLM-centric Interface for Information Graphics Design
GraphiMind: LLM-centric Interface for Information Graphics Design
Qiruin Huang
Min Lu
J. Lanir
Dani Lischinski
Daniel Cohen-Or
Hui Huang
MLLM
37
7
0
24 Jan 2024
TAT-LLM: A Specialized Language Model for Discrete Reasoning over
  Tabular and Textual Data
TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data
Fengbin Zhu
Ziyang Liu
Fuli Feng
Chao Wang
Moxin Li
Tat-Seng Chua
LMTD
LRM
34
15
0
24 Jan 2024
SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced
  Token Detection
SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection
Ke Ye
Heinrich Jiang
Afshin Rostamizadeh
Ayan Chakrabarti
Giulia DeSalvo
Jean-François Kagy
Lazaros Karydas
Gui Citovsky
Sanjiv Kumar
41
0
0
24 Jan 2024
ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Self-Refinement
ULTRA: Unleash LLMs' Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Self-Refinement
Xinliang Frederick Zhang
Carter Blum
Temma Choji
Shalin S Shah
Alakananda Vempala
71
12
0
24 Jan 2024
The Language Barrier: Dissecting Safety Challenges of LLMs in
  Multilingual Contexts
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
Lingfeng Shen
Weiting Tan
Sihao Chen
Yunmo Chen
Jingyu Zhang
Haoran Xu
Boyuan Zheng
Philipp Koehn
Daniel Khashabi
39
38
0
23 Jan 2024
HAZARD Challenge: Embodied Decision Making in Dynamically Changing
  Environments
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments
Qinhong Zhou
Sunli Chen
Yisong Wang
Haozhe Xu
Weihua Du
Hongxin Zhang
Yilun Du
Josh Tenenbaum
Chuang Gan
AI4CE
34
15
0
23 Jan 2024
Raidar: geneRative AI Detection viA Rewriting
Raidar: geneRative AI Detection viA Rewriting
Chengzhi Mao
Carl Vondrick
Hao Wang
Junfeng Yang
DeLMO
31
25
0
23 Jan 2024
DsDm: Model-Aware Dataset Selection with Datamodels
DsDm: Model-Aware Dataset Selection with Datamodels
Logan Engstrom
Axel Feldmann
Aleksander Madry
OODD
30
51
0
23 Jan 2024
Dynamic Layer Tying for Parameter-Efficient Transformers
Dynamic Layer Tying for Parameter-Efficient Transformers
Tamir David Hay
Lior Wolf
33
3
0
23 Jan 2024
Multilingual and Fully Non-Autoregressive ASR with Large Language Model
  Fusion: A Comprehensive Study
Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Wenjie Huang
Cyril Allauzen
Tongzhou Chen
Kilol Gupta
Ke Hu
James Qin
Yu Zhang
Yongqiang Wang
Shuo-yiin Chang
Tara N. Sainath
MoMe
45
10
0
23 Jan 2024
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning
  Capabilities
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
Boyuan Chen
Zhuo Xu
Sean Kirmani
Brian Ichter
Danny Driess
Pete Florence
Dorsa Sadigh
Leonidas J. Guibas
Fei Xia
LRM
ReLM
57
213
0
22 Jan 2024
An Empirical Study of In-context Learning in LLMs for Machine
  Translation
An Empirical Study of In-context Learning in LLMs for Machine Translation
Pranjal A. Chitale
Jay Gala
Raj Dabre
LRM
36
5
0
22 Jan 2024
LLM-based policy generation for intent-based management of applications
LLM-based policy generation for intent-based management of applications
Kristina Dzeparoska
Jieyu Lin
A. Tizghadam
Alberto Leon-Garcia
11
30
0
22 Jan 2024
Distilling Mathematical Reasoning Capabilities into Small Language
  Models
Distilling Mathematical Reasoning Capabilities into Small Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
LRM
40
10
0
22 Jan 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
  Generating with Multimodal LLMs
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Bin Cui
CoGe
DiffM
52
118
0
22 Jan 2024
In-context Learning with Retrieved Demonstrations for Language Models: A
  Survey
In-context Learning with Retrieved Demonstrations for Language Models: A Survey
an Luo
Xin Xu
Yue Liu
Panupong Pasupat
Mehran Kazemi
RALM
40
56
0
21 Jan 2024
PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation
PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation
Nadav Benedek
Lior Wolf
32
13
0
20 Jan 2024
Inducing High Energy-Latency of Large Vision-Language Models with
  Verbose Images
Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
Kuofeng Gao
Yang Bai
Jindong Gu
Shu-Tao Xia
Philip Torr
Zhifeng Li
Wei Liu
VLM
22
39
0
20 Jan 2024
The Radiation Oncology NLP Database
The Radiation Oncology NLP Database
Zheng Liu
J. Holmes
Wenxiong Liao
Chenbin Liu
Lian-Cheng Zhang
...
Quanzheng Li
Xiang Li
Tianming Liu
Jiajian Shen
Wei Liu
LM&MA
AI4CE
37
2
0
19 Jan 2024
Medusa: Simple LLM Inference Acceleration Framework with Multiple
  Decoding Heads
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Tianle Cai
Yuhong Li
Zhengyang Geng
Hongwu Peng
Jason D. Lee
De-huai Chen
Tri Dao
60
259
0
19 Jan 2024
LangBridge: Multilingual Reasoning Without Multilingual Supervision
LangBridge: Multilingual Reasoning Without Multilingual Supervision
Dongkeun Yoon
Joel Jang
Sungdong Kim
Seungone Kim
Sheikh Shafayat
Minjoon Seo
LRM
26
15
0
19 Jan 2024
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial
  Analysis
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis
Chao Zhang
Yuren Mao
Yijiang Fan
Yu Mi
Yunjun Gao
Lu Chen
Dongfang Lou
Jinshu Lin
45
23
0
19 Jan 2024
Previous
123...353637...838485
Next