ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILM
    LRM
ArXivPDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,245 papers shown
Title
Analysis of Plan-based Retrieval for Grounded Text Generation
Analysis of Plan-based Retrieval for Grounded Text Generation
Ameya Godbole
Nicholas Monath
Seungyeon Kim
A. S. Rawat
Andrew McCallum
Manzil Zaheer
RALM
50
2
0
20 Aug 2024
Demystifying the Communication Characteristics for Distributed
  Transformer Models
Demystifying the Communication Characteristics for Distributed Transformer Models
Quentin G. Anthony
Benjamin Michalowicz
Jacob Hatef
Lang Xu
Mustafa Abduljabbar
Hari Subramoni
Hari Subramoni
D. Panda
AI4CE
38
2
0
19 Aug 2024
Geometry Informed Tokenization of Molecules for Language Model
  Generation
Geometry Informed Tokenization of Molecules for Language Model Generation
Xiner Li
Limei Wang
Youzhi Luo
Carl Edwards
Shurui Gui
Yuchao Lin
Heng Ji
Shuiwang Ji
54
6
0
19 Aug 2024
A Transcription Prompt-based Efficient Audio Large Language Model for
  Robust Speech Recognition
A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
Yangze Li
Xiong Wang
Songjun Cao
Yike Zhang
Long Ma
Lei Xie
AuLLM
58
0
0
18 Aug 2024
Crossing New Frontiers: Knowledge-Augmented Large Language Model
  Prompting for Zero-Shot Text-Based De Novo Molecule Design
Crossing New Frontiers: Knowledge-Augmented Large Language Model Prompting for Zero-Shot Text-Based De Novo Molecule Design
Sakhinana Sagar Srinivas
Venkataramana Runkana
49
1
0
18 Aug 2024
PEDAL: Enhancing Greedy Decoding with Large Language Models using
  Diverse Exemplars
PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars
Sumanth Prabhu
42
1
0
16 Aug 2024
Context-Aware Assistant Selection for Improved Inference Acceleration
  with Large Language Models
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Sarath Chandar
54
1
0
16 Aug 2024
A theory of understanding for artificial intelligence: composability,
  catalysts, and learning
A theory of understanding for artificial intelligence: composability, catalysts, and learning
Zijian Zhang
Sara Aronowitz
Alán Aspuru-Guzik
42
0
0
16 Aug 2024
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
Do Xuan Long
Hai Nguyen Ngoc
Tiviatis Sim
Hieu Dao
Shafiq Joty
Kenji Kawaguchi
Nancy F. Chen
Min-Yen Kan
36
8
0
16 Aug 2024
ScalingFilter: Assessing Data Quality through Inverse Utilization of
  Scaling Laws
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws
Ruihang Li
Yixuan Wei
Miaosen Zhang
Nenghai Yu
Han Hu
Houwen Peng
50
2
0
15 Aug 2024
BAM! Just Like That: Simple and Efficient Parameter Upcycling for
  Mixture of Experts
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Qizhen Zhang
Nikolas Gritsch
Dwaraknath Gnaneshwar
Simon Guo
David Cairuz
...
Jakob N. Foerster
Phil Blunsom
Sebastian Ruder
Ahmet Üstün
Acyr Locatelli
MoMe
MoE
56
5
0
15 Aug 2024
Csi-LLM: A Novel Downlink Channel Prediction Method Aligned with LLM
  Pre-Training
Csi-LLM: A Novel Downlink Channel Prediction Method Aligned with LLM Pre-Training
Shilong Fan
Zhenyu Liu
Xinyu Gu
Haoyang Li
20
7
0
15 Aug 2024
Enhancing Large Language Model-based Speech Recognition by
  Contextualization for Rare and Ambiguous Words
Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words
Kento Nozawa
Takashi Masuko
Toru Taniguchi
48
1
0
15 Aug 2024
Training Language Models on the Knowledge Graph: Insights on
  Hallucinations and Their Detectability
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Jiri Hron
Laura J. Culp
Gamaleldin F. Elsayed
Rosanne Liu
Ben Adlam
...
T. Warkentin
Lechao Xiao
Kelvin Xu
Jasper Snoek
Simon Kornblith
45
1
0
14 Aug 2024
Kraken: Inherently Parallel Transformers For Efficient Multi-Device
  Inference
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
R. Prabhakar
Hengrui Zhang
D. Wentzlaff
36
0
0
14 Aug 2024
Training Overhead Ratio: A Practical Reliability Metric for Large
  Language Model Training Systems
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems
Ning Lu
Qian Xie
Hao Zhang
Wenyi Fang
Yang Zheng
Zheng Hu
Jiantao Ma
22
1
0
14 Aug 2024
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text
  and Data Visualization
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization
Zhuoyue Wan
Yuanfeng Song
Shuaimin Li
Chen Jason Zhang
Raymond Chi-Wing Wong
VLM
37
1
0
14 Aug 2024
A Survey on Model MoErging: Recycling and Routing Among Specialized
  Experts for Collaborative Learning
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
Prateek Yadav
Colin Raffel
Mohammed Muqeeth
Lucas Caccia
Haokun Liu
Tianlong Chen
Joey Tianyi Zhou
Leshem Choshen
Alessandro Sordoni
MoMe
51
21
0
13 Aug 2024
LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large
  Language Models
LoRA2^22 : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models
Jia-Chen Zhang
Yu-Jie Xiong
He-Xi Qiu
Dong-Hai Zhu
Chun-Ming Xia
MoE
32
0
0
13 Aug 2024
SparkRA: A Retrieval-Augmented Knowledge Service System Based on Spark
  Large Language Model
SparkRA: A Retrieval-Augmented Knowledge Service System Based on Spark Large Language Model
Dayong Wu
Jiaqi Li
Baoxin Wang
Honghong Zhao
Siyuan Xue
...
Li Qian
Bo Wang
Shijin Wang
Zhixiong Zhang
Guoping Hu
RALM
57
0
0
13 Aug 2024
MGH Radiology Llama: A Llama 3 70B Model for Radiology
MGH Radiology Llama: A Llama 3 70B Model for Radiology
Yucheng Shi
Peng Shu
Zhengliang Liu
Zihao Wu
Quanzheng Li
Xiang Li
LM&MA
30
0
0
13 Aug 2024
Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in
  Language Models
Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models
Yen-Che Hsiao
Abhishek Dutta
LLMAG
LM&Ro
LRM
43
0
0
12 Aug 2024
Animate, or Inanimate, That is the Question for Large Language Models
Animate, or Inanimate, That is the Question for Large Language Models
Leonardo Ranaldi
Giulia Pucci
Fabio Massimo Zanzotto
37
0
0
12 Aug 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation
  Agents
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Xiao-Yang Liu
Tianjie Zhang
Yu Gu
Iat Long Iong
Yifan Xu
...
Zhengxiao Du
Chan Hee Song
Yu Su
Yuxiao Dong
Jie Tang
VLM
LLMAG
55
23
0
12 Aug 2024
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced
  Data
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Haoran Sun
Renren Jin
Shaoyang Xu
Leiyu Pan
Supryadi
...
Lei Yang
Ling Shi
Juesi Xiao
Shaolin Zhu
Deyi Xiong
65
2
0
12 Aug 2024
Defining Boundaries: A Spectrum of Task Feasibility for Large Language
  Models
Defining Boundaries: A Spectrum of Task Feasibility for Large Language Models
Wenbo Zhang
Zihang Xu
Hengrui Cai
40
1
0
11 Aug 2024
LaWa: Using Latent Space for In-Generation Image Watermarking
LaWa: Using Latent Space for In-Generation Image Watermarking
Ahmad Rezaei
Mohammad Akbari
Saeed Ranjbar Alvar
Arezou Fatemi
Yong Zhang
WIGM
54
13
0
11 Aug 2024
Document-Level Event Extraction with Definition-Driven ICL
Document-Level Event Extraction with Definition-Driven ICL
Zhuoyuan Liu
Yilin Luo
86
1
0
10 Aug 2024
Revisiting Multi-Modal LLM Evaluation
Revisiting Multi-Modal LLM Evaluation
Jian Lu
Shikhar Srivastava
Junyu Chen
Robik Shrestha
Manoj Acharya
Kushal Kafle
Christopher Kanan
38
3
0
09 Aug 2024
Learning the Simplicity of Scattering Amplitudes
Learning the Simplicity of Scattering Amplitudes
Clifford Cheung
Aurélien Dersy
Matthew D. Schwartz
33
3
0
08 Aug 2024
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Łukasz Borchmann
Michał Pietruszka
Wojciech Ja'skowski
Dawid Jurkiewicz
Piotr Halama
...
Gabriela Nowakowska
Artur Zawłocki
Łukasz Duhr
Paweł Dyda
Michał Turski
VLM
41
1
0
08 Aug 2024
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with
  Large Language Models
MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models
Haoxuan Li
Zhengmao Yang
Yunshan Ma
Yi Bin
Yang Yang
Tat-Seng Chua
46
0
0
08 Aug 2024
Evaluating Language Model Math Reasoning via Grounding in Educational
  Curricula
Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
L. Lucy
Tal August
Rose E. Wang
Luca Soldaini
Courtney Allison
Kyle Lo
ReLM
LRM
31
3
0
08 Aug 2024
Patchview: LLM-Powered Worldbuilding with Generative Dust and Magnet
  Visualization
Patchview: LLM-Powered Worldbuilding with Generative Dust and Magnet Visualization
John Joon Young Chung
Max Kreminski
50
10
0
07 Aug 2024
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language
  Modeling
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
William Y. Zhu
Keren Ye
Junjie Ke
Jiahui Yu
Leonidas J. Guibas
P. Milanfar
Feng Yang
51
2
0
07 Aug 2024
Advancing Multimodal Large Language Models with Quantization-Aware Scale
  Learning for Efficient Adaptation
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation
Jingjing Xie
Yuxin Zhang
Mingbao Lin
Liujuan Cao
Rongrong Ji
MQ
41
4
0
07 Aug 2024
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble
  Exploitation
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation
Weiqi Feng
Yangrui Chen
Shaoyu Wang
Size Zheng
Haibin Lin
Minlan Yu
MLLM
AI4CE
42
4
0
07 Aug 2024
AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging
AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging
Senkang Hu
Zhengru Fang
Zihan Fang
Yiqin Deng
Xianhao Chen
Yuguang Fang
Sam Kwong
65
14
0
07 Aug 2024
Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral
  7B Model and Data Augmentation
Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation
Artur Guimarães
Bruno Martins
João Magalhães
26
0
0
06 Aug 2024
Evaluation of Segment Anything Model 2: The Role of SAM2 in the
  Underwater Environment
Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment
Shijie Lian
Hua Li
VLM
43
5
0
06 Aug 2024
VisionUnite: A Vision-Language Foundation Model for Ophthalmology
  Enhanced with Clinical Knowledge
VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
Zihan Li
Diping Song
Zefeng Yang
Deming Wang
Fei Li
Xiulan Zhang
P. E. Kinahan
Yu Qiao
VLM
LM&MA
27
3
0
05 Aug 2024
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
Ryan Aponte
Ryan A. Rossi
Shunan Guo
Franck Dernoncourt
Tong Yu
Xiang Chen
Subrata Mitra
Nedim Lipka
OffRL
36
0
0
05 Aug 2024
From Recognition to Prediction: Leveraging Sequence Reasoning for Action
  Anticipation
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Xin Liu
Chao Hao
Zitong Yu
Huanjing Yue
Jingyu Yang
41
1
0
05 Aug 2024
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan:
  A Multi-Player Cooperative Game under Imperfect Information
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Yauwai Yim
Chunkit Chan
Tianyu Shi
Zheye Deng
Wei Fan
Tianshi Zheng
Yangqiu Song
LLMAG
41
10
0
05 Aug 2024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton
  Modules for Compositional Visual Reasoning
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yuanda Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
46
3
0
05 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Ping Luo
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
82
48
0
05 Aug 2024
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
Haolin Jin
Linghan Huang
Haipeng Cai
Jun Yan
Bo Li
Huaming Chen
78
30
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey
  on Methods and Datasets
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
50
0
0
04 Aug 2024
Effective Demonstration Annotation for In-Context Learning via Language
  Model-Based Determinantal Point Process
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
Peng Wang
Xiaobin Wang
Chao Lou
Shengyu Mao
Pengjun Xie
Yong-jia Jiang
54
0
0
04 Aug 2024
MAO: A Framework for Process Model Generation with Multi-Agent
  Orchestration
MAO: A Framework for Process Model Generation with Multi-Agent Orchestration
Leilei Lin
Yumeng Jin
Yingming Zhou
Wenlong Chen
Chen Qian
57
1
0
04 Aug 2024
Previous
123...141516...838485
Next