ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILM
    LRM
ArXivPDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,245 papers shown
Title
A Survey on LoRA of Large Language Models
A Survey on LoRA of Large Language Models
Yuren Mao
Yuhang Ge
Yijiang Fan
Wenyi Xu
Yu Mi
Zhonghao Hu
Yunjun Gao
ALM
63
26
0
08 Jul 2024
An Empirical Comparison of Vocabulary Expansion and Initialization
  Approaches for Language Models
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Nandini Mundra
Aditya Nanda Kishore
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
35
4
0
08 Jul 2024
PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation
PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation
Jinpeng Hu
Tengteng Dong
Luo Gang
Hui Ma
Peng Zou
Xiao Sun
Dan Guo
Meng Wang
AI4MH
44
4
0
08 Jul 2024
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with
  Inverse-Instruct
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct
Yutong Wu
Di Huang
Wenxuan Shi
Wei Wang
Lingzhe Gao
...
Qi Guo
Yewen Pu
Dawei Yin
Xing Hu
Yunji Chen
SyDa
33
2
0
08 Jul 2024
On Speeding Up Language Model Evaluation
On Speeding Up Language Model Evaluation
Jin Peng Zhou
Christian K. Belardi
Ruihan Wu
Travis Zhang
Carla P. Gomes
Wen Sun
Kilian Q. Weinberger
58
1
0
08 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
47
12
0
06 Jul 2024
Re-Tuning: Overcoming the Compositionality Limits of Large Language
  Models with Recursive Tuning
Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning
Eric Pasewark
Kyle Montgomery
Kefei Duan
Dawn Song
Chenguang Wang
LRM
CLL
ReLM
44
1
0
05 Jul 2024
Rethinking Visual Prompting for Multimodal Large Language Models with
  External Knowledge
Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Philip Torr
Lu Yuan
LRM
VLM
35
8
0
05 Jul 2024
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based
  Speech Recognition
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Ye Bai
Jingping Chen
Jitong Chen
Wei Chen
Zhuo Chen
...
Wanyi Zhang
Yang Zhang
Yawei Zhang
Yijie Zheng
Ming Zou
AuLLM
57
19
0
05 Jul 2024
Defense Against Syntactic Textual Backdoor Attacks with Token
  Substitution
Defense Against Syntactic Textual Backdoor Attacks with Token Substitution
Xinglin Li
Xianwen He
Yao Li
Minhao Cheng
26
1
0
04 Jul 2024
A Survey on Natural Language Counterfactual Generation
A Survey on Natural Language Counterfactual Generation
Yongjie Wang
Xiaoqi Qiu
Yu Yue
Xu Guo
Zhiwei Zeng
Yuhong Feng
Zhiqi Shen
42
6
0
04 Jul 2024
Uncertainty-Guided Optimization on Large Language Model Search Trees
Uncertainty-Guided Optimization on Large Language Model Search Trees
Julia Grosse
Ruotian Wu
Ahmad Rashid
Philipp Hennig
Pascal Poupart
Agustinus Kristiadi
45
1
0
04 Jul 2024
TongGu: Mastering Classical Chinese Understanding with
  Knowledge-Grounded Large Language Models
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models
Jiahuan Cao
Dezhi Peng
Peirong Zhang
Yongxin Shi
Yang Liu
Kai Ding
Lianwen Jin
31
0
0
04 Jul 2024
Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential
  Expression Generation
Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential Expression Generation
Polina Tsvilodub
Michael Franke
Fausto Carcassi
42
0
0
04 Jul 2024
MSfusion: A Dynamic Model Splitting Approach for Resource-Constrained
  Machines to Collaboratively Train Larger Models
MSfusion: A Dynamic Model Splitting Approach for Resource-Constrained Machines to Collaboratively Train Larger Models
Jin Xie
Songze Li
FedML
49
0
0
04 Jul 2024
An Empirical Study on Capability of Large Language Models in
  Understanding Code Semantics
An Empirical Study on Capability of Large Language Models in Understanding Code Semantics
Thu-Trang Nguyen
Thanh Trong Vu
H. Vo
Son Nguyen
ELM
47
2
0
04 Jul 2024
InternLM-XComposer-2.5: A Versatile Large Vision Language Model
  Supporting Long-Contextual Input and Output
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Pan Zhang
Xiaoyi Dong
Yuhang Zang
Yuhang Cao
Rui Qian
...
Kai Chen
Jifeng Dai
Yu Qiao
Dahua Lin
Jiaqi Wang
47
100
0
03 Jul 2024
Universal Length Generalization with Turing Programs
Universal Length Generalization with Turing Programs
Kaiying Hou
David Brandfonbrener
Sham Kakade
Samy Jelassi
Eran Malach
54
8
0
03 Jul 2024
Enhancing Translation Accuracy of Large Language Models through
  Continual Pre-Training on Parallel Data
Enhancing Translation Accuracy of Large Language Models through Continual Pre-Training on Parallel Data
Minato Kondo
T. Utsuro
Masaaki Nagata
CLL
44
4
0
03 Jul 2024
Towards More Realistic Extraction Attacks: An Adversarial Perspective
Towards More Realistic Extraction Attacks: An Adversarial Perspective
Yash More
Prakhar Ganesh
G. Farnadi
AAML
79
6
0
02 Jul 2024
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Binxu Li
Tiankai Yan
Yuanting Pan
Zhe Xu
Jie Luo
Ruiyang Ji
Shilong Liu
Haoyu Dong
Zihao Lin
Yixin Wang
LM&MA
49
26
0
02 Jul 2024
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models
  on Secure Code Generation with CodeSecEval
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models on Secure Code Generation with CodeSecEval
Jiexin Wang
Xitong Luo
Liuwen Cao
Hongkui He
Hailin Huang
Jiayuan Xie
Adam Jatowt
Yi Cai
ELM
46
14
0
02 Jul 2024
Meerkat: Audio-Visual Large Language Model for Grounding in Space and
  Time
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time
Sanjoy Chowdhury
Sayan Nag
Subhrajyoti Dasgupta
Jun Chen
Mohamed Elhoseiny
Ruohan Gao
Dinesh Manocha
VLM
MLLM
49
9
0
01 Jul 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Shihan Deng
Weikai Xu
Hongda Sun
Wei Liu
Tao Tan
...
Ang Li
Jian Luan
Bin Wang
Rui Yan
Shuo Shang
LLMAG
52
8
0
01 Jul 2024
Universal Approximation Theory: The Basic Theory for Transformer-based
  Large Language Models
Universal Approximation Theory: The Basic Theory for Transformer-based Large Language Models
Wei Wang
Qing Li
44
0
0
01 Jul 2024
SplitLoRA: A Split Parameter-Efficient Fine-Tuning Framework for Large
  Language Models
SplitLoRA: A Split Parameter-Efficient Fine-Tuning Framework for Large Language Models
Zheng Lin
Xuanjie Hu
Yuxin Zhang
Zhe Chen
Zihan Fang
Xianhao Chen
Ang Li
Praneeth Vepakomma
Yue Gao
54
32
0
01 Jul 2024
Eliminating Position Bias of Language Models: A Mechanistic Approach
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang
Hanlin Zhang
Xiner Li
Kuan-Hao Huang
Chi Han
Shuiwang Ji
Sham Kakade
Hao Peng
Heng Ji
72
12
0
01 Jul 2024
From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning
From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning
Nan Xu
Fei Wang
Sheng Zhang
Hoifung Poon
Muhao Chen
45
6
0
01 Jul 2024
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image
  Generation
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Mushui Liu
Yuhang Ma
Yang Zhen
Jun Dan
Yunlong Yu
Zeng Zhao
Zhipeng Hu
Bai Liu
Changjie Fan
VLM
DiffM
73
14
0
30 Jun 2024
Parm: Efficient Training of Large Sparsely-Activated Models with
  Dedicated Schedules
Parm: Efficient Training of Large Sparsely-Activated Models with Dedicated Schedules
Xinglin Pan
Wenxiang Lin
Shaoshuai Shi
Xiaowen Chu
Weinong Sun
Bo Li
MoE
57
3
0
30 Jun 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
44
4
0
28 Jun 2024
YuLan: An Open-source Large Language Model
YuLan: An Open-source Large Language Model
Yutao Zhu
Kun Zhou
Kelong Mao
Wentong Chen
Yiding Sun
...
Wenbing Huang
Ze-Feng Gao
Yueguo Chen
Weizheng Lu
Ji-Rong Wen
ALM
ELM
44
1
0
28 Jun 2024
Direct Preference Knowledge Distillation for Large Language Models
Direct Preference Knowledge Distillation for Large Language Models
Yixing Li
Yuxian Gu
Li Dong
Dequan Wang
Yu Cheng
Furu Wei
50
6
0
28 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to
  Cutting-Edge Reasoning
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CML
LRM
49
4
0
27 Jun 2024
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Tomer Porian
Mitchell Wortsman
J. Jitsev
Ludwig Schmidt
Y. Carmon
63
21
0
27 Jun 2024
Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services
Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services
Ali Doosthosseini
Jonathan Decker
Hendrik Nolte
Julian M. Kunkel
55
2
0
27 Jun 2024
Fairness and Bias in Multimodal AI: A Survey
Fairness and Bias in Multimodal AI: A Survey
Tosin Adewumi
Lama Alkhaled
Namrata Gurung
G. V. Boven
Irene Pagliai
63
9
0
27 Jun 2024
SSP: Self-Supervised Prompting for Cross-Lingual Transfer to
  Low-Resource Languages using Large Language Models
SSP: Self-Supervised Prompting for Cross-Lingual Transfer to Low-Resource Languages using Large Language Models
Vipul Rathore
Aniruddha Deb
Ankish Chandresh
Parag Singla
Mausam
LRM
52
0
0
27 Jun 2024
FFN: a Fine-grained Chinese-English Financial Domain Parallel Corpus
FFN: a Fine-grained Chinese-English Financial Domain Parallel Corpus
Yuxin Fu
Shijing Si
Leyi Mai
Xi-ang Li
50
1
0
27 Jun 2024
Lifelong Robot Library Learning: Bootstrapping Composable and
  Generalizable Skills for Embodied Control with Language Models
Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models
Georgios Tziafas
Hamidreza Kasaei
KELM
LM&Ro
51
8
0
26 Jun 2024
LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context
  Parallelism
LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism
Diandian Gu
Peng Sun
Qinghao Hu
Ting Huang
Xun Chen
...
Jiarui Fang
Yonggang Wen
Tianwei Zhang
Xin Jin
Xuanzhe Liu
LRM
48
7
0
26 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in
  Transformers
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
41
1
0
26 Jun 2024
Assessing "Implicit" Retrieval Robustness of Large Language Models
Assessing "Implicit" Retrieval Robustness of Large Language Models
Xiaoyu Shen
Rexhina Blloshmi
Dawei Zhu
Jiahuan Pei
Wei Zhang
RALM
KELM
55
0
0
26 Jun 2024
JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large
  Language and Vision-Language Models
JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models
Haibo Jin
Leyang Hu
Xinuo Li
Peiyan Zhang
Chonghan Chen
Jun Zhuang
Haohan Wang
PILM
43
26
0
26 Jun 2024
Learning Neural Networks with Sparse Activations
Learning Neural Networks with Sparse Activations
Pranjal Awasthi
Nishanth Dikkala
Pritish Kamath
Raghu Meka
49
2
0
26 Jun 2024
PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models
PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models
Huixuan Zhang
Yun Lin
Xiaojun Wan
53
0
0
26 Jun 2024
Banishing LLM Hallucinations Requires Rethinking Generalization
Banishing LLM Hallucinations Requires Rethinking Generalization
Johnny Li
Saksham Consul
Eda Zhou
James Wong
Naila Farooqui
...
Zhuxiaona Wei
Tian Wu
Ben Echols
Sharon Zhou
Gregory Diamos
LRM
34
10
0
25 Jun 2024
The FineWeb Datasets: Decanting the Web for the Finest Text Data at
  Scale
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Guilherme Penedo
Hynek Kydlícek
Loubna Ben Allal
Anton Lozhkov
Margaret Mitchell
Colin Raffel
Leandro von Werra
Thomas Wolf
56
195
0
25 Jun 2024
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment
  and Knowledge Aggregation
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation
Yusheng Liao
Shuyang Jiang
Yanfeng Wang
Yu Wang
57
2
0
25 Jun 2024
A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual
  LLMs
A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Vaibhav Singh
Amrith Krishna
Karthika NJ
Ganesh Ramakrishnan
36
4
0
25 Jun 2024
Previous
123...171819...838485
Next