ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILM
    LRM
ArXivPDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,245 papers shown
Title
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Akira Kawabata
Saku Sugawara
LRM
41
3
0
07 Oct 2024
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
Chuanyang Zheng
Yihang Gao
Han Shi
Jing Xiong
Jiankai Sun
...
Xiaozhe Ren
Michael Ng
Xin Jiang
Zhenguo Li
Yu Li
38
3
0
07 Oct 2024
MM-R$^3$: On (In-)Consistency of Multi-modal Large Language Models
  (MLLMs)
MM-R3^33: On (In-)Consistency of Multi-modal Large Language Models (MLLMs)
Shih-Han Chou
Shivam Chandhok
James J. Little
Leonid Sigal
42
0
0
07 Oct 2024
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
Siyuan Huang
Zhiyuan Ma
Jintao Du
Changhua Meng
Weiqiang Wang
Zhouhan Lin
LRM
34
4
0
07 Oct 2024
Rule-based Data Selection for Large Language Models
Rule-based Data Selection for Large Language Models
Xiaomin Li
Mingye Gao
Zhiwei Zhang
Chang Yue
Hong Hu
42
5
0
07 Oct 2024
Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation
Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation
Tunazzina Islam
Dan Goldwasser
43
1
0
07 Oct 2024
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen
Huaqing Zhang
Hongzhou Lin
Jingzhao Zhang
MoE
LRM
66
2
0
07 Oct 2024
DEPT: Decoupled Embeddings for Pre-training Language Models
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob
Lorenzo Sani
Meghdad Kurmanji
William F. Shen
Xinchi Qiu
Dongqi Cai
Yan Gao
Nicholas D. Lane
VLM
238
0
0
07 Oct 2024
MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA
  with LLM and MLLM Integration
MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Lai Wei
Wenkai Wang
Xiaoyu Shen
Yu Xie
Zhihao Fan
Xiaojin Zhang
Zhongyu Wei
Wei Chen
39
4
0
06 Oct 2024
RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM
  Batch Inference
RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference
Yige Xu
Xu Guo
Zhiwei Zeng
Chunyan Miao
44
0
0
06 Oct 2024
Realizing Video Summarization from the Path of Language-based Semantic
  Understanding
Realizing Video Summarization from the Path of Language-based Semantic Understanding
Kuan-Chen Mu
Zhi-Yi Chin
Wei-Chen Chiu
28
0
0
06 Oct 2024
Grokking at the Edge of Linear Separability
Grokking at the Edge of Linear Separability
Alon Beck
Noam Levi
Yohai Bar-Sinai
36
1
0
06 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
62
17
0
06 Oct 2024
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long
  Zero-Shot Text-to-Speech Synthesis
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis
Yuto Nishimura
Takumi Hirose
Masanari Ohi
Hideki Nakayama
Nakamasa Inoue
VLM
42
1
0
06 Oct 2024
OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized
  Distributions
OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions
Yu-Shin Huang
Peter Just
Krishna Narayanan
Chao Tian
52
1
0
06 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
35
1
0
06 Oct 2024
Adaptive Question Answering: Enhancing Language Model Proficiency for
  Addressing Knowledge Conflicts with Source Citations
Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations
Sagi Shaier
Ari Kobren
Philip Ogren
HILM
39
6
0
05 Oct 2024
LongGenBench: Long-context Generation Benchmark
LongGenBench: Long-context Generation Benchmark
Xiang Liu
Peijie Dong
Bo Li
Xiaowen Chu
RALM
55
8
0
05 Oct 2024
Table Question Answering for Low-resourced Indic Languages
Table Question Answering for Low-resourced Indic Languages
Vaishali Pal
Evangelos Kanoulas
Andrew Yates
Maarten de Rijke
LMTD
33
0
0
04 Oct 2024
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for
  Multi-object Demand-driven Navigation
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation
Hongcheng Wang
Peiqi Liu
Wenzhe Cai
Mingdong Wu
Zhengyu Qian
Hao Dong
31
0
0
04 Oct 2024
Can Mamba Always Enjoy the "Free Lunch"?
Can Mamba Always Enjoy the "Free Lunch"?
Ruifeng Ren
Zhicong Li
Yong Liu
44
1
0
04 Oct 2024
How Much Can We Forget about Data Contamination?
How Much Can We Forget about Data Contamination?
Sebastian Bordt
Suraj Srinivas
Valentyn Boreiko
U. V. Luxburg
54
1
0
04 Oct 2024
Cross-lingual Transfer for Automatic Question Generation by Learning
  Interrogative Structures in Target Languages
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages
Seonjeong Hwang
Yunsu Kim
Gary Geunbae Lee
45
0
0
04 Oct 2024
Margin Matching Preference Optimization: Enhanced Model Alignment with
  Granular Feedback
Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback
Kyuyoung Kim
Ah Jeong Seo
Hao Liu
Jinwoo Shin
Kimin Lee
30
2
0
04 Oct 2024
LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive
  Compression Strategy
LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy
Rongzhi Zhang
Kuang Wang
Liyuan Liu
Shuohang Wang
Hao Cheng
Chao Zhang
Yelong Shen
MQ
28
5
0
04 Oct 2024
Frame-Voyager: Learning to Query Frames for Video Large Language Models
Frame-Voyager: Learning to Query Frames for Video Large Language Models
Sicheng Yu
Chengkai Jin
Huanyu Wang
Zhenghao Chen
Sheng Jin
...
Zhenbang Sun
Bingni Zhang
Jiawei Wu
Hao Zhang
Qianru Sun
77
5
0
04 Oct 2024
Scaling Large Motion Models with Million-Level Human Motions
Scaling Large Motion Models with Million-Level Human Motions
Ye Wang
Sipeng Zheng
Bin Cao
Qianshan Wei
Qin Jin
Qin Jin
Zongqing Lu
VGen
47
0
0
04 Oct 2024
NL-Eye: Abductive NLI for Images
NL-Eye: Abductive NLI for Images
Mor Ventura
Michael Toker
Nitay Calderon
Zorik Gekhman
Yonatan Bitton
Roi Reichart
38
1
0
03 Oct 2024
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning
  with Insights from Multi-Agent Collaboration
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent Collaboration
Weikang Yuan
Junjie Cao
Zhuoren Jiang
Yangyang Kang
Jun Lin
Kaisong Song
Tianqianjin Lin
Pengwei Yan
Changlong Sun
Wei Lu
AILaw
ELM
LRM
34
2
0
03 Oct 2024
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better
  Unsupervised Visual Representations
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations
Nikolaos Giakoumoglou
Tania Stathaki
SSL
51
0
0
03 Oct 2024
Visual Prompting in LLMs for Enhancing Emotion Recognition
Visual Prompting in LLMs for Enhancing Emotion Recognition
Qixuan Zhang
Zhifeng Wang
Dylan Zhang
Wenjia Niu
Sabrina Caldwell
Tom Gedeon
Yang Liu
Zhenyue Qin
37
0
0
03 Oct 2024
Agent-Oriented Planning in Multi-Agent Systems
Agent-Oriented Planning in Multi-Agent Systems
Ao Li
Yuexiang Xie
Songze Li
Fugee Tsung
Bolin Ding
Yaliang Li
AIFin
188
6
0
03 Oct 2024
A Two-Stage Proactive Dialogue Generator for Efficient Clinical
  Information Collection Using Large Language Model
A Two-Stage Proactive Dialogue Generator for Efficient Clinical Information Collection Using Large Language Model
Xueshen Li
Xinlong Hou
Nirupama Ravi
Ziyi Huang
Yu Gan
LM&MA
34
0
0
02 Oct 2024
Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for
  Enhanced Batch Prompting
Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for Enhanced Batch Prompting
Longyu Feng
Mengze Hong
Chen Jason Zhang
47
2
0
02 Oct 2024
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving
  Long-Range Reasoning Problems using LLMs
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs
Kangsheng Wang
Xiao Zhang
Hao Liu
Songde Han
Huimin Ma
Tianyu Hu
LRM
59
5
0
02 Oct 2024
Mind Scramble: Unveiling Large Language Model Psychology Via
  Typoglycemia
Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia
Miao Yu
Junyuan Mao
Guibin Zhang
Jingheng Ye
Sihang Li
Aoxiao Zhong
Yang Liu
Yuxuan Liang
Kun Wang
Qingsong Wen
44
2
0
02 Oct 2024
Stars, Stripes, and Silicon: Unravelling the ChatGPT's All-American,
  Monochrome, Cis-centric Bias
Stars, Stripes, and Silicon: Unravelling the ChatGPT's All-American, Monochrome, Cis-centric Bias
Federico Torrielli
34
0
0
02 Oct 2024
Selective Aggregation for Low-Rank Adaptation in Federated Learning
Selective Aggregation for Low-Rank Adaptation in Federated Learning
Pengxin Guo
Shuang Zeng
Y. Wang
Huijie Fan
Feifei Wang
Liangqiong Qu
FedML
50
11
0
02 Oct 2024
ENTP: Encoder-only Next Token Prediction
ENTP: Encoder-only Next Token Prediction
Ethan Ewer
Daewon Chae
Thomas Zeng
Jinkyu Kim
Kangwook Lee
38
3
0
02 Oct 2024
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding
Yanming Liu
Xinyue Peng
Jiannan Cao
Shi Bo
Yanxin Shen
Tianyu Du
Sheng Cheng
Xun Wang
Jianwei Yin
Xuhong Zhang
71
9
0
02 Oct 2024
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and
  Reliability
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Weitong Zhang
Chengqi Zang
Bernhard Kainz
41
0
0
01 Oct 2024
Causal Representation Learning with Generative Artificial Intelligence:
  Application to Texts as Treatments
Causal Representation Learning with Generative Artificial Intelligence: Application to Texts as Treatments
Kosuke Imai
Kentaro Nakamura
CML
30
4
0
01 Oct 2024
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language
  Models
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models
Keivan Alizadeh
Iman Mirzadeh
Hooman Shahrokhi
Dmitry Belenko
Frank Sun
Minsik Cho
Mohammad Hossein Sekhavat
Moin Nabi
Mehrdad Farajtabar
MoE
33
1
0
01 Oct 2024
M$^{2}$M: Learning controllable Multi of experts and multi-scale
  operators are the Partial Differential Equations need
M2^{2}2M: Learning controllable Multi of experts and multi-scale operators are the Partial Differential Equations need
Aoming Liang
Zhaoyang Mu
Pengxiao Lin
Cong Wang
Mingming Ge
Ling Shao
Dixia Fan
Hao Tang
AI4CE
36
0
0
01 Oct 2024
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge
  Distillation for Large Language Models in Code Generation
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Ziyang Luo
Xin Li
Hongzhan Lin
Jing Ma
Lidong Bing
VLM
32
0
0
01 Oct 2024
Dynamic Planning for LLM-based Graphical User Interface Automation
Dynamic Planning for LLM-based Graphical User Interface Automation
Shaoqing Zhang
Zhuosheng Zhang
Kehai Chen
Xinbei Ma
Muyun Yang
Tiejun Zhao
Min Zhang
LLMAG
37
8
0
01 Oct 2024
Self-controller: Controlling LLMs with Multi-round Step-by-step
  Self-awareness
Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness
Xiao Peng
Xufan Geng
LLMAG
29
0
0
01 Oct 2024
Recent Advances in Speech Language Models: A Survey
Recent Advances in Speech Language Models: A Survey
Wenqian Cui
Dianzhi Yu
Xiaoqi Jiao
Ziqiao Meng
Guangyan Zhang
Qichao Wang
Yiwen Guo
Irwin King
AuLLM
61
17
0
01 Oct 2024
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data
  Mining
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining
Vinayak Arannil
Neha Narwal
Sourav Sanjukta Bhabesh
Sai Nikhil Thirandas
Darren Yow-Bang Wang
Graham Horwood
Alex Anto Chirayath
Gouri Pandeshwar
43
0
0
30 Sep 2024
Adapting LLMs for the Medical Domain in Portuguese: A Study on
  Fine-Tuning and Model Evaluation
Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation
P. H. Paiola
Gabriel Lino Garcia
João Renato Ribeiro Manesco
Mateus Roder
Douglas Rodrigues
João Paulo Papa
LM&MA
24
0
0
30 Sep 2024
Previous
123...111213...838485
Next